ID A0BKI6_PARTE Unreviewed; 435 AA. AC A0BKI6; DT 28-NOV-2006, integrated into UniProtKB/TrEMBL. DT 28-NOV-2006, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAK59053.1}; GN ORFNames=GSPATT00029684001 {ECO:0000313|EMBL:CAK59053.1}; OS Paramecium tetraurelia. OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium. OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK59053.1, ECO:0000313|Proteomes:UP000000600}; RN [1] {ECO:0000313|EMBL:CAK59053.1, ECO:0000313|Proteomes:UP000000600} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK59053.1, RC ECO:0000313|Proteomes:UP000000600}; RX PubMed=17086204; DOI=10.1038/nature05230; RG Genoscope; RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M., RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A., RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., RA Guigo R., Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., RA Klotz C., Koll F., Le Mouel A., Lepere G., Malinsky S., Nowacki M., RA Nowak J.K., Plattner H., Poulain J., Ruiz F., Serrano V., Zagulski M., RA Dessen P., Betermier M., Weissenbach J., Scarpelli C., Schaechter V., RA Sperling L., Meyer E., Cohen J., Wincker P.; RT "Global trends of whole-genome duplications revealed by the ciliate RT Paramecium tetraurelia."; RL Nature 444:171-178(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CT868000; CAK59053.1; -; Genomic_DNA. DR RefSeq; XP_001426451.1; XM_001426414.1. DR STRING; 412030.XP_001426451.1; -. DR EnsemblProtists; CAK59053; CAK59053; GSPATT00029684001. DR GeneID; 5012235; -. DR KEGG; ptm:GSPATT00029684001; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A0BKI6; -. DR Proteomes; UP000000600; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000600}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000600}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 351 375 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 234 254 {ECO:0000256|SAM:Coils}. FT COILED 312 332 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 435 AA; 50937 MW; 60D3F2F2A85A3BDA CRC64; MQTYHTIKTF INHQEKLSDQ LSFYDQFLLM AIQSISLNWI NTITNIDNWF ENIWSSSQTK ITQKNPPSAQ NFASQFGGAI ILTKSSALKQ VDNVLVDSVE VYMITECNQK MCFLLYVQKK RFLQKQQHLQ IKSCIPQLLK TFKQVYGSVV YPTRVWELLG NFYAEDINEW QIFNLDQRFL RYLKILIVDF HNAEFHCTLT QIRVFGKTVI GDLIDSHKRD KVIEPETKTK NLTQEQQEIK LNEVSEEEDR SKNDTCSVVD YFYSNQISKR IKNQYINVLP YESRQSLFKV TAQNILILSH NVELFKNEIN QIKHLDIQYQ NEQDQIKAFQ QQLITSISEQ KLINERLESE LYFINLKLLT MFIILVGITS ILLFISFCNQ NKQSSIQQQR VSIKAQSAIF KSQPELMTPK FSENNANTNT TKQTKNSNGK SKKSH // ID A0BT48_PARTE Unreviewed; 451 AA. AC A0BT48; DT 28-NOV-2006, integrated into UniProtKB/TrEMBL. DT 28-NOV-2006, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAK61715.1}; GN ORFNames=GSPATT00031947001 {ECO:0000313|EMBL:CAK61715.1}; OS Paramecium tetraurelia. OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium. OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK61715.1, ECO:0000313|Proteomes:UP000000600}; RN [1] {ECO:0000313|EMBL:CAK61715.1, ECO:0000313|Proteomes:UP000000600} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK61715.1, RC ECO:0000313|Proteomes:UP000000600}; RX PubMed=17086204; DOI=10.1038/nature05230; RG Genoscope; RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M., RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A., RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., RA Guigo R., Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., RA Klotz C., Koll F., Le Mouel A., Lepere G., Malinsky S., Nowacki M., RA Nowak J.K., Plattner H., Poulain J., Ruiz F., Serrano V., Zagulski M., RA Dessen P., Betermier M., Weissenbach J., Scarpelli C., Schaechter V., RA Sperling L., Meyer E., Cohen J., Wincker P.; RT "Global trends of whole-genome duplications revealed by the ciliate RT Paramecium tetraurelia."; RL Nature 444:171-178(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CT868015; CAK61715.1; -; Genomic_DNA. DR RefSeq; XP_001429113.1; XM_001429076.1. DR STRING; 412030.XP_001429113.1; -. DR EnsemblProtists; CAK61715; CAK61715; GSPATT00031947001. DR GeneID; 5014897; -. DR KEGG; ptm:GSPATT00031947001; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A0BT48; -. DR Proteomes; UP000000600; Partially assembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000600}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000600}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 360 383 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 301 332 {ECO:0000256|SAM:Coils}. FT COILED 336 356 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 451 AA; 53217 MW; D6356B6CA0027AF3 CRC64; MEDKSMIYFF STAMKMIDSN LRTAYYTLED TVTSLSNLPA LLYNRVIGDS RIQVKSIPSQ INFANYFGGA TILTKSKQLV GVDNILVDNQ ETYMITECNQ EKLFFIICLK EEIQLETIYF INKEFYSSTI KNFKVFGSIV QPTQAWDFLQ AFESEDINDW QSFEFESHFL RYLKIEIIDF HQAEYHCTLT QIRYLGYDIK FRVFGQTVIG DLIQSHKRYK LQLPKIEPIK EEIKQPQRIK MPCSEIINNY SSTNNSTCSY FDYIFDIQQS RADEIENQNQ YLDVIPFESN QSLFKVTAQN IVILSHNLQL LKEQLQSIRN QKMKEMENQK QHDYIQQQLL YELSQQQNIN KNLEEQIKKL YFLTDICVIL ICIILFIGFC WILRKQQNHY NNNRNQSIVI ENKKNVPDIT TLTPILVNGY SINNDENCHH IVKSKSNNNN QKNKKKSNFV N // ID A0BZC4_PARTE Unreviewed; 466 AA. AC A0BZC4; DT 28-NOV-2006, integrated into UniProtKB/TrEMBL. DT 28-NOV-2006, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAK63891.1}; GN ORFNames=GSPATT00033744001 {ECO:0000313|EMBL:CAK63891.1}; OS Paramecium tetraurelia. OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium. OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK63891.1, ECO:0000313|Proteomes:UP000000600}; RN [1] {ECO:0000313|EMBL:CAK63891.1, ECO:0000313|Proteomes:UP000000600} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK63891.1, RC ECO:0000313|Proteomes:UP000000600}; RX PubMed=17086204; DOI=10.1038/nature05230; RG Genoscope; RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M., RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A., RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., RA Guigo R., Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., RA Klotz C., Koll F., Le Mouel A., Lepere G., Malinsky S., Nowacki M., RA Nowak J.K., Plattner H., Poulain J., Ruiz F., Serrano V., Zagulski M., RA Dessen P., Betermier M., Weissenbach J., Scarpelli C., Schaechter V., RA Sperling L., Meyer E., Cohen J., Wincker P.; RT "Global trends of whole-genome duplications revealed by the ciliate RT Paramecium tetraurelia."; RL Nature 444:171-178(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CT868029; CAK63891.1; -; Genomic_DNA. DR RefSeq; XP_001431289.1; XM_001431252.1. DR STRING; 412030.XP_001431289.1; -. DR EnsemblProtists; CAK63891; CAK63891; GSPATT00033744001. DR GeneID; 5017073; -. DR KEGG; ptm:GSPATT00033744001; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A0BZC4; -. DR Proteomes; UP000000600; Partially assembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000600}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000600}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 406 426 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 344 375 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 466 AA; 55587 MW; 50FBF46ED0627AA2 CRC64; MKYYYAFFIV LVFVYFKRED IQKLSHIDHI EDTAEKNKQL AEFINGRQIH DLILFYAVKM IDSNLQAAYN TLEETVSSLQ KLSVLLFNRI TRESRVQVKS IPSQINFANY FGGATILTKS KQLVGVDNIL VDNQETYMIT ECNQQKLYII ICLKEEIQLE SIYFINKEFY SSTMKNFRVF GSVVYPTESW DFLQAFESED INEWQNFEFE SHFLRYLKIE IIDFHQAEYH CTLTQIRFKN YKCFRVFGQT VIGDLIQSHK RYKLQLPKIE PIREEVKKPQ RIKMPCNEII TNYANSNNST CSYFNYLFDV QQSNIDEIKS QNEFLDVIPF ESNQSLFKVT AQNIVILSHN LQLLKEQLQS INNNKLKEMQ NQKQNDDIQQ QLLFEFSQQQ NININLERQI YQLNVFTRYC IIGILILFIG LCSVIIKMQY HSRNIRHQQI DVQNQKDIQR YYYSDSNFGE WVFNEQ // ID A0E2N0_PARTE Unreviewed; 331 AA. AC A0E2N0; DT 28-NOV-2006, integrated into UniProtKB/TrEMBL. DT 28-NOV-2006, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAK89547.1}; GN ORFNames=GSPATT00022719001 {ECO:0000313|EMBL:CAK89547.1}; OS Paramecium tetraurelia. OC Eukaryota; Alveolata; Ciliophora; Intramacronucleata; OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium. OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK89547.1, ECO:0000313|Proteomes:UP000000600}; RN [1] {ECO:0000313|EMBL:CAK89547.1, ECO:0000313|Proteomes:UP000000600} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK89547.1, RC ECO:0000313|Proteomes:UP000000600}; RX PubMed=17086204; DOI=10.1038/nature05230; RG Genoscope; RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M., RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A., RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., RA Guigo R., Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., RA Klotz C., Koll F., Le Mouel A., Lepere G., Malinsky S., Nowacki M., RA Nowak J.K., Plattner H., Poulain J., Ruiz F., Serrano V., Zagulski M., RA Dessen P., Betermier M., Weissenbach J., Scarpelli C., Schaechter V., RA Sperling L., Meyer E., Cohen J., Wincker P.; RT "Global trends of whole-genome duplications revealed by the ciliate RT Paramecium tetraurelia."; RL Nature 444:171-178(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CT868655; CAK89547.1; -; Genomic_DNA. DR RefSeq; XP_001456944.1; XM_001456907.1. DR STRING; 412030.XP_001456944.1; -. DR EnsemblProtists; CAK89547; CAK89547; GSPATT00022719001. DR GeneID; 5042729; -. DR KEGG; ptm:GSPATT00022719001; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A0E2N0; -. DR Proteomes; UP000000600; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000600}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000600}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 250 271 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 220 247 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 331 AA; 38977 MW; 52AA059C792B1D74 CRC64; MCILYVCLKE EISLETITFI NKELYSSTIK NFQVYGSVVY PTKEWELLGN FYAEDSNEWQ IFNLEQRFLR YLKIHILDFH SAEFHCTLTQ IRVFGQTVIG DLIDSHKRDQ KVTKPESKTA NSTQEQQEIN LKEVSEEEDR SKIDTCSVVD YFYHTQPTKR VETQYINVLP YESRQSLFKV TAQNILILSH NVELFKNEIN QIKQQDLQHL QEQQEIRTFQ QQLMTSIQEQ KQQNIKLQDE LDFLNKKLSI ILFIIFMFIL LLGVAIILFL MNCCNQNKEQ KLEQPRASAK THSIIVKSYP ELLTHQLIEN DATRKHISQV KTSNGKTKKS N // ID A0NF12_ANOGA Unreviewed; 1200 AA. AC A0NF12; DT 09-JAN-2007, integrated into UniProtKB/TrEMBL. DT 23-OCT-2007, sequence version 2. DT 11-NOV-2015, entry version 48. DE SubName: Full=AGAP008473-PA {ECO:0000313|EMBL:EAU76307.2}; DE Flags: Fragment; GN ORFNames=AgaP_AGAP008473 {ECO:0000313|EMBL:EAU76307.2}; OS Anopheles gambiae (African malaria mosquito). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Anophelinae; Anopheles. OX NCBI_TaxID=7165 {ECO:0000313|Proteomes:UP000007062}; RN [1] {ECO:0000313|EMBL:EAU76307.2, ECO:0000313|Proteomes:UP000007062} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PEST {ECO:0000313|EMBL:EAU76307.2, RC ECO:0000313|Proteomes:UP000007062}; RX PubMed=12364791; DOI=10.1126/science.1076181; RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R., RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R., RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., RA Lai Z., Kraft C.L., Abril J.F., Anthouard V., Arensburger P., RA Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V., RA Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S., RA Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M., RA Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I., RA Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z., RA Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R., RA Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E., RA Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., RA Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R., RA Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C., RA Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V., RA Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D., RA Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H., RA Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A., RA Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I., RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., RA Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S., RA Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C., RA Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.; RT "The genome sequence of the malaria mosquito Anopheles gambiae."; RL Science 298:129-149(2002). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAU76307.2}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAAB01008964; EAU76307.2; -; Genomic_DNA. DR RefSeq; XP_001237874.2; XM_001237873.2. DR ProteinModelPortal; A0NF12; -. DR GeneID; 4577947; -. DR KEGG; aga:AgaP_AGAP008473; -. DR VectorBase; AGAP008473; Anopheles gambiae. DR HOGENOM; HOG000044781; -. DR InParanoid; A0NF12; -. DR OMA; FEAFETD; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; A0NF12; -. DR Proteomes; UP000007062; Chromosome 3R. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007062}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007062}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 1200 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002627577. FT TRANSMEM 978 997 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 820 840 {ECO:0000256|SAM:Coils}. FT COILED 926 960 {ECO:0000256|SAM:Coils}. FT NON_TER 1200 1200 {ECO:0000313|EMBL:EAU76307.2}. SQ SEQUENCE 1200 AA; 129687 MW; 511982DA83F52E66 CRC64; MKPSLCYTYC TLLLVSLVSS CTLFLIVASE KLKPTTDDQL QQHGAGTREP LDGSNLVQQH TDGAGERLPS NGGVNEAKEA FPHKRVLSSG NGTPPGHVRS KNGPGEEGNG AKPLKRPAPG RAGGRNTLKK DILTAKPHQQ QPQEEPPDLK DVIFLETISK NVGVTNSGLQ DIPGPPSIIT LVDNQHHELN NIESKLEENL EATLKNLSHN NVHTVQTTVP LHGNDHGQLA GEEEAPANGT TAGPASSSTP SGEPAATQET TAEATLTDAT APTVLPPSKQ LEVNLTEENP MPVFSEWAQK QMAEAEKKLG EVVNASAMKK GTKPAGSKAG TGSMKLRAKN YAAPECGAKI IASNPEAQST GSVLTAPKDE YLLNPCTSKI WFVVELCEPV QAERIELANF ELFSSSPKEF SVSVSNRFPT RDWANVGQFT AKDERDVQSF LLHPHLFGKF VRVEILSHYN QEHFCPVSLF RVYGTSEFEA FETDNTPLQP DEDDDDDELV ARDGQTVEPQ QGVDATTGAP KGGKNPNNIL KSAGEVVMNM VKKAAEALGK SGNESNDAAT PGSHPSPGGK IPLTDVLTGL PSPASPSCVT LAYTIRCVNC TDHFRARLES LMNCKHNLLT SLLGVARIEH AFDSAQHFLC ANVLGFNLPR SLQGESEGAG ELVTVYPSVC LNMRYSVLNL LPDELVAGLC NTVAFDLKLL AKVEASDAEG REIDGSVVAG VDGPSQNALD LPQGEHTKQG EHHQQHQVEQ TLDVETNAQN GNEKGEQQEQ NAVQEDSEKG SPPGDAAIDP SGSENASKED VNMFATEPTP PAATPAEPER DEASQNVDGQ KEQEQQQQQE DFANNQHWET IDDQSEMAGT GPSGSGAFTT GPAATTVTPG MATGQQKVQP ESVFLRLSNR IKALERNMSL SGQYLEELSR RYRKQVEELQ HSYAKTLHEI QEQNQRMADS ETKLREVNER LRQELVDFQA TATDWRNIAL AVLTMIVLML FTLLAMVRSV ARSVNRLTAG HPVIDRELAQ VGSDAKPIRG RMLRRKSIDG MPTSVATSLS PGRLRKKRPS EEALNISGTY VNLLIDDDAR EVGVPPQSSV PPLAMERKKS KQKHRKVSAP SMMGVISSST PLATGQGVAF SSHEPAKRSL SMIEARQMVN GGGQQTPEKD TSSRIDELPY LEDNDEFIIP TASDLSYDEY MPGSNASTAE // ID A1C9Z0_ASPCL Unreviewed; 847 AA. AC A1C9Z0; DT 23-JAN-2007, integrated into UniProtKB/TrEMBL. DT 23-JAN-2007, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAW12558.1}; GN ORFNames=ACLA_009810 {ECO:0000313|EMBL:EAW12558.1}; OS Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC OS 3887 / NRRL 1). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=344612 {ECO:0000313|Proteomes:UP000006701}; RN [1] {ECO:0000313|Proteomes:UP000006701} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1 RC {ECO:0000313|Proteomes:UP000006701}; RX PubMed=18404212; DOI=10.1371/journal.pgen.1000046; RA Fedorova N.D., Khaldi N., Joardar V.S., Maiti R., Amedeo P., RA Anderson M.J., Crabtree J., Silva J.C., Badger J.H., Albarraq A., RA Angiuoli S., Bussey H., Bowyer P., Cotty P.J., Dyer P.S., Egan A., RA Galens K., Fraser-Liggett C.M., Haas B.J., Inman J.M., Kent R., RA Lemieux S., Malavazi I., Orvis J., Roemer T., Ronning C.M., RA Sundaram J.P., Sutton G., Turner G., Venter J.C., White O.R., RA Whitty B.R., Youngman P., Wolfe K.H., Goldman G.H., Wortman J.R., RA Jiang B., Denning D.W., Nierman W.C.; RT "Genomic islands in the pathogenic filamentous fungus Aspergillus RT fumigatus."; RL PLoS Genet. 4:E1000046-E1000046(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS027049; EAW12558.1; -; Genomic_DNA. DR RefSeq; XP_001273984.1; XM_001273983.1. DR EnsemblFungi; CADACLAT00001317; CADACLAP00001298; CADACLAG00001317. DR GeneID; 4706444; -. DR KEGG; act:ACLA_009810; -. DR EuPathDB; FungiDB:ACLA_009810; -. DR HOGENOM; HOG000172520; -. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006701; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006701}; KW Reference proteome {ECO:0000313|Proteomes:UP000006701}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 847 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002633022. SQ SEQUENCE 847 AA; 91787 MW; 344B4C03A5063E62 CRC64; MIRDRCAGAF LLIHLTILFF IAIGGVAGKS SSSQPVCLAR TWRDTEADFI RWPVCVESRW SKTTAIVPGS PATVSTVSDA VSTPSGGTAS SSASQEQEPE HVHDPELDTD SPLDNVQFLS FEDWKKQNLA KAGQSAENVG GSRRGGVAGA EDRHRPTGIS NALDSLGEDA EIELDFGGFG ADAPEATRPP PFGAAGVQKG DRLGAIGSGG GAEASAPSPA VLRAGTMRRK DAGTTCKERF NYASFDCAAT VLKTNPECQG SSSVLIENKD SYMLNECRAS NKFLILELCD DILVDTVVLA NYEFFSSIFH TFRISVSDRY PAKMDQWREL GVYEARNTRE VQAFAVENPL IWARYLKIEF LTHYGNEFYC PLSLIRVHGT TMLEEYKHDG EAARGDDEMV EESLEPGQVT DDVESAKPLA TPSIDPGVVK EPEVLARGSC LNPAKAVEAL LTMGHPDKEV CGQDEVPVDT AGHNGATFAA ENGSLPRDPR PSHPDVPAAN EPSLNASMQE TADARRAAGQ PGTDAVPSPS STTSLSETAQ QDVTFEADQR ATAPPQEEHT PPSESTKTTV AQPPSPTPPT QESFFKSVNK RLQMLESNST LSLLYIEEQS RILRDAFSKV EKRQLAKTST FLETLNVTVL NELRDFRDQY DQVWKTVALE FENQRIQYHQ ELFSLSAQLG VLADELVFQK RVAVIQSIMV LFCFGLVLFS RGAVSSYIEL PSVQNMVSRS YSLRSSSPPF RFGSPSASPG STRPASSYHG GHRRNASEDS QDSPPSPPIA YAPPTPTSET SSPLESIEKR EPSPSPNTLE MPEIEPPHLR SQSSPPVLKT EEEDGDGNSA SSESMGS // ID A1CBP9_ASPCL Unreviewed; 721 AA. AC A1CBP9; DT 23-JAN-2007, integrated into UniProtKB/TrEMBL. DT 23-JAN-2007, sequence version 1. DT 11-NOV-2015, entry version 27. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAW13167.1}; GN ORFNames=ACLA_016130 {ECO:0000313|EMBL:EAW13167.1}; OS Aspergillus clavatus (strain ATCC 1007 / CBS 513.65 / DSM 816 / NCTC OS 3887 / NRRL 1). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=344612 {ECO:0000313|Proteomes:UP000006701}; RN [1] {ECO:0000313|Proteomes:UP000006701} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 1007 / CBS 513.65 / DSM 816 / NCTC 3887 / NRRL 1 RC {ECO:0000313|Proteomes:UP000006701}; RX PubMed=18404212; DOI=10.1371/journal.pgen.1000046; RA Fedorova N.D., Khaldi N., Joardar V.S., Maiti R., Amedeo P., RA Anderson M.J., Crabtree J., Silva J.C., Badger J.H., Albarraq A., RA Angiuoli S., Bussey H., Bowyer P., Cotty P.J., Dyer P.S., Egan A., RA Galens K., Fraser-Liggett C.M., Haas B.J., Inman J.M., Kent R., RA Lemieux S., Malavazi I., Orvis J., Roemer T., Ronning C.M., RA Sundaram J.P., Sutton G., Turner G., Venter J.C., White O.R., RA Whitty B.R., Youngman P., Wolfe K.H., Goldman G.H., Wortman J.R., RA Jiang B., Denning D.W., Nierman W.C.; RT "Genomic islands in the pathogenic filamentous fungus Aspergillus RT fumigatus."; RL PLoS Genet. 4:E1000046-E1000046(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS027049; EAW13167.1; -; Genomic_DNA. DR RefSeq; XP_001274593.1; XM_001274592.1. DR EnsemblFungi; CADACLAT00001034; CADACLAP00001015; CADACLAG00001034. DR GeneID; 4706556; -. DR KEGG; act:ACLA_016130; -. DR EuPathDB; FungiDB:ACLA_016130; -. DR HOGENOM; HOG000176993; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000006701; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006701}; KW Reference proteome {ECO:0000313|Proteomes:UP000006701}. SQ SEQUENCE 721 AA; 80693 MW; EF937F234F6E5F1C CRC64; MPPKRAGTRR AGATARSDVS IILGHSSPAV TNTPLPDVPT QPSWAYGSPA PAVLPRRLTA KQMGLAEVAE SIDQTIREAQ KRDQHNNPDE SNEGSDRPHM KTRLRGRSMA ANQSPVRRRA KREPTPDQVQ LLEGLREATL SPNRGYREPQ DQFERSTATP TPPIPHTLST ASSPTSQLLA DPKYPSLPAG QLYPSPLQRV GSPARNDMPL ENSTQSAGFD DNESVISWMV ERDVHDDDLQ RTSSNRYRNE PQGRNITAPP RRFSGLAFAN ETIHEEDEPD SRLSLSKARS REPTVESQAR SEPRPDLNPS LEPQQPEPEV STAPARTIIP HSFTKETSFQ DSTVPLSEQS FTNEARSIPT ARFIPNFSFG HPGKQALRIV GLMILTTLSL FTLYSFSDRI VELSRDIISW SPFHSQTPFI PLNTSDYEVV NHLNSQMVKL GAQVSSMSKE LKTIKSEVHD VAAPTTVLEP VRTLPKKPNF LNINMGIIVD QHMTSPTIED RRDHSDDPRP EENGPLAALL PWDDHGDCWC SAPRNGISQL ALHLARGIVP EDVVVEHIPK HAAITPETAP KDMELWVRYT VDINTDSDLP SEAGSASWYS RHLDKLLSSF SSKSEAFEKE YQSPMLAGRF SLHDYIIGIL RPAYSKEPET AYWNDTLLGP SFYRVSKWRY NIHGAHHIQE NSLDVIIDQP DIRVDQVVFR VKSNWGANYT CLYRLKLHGH L // ID A1CZC2_NEOFI Unreviewed; 844 AA. AC A1CZC2; DT 23-JAN-2007, integrated into UniProtKB/TrEMBL. DT 23-JAN-2007, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAW24092.1}; GN ORFNames=NFIA_036640 {ECO:0000313|EMBL:EAW24092.1}; OS Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL OS 181) (Aspergillus fischerianus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Neosartorya. OX NCBI_TaxID=331117 {ECO:0000313|Proteomes:UP000006702}; RN [1] {ECO:0000313|Proteomes:UP000006702} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181 RC {ECO:0000313|Proteomes:UP000006702}; RX PubMed=18404212; DOI=10.1371/journal.pgen.1000046; RA Fedorova N.D., Khaldi N., Joardar V.S., Maiti R., Amedeo P., RA Anderson M.J., Crabtree J., Silva J.C., Badger J.H., Albarraq A., RA Angiuoli S., Bussey H., Bowyer P., Cotty P.J., Dyer P.S., Egan A., RA Galens K., Fraser-Liggett C.M., Haas B.J., Inman J.M., Kent R., RA Lemieux S., Malavazi I., Orvis J., Roemer T., Ronning C.M., RA Sundaram J.P., Sutton G., Turner G., Venter J.C., White O.R., RA Whitty B.R., Youngman P., Wolfe K.H., Goldman G.H., Wortman J.R., RA Jiang B., Denning D.W., Nierman W.C.; RT "Genomic islands in the pathogenic filamentous fungus Aspergillus RT fumigatus."; RL PLoS Genet. 4:E1000046-E1000046(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS027686; EAW24092.1; -; Genomic_DNA. DR RefSeq; XP_001265989.1; XM_001265988.1. DR EnsemblFungi; CADNFIAT00003780; CADNFIAP00003691; CADNFIAG00003780. DR GeneID; 4592279; -. DR KEGG; nfi:NFIA_036640; -. DR EuPathDB; FungiDB:NFIA_036640; -. DR HOGENOM; HOG000172520; -. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006702; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006702}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 844 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002633307. FT TRANSMEM 691 708 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 844 AA; 91840 MW; B6503266406DE711 CRC64; MMLWGRCAGA SLIHLAALAL LTAGQAAREH SQPLCLARGW RDTEAEFIRW PVCIETRWSR SGTTVTGGPS VTVSTISDNS SPTSTASHAA QQQHPGQEQE QEQDTDSPLD NAKFLSFEDW KKQNLAKVGQ SAENVGGNRR GGVTGNESRR RPTGISNALD SLGEDAEIEL DFGGFGADAP EAARPPSFGS GVQVGESAGS VDSGLGGDGS APSPGVIRSG SSRRKDAGTT CKERFNYASF DCAATVLKTN PECQGSSSVL IENKDSYMLN ECRAKNKFLI LELCDDILVD TVVLANYEFF SSIFHTFRVS VSDRYPAKPD QWRELGVFGA RNTREVQAFA VENPLIWARY LKIEFLTHYG NEFYCPLSLI RVHGTTMLEE YKHDGEASRV DDEIVDETLE PDHAVTDAEI AESSENSSDV GAETCEGMGR QLQDGLQDTS PNPAQGLERL LANYLDSETC SAQATPTRLA GQERADAAVQ HDSPSTDTTP PGPEDSVPIV PGAGNGTKFA PDARRSAGQS GVDGISSPAS TAIMSEPVQH DTTSEADQKS TASSQEEQVP PVDSAKFSAT QPPSPNPTTQ ESFFKSVNKR LQMLESNSTL SLLYIEEQSR ILRDAFSKVE KRQLSKTSTF LENLNVTVLN ELRQFREQYD QVWKTVALEF ETQRIQYHQE IFSLSAQLGV LADELVFQKR VAVIQSIMVL FCFGLVLFSR GAVSSYMEFP SVQNMVSRSY SLRSSSPPFS SPSMSPSSTR PTSSYRSRHR RNITDDTQDS APSPTIAYSP PTPTSETSVP LESIKKRESS PSPGDLELPD IEPPQFRSQS SPPVLKSGED SDDEDSEASG SMEV // ID A1DDL5_NEOFI Unreviewed; 737 AA. AC A1DDL5; DT 23-JAN-2007, integrated into UniProtKB/TrEMBL. DT 23-JAN-2007, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAW17472.1}; GN ORFNames=NFIA_073880 {ECO:0000313|EMBL:EAW17472.1}; OS Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL OS 181) (Aspergillus fischerianus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Neosartorya. OX NCBI_TaxID=331117 {ECO:0000313|Proteomes:UP000006702}; RN [1] {ECO:0000313|Proteomes:UP000006702} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 1020 / DSM 3700 / FGSC A1164 / NRRL 181 RC {ECO:0000313|Proteomes:UP000006702}; RX PubMed=18404212; DOI=10.1371/journal.pgen.1000046; RA Fedorova N.D., Khaldi N., Joardar V.S., Maiti R., Amedeo P., RA Anderson M.J., Crabtree J., Silva J.C., Badger J.H., Albarraq A., RA Angiuoli S., Bussey H., Bowyer P., Cotty P.J., Dyer P.S., Egan A., RA Galens K., Fraser-Liggett C.M., Haas B.J., Inman J.M., Kent R., RA Lemieux S., Malavazi I., Orvis J., Roemer T., Ronning C.M., RA Sundaram J.P., Sutton G., Turner G., Venter J.C., White O.R., RA Whitty B.R., Youngman P., Wolfe K.H., Goldman G.H., Wortman J.R., RA Jiang B., Denning D.W., Nierman W.C.; RT "Genomic islands in the pathogenic filamentous fungus Aspergillus RT fumigatus."; RL PLoS Genet. 4:E1000046-E1000046(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS027696; EAW17472.1; -; Genomic_DNA. DR RefSeq; XP_001259369.1; XM_001259368.1. DR EnsemblFungi; CADNFIAT00006703; CADNFIAP00006536; CADNFIAG00006703. DR GeneID; 4586025; -. DR KEGG; nfi:NFIA_073880; -. DR EuPathDB; FungiDB:NFIA_073880; -. DR HOGENOM; HOG000176993; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000006702; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006702}. FT COILED 438 458 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 737 AA; 82323 MW; B4C5753E23EC86DC CRC64; MPPKRADTRR AGATARSEAS IIFGHSSPSV SNQPLPDVPT QPSWAYGSPA APVLPRRLVA KNMGLAEVAE SIDQTIRDAE KRDRRNDPDE SNDTDDRPHM NTRSRRRSSA ANASPIRRRT KREPTPDQVQ LLDALRDATV SPNQRNGENE TQAERSTATP TPPIPHTLST MSSPTSQILT EPKYPSLPIE QLYPSPLQRI GSPTRNDVSL EMSQNAGIDD NESVISWMVE RDIHDDDLQR TRSTRYRKEP VGKNITAPPR RFSGLAFANE TIVEEDEPDS RLSVSKTPRE PTVESEAQSD HQTEPDQPLE SPEPQPEPQK EPIPQVEVSS APARTIIPDF FMKEQPFNNS TTQPSDQSFT DHARSTPAAS FIPRVSVSLP WTQILRIAGA ILLTAISLLT IYSFSDRIAN IPHDIASHFP FRNPAPSIPL NTSDIEALNS LNNQVMRLGA QVSSISKELS VVKSEVKNVA GPTTIIEPVK VPKKPNFLSI GAGVLVDPRM TSPTYGEKKS RLPKWLQDRV SDWGEDPRPK PNPPLTALVP WDSVGDCWCS APRNGVSQLA LHLSRPIVPE EVVVEHIPKH ATLNPGAAPR EMELWVQYTI NKSTSGDLPT DAGSAGWYKS YLNWLLSFES EVLETEYQSP MLSERFSLHD YIMGYLRPAY HNEPESAYWN ATTLGPTFYR VGKWKYDIHG QHHVQEFSLD AIIDQPDIRV DQVAFRVNSN WGANFTCFYR LKLYGHL // ID A1Z6Q1_DROME Unreviewed; 881 AA. AC A1Z6Q1; DT 06-FEB-2007, integrated into UniProtKB/TrEMBL. DT 01-APR-2015, sequence version 2. DT 11-NOV-2015, entry version 72. DE SubName: Full=Klaroid, isoform B {ECO:0000313|EMBL:AAF57400.3}; GN Name=koi {ECO:0000313|EMBL:AAF57400.3, GN ECO:0000313|FlyBase:FBgn0265003}; GN ORFNames=CG44154 {ECO:0000313|FlyBase:FBgn0265003}, GN Dmel_CG44154 {ECO:0000313|EMBL:AAF57400.3}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., RA Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., RA Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AAF57400.3, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE013599; AAF57400.3; -; Genomic_DNA. DR RefSeq; NP_610240.4; NM_136396.4. DR UniGene; Dm.7280; -. DR ProteinModelPortal; A1Z6Q1; -. DR SMR; A1Z6Q1; 685-878. DR IntAct; A1Z6Q1; 9. DR STRING; 7227.FBpp0292403; -. DR PaxDb; A1Z6Q1; -. DR PRIDE; A1Z6Q1; -. DR GeneID; 35594; -. DR UCSC; CG18584-RA; d. melanogaster. DR CTD; 35594; -. DR FlyBase; FBgn0265003; koi. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A1Z6Q1; -. DR GenomeRNAi; 35594; -. DR NextBio; 794196; -. DR Proteomes; UP000000803; Chromosome 2R. DR Bgee; A1Z6Q1; -. DR ExpressionAtlas; A1Z6Q1; differential. DR Genevisible; A1Z6Q1; DM. DR GO; GO:0034399; C:nuclear periphery; IDA:FlyBase. DR GO; GO:0048471; C:perinuclear region of cytoplasm; IDA:FlyBase. DR GO; GO:0007097; P:nuclear migration; IMP:FlyBase. DR GO; GO:0051647; P:nucleus localization; IMP:FlyBase. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:A1Z6Q1}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 167 186 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 259 278 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 37 57 {ECO:0000256|SAM:Coils}. FT COILED 501 528 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 881 AA; 100144 MW; E63BD95C6E9B7115 CRC64; MTPDAKRKQN SITATVTSIL TKRSGGATST PRNRSQLETT QNTLNSAQEK LNQSNGNLSS GNVSDYLAYI EYRDAGEYWN KTPKTDYTYS ELSPHRRQLA PGIVAMPNMS RKSLENHNDR VNYMVQQNPA QEEFIRRRYQ SKYTQQVNYD SADELDATFG QQKQSWWLIR LIQLVVSSIT TVWSRVTNLS ATETTAYQNY HAKRQQSQQV GLWWKIVQTI GGGLASLLRY LYVFIGSVLS LDTWLLRSSD AENKSKKRFL IFLLILLPLL LLSGWLLLQE DQRSAYVQRA EALLPLPLSI FGSLRSRFSN AGATLKSWME VPTVRSPQRE AEAIKVNMAS IEQNIQKALT AEEYENILNH VNSYVQQLVE LKMQQHSKEL APQQIELFVK LMKENLKQIM YKTELSEKDL SDLAIKLKLE LQSSGGWQDG AKLSQANLEE ITKLIKAEVH LHESHYTIQL DRIDFASLLE RILAAPALAD FVDARISLRV GELEPKESSG SSDAEVQIER LNREIAFIKL ALSDKQAENA DLHQSISNLK LGQEDLLERI QQHELSQDRR FHGLLAEIEN KLSALNDSQF ALLNKQIKLS LVEILGFKQS TAGGSAGQLD DFDLQTWVRS MFVAKDYLEQ QLLELNKRTN NNIRDEIERS SILLMSDISQ RLKREILLVV EAKHNESTKA LKGHIREEEV RQIVKTVLAI YDADKTGLVD FALESAGGQI LSTRCTESYQ TKSAQISVFG IPLWYPTNTP RVAISPNVQP GECWAFQGFP GFLVLKLNSL VYVTGFTLEH IPKSLSPTGR IESAPRNFTV WGLEQEKDQE PVLFGDYQFE DNGASLQYFA VQNLDIKRPY EIVELRIETN HGHPTYTCLY RFRVHGKPPA T // ID A2DRZ3_TRIVA Unreviewed; 493 AA. AC A2DRZ3; DT 20-FEB-2007, integrated into UniProtKB/TrEMBL. DT 20-FEB-2007, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAY16763.1}; GN ORFNames=TVAG_447210 {ECO:0000313|EMBL:EAY16763.1}; OS Trichomonas vaginalis. OC Eukaryota; Parabasalia; Trichomonadida; Trichomonadidae; Trichomonas. OX NCBI_TaxID=5722 {ECO:0000313|Proteomes:UP000001542}; RN [1] {ECO:0000313|Proteomes:UP000001542} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC PRA-98 / G3 {ECO:0000313|Proteomes:UP000001542}; RX PubMed=17218520; DOI=10.1126/science.1132894; RA Carlton J.M., Hirt R.P., Silva J.C., Delcher A.L., Schatz M., Zhao Q., RA Wortman J.R., Bidwell S.L., Alsmark U.C.M., Besteiro S., RA Sicheritz-Ponten T., Noel C.J., Dacks J.B., Foster P.G., Simillion C., RA Van de Peer Y., Miranda-Saavedra D., Barton G.J., Westrop G.D., RA Mueller S., Dessi D., Fiori P.L., Ren Q., Paulsen I., Zhang H., RA Bastida-Corcuera F.D., Simoes-Barbosa A., Brown M.T., Hayes R.D., RA Mukherjee M., Okumura C.Y., Schneider R., Smith A.J., Vanacova S., RA Villalvazo M., Haas B.J., Pertea M., Feldblyum T.V., Utterback T.R., RA Shu C.L., Osoegawa K., de Jong P.J., Hrdy I., Horvathova L., RA Zubacova Z., Dolezal P., Malik S.B., Logsdon J.M. Jr., Henze K., RA Gupta A., Wang C.C., Dunne R.L., Upcroft J.A., Upcroft P., White O., RA Salzberg S.L., Tang P., Chiu C.-H., Lee Y.-S., Embley T.M., RA Coombs G.H., Mottram J.C., Tachezy J., Fraser-Liggett C.M., RA Johnson P.J.; RT "Draft genome sequence of the sexually transmitted pathogen RT Trichomonas vaginalis."; RL Science 315:207-212(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS113238; EAY16763.1; -; Genomic_DNA. DR RefSeq; XP_001328986.1; XM_001328951.1. DR EnsemblProtists; EAY16763; EAY16763; TVAG_447210. DR GeneID; 4774782; -. DR KEGG; tva:TVAG_447210; -. DR EuPathDB; TrichDB:TVAG_447210; -. DR InParanoid; A2DRZ3; -. DR Proteomes; UP000001542; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR011333; POZ_dom. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF54695; SSF54695; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001542}; KW Reference proteome {ECO:0000313|Proteomes:UP000001542}. SQ SEQUENCE 493 AA; 56342 MW; 7629C51C3AC43B61 CRC64; MTAPELRKLS LESLAYYPFP PNFTFIIGGV SYPCHLVQIL PISGTIQQLF LNDKSISSYT FEHLKDPYNN FPLFIDFING HTIDINDDNL ILLYNIGLIL DIPYLINGAG KLANCEINSE NAVSFCQKYY DHGVDYEIPA TFIAVNWDTL SGLESVMNLP VEILNTIIQI DGFKVSSETE LFEWIENLVN TKGRQYIPLF GHVLFSRLRR HHIKHLIEIL SKDTIDPFVW QELDDRLIME INPDENLEPS DEKNIDANQK PESETSQMTN TQPYSTYGSA LYSQNNFIPI NLKEKANNEK EFADAYSEES LIDLSYEPGY RLNGVVAYIK NQLGPNYTDA VIATGGGTKI KKIGNIFDYD DTKKAWWDNF DLGINRCTKE NAWCMIELRG YLLNLQSYTL ASPANRISFH QPKSWRIEVS ADGVNFETVH EVSKCPEMNV QYPILTFSLP KGETQPIRFI KLVMLENYAS AQSSNQYELS LSAFELYGKL RQL // ID A2R3Z2_ASPNC Unreviewed; 810 AA. AC A2R3Z2; DT 06-MAR-2007, integrated into UniProtKB/TrEMBL. DT 06-MAR-2007, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Aspergillus niger contig An14c0180, genomic contig {ECO:0000313|EMBL:CAK42160.1}; GN ORFNames=An14g05990 {ECO:0000313|EMBL:CAK42160.1}; OS Aspergillus niger (strain CBS 513.88 / FGSC A1513). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=425011 {ECO:0000313|Proteomes:UP000006706}; RN [1] {ECO:0000313|EMBL:CAK42160.1, ECO:0000313|Proteomes:UP000006706} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 513.88 / FGSC A1513 {ECO:0000313|Proteomes:UP000006706}; RX PubMed=17259976; DOI=10.1038/nbt1282; RA Pel H.J., de Winde J.H., Archer D.B., Dyer P.S., Hofmann G., RA Schaap P.J., Turner G., de Vries R.P., Albang R., Albermann K., RA Andersen M.R., Bendtsen J.D., Benen J.A.E., van den Berg M., RA Breestraat S., Caddick M.X., Contreras R., Cornell M., Coutinho P.M., RA Danchin E.G.J., Debets A.J.M., Dekker P., van Dijck P.W.M., RA van Dijk A., Dijkhuizen L., Driessen A.J.M., d'Enfert C., Geysens S., RA Goosen C., Groot G.S.P., de Groot P.W.J., Guillemette T., RA Henrissat B., Herweijer M., van den Hombergh J.P.T.W., RA van den Hondel C.A.M.J.J., van der Heijden R.T.J.M., RA van der Kaaij R.M., Klis F.M., Kools H.J., Kubicek C.P., RA van Kuyk P.A., Lauber J., Lu X., van der Maarel M.J.E.C., RA Meulenberg R., Menke H., Mortimer M.A., Nielsen J., Oliver S.G., RA Olsthoorn M., Pal K., van Peij N.N.M.E., Ram A.F.J., Rinas U., RA Roubos J.A., Sagt C.M.J., Schmoll M., Sun J., Ussery D., Varga J., RA Vervecken W., van de Vondervoort P.J.J., Wedler H., Woesten H.A.B., RA Zeng A.-P., van Ooyen A.J.J., Visser J., Stam H.; RT "Genome sequencing and analysis of the versatile cell factory RT Aspergillus niger CBS 513.88."; RL Nat. Biotechnol. 25:221-231(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM270325; CAK42160.1; -; Genomic_DNA. DR EnsemblFungi; CADANGAT00011536; CADANGAP00011314; CADANGAG00011536. DR HOGENOM; HOG000176993; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000006706; Chromosome 1R. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006706}; KW Reference proteome {ECO:0000313|Proteomes:UP000006706}. SQ SEQUENCE 810 AA; 89724 MW; 1EBE81CD789BC694 CRC64; MRNLVYPKYH VLESLNRPCD AVDRHSTRVF SRRVAFALEG RQLPRIAIDT SELVILVAAI VALASPALDL CLRLQASYSV PSSHTNRNIK PILLRIELPM PARRGATRRA GSTRSDIGSA STYFQSKLGP EARTQALPNL PTKQSFAYGS AETPILPREL KIQPHMDLTE MADAIDKGIE DAKDRQMKEK ETTQDKSRRQ KSPSITRSPV RRSRREPTPD ELQLLDNLRE ATKSPTPVRG NYSNNDQSTA TPTPPIPHTL STASSPAQSL PVPRYPHVPA ENLYPSPMGR FGPQLHDGPP LGSSPLPDDS SLYSFTVERA INSDELTRTL SDGKNIKAPP RRFSGLAFAN EPIHEEEEPD SRLLKTKSRS PSLQPSYEDF QIEPSPEPEP QSEPESVQEL ELEPTPEPEP IPELEPMPEP TPEPEVIREK SPAAQFTAPT KTLIPNAYAR RTPSQEPSVD DGQQNIRQTG QSWSWVGSLS AQLPSVSTVA RILAGIALAA ATVYLVAFGG IPSLSRPPQY IPMDENNMLA VSSLTDQMSR IGAQVSSLAK EMRTVKWDVN EVQSEVRSSP TPIMPPSRGS TDLGPPTEQK TNFLSIGLGV IVIPGLTSPT VGHKLSAWQW AYVNLWRGSH YRPASPPLAA LVPWEDYGDC WCSTPRDGMS QIGIDLGQKI VPEEVAVEHM PKTATLKPEN APREMELWAQ YVLVQKGTSR PARTQAERFS IHKPIMDALR SAWPTEDPTA YSDDPLLGPT YYRVGKFTYD IHGSHHVQRF ELDAVIDSPE VRVDRVVFRA TSNWGGNHTC IYRLKLFGHV // ID A2R9F4_ASPNC Unreviewed; 829 AA. AC A2R9F4; DT 06-MAR-2007, integrated into UniProtKB/TrEMBL. DT 06-MAR-2007, sequence version 1. DT 14-OCT-2015, entry version 37. DE SubName: Full=Aspergillus niger contig An17c0040, genomic contig {ECO:0000313|EMBL:CAK48819.1}; GN ORFNames=An17g01170 {ECO:0000313|EMBL:CAK48819.1}; OS Aspergillus niger (strain CBS 513.88 / FGSC A1513). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=425011 {ECO:0000313|Proteomes:UP000006706}; RN [1] {ECO:0000313|EMBL:CAK48819.1, ECO:0000313|Proteomes:UP000006706} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 513.88 / FGSC A1513 {ECO:0000313|Proteomes:UP000006706}; RX PubMed=17259976; DOI=10.1038/nbt1282; RA Pel H.J., de Winde J.H., Archer D.B., Dyer P.S., Hofmann G., RA Schaap P.J., Turner G., de Vries R.P., Albang R., Albermann K., RA Andersen M.R., Bendtsen J.D., Benen J.A.E., van den Berg M., RA Breestraat S., Caddick M.X., Contreras R., Cornell M., Coutinho P.M., RA Danchin E.G.J., Debets A.J.M., Dekker P., van Dijck P.W.M., RA van Dijk A., Dijkhuizen L., Driessen A.J.M., d'Enfert C., Geysens S., RA Goosen C., Groot G.S.P., de Groot P.W.J., Guillemette T., RA Henrissat B., Herweijer M., van den Hombergh J.P.T.W., RA van den Hondel C.A.M.J.J., van der Heijden R.T.J.M., RA van der Kaaij R.M., Klis F.M., Kools H.J., Kubicek C.P., RA van Kuyk P.A., Lauber J., Lu X., van der Maarel M.J.E.C., RA Meulenberg R., Menke H., Mortimer M.A., Nielsen J., Oliver S.G., RA Olsthoorn M., Pal K., van Peij N.N.M.E., Ram A.F.J., Rinas U., RA Roubos J.A., Sagt C.M.J., Schmoll M., Sun J., Ussery D., Varga J., RA Vervecken W., van de Vondervoort P.J.J., Wedler H., Woesten H.A.B., RA Zeng A.-P., van Ooyen A.J.J., Visser J., Stam H.; RT "Genome sequencing and analysis of the versatile cell factory RT Aspergillus niger CBS 513.88."; RL Nat. Biotechnol. 25:221-231(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM270386; CAK48819.1; -; Genomic_DNA. DR RefSeq; XP_001398346.1; XM_001398309.1. DR EnsemblFungi; CADANGAT00013513; CADANGAP00013264; CADANGAG00013513. DR GeneID; 4989440; -. DR KEGG; ang:ANI_1_454154; -. DR HOGENOM; HOG000172520; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006706; Chromosome 5L. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006706}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006706}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 829 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002645689. FT TRANSMEM 674 695 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 829 AA; 89738 MW; CCD789C6042C4779 CRC64; MTSWTASQWI PWMTLISTWI DGTTADPSQT ICPAPRWQVA EAEFIQWPQC PETRWEAEPA TPIPAEQQPL LKAPEETLSA VSVSMASSES SARPDHELDT ESPLDNANFL SFEDWKKQNL AKVGQSAENV GGRGAAAAAA GKEGRRRPTG INNALDSLGE DVEIELDFGG FGADTPEAAK PTSWGARVST GVTGGEAGSA GDVDSLAHGV PPAGGVSRSK DAGTTCKERF NYASFDCAAT VLKTNPECTG SSSVLIENKD SYMLNECRAN NKFLILELCD DILVDTVVLA NYEFFSSIFH TFRVSVSDRY PAKLDQWREL GVYEARNTRE VQAFAVENPL IWARYVKIEF LTHYGNEFFC PLSLIRVHGT TMLEEYKHDG EVSRTDDVVA DEELEPAPVA AEIETIPTVD AAAAAGPIEQ KVDEQTPETC PNPGPVVDEA VMMQLWGVPW TCSIHDSPAA GDEGTQASLN RPSATDATPP KGDDAAPLGN EAPVKEAGEQ KMTVSPNVDS APSSATTAGP ETTSQGEADS RSTGFTKEEQ SVAAETTRST ATQPPSANPT TQESFFKSVN KRLQMLESNS SLSLLYIEEQ SRILRDAFNK VEKRQLAKTS TFLEQLNVTV LHELKQFREQ YDNVWKSVAL EFEHQRIQYH QEVHSLSAQL GVLADELVFQ KRVAVIQSIM ILFCFGLVLF SRGAVSSYIE LPSMQNMVSR SYSLRSSSPP FGSPSVSPTS SGRRAGGHRR NLSEDSQEDG PISPTLAYSP PTPVSDVMSS SEEAENQRGN SLALPEVAPP VRSRSSPPDL KGGEESIEES SSSGDSPVSH GRNAAVTEA // ID A2RVD8_DROME Unreviewed; 221 AA. AC A2RVD8; DT 06-MAR-2007, integrated into UniProtKB/TrEMBL. DT 06-MAR-2007, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=IP10153p {ECO:0000313|EMBL:ABM92803.1}; GN Name=CG6589 {ECO:0000313|EMBL:ABM92803.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:ABM92803.1}; RN [1] {ECO:0000313|EMBL:ABM92803.1} RP NUCLEOTIDE SEQUENCE. RA Stapleton M., Carlson J., Frise E., Kapadia B., Park S., Wan K., RA Yu C., Celniker S.; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BT029929; ABM92803.1; -; mRNA. DR STRING; 7227.FBpp0079852; -. DR PaxDb; A2RVD8; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR Bgee; A2RVD8; -. DR ExpressionAtlas; A2RVD8; differential. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 221 221 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 221 AA; 24581 MW; FC4D7E218B0FCE7C CRC64; MCNNRDVSAY VDTLFKRKIG HLMDDVYNLK KQVMSADCSS KSAQSTPKPE SVALAKPRIN YASEELGARI INVKAHSIDG TNIIRSLLGL DFSTNPPVNM IRTGLSPGSC FGFNGSRATV TLHLARTIIV EAITLTHVAR EMTPDLCVKS APKNFDVYGL RSENSKRELL GQWSYDNAAN KRTQSYSVRS DTFFRNLDFS FNSNHGANST CIYRVEVYGR L // ID A2WN90_ORYSI Unreviewed; 455 AA. AC A2WN90; DT 20-MAR-2007, integrated into UniProtKB/TrEMBL. DT 20-MAR-2007, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAY73436.1}; GN ORFNames=OsI_01316 {ECO:0000313|EMBL:EAY73436.1}; OS Oryza sativa subsp. indica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39946 {ECO:0000313|EMBL:EAY73436.1, ECO:0000313|Proteomes:UP000007015}; RN [1] {ECO:0000313|EMBL:EAY73436.1, ECO:0000313|Proteomes:UP000007015} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. 93-11 {ECO:0000313|Proteomes:UP000007015}; RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000126; EAY73436.1; -; Genomic_DNA. DR STRING; 39946.BGIOSGA001927-PA; -. DR EnsemblPlants; BGIOSGA001927-TA; BGIOSGA001927-PA; BGIOSGA001927. DR Gramene; A2WN90; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR OMA; RVSGWYQ; -. DR Proteomes; UP000007015; Chromosome 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007015}; KW Reference proteome {ECO:0000313|Proteomes:UP000007015}. FT COILED 180 214 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 455 AA; 48738 MW; 503593BD441F5580 CRC64; MASPSLAAAA ASPLTSLDRA TSPDTGSRPD GADAVARRKR PVLLLDQRQH LSSPNLDSSV DVDAAAAAAG VAQAQSETPR RKKPGHTSSS TRPRCQTALS VAAKNAVLLA VLLYVGDLAW RAARPAPPRP VDQAAMAGYD ARVADVEASL ARAFRMLQVQ LEAVDRKIDG EVGAVRGELA ALLEEKRLEL EGQLKRLDAR ADDLSDALGA LKRMEFLRKD EFDKFWNEVK ESLGSGPGTE VDLDQVRALA REITMGEIEK HAADGIGRVD YAVASAGGKV VRHSDAYDAG KRGGFFSSLL SGDTAASPKK ILQPSFGEPG QCFPLQGSSG FVEIKLRKGI VPDAITLEHV SKDVAYDMST APKDCRVSGW YQEAHNEAYS GHAASAKMYV LTEFTYDLDK KNVQTFDITA PDVGIINMVR LDFTSNHGSS ALTCIYRIRV HGHEPVSPGM SVSQS // ID A2Y2L0_ORYSI Unreviewed; 453 AA. AC A2Y2L0; DT 20-MAR-2007, integrated into UniProtKB/TrEMBL. DT 20-MAR-2007, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAY97320.1}; GN ORFNames=OsI_19241 {ECO:0000313|EMBL:EAY97320.1}; OS Oryza sativa subsp. indica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39946 {ECO:0000313|EMBL:EAY97320.1, ECO:0000313|Proteomes:UP000007015}; RN [1] {ECO:0000313|EMBL:EAY97320.1, ECO:0000313|Proteomes:UP000007015} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. 93-11 {ECO:0000313|Proteomes:UP000007015}; RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000130; EAY97320.1; -; Genomic_DNA. DR STRING; 39946.BGIOSGA018402-PA; -. DR PRIDE; A2Y2L0; -. DR EnsemblPlants; BGIOSGA018402-TA; BGIOSGA018402-PA; BGIOSGA018402. DR Gramene; A2Y2L0; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR OMA; VKHSEPF; -. DR Proteomes; UP000007015; Chromosome 5. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007015}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007015}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 113 136 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 190 224 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 453 AA; 50342 MW; D1B73097E07B4E9B CRC64; MSVSTAAVPT ANTNGNHALS MDSHSSQDVR RRTVVVARKK ASPELLADGG FNGTSSVDKI TDKKDLSHTI RGESVLGKSK YPLEARKDAI ASAAAADRQK KSGAKQEKAK WEIALSVLMK LCLLISAVAW MGQLFWRWQN GDLSFTTLDM ESRLSKVEGF KKTTKMLQVQ LDILDKKLGN EIDKTRRDIT KQFEDKGNKL EIKMKALEGK TDKLDKSLAE LRDMGFVSKK EFDEIVEQLK KKKGLDGTVG DISLDDIRLF AKEIVEMEIE RHAADGLGMV DYALASGGGK VVKHSEAFRK AKSFMPSRNS LLEQAKKMLE PSFGQPGECF ALQGSSGYVE IKLRTGIIPE AVSLEHVDKS VAYDRSSAPK DFQVSGWYEG PEDDSDKESR VVTNLGEFSY DLEKNNAQTF QLERTADSRV INMVRLDFCS NHGNSELTCI YRFRVHGREP GSP // ID A3LQA0_PICST Unreviewed; 443 AA. AC A3LQA0; DT 03-APR-2007, integrated into UniProtKB/TrEMBL. DT 24-JUL-2007, sequence version 2. DT 11-NOV-2015, entry version 37. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ABN65170.2}; DE Flags: Fragment; GN ORFNames=PICST_3580 {ECO:0000313|EMBL:ABN65170.2}; OS Scheffersomyces stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 / OS NRRL Y-11545) (Yeast) (Pichia stipitis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Scheffersomyces. OX NCBI_TaxID=322104 {ECO:0000313|EMBL:ABN65170.2, ECO:0000313|Proteomes:UP000002258}; RN [1] {ECO:0000313|EMBL:ABN65170.2, ECO:0000313|Proteomes:UP000002258} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545 RC {ECO:0000313|Proteomes:UP000002258}; RX PubMed=17334359; DOI=10.1038/nbt1290; RA Jeffries T.W., Grigoriev I.V., Grimwood J., Laplaza J.M., Aerts A., RA Salamov A., Schmutz J., Lindquist E., Dehal P., Shapiro H., Jin Y.-S., RA Passoth V., Richardson P.M.; RT "Genome sequence of the lignocellulose-bioconverting and xylose- RT fermenting yeast Pichia stipitis."; RL Nat. Biotechnol. 25:319-326(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP000496; ABN65170.2; -; Genomic_DNA. DR RefSeq; XP_001383199.2; XM_001383162.1. DR STRING; 322104.XP_001383199.2; -. DR EnsemblFungi; ABN65170; ABN65170; PICST_3580. DR GeneID; 4837225; -. DR KEGG; pic:PICST_3580; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A3LQA0; -. DR OMA; IDECHFM; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002258; Chromosome 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002258}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002258}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 420 437 Helical. {ECO:0000256|SAM:Phobius}. FT NON_TER 1 1 {ECO:0000313|EMBL:ABN65170.2}. FT NON_TER 443 443 {ECO:0000313|EMBL:ABN65170.2}. SQ SEQUENCE 443 AA; 50995 MW; 016E0D173841C41D CRC64; NSTDIDECHF MSFEEWKKSK KEADVVQSEL QQATNTSKNT TKHKFKNKNS TSKELSIRRN DSTNSTDIES LEHVITPEEG RVYKDRFNYA SSGCGANIIK TNSEAKGASA ILAENKDSYL LNRCSASNRF VVIELCQEIL VDSVVVGNFE FFSSMFKEVR VSVSDKFPTT NWRVLGEFEA ENVRDVQTFK IQNPLIWARY FKLEVLSHYG DEFYCPITLV RVHGKTMMEE VKENEESSQT QDEDEELLID TTTLNQFDND TLDECRVFMP HLGLNEFLSD FISTVPDYCD IKSNEQEQVH TTEAHTTTQE SVYRTIMNRL SLLESNATLS LLYIEEQSKL LSTAFTNLER RQSMNFESLI DSVNSTLINQ LINFKDSYLS MHSEYAKLYK LQELNHQDLL SNSKQKLGSL GNELTFQKRM AVFNTIIILC LLVYVIVTRD AYI // ID A4D2Q0_HUMAN Unreviewed; 974 AA. AC A4D2Q0; DT 03-APR-2007, integrated into UniProtKB/TrEMBL. DT 03-APR-2007, sequence version 1. DT 11-NOV-2015, entry version 62. DE SubName: Full=Unc-84 homolog A (C. elegans) {ECO:0000313|EMBL:EAL23707.1}; GN Name=UNC84A {ECO:0000313|EMBL:EAL23707.1}; GN ORFNames=tcag7.1079 {ECO:0000313|EMBL:EAL23707.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] {ECO:0000313|EMBL:EAL23707.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12690205; DOI=10.1126/science.1083423; RA Scherer S.W., Cheung J., MacDonald J.R., Osborne L.R., Nakabayashi K., RA Herbrick J.A., Carson A.R., Parker-Katiraee L., Skaug J., Khaja R., RA Zhang J., Hudek A.K., Li M., Haddad M., Duggan G.E., Fernandez B.A., RA Kanematsu E., Gentles S., Christopoulos C.C., Choufani S., RA Kwasnicka D., Zheng X.H., Lai Z., Nusskern D., Zhang Q., Gu Z., Lu F., RA Zeesman S., Nowaczyk M.J., Teshima I., Chitayat D., Shuman C., RA Weksberg R., Zackai E.H., Grebe T.A., Cox S.R., Kirkpatrick S.J., RA Rahman N., Friedman J.M., Heng H.H., Pelicci P.G., Lo-Coco F., RA Belloni E., Shaffer L.G., Pober B., Morton C.C., Gusella J.F., RA Bruns G.A., Korf B.R., Quade B.J., Ligon A.H., Ferguson H., RA Higgins A.W., Leach N.T., Herrick S.R., Lemyre E., Farra C.G., RA Kim H.G., Summers A.M., Gripp K.W., Roberts W., Szatmari P., RA Winsor E.J., Grzeschik K.H., Teebi A., Minassian B.A., Kere J., RA Armengol L., Pujana M.A., Estivill X., Wilson M.D., Koop B.F., RA Tosi S., Moore G.E., Boright A.P., Zlotorynski E., Kerem B., RA Kroisel P.M., Petek E., Oscier D.G., Mould S.J., Dohner H., Dohner K., RA Rommens J.M., Vincent J.B., Venter J.C., Li P.W., Mural R.J., RA Adams M.D., Tsui L.C.; RT "Human chromosome 7: DNA sequence and biology."; RL Science 300:767-772(2003). RN [2] {ECO:0000313|EMBL:EAL23707.1} RP NUCLEOTIDE SEQUENCE. RA Scherer S.W., Cheung J., MacDonald J.R., Osborne L.R., Nakabayashi K., RA Herbrick J.-A., Carson A.R., Parker-Katiraee L., Skaug J., Khaja R., RA Zhang J., Hudek A.K., Li M., Haddad M., Duggan G.E., Fernandez B.A., RA Kanematsu E., Gentles S., Christopoulos C.C., Choufani S., RA Kwasnicka D., Zheng X.H., Nusskern D., Zhang Q., Gu Z., Lu F., RA Zeesman S., Teshima I., Chitayat D., Shuman C., Weksberg R., RA Zackai E.H., Grebe T.A., Cox S.R., Kirkpatrick S.J., Rahman N., RA Friedman J.M., Heng H.H.Q., Pelicci P., Lococo F., Belloni E., RA Shaffer L.G., Morton C.C., Pober B., Gusella J., Bruns G., Korf B.R., RA Quade B.J., Ligon A.H., Ferguson H., Higgins A.W., Leach N.T., RA Herrick S.R., Lemyre E., Farra C.G., Kim H.-G., Summers A.M., RA Gripp K.W., Roberts W., Szatmari P., Winsor E.J.T., Grzeschik K.-H., RA Teebi A., Minassian B.A., Kere J., Armengol L., Pujana M.Angel., RA Estivill X., Wilson M.D., Koop B.F., Tosi S., Moore G.E., RA Boright A.P., Zlotorynski E., Kerem B., Kroisel P.M., Petek E., RA Oscier D.G., Mould S.J., Doehner H., Doehner K., Rommens J.M., RA Vincent J.B., Venter J.C., Li P.W., Mural R.J., Adams M.D., RA Tsui L.-C.; RL Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH236965; EAL23707.1; -; Genomic_DNA. DR UniGene; Hs.438072; -. DR ProteinModelPortal; A4D2Q0; -. DR STRING; 9606.ENSP00000384015; -. DR MaxQB; A4D2Q0; -. DR PaxDb; A4D2Q0; -. DR PRIDE; A4D2Q0; -. DR H-InvDB; HIX0167829; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR PhylomeDB; A4D2Q0; -. DR ChiTaRS; SUN1; human. DR NextBio; 35461708; -. DR Bgee; A4D2Q0; -. DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IDA:HPA. DR GO; GO:0031965; C:nuclear membrane; IDA:HPA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 448 471 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 478 497 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 572 592 {ECO:0000256|SAM:Coils}. FT COILED 617 651 {ECO:0000256|SAM:Coils}. FT COILED 664 684 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 974 AA; 108455 MW; C7B7AFC1381E0FAE CRC64; MGDVRKAKEA LAEVQIPPPD AGPSPRCKPP PAERLLGPGR RRRGLRRRRE AVWFEVVNMD FSRLHMYSPP QCVPENTGYT YALSSSYSSD ALDFETEHKL DPVFDSPRMS RRSLRLATTA CTLGDGEAVG ADSGTSSAVS LKNRAARTTK QRRSTNKSAF SINHVSRQVT SSGVSHGGTV SLQDAVTRRP PVLDESWIRE QTTVDHFWGL DDDGDLKGGN KAAIQGNGDV GAAAATAHNG FSCSNCSMLS ERKDVLTAHP AAPGPVSRVY SRDRNQKCGA SFYVNRILWL ARYTASSFSS FLVQLFQVVL MKLSYESENY KLKTHESKDC ESESYKSKSH ESKAHASYYG RMNVREVLRE DGHLSVNGEA LCDDCKGKRH LDAHTAAHSQ SPRLPGRAGT LWHIWACAGY FLLQILRRIG AVGQAVSRTA WSALWLAVVA PGKAASGVFW WLGIGWYQFV TLISWLNVFL LTRCLRNICK FLVLLIPLFL LLAGLSLRGQ GNFFSFLPVL NWASMHRTQR VDDPQDVFKP TTSRLKQPLQ GDSEAFPWHW MSGVEQQVAS LSGQCHHHGE NLRELTTLLQ KLQARVDQME GGAAGPSASV RDAVGQPPRE TDFMAFHQEH EVRMSHLEDI LGKLREKSEA IQKELEQTKQ KTISAVGEQL LPTVEHLQLE LDQLKSELSS WRHVKTGCET VDAVQERVDV QVREMVKLLF SEDQQGGSLE QLLQRFSSQF VSKGDLQTML RDLQLQILRN VTHHVSVTKQ LPTSEAVVSA VSEAGASGIT EAQARAIVNS ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALM SLFGIPLWYF SQSPRVVIQP DIYPGNCWAF KGSQGYLVVR LSMMIHPAAF TLEHIPKTLS PTGNISSAPK DFAVYGLENE YQEEGQLLGQ FTYDQDGESL QMFQALKRPD DTAFQIVELR IFSNWGHPEY TCLYRFRVHG EPVK // ID A4HHX6_LEIBR Unreviewed; 575 AA. AC A4HHX6; DT 01-MAY-2007, integrated into UniProtKB/TrEMBL. DT 01-MAY-2007, sequence version 1. DT 11-NOV-2015, entry version 39. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAM40182.1}; GN ORFNames=LBRM_30_0330 {ECO:0000313|EMBL:CAM40182.1}; OS Leishmania braziliensis. OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmaniinae; Leishmania; Leishmania braziliensis species complex. OX NCBI_TaxID=5660 {ECO:0000313|Proteomes:UP000007258}; RN [1] {ECO:0000313|EMBL:CAM40182.1, ECO:0000313|Proteomes:UP000007258} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MHOM/BR/75/M2904 {ECO:0000313|EMBL:CAM40182.1, RC ECO:0000313|Proteomes:UP000007258}; RX PubMed=17572675; DOI=10.1038/ng2053; RA Peacock C.S., Seeger K., Harris D., Murphy L., Ruiz J.C., Quail M.A., RA Peters N., Adlem E., Tivey A., Aslett M., Kerhornou A., Ivens A., RA Fraser A., Rajandream M.-A., Carver T., Norbertczak H., RA Chillingworth T., Hance Z., Jagels K., Moule S., Ormond D., Rutter S., RA Sqaures R., Whitehead S., Rabbinowitsch E., Arrowsmith C., White B., RA Thurston S., Bringaud F., Baldauf S.L., Faulconbridge A., Jeffares D., RA Depledge D.P., Oyola S.O., Hilley J.D., Brito L.O., Tosi L.R.O., RA Barrell B., Cruz A.K., Mottram J.C., Smith D.F., Berriman M.; RT "Comparative genomic analysis of three Leishmania species that cause RT diverse human disease."; RL Nat. Genet. 39:839-847(2007). RN [2] {ECO:0000313|EMBL:CAM40182.1, ECO:0000313|Proteomes:UP000007258} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MHOM/BR/75/M2904 {ECO:0000313|EMBL:CAM40182.1, RC ECO:0000313|Proteomes:UP000007258}; RX PubMed=22038252; DOI=10.1101/gr.122945.111; RA Rogers M.B., Hilley J.D., Dickens N.J., Wilkes J., Bates P.A., RA Depledge D.P., Harris D., Her Y., Herzyk P., Imamura H., Otto T.D., RA Sanders M., Seeger K., Dujardin J.C., Berriman M., Smith D.F., RA Hertz-Fowler C., Mottram J.C.; RT "Chromosome and gene copy number variation allow major structural RT change between species and strains of Leishmania."; RL Genome Res. 21:2129-2142(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR799005; CAM40182.1; -; Genomic_DNA. DR RefSeq; XP_001566666.1; XM_001566616.1. DR STRING; 420245.XP_001566666.1; -. DR EnsemblProtists; CAM40182; CAM40182; LBRM_30_0330. DR GeneID; 5417558; -. DR KEGG; lbz:LBRM_30_0330; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A4HHX6; -. DR OMA; CTITSFQ; -. DR Proteomes; UP000007258; Chromosome 30. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007258}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007258}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 516 535 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 452 486 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 575 AA; 62388 MW; C457D3541E3CC6E7 CRC64; MRPEERLTVL CALLLLYALF SVPLEFLTVF RHRTTPPAVG VSSPSPLKTF STGSAPGLSI NYASSYLGAT VVSVEPPSCH GGSALISDSA DNYVLCPCNA ARKQFVMQLI RDVEVRSVMV RNAEHFSSGV RNFTLLGSLQ YPTSTWLVLG HFEAEQRRGR QYFDVTPGRR VRFIKLQWAT SYGPEPWCTI TSFQVYGIDL LETLTRFDES DDLVATEGAA EVSGGHRCSP DMDCFHRPAL PSAPGKVVAP SAAGSSAVAS GSGATPAVSI DELAAEMWNG VTATARASKE DDTDVLLFAP VDVAVSADRG SPSQSGARAK RPSSTSSIAL VKIARELQSP SCSSVNSMYW NASLQCTFSD LTALWGPCAV TRCGTSSHVT AVGTTPTHAS ASSVNTASIK GFPASRSIYQ SVAASLLTQL LRQQRSLHHE LTLLTRRQRH LTEELNHTRS LLSNFYAKYK ETEREFSQYR NQLRGLHAEL QLLQERFLLR RHSSFCGEGV DTSGSGGSVM RSDTTLTVIL LVVLALMVVL VLIYSSSSSF PGVRRPSDWE RYYSVSRGGG SGSPPWLQPQ RGRAK // ID A4I526_LEIIN Unreviewed; 586 AA. AC A4I526; DT 01-MAY-2007, integrated into UniProtKB/TrEMBL. DT 01-MAY-2007, sequence version 1. DT 11-NOV-2015, entry version 38. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAM69894.1}; GN ORFNames=LINJ_30_0320 {ECO:0000313|EMBL:CAM69894.1}; OS Leishmania infantum. OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmaniinae; Leishmania. OX NCBI_TaxID=5671 {ECO:0000313|Proteomes:UP000008153}; RN [1] {ECO:0000313|EMBL:CAM69894.1, ECO:0000313|Proteomes:UP000008153} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JPCM5 {ECO:0000313|EMBL:CAM69894.1, RC ECO:0000313|Proteomes:UP000008153}; RX PubMed=17572675; DOI=10.1038/ng2053; RA Peacock C.S., Seeger K., Harris D., Murphy L., Ruiz J.C., Quail M.A., RA Peters N., Adlem E., Tivey A., Aslett M., Kerhornou A., Ivens A., RA Fraser A., Rajandream M.-A., Carver T., Norbertczak H., RA Chillingworth T., Hance Z., Jagels K., Moule S., Ormond D., Rutter S., RA Sqaures R., Whitehead S., Rabbinowitsch E., Arrowsmith C., White B., RA Thurston S., Bringaud F., Baldauf S.L., Faulconbridge A., Jeffares D., RA Depledge D.P., Oyola S.O., Hilley J.D., Brito L.O., Tosi L.R.O., RA Barrell B., Cruz A.K., Mottram J.C., Smith D.F., Berriman M.; RT "Comparative genomic analysis of three Leishmania species that cause RT diverse human disease."; RL Nat. Genet. 39:839-847(2007). RN [2] {ECO:0000313|EMBL:CAM69894.1, ECO:0000313|Proteomes:UP000008153} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JPCM5 {ECO:0000313|EMBL:CAM69894.1, RC ECO:0000313|Proteomes:UP000008153}; RX PubMed=22038252; DOI=10.1101/gr.122945.111; RA Rogers M.B., Hilley J.D., Dickens N.J., Wilkes J., Bates P.A., RA Depledge D.P., Harris D., Her Y., Herzyk P., Imamura H., Otto T.D., RA Sanders M., Seeger K., Dujardin J.C., Berriman M., Smith D.F., RA Hertz-Fowler C., Mottram J.C.; RT "Chromosome and gene copy number variation allow major structural RT change between species and strains of Leishmania."; RL Genome Res. 21:2129-2142(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR796462; CAM69894.1; -; Genomic_DNA. DR RefSeq; XP_001466845.1; XM_001466808.1. DR STRING; 435258.XP_001466845.1; -. DR EnsemblProtists; CAM69894; CAM69894; LINJ_30_0320. DR GeneID; 5070887; -. DR KEGG; lif:LINJ_30_0320; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A4I526; -. DR OMA; CTITSFQ; -. DR Proteomes; UP000008153; Chromosome 30. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008153}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008153}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 518 538 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 457 491 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 586 AA; 61939 MW; 25F41AFACEC83CA4 CRC64; MRPQERLTVL CTLLLLYVLF SAPVELLTVF WHRTTSAAVG ISSPSPLKRL STGSAPGLST NYASLYLGAA VVSMEPSSCH GGVALISESV DKYVLCPCDA PRKQFVVQLI RDVQVRSVMV RNAEHFSSGV RNFTLLGSLQ YPTSTWLVLG HFEAEQRRGR QYFDVAPRSR VRFIKLQWAT SYGPEPWCTI TSFQVYGIDV LETLTRYDGG DDLVAGEDAA GASGGLRGTP DMHRFHLPAL PPTPGEVAAP PLAGSSAVPS RNGATSANDA PAPAVFIDEL AAGMWAGAAA TVGASRGADA DDLLLAPVDV GASAETGPLS QPDADAKRSS PTNSIALAAT APALQSVNCS AAQPIGRNAS VKCTITDLTA LWGPCAVATS GASDFTAVTT PTSAPALSVS TPSSKGLSAS GSIYQSAAGS LLTNLLRQQR STHHELTLLM QRERHLAQEL NRTRILLSDF YARYKATERE ADEYRDRLHG LQSKLQLLQE RFLREHSSCC GEGGGAGRSG GSIMRSDTAM AVASFALLAL AVILMLMYSS SSSRSVVGPP SGWGCYYNIG RGSGGVASGG GNRPPLWPRP QRGRAR // ID A4RXN9_OSTLU Unreviewed; 1676 AA. AC A4RXN9; DT 15-MAY-2007, integrated into UniProtKB/TrEMBL. DT 15-MAY-2007, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ABO96083.1}; GN ORFNames=OSTLU_24577 {ECO:0000313|EMBL:ABO96083.1}; OS Ostreococcus lucimarinus (strain CCE9901). OC Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; OC Mamiellales; Bathycoccaceae; Ostreococcus. OX NCBI_TaxID=436017 {ECO:0000313|EMBL:ABO96083.1, ECO:0000313|Proteomes:UP000001568}; RN [1] {ECO:0000313|EMBL:ABO96083.1, ECO:0000313|Proteomes:UP000001568} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCE9901 {ECO:0000313|EMBL:ABO96083.1, RC ECO:0000313|Proteomes:UP000001568}; RX PubMed=17460045; DOI=10.1073/pnas.0611046104; RA Palenik B., Grimwood J., Aerts A., Rouze P., Salamov A., Putnam N., RA Dupont C., Jorgensen R., Derelle E., Rombauts S., Zhou K., Otillar R., RA Merchant S.S., Podell S., Gaasterland T., Napoli C., Gendler K., RA Manuell A., Tai V., Vallon O., Piganeau G., Jancek S., Heijde M., RA Jabbari K., Bowler C., Lohr M., Robbens S., Werner G., Dubchak I., RA Pazour G.J., Ren Q., Paulsen I., Delwiche C., Schmutz J., Rokhsar D., RA Van de Peer Y., Moreau H., Grigoriev I.V.; RT "The tiny eukaryote Ostreococcus provides genomic insights into the RT paradox of plankton speciation."; RL Proc. Natl. Acad. Sci. U.S.A. 104:7705-7710(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP000585; ABO96083.1; -; Genomic_DNA. DR RefSeq; XP_001417790.1; XM_001417753.1. DR STRING; 436017.A4RXN9; -. DR EnsemblPlants; ABO96083; ABO96083; OSTLU_24577. DR GeneID; 5001735; -. DR KEGG; olu:OSTLU_24577; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR Proteomes; UP000001568; Chromosome 5. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001568}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001568}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 86 108 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 136 156 {ECO:0000256|SAM:Coils}. FT COILED 185 205 {ECO:0000256|SAM:Coils}. FT COILED 212 246 {ECO:0000256|SAM:Coils}. FT COILED 283 310 {ECO:0000256|SAM:Coils}. FT COILED 327 361 {ECO:0000256|SAM:Coils}. FT COILED 398 425 {ECO:0000256|SAM:Coils}. FT COILED 442 476 {ECO:0000256|SAM:Coils}. FT COILED 513 540 {ECO:0000256|SAM:Coils}. FT COILED 557 591 {ECO:0000256|SAM:Coils}. FT COILED 628 655 {ECO:0000256|SAM:Coils}. FT COILED 732 752 {ECO:0000256|SAM:Coils}. FT COILED 765 785 {ECO:0000256|SAM:Coils}. FT COILED 825 845 {ECO:0000256|SAM:Coils}. FT COILED 898 925 {ECO:0000256|SAM:Coils}. FT COILED 1041 1061 {ECO:0000256|SAM:Coils}. FT COILED 1177 1197 {ECO:0000256|SAM:Coils}. FT COILED 1313 1333 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1676 AA; 180147 MW; 1BF1441441F078D9 CRC64; MPARAPARTR APATSAETRA KSATRARGDA RDTRGADARD DDLNEKDDRS DDDDDDDDDD ACDDAWGVAR GERDVRAGMI RVRSTVPLIV STLCVVVTAL IAVCAQSWHV RETAMRLASV RLSATSVKLG PTTRRLTALE DAVRRLESEK ASVRAVDGVK ATVEAERAAN AANAATTTTD AKVVKTEMKA ELKSVLAQID ALSKRQNSFV TTAELKKSSG ALKAELAELE KTAAALEKKQ ANFAAKTELA SLEKTTAALD KGQANFVTKA QLQKVNVAND LALSAERLRI DDAIKEAKSL RDAIDAVSKS LAQIDALSKR QNSFVTTAEL KKSSGALKAE LAELEKTAAA LEKKQANFAA KTELASLEKT TAALDKGQAN FVTKAQLQKV NVANDLALSA ERLRIDDAIK EAKSLRDAID AVSKSLAQID ALSKRQNSFV TTAELKKSSG ALKAELAELE KTAAALEKKQ ANFAAKTELA SLEKTTAALD KGQANFVTKA QLQKVNVAND LALSAERLRI DDAIKEAKSL RDAIDAVSKS LAQIDALSKR QNSFVTTAEL KKSSGALKAE LAELEKTAAA LEKKQANFAA KTELASLEKT TAALDKGQAN FVTKAQLQKV NVANDLALSA ERLRIDDAIK EAKSLRDAID AVSKSQDVFI TNTTLRRVES QIEALKKMQA KKGFFSTRPK GPVVSDEQIA MLEATVATFA THLADLEKNQ ADFVTTAQLQ KFEGASEEIK SLQREIHVVS KKQANYVSVA QLQKLEVVTD EIKSLQSEIS ALAKTQDEFA ILSASLKDVE SKIAALADGK KKTGLLSKLS KKTDTKVTNK QLVELENAVA TLEKTQTKFA SLASSQISTL ESTLDAISQQ QSSALTAADL KSIQKAITAL EKSQESFATT AQLQKVDASR EIKALEEALA EILQSQHSFA NSTAVEELET KVMALEARQN EKRGGMFKRS ASTDTSVTNK QLKALEDTIA ALTKSQADFA TVTQLKESAQ ALSQQQAGAL TAADLKGIQK AVASLEKSQN GFATTAQLQK LEVVTDEIKS LQSEISALAK TQDEFAILSA SLKDVESKIA ALADGKKKTG LLSKLSKKTD TKVTNKQLKA LEDAIAALTK SQADFATVTQ LKESAQALSQ QQAGALTAAD LKGIQKAVAS LEKSQNGFAT TAQLQKLEVV TDEIKSLQSE ISALAKTQDE FAILSASLKD VESKIAALAD GKKKTGLLSK LSKKTDTKVT NKQLKALEDA IAALTKSQAD FATVTQLKES AQALSQQQAG ALTAADLKGI QKAVASLEKS QNGFATTAQL QKLEVVTDEI KSLQSEISAL AKTQDEFAIL SASLKDVESK IAALADGKKK TGLLSKLSKK TDTKVTNKQL KALEDAIAAL TKSQADFATV TQLKKVAATL DALKNVHDNY ATTKLVKNLE LKLNQVIKTS GGLSTTQTNL VELEQRAMER VESYIKTIPK DTSRKLTKHI EEASSLWFAD RTGRQDFALA SGGGRVVGHS QLSPFVGRGD GPITSVLSFL QSGVHPKSDE WLLTPSLEQP GDCIALHSST GYVDVRLRQS VKVDAVTLEH ANSLNAYDLH SAPRDVQVYG WHARKKSCKH SKPPKSLIPL GNYTYSINRG SVQTFDVVSP QTVDHVRLVV KNNQGHQKWT CLYRFRVHGV PERVEA // ID A4RZI8_OSTLU Unreviewed; 666 AA. AC A4RZI8; DT 15-MAY-2007, integrated into UniProtKB/TrEMBL. DT 15-MAY-2007, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ABO96635.1}; GN ORFNames=OSTLU_32403 {ECO:0000313|EMBL:ABO96635.1}; OS Ostreococcus lucimarinus (strain CCE9901). OC Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; OC Mamiellales; Bathycoccaceae; Ostreococcus. OX NCBI_TaxID=436017 {ECO:0000313|EMBL:ABO96635.1, ECO:0000313|Proteomes:UP000001568}; RN [1] {ECO:0000313|EMBL:ABO96635.1, ECO:0000313|Proteomes:UP000001568} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCE9901 {ECO:0000313|EMBL:ABO96635.1, RC ECO:0000313|Proteomes:UP000001568}; RX PubMed=17460045; DOI=10.1073/pnas.0611046104; RA Palenik B., Grimwood J., Aerts A., Rouze P., Salamov A., Putnam N., RA Dupont C., Jorgensen R., Derelle E., Rombauts S., Zhou K., Otillar R., RA Merchant S.S., Podell S., Gaasterland T., Napoli C., Gendler K., RA Manuell A., Tai V., Vallon O., Piganeau G., Jancek S., Heijde M., RA Jabbari K., Bowler C., Lohr M., Robbens S., Werner G., Dubchak I., RA Pazour G.J., Ren Q., Paulsen I., Delwiche C., Schmutz J., Rokhsar D., RA Van de Peer Y., Moreau H., Grigoriev I.V.; RT "The tiny eukaryote Ostreococcus provides genomic insights into the RT paradox of plankton speciation."; RL Proc. Natl. Acad. Sci. U.S.A. 104:7705-7710(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP000586; ABO96635.1; -; Genomic_DNA. DR RefSeq; XP_001418342.1; XM_001418305.1. DR STRING; 436017.A4RZI8; -. DR EnsemblPlants; ABO96635; ABO96635; OSTLU_32403. DR GeneID; 5002200; -. DR KEGG; olu:OSTLU_32403; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000001568; Chromosome 6. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001568}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001568}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 639 662 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 36 82 {ECO:0000256|SAM:Coils}. FT COILED 333 367 {ECO:0000256|SAM:Coils}. FT COILED 549 569 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 666 AA; 72720 MW; FE9554BDBEC43A8A CRC64; MCRCFPTWSR GVGAAEEARR GESDVKKTKK EPRAPLLSLK EYKENMEEKM AQKQREKEAK QREKEEKERK RKKEEDEALR RLVEVNVTHG AVSENEDAET KGETLEPNST ETTTVDEEPA PSEVSIEVEG GQQQAETMDG ASTAETGELA RDAVSETSQA APAPVEPPVM EEEDTDGEAT FAELVIKPER LTEADAEMYN YAASFNGAKV VASDKDSKHA SAALKEDKDV YYISPCASEK FVTVELSEEV TVTSLVLGNF EFHSSRVKDF EVWGTDGHHA IEEGWKRLMI GRADNTQNYQ KFAVPSPAWV RYVQIRMTGH HDQQHFCTLS LLRIHGKDAK ETLKEEMERL QAEVQEVESL LSDEDEDEDE DEDVDVRESS AEVVLDVEEQ NREETNASAV VGEENERAST GDDRDVSTSS ETDHSANVNT SIAEGAPSET TSNSDEDDNA AQERATKIDA STTSRPNATA AGATAVNATN SNATGVATAK PKMATSTNEL AKGGGDANVF RLLAQKIKDL ELNQSLLSRY VESLNVRYGE TLEDFGKEID EIEESVSNST GKLDEASRQA RASSKACDDA VARVNDSSEK LVAAAVSELD AYRTTVAKRD TVLALALALT AGALVASRRS SGAIERVLSA LSSFALLVIV VANIVLIAQN FLLKSM // ID A5AFQ0_VITVI Unreviewed; 529 AA. AC A5AFQ0; DT 12-JUN-2007, integrated into UniProtKB/TrEMBL. DT 12-JUN-2007, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CAN68972.1}; GN ORFNames=VITISV_043156 {ECO:0000313|EMBL:CAN68972.1}; OS Vitis vinifera (Grape). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; Vitales; Vitaceae; Vitis. OX NCBI_TaxID=29760 {ECO:0000313|EMBL:CAN68972.1}; RN [1] {ECO:0000313|EMBL:CAN68972.1} RP NUCLEOTIDE SEQUENCE. RA Velasco R., Zharkikh A., Troggio M., Cartwright D.A., Cestaro A., RA Pruss D., Pindo M., FitzGerald L.M., Vezzulli S., Reid J., RA Malacarne G., Iliev D., Coppola G., Wardell B., Micheletti D., RA Macalma T., Facci M., Mitchell J.T., Perazzolli M., Eldredge G., RA Gatto P., Oyzerski R., Moretto M., Gutin N., Stefanini M., Chen Y., RA Segala C., Davenport C., Dematte L., Mraz A., Battilana J., Stormo K., RA Costa F., Tao Q., Si-Ammour A., Harkins T., Lackey A., Perbost C., RA Taillon B., Stella A., Solovyev V., Fawcett J.A., Sterck L., RA Vandepoele K., Grando S.M., Toppo S., Moser C., Lanchbury J., RA Bogden R., Skolnick M., Sgaramella V., Bhatnagar S.K., Fontana P., RA Gutin A., Van de Peer Y., Salamini F., Viola R.; RT "The first genome sequence of an elite grapevine cultivar (Pinot noir RT Vitis vinifera L.): coping with a highly heterozygous genome."; RL PLoS ONE 2:e1326-e1326(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM425559; CAN68972.1; -; Genomic_DNA. DR ProteinModelPortal; A5AFQ0; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 24 46 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 478 498 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 450 477 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 529 AA; 59364 MW; D2F6918DBE5B90A0 CRC64; MQRSRRALLQ RRALEKAIIG RSRLYKVSLS LVFVLWGLVF LLSLWISHGD GYQDGSGMPL IGISTWDEAK QGLNLGSCSV DEHSLIETNS DNSYEGSRND AETKDFTNEL HSKGNVKSTL PVEEGSEVEK SSSDVKSEKD TPKNDRLSRA VPPGLDEFKS KAISYKSKSV TGQAGNVIHR VEPGGADYNY ASASKGAKVL ASNKEAKGAS NILGKDKDKY LRNPCSAEEK FVVIELSEET LVDTIEIANF EHYSSNPKDF ELLGSSVFPT DEWVKLGNFT AANVKHAQRF ALHEPKWVRY LKLNLLSHHG TEFYCTLSVV EVYGVDAVER MLEDLISVQD NPFVPEEITA EKKSIPSQPE PTEGNNLYQK PVSETESDPL LDKPEAIKSN XPDPVEEIRH STEFDKEIEE KDVLLENIRS DIRNFLDSKE IITKDVSDLI SWKSLVSLQL DNLLKDNALL RAEVQKVQED QTHMENKGIA VFLICLIFGF WAFARLLVDM MLSVYMAVDG LMSRYELRAP FGCDSRKYL // ID A5BJ94_VITVI Unreviewed; 640 AA. AC A5BJ94; DT 12-JUN-2007, integrated into UniProtKB/TrEMBL. DT 12-JUN-2007, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CAN63550.1}; GN ORFNames=VITISV_043049 {ECO:0000313|EMBL:CAN63550.1}; OS Vitis vinifera (Grape). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; Vitales; Vitaceae; Vitis. OX NCBI_TaxID=29760 {ECO:0000313|EMBL:CAN63550.1}; RN [1] {ECO:0000313|EMBL:CAN63550.1} RP NUCLEOTIDE SEQUENCE. RA Velasco R., Zharkikh A., Troggio M., Cartwright D.A., Cestaro A., RA Pruss D., Pindo M., FitzGerald L.M., Vezzulli S., Reid J., RA Malacarne G., Iliev D., Coppola G., Wardell B., Micheletti D., RA Macalma T., Facci M., Mitchell J.T., Perazzolli M., Eldredge G., RA Gatto P., Oyzerski R., Moretto M., Gutin N., Stefanini M., Chen Y., RA Segala C., Davenport C., Dematte L., Mraz A., Battilana J., Stormo K., RA Costa F., Tao Q., Si-Ammour A., Harkins T., Lackey A., Perbost C., RA Taillon B., Stella A., Solovyev V., Fawcett J.A., Sterck L., RA Vandepoele K., Grando S.M., Toppo S., Moser C., Lanchbury J., RA Bogden R., Skolnick M., Sgaramella V., Bhatnagar S.K., Fontana P., RA Gutin A., Van de Peer Y., Salamini F., Viola R.; RT "The first genome sequence of an elite grapevine cultivar (Pinot noir RT Vitis vinifera L.): coping with a highly heterozygous genome."; RL PLoS ONE 2:e1326-e1326(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM461301; CAN63550.1; -; Genomic_DNA. DR STRING; 29760.VIT_13s0175g00100.t01; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 202 222 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 640 AA; 71104 MW; 5CC5F4CA2B5DF224 CRC64; MSASTVSITA NTAARRRPVV IGEKKPNIEL LSGDAGVSQF NGIAGEBKLT GGGGKDLSHS IRGETILERS KEVVQIKKTS ANAATEPRRT RKVVSKSERP RWVTAVSIFT KNLVLLVVIL GLVQMIRKLA LKSADSSGGS LVAVPDFERR IAEVESFLKT TTKMMQVQVE VVDRKIESEV GGLRRELSKK IEEKAGDFNN HLEKLDSKSE TLEKKLGELG AMEFLRKEDF DKIFDELKNA KSADYGDREM SLDEIRGIAR EIVEKEIERH AADGLGRVDY ALSSSGAMVV RHSEPYILGK GSGWFPKTSL TGVHRDSERM LKPSFGEPGQ CFPLKGDSGF VQIRLRTTII PEAITLEHVD KAASGLRINL AKSEIIPXGE VEEIEEMAAE LXCRFKVGKG TKVNFWTDXW CGNATLSQSF PQLYALAVXR NATXNEVWDS NFGQGGWNLR FXRGFNDWEL DLIGDLLTML RDFRISXEEE SVFWKGGENG KFGVKEAYNL LIAPNEFAFP KKTLRVSIKA LATYIKCLFF IGVCYARGKM VAYDRSSAPK DCRVYGWHQG HDTDIAAETG SMFLLAEFSY DLEKSNAQTF NVLDLVGSGX VDMVRFDFAS NHGXPSHTCI YRLRVHGHEP DSVSMLAMQS // ID A5DEZ7_PICGU Unreviewed; 666 AA. AC A5DEZ7; DT 12-JUN-2007, integrated into UniProtKB/TrEMBL. DT 22-JUL-2008, sequence version 2. DT 11-NOV-2015, entry version 31. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDK37750.2}; GN ORFNames=PGUG_01848 {ECO:0000313|EMBL:EDK37750.2}; OS Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM OS 1539 / NBRC 10279 / NRRL Y-324) (Yeast) (Candida guilliermondii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Meyerozyma. OX NCBI_TaxID=294746 {ECO:0000313|EMBL:EDK37750.2, ECO:0000313|Proteomes:UP000001997}; RN [1] {ECO:0000313|Proteomes:UP000001997} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL RC Y-324 {ECO:0000313|Proteomes:UP000001997}; RX PubMed=19465905; DOI=10.1038/nature08064; RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S., RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., RA Agrafioti I., Arnaud M.B., Bates S., Brown A.J.P., Brunke S., RA Costanzo M.C., Fitzpatrick D.A., de Groot P.W.J., Harris D., RA Hoyer L.L., Hube B., Klis F.M., Kodira C., Lennard N., Logue M.E., RA Martin R., Neiman A.M., Nikolaou E., Quail M.A., Quinn J., RA Santos M.C., Schmitzberger F.F., Sherlock G., Shah P., RA Silverstein K.A.T., Skrzypek M.S., Soll D., Staggs R., Stansfield I., RA Stumpf M.P.H., Sudbery P.E., Srikantha T., Zeng Q., Berman J., RA Berriman M., Heitman J., Gow N.A.R., Lorenz M.C., Birren B.W., RA Kellis M., Cuomo C.A.; RT "Evolution of pathogenicity and sexual reproduction in eight Candida RT genomes."; RL Nature 459:657-662(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH408156; EDK37750.2; -; Genomic_DNA. DR RefSeq; XP_001486177.1; XM_001486127.1. DR STRING; 294746.XP_001486177.1; -. DR EnsemblFungi; EDK37750; EDK37750; PGUG_01848. DR GeneID; 5127590; -. DR KEGG; pgu:PGUG_01848; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A5DEZ7; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001997; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001997}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001997}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 610 627 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 561 585 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 666 AA; 74627 MW; 8734A94A79629A0A CRC64; MARLFLASRE YSSCAGKITE KLHYFLTNDM GCKGAWFLLV AAVSLTCVSG DTSLSESQIL NQTDESTINS LTNTAGSVSN THLSHSKNIE TKEIRPKTTN LIEKITSIAS SQISTGFPDS LIPTPSYPFC TTGPGCNVFQ RPSSTSPTSG VEMRVTDEVA STFLELETQN SHTSISIDHQ VPTNISVSKN SSNITNEDID ECRFMSFEEW KRQKQEEAAA ESVSASSASA ASSADSLSSH TLAKVNTSTV DTTYSSPSNI AQEEDQGRVY KKKFNYASSD CAATIMKANS EAKGASAILH ENKDSYLLNQ CSSSNKYVII ELCQDILVSE VVMGNFEFFS SMFKDIRISV SDQFPATKWE VLGEFEAENV RDLQVFTVRN PLIWARYLKL EIVSHYGNEF YCPISVVRVH GKTMMEEFKE EQKGVDQVPS MNITIATDVA NITTQNSNNY STRDDECRVI LPHLLLHEFL QDINGTDQYC AAVAVDREVK TIETTATTQE SIYKNIMKRL SLLESNATLS LLYIEEQSKL LSTAFTNLER RQTSTFNDLV QSFNITMLSQ VKIMKEAYER IQKEATMLLQ NQENRYRGGL SDANYKLGML ANDLRFHKRI IIFNTLIIIC LLVYVVLTRD TAIGDYITKE KRSQRWGQFT VSKKSALKRK SRKRAN // ID A5DTK8_LODEL Unreviewed; 656 AA. AC A5DTK8; DT 12-JUN-2007, integrated into UniProtKB/TrEMBL. DT 12-JUN-2007, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDK42516.1}; GN ORFNames=LELG_00694 {ECO:0000313|EMBL:EDK42516.1}; OS Lodderomyces elongisporus (strain ATCC 11503 / CBS 2605 / JCM 1781 / OS NBRC 1676 / NRRL YB-4239) (Yeast) (Saccharomyces elongisporus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Lodderomyces. OX NCBI_TaxID=379508 {ECO:0000313|EMBL:EDK42516.1, ECO:0000313|Proteomes:UP000001996}; RN [1] {ECO:0000313|Proteomes:UP000001996} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 11503 / CBS 2605 / JCM 1781 / NBRC 1676 / NRRL YB-4239 RC {ECO:0000313|Proteomes:UP000001996}; RX PubMed=19465905; DOI=10.1038/nature08064; RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S., RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., RA Agrafioti I., Arnaud M.B., Bates S., Brown A.J.P., Brunke S., RA Costanzo M.C., Fitzpatrick D.A., de Groot P.W.J., Harris D., RA Hoyer L.L., Hube B., Klis F.M., Kodira C., Lennard N., Logue M.E., RA Martin R., Neiman A.M., Nikolaou E., Quail M.A., Quinn J., RA Santos M.C., Schmitzberger F.F., Sherlock G., Shah P., RA Silverstein K.A.T., Skrzypek M.S., Soll D., Staggs R., Stansfield I., RA Stumpf M.P.H., Sudbery P.E., Srikantha T., Zeng Q., Berman J., RA Berriman M., Heitman J., Gow N.A.R., Lorenz M.C., Birren B.W., RA Kellis M., Cuomo C.A.; RT "Evolution of pathogenicity and sexual reproduction in eight Candida RT genomes."; RL Nature 459:657-662(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH981524; EDK42516.1; -; Genomic_DNA. DR RefSeq; XP_001528174.1; XM_001528124.1. DR ProteinModelPortal; A5DTK8; -. DR STRING; 379508.XP_001528174.1; -. DR EnsemblFungi; EDK42516; EDK42516; LELG_00694. DR GeneID; 5235691; -. DR KEGG; lel:LELG_00694; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A5DTK8; -. DR OMA; IDECHFM; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001996; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001996}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001996}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 30 {ECO:0000256|SAM:SignalP}. FT CHAIN 31 656 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002680301. FT TRANSMEM 588 605 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 371 391 {ECO:0000256|SAM:Coils}. FT COILED 561 581 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 656 AA; 73722 MW; FB44466008696B03 CRC64; MVRCCYARVN ILPLLLFVLQ FFQQCRLINA NETQSVWDAY EQSPLMPSEF PVFAREEEQG SQSSKVSMKT RDQSNEYLQS HLQFAEETQP ASSNDSVSYN DSVIDECHFM SFEEWKKHKI GSLNNGSRKH QNNSKTTGQA KKGKLLSLAS ASVSASGSSS KSILPLSSSS QLSASAASLG LSASASSLSY SSLSSSSLSS KKSSAESVDT PKEEKPLNGK VYKDKFNFAS IDCGATVVET NAQAKGASAI LKENKDTYLL NECSVSNQYV IIELCQDILV GQVALANYEF FSSMFRDVKI SVSDRFPASS WKVIGNYMAL NTRELHVFNI KNPLIWARYL KVEITSHYGN EFYCPISLIR VHGKTMIDEL KEDEENNKQG HEFVVEELEE VLEETLTTKP SSVGRDDNDS LLPNESYDEC RVILPHLRLN EFLKDFNASD SDTNLLCLPT DMASGATGSS VSLPISTAHT SSITTTQDSI YKNFMKRLSL LESNATLSLL YIEEQLKLLS TAFSNLERRQ NKKFNQLVTS VNATLMHQLI SFKDSYDLLH RQYRDVLKTQ AQSYKQHLVD TTREFEQFKE ELTFQRRIVI FNSLLIIFLL TYLVLTRDVD FDVQQQKNVH LVNHEPASIS DTIVVPSSTN HRSRNSSLHK NGKTTT // ID A5HL83_DROME Unreviewed; 304 AA. AC A5HL83; DT 12-JUN-2007, integrated into UniProtKB/TrEMBL. DT 12-JUN-2007, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Sperm-associated SUN domain protein {ECO:0000313|EMBL:ABQ08586.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:ABQ08586.1}; RN [1] {ECO:0000313|EMBL:ABQ08586.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=W1118 {ECO:0000313|EMBL:ABQ08586.1}; RC TISSUE=50-hour old pupae {ECO:0000313|EMBL:ABQ08586.1}; RA Kracklauer M.P., Wiora H.M., Chen X.; RT "Cloning of dspag sequence in Drosophila."; RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EF537040; ABQ08586.1; -; mRNA. DR STRING; 7227.FBpp0079852; -. DR PaxDb; A5HL83; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR Bgee; A5HL83; -. DR ExpressionAtlas; A5HL83; differential. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 12 30 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 304 AA; 33995 MW; 2239BFF31A953AD8 CRC64; MDGCRRARKR VYVTYVISFV LLSTFFYYLM AHNSRNNLGI MRLREDVDDI SHILRQQQID SKVAQGSCKF NCLGGEPKGV GSGMCNNRDV SAYVDTLFKR KIGHLMDDVY NLKKQVMSAD CSSKSAQSTP KPESVALAKP RINYASEELG ARIINVKAHS IDGTNIIRSL LGLDFSTNPP VNMIRTGLSP GSCFGFNGSR ATVTLHLART IIVEAITLTH VAREMTPDLC VKSAPKNFDV YGLRSENSKR ELLGQWSYDN AANKRTQSYS VRSDTFFRNL DFSFNSNHGA NSTCIYRVEV YGRL // ID A5JZW1_PLAVS Unreviewed; 926 AA. AC A5JZW1; DT 10-JUL-2007, integrated into UniProtKB/TrEMBL. DT 10-JUL-2007, sequence version 1. DT 14-OCT-2015, entry version 34. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EDL47522.1}; GN ORFNames=PVX_123440 {ECO:0000313|EMBL:EDL47522.1}; OS Plasmodium vivax (strain Salvador I). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium). OX NCBI_TaxID=126793 {ECO:0000313|Proteomes:UP000008333}; RN [1] {ECO:0000313|EMBL:EDL47522.1, ECO:0000313|Proteomes:UP000008333} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Salvador I {ECO:0000313|EMBL:EDL47522.1, RC ECO:0000313|Proteomes:UP000008333}; RX PubMed=18843361; DOI=10.1038/nature07327; RA Carlton J.M., Adams J.H., Silva J.C., Bidwell S.L., Lorenzi H., RA Caler E., Crabtree J., Angiuoli S.V., Merino E.F., Amedeo P., RA Cheng Q., Coulson R.M.R., Crabb B.S., del Portillo H.A., Essien K., RA Feldblyum T.V., Fernandez-Becerra C., Gilson P.R., Gueye A.H., Guo X., RA Kang'a S., Kooij T.W.A., Korsinczky M., Meyer E.V.-S., Nene V., RA Paulsen I., White O., Ralph S.A., Ren Q., Sargeant T.J., RA Salzberg S.L., Stoeckert C.J., Sullivan S.A., Yamamoto M.M., RA Hoffman S.L., Wortman J.R., Gardner M.J., Galinski M.R., RA Barnwell J.W., Fraser-Liggett C.M.; RT "Comparative genomics of the neglected human malaria parasite RT Plasmodium vivax."; RL Nature 455:757-763(2008). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDL47522.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKM01000001; EDL47522.1; -; Genomic_DNA. DR RefSeq; XP_001617249.1; XM_001617199.1. DR ProteinModelPortal; A5JZW1; -. DR GeneID; 5476558; -. DR KEGG; pvx:PVX_123440; -. DR EuPathDB; PlasmoDB:PVX_123440; -. DR HOGENOM; HOG000281004; -. DR InParanoid; A5JZW1; -. DR Proteomes; UP000008333; Chromosome 14. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008333}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008333}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 244 266 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 339 366 {ECO:0000256|SAM:Coils}. FT COILED 382 402 {ECO:0000256|SAM:Coils}. FT COILED 436 470 {ECO:0000256|SAM:Coils}. FT COILED 531 562 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 926 AA; 105219 MW; 8F528633890DD379 CRC64; MSANATNAGG NPRGRGGRGS KLAKNSHIAQ NAASAASSGV SSNAANSSNA ANSSNVANIA NSANGRNAPH GLSPPSHKAF AEGEISDAGP PATDRSDDNN STVDNDERNT LIQVLHSYED FQKKNMHYGK VNPYRKRESQ LSKMRRSIVK LFTLSDIRLD ESDSNRNVDG STYGSSVSKR RNNANGSNFL TSRAAMWNEV EKNDDPEYDL KLESSKNKSF RDAITHQLSV LINDTINDKK GMTYIAGLMI ILSVLITCIS GVITLFSNSA GEKTNLNLHT TKNNYDDINK FMNYIKLGNE ENNRSDFLRI YELLEEFKMS MNQSMSENMN TILNSKKENH ALYELNASLE NKLKELEKKL LVNTKDIDYF KIHSKREVEN FKKILQENYE SFQSKLKDYV KTVDTIKKDI HKKSSLINDV EKKMNKSQMD IKKDVSDRVE NEKRGLLNTI SELQKKIQSI ESKLSSHGAN SRDSLQMDSE AQGGSTKRGW TTLGWTTPEA AQAGWTPPEA TQPGSTQPEV KHKDDQGEAR ERQIEQRIAE WNEQHAGLMQ EIQKELELLK ESAKKSTDFL DDVFPSFEHK ILKNVENKIK YYLEMYKKDI LSEITESKVI YNEEKYKAMA LKQEKMQSEL LKTISSQIKA QTKVIKDDLS KSLHTMVDQK QIKIDSDYPV KAAKISYDSI DMLQKKVDEL YNEFILDYNQ IDWALESLGA RIVYKMTSSP LNRNDFIEKF LNQIASFLPS EEIYGMIKPM GKDPSIILKP SNFPGDCFSF NGSKGKITIH LPATIDVSSI SIQHVHENIS NNSNATPKYF SVYGVVDSNW PEHFESQDIN YDDFKNSSLY SCLHSVYGNL QPREILDKWL KGNKNPGLLH LGDFYFDRKK RISTYPTKHC FPVKRIIFEF TENYGAPYTC VYRLKVHGKR CIRKFK // ID A5K417_PLAVS Unreviewed; 1697 AA. AC A5K417; DT 10-JUL-2007, integrated into UniProtKB/TrEMBL. DT 10-JUL-2007, sequence version 1. DT 14-OCT-2015, entry version 34. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EDL46271.1}; GN ORFNames=PVX_118525 {ECO:0000313|EMBL:EDL46271.1}; OS Plasmodium vivax (strain Salvador I). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium). OX NCBI_TaxID=126793 {ECO:0000313|Proteomes:UP000008333}; RN [1] {ECO:0000313|EMBL:EDL46271.1, ECO:0000313|Proteomes:UP000008333} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Salvador I {ECO:0000313|EMBL:EDL46271.1, RC ECO:0000313|Proteomes:UP000008333}; RX PubMed=18843361; DOI=10.1038/nature07327; RA Carlton J.M., Adams J.H., Silva J.C., Bidwell S.L., Lorenzi H., RA Caler E., Crabtree J., Angiuoli S.V., Merino E.F., Amedeo P., RA Cheng Q., Coulson R.M.R., Crabb B.S., del Portillo H.A., Essien K., RA Feldblyum T.V., Fernandez-Becerra C., Gilson P.R., Gueye A.H., Guo X., RA Kang'a S., Kooij T.W.A., Korsinczky M., Meyer E.V.-S., Nene V., RA Paulsen I., White O., Ralph S.A., Ren Q., Sargeant T.J., RA Salzberg S.L., Stoeckert C.J., Sullivan S.A., Yamamoto M.M., RA Hoffman S.L., Wortman J.R., Gardner M.J., Galinski M.R., RA Barnwell J.W., Fraser-Liggett C.M.; RT "Comparative genomics of the neglected human malaria parasite RT Plasmodium vivax."; RL Nature 455:757-763(2008). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDL46271.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKM01000004; EDL46271.1; -; Genomic_DNA. DR RefSeq; XP_001615998.1; XM_001615948.1. DR ProteinModelPortal; A5K417; -. DR GeneID; 5475296; -. DR KEGG; pvx:PVX_118525; -. DR EuPathDB; PlasmoDB:PVX_118525; -. DR HOGENOM; HOG000282163; -. DR InParanoid; A5K417; -. DR Proteomes; UP000008333; Chromosome 12. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008333}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008333}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 1697 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002685009. FT TRANSMEM 1655 1678 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 386 406 {ECO:0000256|SAM:Coils}. FT COILED 1288 1308 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1697 AA; 196883 MW; 2CA8197721C0B03C CRC64; MIWWFLISIN FLFFLIKSFF TPKDTTYMND INNNQDSSST EKYILGENYN LTSLKLKIDF GSLDTGTKII EYSKGIINIR SIQQYDYDSY MLTPCNSDIW WIYSFSDIIH IEKIGLVSLE HYASNFKVIE ILGSDVYPTK KWKKLGKIAT NFTKSFELFN IYDYCKNYDE DNCWVKYMKF VVLSHHNLEK NYYCTLTHLQ IFASSGVDML SDKIYSDDNI TQIEKDFHDN YKHGKNENIR DHEVIENLEV LLEDKERIQG GGTAPGLGDG RNGNRHIDKK QNGEALHSPE MQTSENITTE QPPMGAPPHS ATPPTVPNFT PFEGSFLDPE TIEEELIDTD LIDSKLMDTE LIKKELMDTE LIEKELMNAE LVERDSKGGH INEPSLDHLT EQIGNAFDDK RDHERNTSKE DEVNTSQQKE EEKDRASLSP SLATKQNLLS KKKSHQGVSP HSLVILNKAM KKKYALLQNF NKLRISKLLK NYPFVNYKNV VNKYMPYVAY LMRHKDQLSY DHHMEDAIQV DHFFNPVDDA LLGGSPPGEH VPTHRRFLSP FRVHMRRVFP KLHSCGDIQR RVPQHSGTTP RTSVKRTTSS NPSGAPPRKP PKLHCYNYGA IKNVQNSILQ NRRMYPLLTI WKPRLLCSHD ITCLKKHAGD INKLLTFRRN LIKSTIRKYL VFISGKRLPP PRRRRHKFLK KKKRKTRKKR GIFRNVFLSK RFNKVGHIKK LHVFYSPIFR FPTCDASRYF PHEEEEKKDP LMTYILEKMQ QQSRIKDPQI LTPLKEQSQD GLIMVYLFSE LRKGRETHNL ESLKWCFDFM TKWKNTFDSV LLMYFVRCAP LASLRNRGRC VDLMKNGLLL KKRDKSTLEK ELHGLFFAHS DRSRTCKRQD RLERTDLSYY AYREEGSEED TEEGAEEDAV EGAEEDADTG EYYPDGEPFP VEPPHLRDPP INHQVSVEPP PTINTLTLYL HMLRGKNVHS RYHTMALMRS PRDVKLLPKL MADICSSLSS RRIAEDPAKS ESYVREVVSD HVANWAEIFT AKFTPNCSPN CRPNHPFSEK LPTAKSKLVY TIAARMEKAT LVDLPHSVRN RGGLLRPPQE VPSVQEKEEV HLQYRKKAQC EAITALFVDL IISRISTWRI TLEREKKDVS KKEQSAKAGG GTDHLEEQSK RAYNEKPIQR SQHGHLEEAK MCLTLGEMEN MIYSDRYLKQ YVEEAGIHKE MQKKDKTKNI INVKYREKNN DKSIKILNEI KETEESNSKL VNEVYDIISE YDDNNESKIY SVKLKSGKQI PLIINKPENK MKEKAINSEA KKKNMKENKL QYLDEFDHVL NDHIQYDHYE QNCIREEKIE EKAKNTRGHA LLTLVDKVKT IENKNNYVIS KLKDVIKITN NKTKIIYHML SNFKILQNTI SLLLKYIMIN EKNMKNLHKN RNKSESFFKI LKDICILQIN EKKKPFDSLQ YICKYLQDLL YDEIEKIYLF EKPTRGGAAG VGPSTSGGRR TTAANGGSTS ASMGGKFPLC GEEEENMLLH NKNSFHDKNP NFIFKEKKKS IFNFLYYENH CHNDIFKTPL IYYNSSVDSL QTFYFKIYNF FRNLPFFHHL VYKFRHYKRV LISYLSGGAA GDAAAQFGSA QFGGAPPTGA QFGSSHFGAS HFGGAYGDPQ NRGNLYAFLL GLLLILFLVN NFFCFLLYKH LSNKLNRFVQ GCTCHRK // ID A6QLV1_BOVIN Unreviewed; 728 AA. AC A6QLV1; DT 21-AUG-2007, integrated into UniProtKB/TrEMBL. DT 21-AUG-2007, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=UNC84B protein {ECO:0000313|EMBL:AAI48096.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSBTAP00000004813}; GN Name=UNC84B {ECO:0000313|EMBL:AAI48096.1}; GN Synonyms=SUN2 {ECO:0000313|Ensembl:ENSBTAP00000004813}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913; RN [1] {ECO:0000313|EMBL:AAI48096.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=L1 Hereford {ECO:0000313|EMBL:AAI48096.1}; RC TISSUE=Fetal skin {ECO:0000313|EMBL:AAI48096.1}; RA Moore S., Alexander L., Brownstein M., Guan L., Lobo S., Meng Y., RA Tanaguchi M., Wang Z., Yu J., Prange C., Schreiber K., Shenmen C., RA Wagner L., Bala M., Barbazuk S., Barber S., Babakaiff R., Beland J., RA Chun E., Del Rio L., Gibson S., Hanson R., Kirkpatrick R., Liu J., RA Matsuo C., Mayo M., Santos R.R., Stott J., Tsai M., Wong D., RA Siddiqui A., Holt R., Jones S.J., Marra M.A.; RL Submitted (JUN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSBTAP00000004813, ECO:0000313|Proteomes:UP000009136} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000004813, RC ECO:0000313|Proteomes:UP000009136}; RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42; RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., RA Puiu D., Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., RA Marcais G., Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.; RT "A whole-genome assembly of the domestic cow, Bos taurus."; RL Genome Biol. 10:R42.01-R42.10(2009). RN [3] {ECO:0000313|Ensembl:ENSBTAP00000004813} RP IDENTIFICATION. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000004813}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DAAA02014665; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BC148095; AAI48096.1; -; mRNA. DR RefSeq; NP_001095789.1; NM_001102319.1. DR UniGene; Bt.61384; -. DR STRING; 9913.ENSBTAP00000004813; -. DR Ensembl; ENSBTAT00000004813; ENSBTAP00000004813; ENSBTAG00000003693. DR GeneID; 618392; -. DR KEGG; bta:618392; -. DR CTD; 25777; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG056957; -. DR KO; K19347; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 20901159; -. DR Proteomes; UP000009136; Chromosome 5. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009136}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009136}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 173 191 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 197 214 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 221 243 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 282 302 {ECO:0000256|SAM:Coils}. FT COILED 384 404 {ECO:0000256|SAM:Coils}. FT COILED 415 442 {ECO:0000256|SAM:Coils}. FT COILED 489 509 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 728 AA; 81446 MW; 422426DE3FCB325C CRC64; MSRRSQRLTR YSSQGEDDGG SSSSGGSSVM GSQSTLFKDS PLRTLKKKTN NSKRLSPAPQ LGPSSDTHTY YSESVVRESY IGSPRAASIA ASLTRHSLLD DPYWSEDLRV RRRRGTGGTD SSKPNGLPEN KLSEDFLGSS SGYSSEDDYV GYSETDHQGS GSRLRNAVSR VGSLLWMVVT SPGRLFGLLY WWIGTTWYRL TTAASLLDVF VLTRRFSSLK MFLWFLLLLL LLTGLTYGAW YFYPFGLQTL HPAVVSWWAS KGSIGQREVW ESRDSSPHFQ AEQRILSRVH SLERRLDALA AEFSSSWQKE AMRLERLELQ QGAGGQGGGG GGLSHEDTLA LLEGLVSRRE AALKEDLRRD TAARIQEELV TLRSEHQQDS EDLFKKIVQA SQESEARLQQ LKSEWQSRMT QESFQENAMK ELGRLEGQLA GLRQELAALT LKQSLVEDQV GLLPQQLQAV RDDVESQFPA WVSQFLLRGG GTRTGLLQQE EMEAQLRDLE SRILTHVAEM QGKSAREAVA SLGLTLQREG VIGVTEEQVH RIVNQALKRY SEDRIGMVDY ALESGGASVI STRCSETYET KTALLSLFGI PLWYHSQSPR VILQPDVHPG NCWAFQGPQG FAVVRLSARI RPTAVTLEHV PKSLSPNSTI SSAPKDFAIF GLDEDLQQEG TPLGQFTYDQ DGEPIQTFYF QDPKMATYQV VELRILTNWG HPEYTCIYRF RVHGEPAH // ID A6QSD1_AJECN Unreviewed; 731 AA. AC A6QSD1; DT 21-AUG-2007, integrated into UniProtKB/TrEMBL. DT 21-AUG-2007, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDN02423.1}; GN ORFNames=HCAG_00287 {ECO:0000313|EMBL:EDN02423.1}; OS Ajellomyces capsulatus (strain NAm1 / WU24) (Darling's disease fungus) OS (Histoplasma capsulatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Histoplasma. OX NCBI_TaxID=339724 {ECO:0000313|EMBL:EDN02423.1, ECO:0000313|Proteomes:UP000009297}; RN [1] {ECO:0000313|Proteomes:UP000009297} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NAm1 / WU24 {ECO:0000313|Proteomes:UP000009297}; RX PubMed=19717792; DOI=10.1101/gr.087551.108; RA Sharpton T.J., Stajich J.E., Rounsley S.D., Gardner M.J., RA Wortman J.R., Jordar V.S., Maiti R., Kodira C.D., Neafsey D.E., RA Zeng Q., Hung C.-Y., McMahan C., Muszewska A., Grynberg M., RA Mandel M.A., Kellner E.M., Barker B.M., Galgiani J.N., Orbach M.J., RA Kirkland T.N., Cole G.T., Henn M.R., Birren B.W., Taylor J.W.; RT "Comparative genomic analyses of the human fungal pathogens RT Coccidioides and their relatives."; RL Genome Res. 19:1722-1731(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476655; EDN02423.1; -; Genomic_DNA. DR RefSeq; XP_001543241.1; XM_001543191.1. DR EnsemblFungi; EDN02423; EDN02423; HCAG_00287. DR GeneID; 5449627; -. DR KEGG; aje:HCAG_00287; -. DR EuPathDB; FungiDB:HCAG_00287; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000009297; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009297}; KW Reference proteome {ECO:0000313|Proteomes:UP000009297}. FT COILED 161 181 {ECO:0000256|SAM:Coils}. FT COILED 408 428 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 731 AA; 79079 MW; 304828646DD5F7AC CRC64; MTGRKTASVH SGSRAQNTRG TRAAPTENPT AATEGNLSNP DLGNPSLPDV RTQQSFAYGS TKTPALPRQL EVDPSMGLSE MIDTLDDGLR QAQDRELARV DVEDPTHPVP ERRQTRSMSA SVRSSISPAP GPVSRRASSR NATTRSRAGP RRAVSRQTTP EEQLLETLRE VSEETEGVKR EEDPSVSVLH DTPSFNGSAS VSWTTERAIH GILPRETNAG TRPNYYLHDP YGSRPSSSQE PSGLRLPPTR RPIFEEAFRA NPPLPGPIDV PNVSTSAAAR RTLPPVPAFN QLRNKSASKS SASSAFSASI HTPGSSTHSS PVLVAATPAG VRVTSKQRLS GIAKTPSALL VTIGLILMTV LTYFCRDHAC MFPQSLQNTM SHYLCSPAST FATDNSTSMY SEAFHKLSSR LDQRLSDMAK EVAILKNEWN RRLPHLKEAL SGSPAAAINP LKPPKVNYAS IGMGAVVDPY LTSPTMATSA GLVSRIGQYL AKVPRGSPPV AALQPWDGVG ECWCAATRSN ASQLTILLGR AIVPEEVVIE HIPKGATLDP GSAPREMELW VQYMARPPTA AAAYPQGSGS SNPSPPPSSA SSPHAPSPFP PSSAPSQPLP PPPATPHLRN PPFSHLRPSY YPHHLLPSWL RDAILTTLRQ VYPNEPTTAY SDDALLGPSF FRVGRWQYNI HGGHHIQRFD LDAVIDMPAV RVEKVVFRVK SNWGAAHTCL YRIYRRALGV E // ID A6ZNZ8_YEAS7 Unreviewed; 587 AA. AC A6ZNZ8; DT 11-SEP-2007, integrated into UniProtKB/TrEMBL. DT 11-SEP-2007, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Integral membrane protein {ECO:0000313|EMBL:EDN64013.1}; GN ORFNames=SCY_5216 {ECO:0000313|EMBL:EDN64013.1}; OS Saccharomyces cerevisiae (strain YJM789) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=307796 {ECO:0000313|EMBL:EDN64013.1, ECO:0000313|Proteomes:UP000007060}; RN [1] {ECO:0000313|EMBL:EDN64013.1, ECO:0000313|Proteomes:UP000007060} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=YJM789 {ECO:0000313|EMBL:EDN64013.1, RC ECO:0000313|Proteomes:UP000007060}; RX PubMed=17652520; DOI=10.1073/pnas.0701291104; RA Wei W., McCusker J.H., Hyman R.W., Jones T., Ning Y., Cao Z., Gu Z., RA Bruno D., Miranda M., Nguyen M., Wilhelmy J., Komp C., Tamse R., RA Wang X., Jia P., Luedi P., Oefner P.J., David L., Dietrich F.S., RA Li Y., Davis R.W., Steinmetz L.M.; RT "Genome sequencing and comparative analysis of Saccharomyces RT cerevisiae strain YJM789."; RL Proc. Natl. Acad. Sci. U.S.A. 104:12825-12830(2007). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDN64013.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAFW02000030; EDN64013.1; -; Genomic_DNA. DR EnsemblFungi; EDN64013; EDN64013; SCY_5216. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007060; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007060}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67254 MW; 8C28CE40124071A8 CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFGQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID A7AW08_BABBO Unreviewed; 888 AA. AC A7AW08; DT 11-SEP-2007, integrated into UniProtKB/TrEMBL. DT 11-SEP-2007, sequence version 1. DT 14-OCT-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EDO05236.1}; GN ORFNames=BBOV_I001520 {ECO:0000313|EMBL:EDO05236.1}; OS Babesia bovis. OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida; OC Babesiidae; Babesia. OX NCBI_TaxID=5865 {ECO:0000313|EMBL:EDO05236.1, ECO:0000313|Proteomes:UP000002173}; RN [1] {ECO:0000313|EMBL:EDO05236.1, ECO:0000313|Proteomes:UP000002173} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=T2Bo {ECO:0000313|EMBL:EDO05236.1, RC ECO:0000313|Proteomes:UP000002173}; RX PubMed=17953480; DOI=10.1371/journal.ppat.0030148; RA Brayton K.A., Lau A.O.T., Herndon D.R., Hannick L., Kappmeyer L.S., RA Berens S.J., Bidwell S.L., Brown W.C., Crabtree J., Fadrosh D., RA Feldblum T., Forberger H.A., Haas B.J., Howell J.M., Khouri H., RA Koo H., Mann D.J., Norimine J., Paulsen I.T., Radune D., Ren Q., RA Smith R.K. Jr., Suarez C.E., White O., Wortman J.R., Knowles D.P. Jr., RA McElwain T.F., Nene V.M.; RT "Genome sequence of Babesia bovis and comparative analysis of RT apicomplexan hemoprotozoa."; RL PLoS Pathog. 3:1401-1413(2007). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDO05236.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAXT01000005; EDO05236.1; -; Genomic_DNA. DR RefSeq; XP_001608804.1; XM_001608754.1. DR EnsemblProtists; EDO05236; EDO05236; BBOV_I001520. DR GeneID; 5477009; -. DR KEGG; bbo:BBOV_I001520; -. DR EuPathDB; PiroplasmaDB:BBOV_I001520; -. DR InParanoid; A7AW08; -. DR Proteomes; UP000002173; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002173}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002173}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 12 32 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 843 864 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 616 636 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 888 AA; 102547 MW; 7418A2A694B5AD78 CRC64; MSILLTHKHK KRYVVQVIFC ILTLYGCLRH IFNSYQSGYL SANSVKDAKR YNSKTVKGKP FDSQTFKLKV DFLSEDAGAI IVAQSKDISH LKSIQNDDPN SYLLAPCQKT DWFIISFPER ISVKYVAFIS NEYYASTYKT IRISRSSVYP SEKWHILAEL ETEMGQSEIF DLSLLCDKGG TKGCWTKYIK VEFLDYHRFE DNYYCSLTSM KVYGSTAVDL LESEITDDIN PYKGTNVPYG QNTDKVTEIA TIHIKEPSND NKLQPGWPDN YSGRVVFGKS KRISKLVCPA TITNKYDTIH VLLFKFMASL AKRKDLHGIN HRLYPRMKRF IYEKDCGVRS LLSFWSNNAL HHVTNCKSKT RCWKMLFHVQ LPLDHTWFEK VIYRMIKQGW IKFKVPWTFR GILSTPILLC VRRYGFLGRF TCYYYFSYTS FTLTSNQKIV GYIPTRVPDL SFKCIYILFS DRKLKKVAPV SKKDISEVFN GHTFVKGTRM LIADDSISSI ILSRDLCALT INAHMFQNAD HMKYLMDLVL KSYVAYKDNV VSRVTKDLGL VRQIANSLDR SNTNFTRLHI DNMDGNHVGA KGDGSGGIWE NIPDSNFISD QRIKAMKETK GHEHVLLQIS ERVKALENIV AEFRKKQDNS DVTLGSCLEH LQYVANKVQR RTYQTLNVTT YGRDIDSEFS KILNILGIKR YHIIYVDDLP RKPVFNKMLQ ERKIAGSFIL QTVIVEKGSS AFNKLKGQCC HITGVVPKQN KGRCHVIIAL IKLSKYMPIL RRLQVLDCCR RFGCVCIRYN PPDSSYLVWL PHFSGADGTK IGCSLMGPLY SFIVSCSNHI FAPVIAFFEH ISLAIFNIYT LFLCFFFFQA IWFYRERNTR VMIYEMSKHI KKLNHHTA // ID A7E4T9_SCLS1 Unreviewed; 1017 AA. AC A7E4T9; DT 11-SEP-2007, integrated into UniProtKB/TrEMBL. DT 11-SEP-2007, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDN90911.1}; GN ORFNames=SS1G_00311 {ECO:0000313|EMBL:EDN90911.1}; OS Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) (White OS mold) (Whetzelinia sclerotiorum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Leotiomycetes; OC Helotiales; Sclerotiniaceae; Sclerotinia. OX NCBI_TaxID=665079 {ECO:0000313|Proteomes:UP000001312}; RN [1] {ECO:0000313|Proteomes:UP000001312} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 18683 / 1980 / Ss-1 {ECO:0000313|Proteomes:UP000001312}; RX PubMed=21876677; DOI=10.1371/journal.pgen.1002230; RA Amselem J., Cuomo C.A., van Kan J.A.L., Viaud M., Benito E.P., RA Couloux A., Coutinho P.M., de Vries R.P., Dyer P.S., Fillinger S., RA Fournier E., Gout L., Hahn M., Kohn L., Lapalu N., Plummer K.M., RA Pradier J.-M., Quevillon E., Sharon A., Simon A., ten Have A., RA Tudzynski B., Tudzynski P., Wincker P., Andrew M., Anthouard V., RA Beever R.E., Beffa R., Benoit I., Bouzid O., Brault B., Chen Z., RA Choquer M., Collemare J., Cotton P., Danchin E.G., Da Silva C., RA Gautier A., Giraud C., Giraud T., Gonzalez C., Grossetete S., RA Gueldener U., Henrissat B., Howlett B.J., Kodira C., Kretschmer M., RA Lappartient A., Leroch M., Levis C., Mauceli E., Neuveglise C., RA Oeser B., Pearson M., Poulain J., Poussereau N., Quesneville H., RA Rascle C., Schumacher J., Segurens B., Sexton A., Silva E., Sirven C., RA Soanes D.M., Talbot N.J., Templeton M., Yandava C., Yarden O., RA Zeng Q., Rollins J.A., Lebrun M.-H., Dickman M.; RT "Genomic analysis of the necrotrophic fungal pathogens Sclerotinia RT sclerotiorum and Botrytis cinerea."; RL PLoS Genet. 7:E1002230-E1002230(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476621; EDN90911.1; -; Genomic_DNA. DR RefSeq; XP_001598225.1; XM_001598175.1. DR EnsemblFungi; EDN90911; EDN90911; SS1G_00311. DR GeneID; 5494871; -. DR KEGG; ssl:SS1G_00311; -. DR EuPathDB; FungiDB:SS1G_00311; -. DR InParanoid; A7E4T9; -. DR OMA; SEDIWID; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001312; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001312}; KW Reference proteome {ECO:0000313|Proteomes:UP000001312}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 27 {ECO:0000256|SAM:SignalP}. FT CHAIN 28 1017 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002707511. SQ SEQUENCE 1017 AA; 112537 MW; 0B9630E7E7EB1388 CRC64; MVFAGHRGMV AATALLFLCN LPSRIIASSV TTATSVNSII SVPNTTGTCE FRTINYITDI LPQQCLRSSW SGTESPLTTV TRATDAVGAS ESQGNDGLTD TVSNTPFSEY VAQTNDGGIA RETSRTSESR QSTSSSPSPN IPTPAASSIT VDEGELNHAS FLSFEEWKKQ ILEQAGQQDL NLGKRRSAEA ARKREAEAFQ NNLESLGDDG EIDLDFRAFR SGAAEQTSRM TEDNHVDSSK GSQKEKSDSG HRRDQHRSKD AGKTCKERFS YASFDAGATV LKTHQGAKNS KAVLIENKDS YMLSECKTPN KFLIIELSED IWIDTLVLAN YEFFSSMLRT FRVSVSDRWP VKTDKWKDLG IYEARNSREI QAFLIENPQI WARYIRIEFL THYGKEYYCP LSLVRVHGTR MLESWKDTEA NNDDDEEADE DPDDGFVPEA VAEAIQVTST EVQTVHVTTS GQKETTRSYA PRDVQKDEAP LETYSKPPPT PTSPWRKSVV GQSEILIAQA RGLCFPSDAP EHILTSQAAV ENESTDFKTT MQVPPSPEMI STAFTDNWIT SSSSAFTGPS AQQTLSEDKA SLASTSLTQE THESSRISHG SLHSTTPTSF TTTHIHKPQD ATSTNKTRST NTASASASLP TIQESFFKAV SRRLQLLETN STLSLKYIEE QSKMLREAFL KVEKRQLQKT TGFLENLNST VLTELRVFRQ QYDEIWQSTV ISLESQREES RREILAISAR LNILADEVVF QKRMSIIQSI LLLLCLGLVI FSRVSSAEPL SFSLHNRRLR RSANSMGSPN DTPGYTSRDH EDYVGGASFP VNAWKNQHRR QPSDESVNSR SRSRGWGPPT PISTYSRSDN ELTPPRSFDE ATANTVTGAS TGTFSRLRRS ISMRYQNSNP LLSASKEQEL LRTSSFGPSL RSQNSSPASF LSVADAKERG GKERISSGLA SPPPSDNDSH DPMDDVNSAP TGVQHETLER QSVNDIRESQ APQTLTQEEP DNEQKPLPAL PVGNPSP // ID A7ESM0_SCLS1 Unreviewed; 760 AA. AC A7ESM0; DT 11-SEP-2007, integrated into UniProtKB/TrEMBL. DT 11-SEP-2007, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDN92462.1}; GN ORFNames=SS1G_08325 {ECO:0000313|EMBL:EDN92462.1}; OS Sclerotinia sclerotiorum (strain ATCC 18683 / 1980 / Ss-1) (White OS mold) (Whetzelinia sclerotiorum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Leotiomycetes; OC Helotiales; Sclerotiniaceae; Sclerotinia. OX NCBI_TaxID=665079 {ECO:0000313|Proteomes:UP000001312}; RN [1] {ECO:0000313|Proteomes:UP000001312} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 18683 / 1980 / Ss-1 {ECO:0000313|Proteomes:UP000001312}; RX PubMed=21876677; DOI=10.1371/journal.pgen.1002230; RA Amselem J., Cuomo C.A., van Kan J.A.L., Viaud M., Benito E.P., RA Couloux A., Coutinho P.M., de Vries R.P., Dyer P.S., Fillinger S., RA Fournier E., Gout L., Hahn M., Kohn L., Lapalu N., Plummer K.M., RA Pradier J.-M., Quevillon E., Sharon A., Simon A., ten Have A., RA Tudzynski B., Tudzynski P., Wincker P., Andrew M., Anthouard V., RA Beever R.E., Beffa R., Benoit I., Bouzid O., Brault B., Chen Z., RA Choquer M., Collemare J., Cotton P., Danchin E.G., Da Silva C., RA Gautier A., Giraud C., Giraud T., Gonzalez C., Grossetete S., RA Gueldener U., Henrissat B., Howlett B.J., Kodira C., Kretschmer M., RA Lappartient A., Leroch M., Levis C., Mauceli E., Neuveglise C., RA Oeser B., Pearson M., Poulain J., Poussereau N., Quesneville H., RA Rascle C., Schumacher J., Segurens B., Sexton A., Silva E., Sirven C., RA Soanes D.M., Talbot N.J., Templeton M., Yandava C., Yarden O., RA Zeng Q., Rollins J.A., Lebrun M.-H., Dickman M.; RT "Genomic analysis of the necrotrophic fungal pathogens Sclerotinia RT sclerotiorum and Botrytis cinerea."; RL PLoS Genet. 7:E1002230-E1002230(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476631; EDN92462.1; -; Genomic_DNA. DR RefSeq; XP_001590585.1; XM_001590535.1. DR EnsemblFungi; EDN92462; EDN92462; SS1G_08325. DR GeneID; 5486705; -. DR KEGG; ssl:SS1G_08325; -. DR EuPathDB; FungiDB:SS1G_08325; -. DR InParanoid; A7ESM0; -. DR OMA; YDNASIM; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001312; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001312}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001312}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 254 272 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 458 489 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 760 AA; 84396 MW; D42C60FD5AFAEF08 CRC64; MSTRRTTRAG SRAASSVGGT HAVGPEDIPA ARTPGRGRPR KSGGSVAGSE RANGLPPMAA STSTAYGTNT LALPNYAPRG APLANDISSV IEGLLEPEPL NRGARPRPVS PQAPVPDIDA PTAQPKSGAP AHIPTMRSRS GPTKRRVPHA FDRNFQNESQ LYDNASIMSG SYQQDEFRVS GPSYDEGELN CCTALRGIAE EQQYMRGGAQ GWNFLDLGDE TSFDFRQVLR SPLSFFKKLG KAFFRLFGDV TKSLFPLLLL GFIIFISWMA YLNSGGPSKH WYGADVSANI KQFIPSQIRH PSRIFAPEDL KDVYRRLDIA ESDIVFLKHR SSIDKNALAE IQKILPDFIA LDKDYNGKPA LPASLWQAMR ERIRADPSLI PPPVIMTTGR PSDTSTKSDS HDNEVFNSKQ FDRYLESNRA KIQSWAGSEF DVLHQTRLQQ LIKDGKIATR ENVVELIKQS YRDEAQEVKS ELEKVTKALE DKVASLGKQA LTAERVKQLA NEVVIAQVEA ITKANVNKKT VQSLQRMDHF AKRSHSAVIP RLTSPSYRFP HMDFGFIRRS IALVANHPIP LPNPADAALN SWEDIGDCWC SAQQNGNGPT LGVITANYIW PDQVVVEHYP QSGYVNSGLS PLSAPRQMEL LVYIPHKPTY LKVKSMSDQI FPEANYRDLE SGWVQVGAWS YDAYGHTQQA FPLQIELKEF ISEPENNLKN DTSATNKFLV RSKGNHGRGA VPYTCIYRVR LHGDVRPDEE TFYTLDSSKT // ID A7MCZ7_HUMAN Unreviewed; 812 AA. AC A7MCZ7; DT 02-OCT-2007, integrated into UniProtKB/TrEMBL. DT 02-OCT-2007, sequence version 1. DT 11-NOV-2015, entry version 40. DE SubName: Full=UNC84A protein {ECO:0000313|EMBL:AAI52419.1}; GN Name=UNC84A {ECO:0000313|EMBL:AAI52419.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:AAI52419.1}; RN [1] {ECO:0000313|EMBL:AAI52419.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RA Gerhard D.S., Wagner L., Feingold E.A., Shenmen C.M., Grouse L.H., RA Schuler G., Klein S.L., Old S., Rasooly R., Good P., Guyer M., RA Peck A.M., Derge J.G., Lipman D., Collins F.S., Jang W., Sherry S., RA Feolo M., Misquitta L., Lee E., Rotmistrovsky K., Greenhut S.F., RA Schaefer C.F., Buetow K., Bonner T.I., Haussler D., Kent J., RA Kiekhaus M., Furey T., Brent M., Prange C., Schreiber K., Shapiro N., RA Bhat N.K., Hopkins R.F., Hsie F., Driscoll T., Soares M.B., RA Casavant T.L., Scheetz T.E., Brown-stein M.J., Usdin T.B., RA Toshiyuki S., Carninci P., Piao Y., Dudekula D.B., Ko M.S., RA Kawakami K., Suzuki Y., Sugano S., Gruber C.E., Smith M.R., RA Simmons B., Moore T., Waterman R., Johnson S.L., Ruan Y., Wei C.L., RA Mathavan S., Gunaratne P.H., Wu J., Garcia A.M., Hulyk S.W., Fuh E., RA Yuan Y., Sneed A., Kowis C., Hodgson A., Muzny D.M., McPherson J., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madari A., Young A.C., Wetherby K.D., RA Granite S.J., Kwong P.N., Brinkley C.P., Pearson R.L., Bouffard G.G., RA Blakesly R.W., Green E.D., Dickson M.C., Rodriguez A.C., Grimwood J., RA Schmutz J., Myers R.M., Butterfield Y.S., Griffith M., Griffith O.L., RA Krzywinski M.I., Liao N., Morin R., Morrin R., Palmquist D., RA Petrescu A.S., Skalska U., Smailus D.E., Stott J.M., Schnerch A., RA Schein J.E., Jones S.J., Holt R.A., Baross A., Marra M.A., Clifton S., RA Makowski K.A., Bosak S., Malek J.; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC152418; AAI52419.1; -; mRNA. DR UniGene; Hs.438072; -. DR STRING; 9606.ENSP00000384015; -. DR PaxDb; A7MCZ7; -. DR PRIDE; A7MCZ7; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG104132; -. DR ChiTaRS; SUN1; human. DR NextBio; 35463415; -. DR InterPro; IPR012919; SUN_dom. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 286 309 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 316 335 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 410 430 {ECO:0000256|SAM:Coils}. FT COILED 455 489 {ECO:0000256|SAM:Coils}. FT COILED 502 522 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 812 AA; 90118 MW; 5ECF0C21A97AB4E5 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADSGTSSA VSLKNRAART TKQRRSTNKS AFSINHVSRQ VTSSGVSYGG TVSLQDAVTR RPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGVAAATAH NGFSCSNCSM LSERKDVLTA HPAAPGPVSR VYSRDRNQKC DDCKGKRHLD AHPGRAGTLW HIWACAGYFL LQILRRIGAV GQAVSRTAWS ALWLAVVAPG KAASGVFWWL GIGWYQFVTL ISWLNVFLLT RCLRNICKFL VLLIPLFLLL AGLSLRGQGN FFSFLPVLNW ASMHRTQRVD DPQDVFKPTT SRLKQPLQGD SEAFPWHWMS GVEQQVASLS GQCHHHGENL RELTTLLQKL QARVDQMEGG AAGPSASVRD AVGQPPRETD FMAFHQEHEV RMSHLEDILG KLREKSEAIQ KELEQTKQKT ISAVGEQLLP TVEHLQLELD QLKSELSSWR HVKTGCETVD AVQERVDVQV REMVKLLFSE DQQGGSLEQL LQRFSSQFVS KGDLQTMLRD LQLQILRNVT HHVSVTKQLP TSEAVVSAVS EAGASGITEA QARAIVNSAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALMSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MMIHPAAFTL EHIPKTLSPT GNISSAPKDF AVYGLENEYQ EEGQLLGQFT YDQDGESLQM FQALKRPDDT AFQIVELRIF SNWGHPEYTC LYRFRVHGEP VK // ID A7S8D4_NEMVE Unreviewed; 201 AA. AC A7S8D4; DT 02-OCT-2007, integrated into UniProtKB/TrEMBL. DT 02-OCT-2007, sequence version 1. DT 11-NOV-2015, entry version 34. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDO40082.1}; GN ORFNames=v1g108621 {ECO:0000313|EMBL:EDO40082.1}; OS Nematostella vectensis (Starlet sea anemone). OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Actiniaria; OC Edwardsiidae; Nematostella. OX NCBI_TaxID=45351 {ECO:0000313|Proteomes:UP000001593}; RN [1] {ECO:0000313|EMBL:EDO40082.1, ECO:0000313|Proteomes:UP000001593} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CH2 X CH6 {ECO:0000313|Proteomes:UP000001593}; RX PubMed=17615350; DOI=10.1126/science.1139158; RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., RA Salamov A., Terry A., Shapiro H., Lindquist E., Kapitonov V.V., RA Jurka J., Genikhovich G., Grigoriev I.V., Lucas S.M., Steele R.E., RA Finnerty J.R., Technau U., Martindale M.Q., Rokhsar D.S.; RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire and RT genomic organization."; RL Science 317:86-94(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS469597; EDO40082.1; -; Genomic_DNA. DR RefSeq; XP_001632145.1; XM_001632095.1. DR UniGene; Nve.40381; -. DR STRING; 45351.NEMVEDRAFT_v1g108621-PA; -. DR EnsemblMetazoa; EDO40082; EDO40082; NEMVEDRAFT_v1g108621. DR GeneID; 5511738; -. DR KEGG; nve:NEMVE_v1g108621; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000007503; -. DR InParanoid; A7S8D4; -. DR KO; K19347; -. DR OMA; FPLWYFS; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000001593; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001593}; KW Reference proteome {ECO:0000313|Proteomes:UP000001593}. SQ SEQUENCE 201 AA; 22543 MW; 9BB10C9A02099343 CRC64; MLEKMLRDNM EAALDKYSAD KLGIPDYALE SDGGAIHFPH HSTTFNTGTE RGWFGLSFWY DTVSPRVIIQ PDNKPGQCWA FQGQQGYVVI KLSRAIIPTM FTLEHIPSSL STNGHGKIPT APKDFSVWGW ADPEGTEKVM LGNFTYKKEG KSLQTFQVKG TPSDAVFRYV ELRVLSNHGQ ATHTCIYRLR VHGDLPPTTR P // ID A7TF77_VANPO Unreviewed; 683 AA. AC A7TF77; DT 02-OCT-2007, integrated into UniProtKB/TrEMBL. DT 02-OCT-2007, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDO19102.1}; GN ORFNames=Kpol_2000p70 {ECO:0000313|EMBL:EDO19102.1}; OS Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) OS (Kluyveromyces polysporus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Vanderwaltozyma. OX NCBI_TaxID=436907 {ECO:0000313|Proteomes:UP000000267}; RN [1] {ECO:0000313|EMBL:EDO19102.1, ECO:0000313|Proteomes:UP000000267} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 22028 / DSM 70294 {ECO:0000313|Proteomes:UP000000267}; RX PubMed=17494770; DOI=10.1073/pnas.0608218104; RA Scannell D.R., Frank A.C., Conant G.C., Byrne K.P., Woolfit M., RA Wolfe K.H.; RT "Independent sorting-out of thousands of duplicated gene pairs in two RT yeast species descended from a whole-genome duplication."; RL Proc. Natl. Acad. Sci. U.S.A. 104:8397-8402(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS480382; EDO19102.1; -; Genomic_DNA. DR RefSeq; XP_001646960.1; XM_001646910.1. DR STRING; 436907.XP_001646960.1; -. DR EnsemblFungi; EDO19102; EDO19102; Kpol_2000p70. DR GeneID; 5547431; -. DR KEGG; vpo:Kpol_2000p70; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A7TF77; -. DR OrthoDB; EOG7SBNXT; -. DR PhylomeDB; A7TF77; -. DR Proteomes; UP000000267; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000267}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000267}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 683 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002712765. FT TRANSMEM 546 563 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 683 AA; 77136 MW; 837F8D3F114E35C4 CRC64; MINTILICLG LIFSIISATN STGNETDYLK HRQNLSNQSN HTTWENIQSL SNISSIDIGV NNGSVPVDAS IKHQSSSLVA VPTLNSLRCP TIQVDKPDQT VSNNLTTNLD NKTDSNSTFL SFNEWREAKL SEIPSILDRP LKTKAPVDAS CYKENNVIGE EMEIDVGVFT DSSDEDEKED GPAVKIYKDK FNYASIDCAA TIMKSNSDAI GAGSILIENK DSYLLNPCSA PNKFVIIELC QDILVEEIVM ANFEFFSSTF KDIKFLVSNR YPVSKSEWKT LGTFQGENSR DIQKFKIENP QIWARYLRIE ILSHYDDEFY CPISIVRVHG KTMMDEYKMS NIKETSEDAP CIENSEEVSL KETTIVENCD PLPDIPPENI TDISNLSKMS GICTSQIVPL KFDQFLMDFN NSYCPPKATK DMQITSSSVS SSSTEESIFK NIMKRLSILE NNATLTVLYI EEQSKLLYKS FEKLEKNHAT KFSDLIGIFN ATVVSNLDAL GDFANQLKEQ SLKILEEQKL NNDHFTTQTE HRLKRMENQL GYQRRLIYSM LFFVAGLVLF LILNKEITLE DDNDGNDWII TAPPLEKLKK FNSSMSSTYK EGMGTEKTLF RSPSNSSLSA SSIIEPLPEE NDSNSNYNSD IDENVSKSSQ SEDNVNDRIN ETLKLSDIAK EDLMDEEDME WEF // ID A7TK44_VANPO Unreviewed; 721 AA. AC A7TK44; DT 02-OCT-2007, integrated into UniProtKB/TrEMBL. DT 02-OCT-2007, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDO17390.1}; GN ORFNames=Kpol_1060p46 {ECO:0000313|EMBL:EDO17390.1}; OS Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) OS (Kluyveromyces polysporus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Vanderwaltozyma. OX NCBI_TaxID=436907 {ECO:0000313|Proteomes:UP000000267}; RN [1] {ECO:0000313|EMBL:EDO17390.1, ECO:0000313|Proteomes:UP000000267} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 22028 / DSM 70294 {ECO:0000313|Proteomes:UP000000267}; RX PubMed=17494770; DOI=10.1073/pnas.0608218104; RA Scannell D.R., Frank A.C., Conant G.C., Byrne K.P., Woolfit M., RA Wolfe K.H.; RT "Independent sorting-out of thousands of duplicated gene pairs in two RT yeast species descended from a whole-genome duplication."; RL Proc. Natl. Acad. Sci. U.S.A. 104:8397-8402(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS480405; EDO17390.1; -; Genomic_DNA. DR RefSeq; XP_001645248.1; XM_001645198.1. DR STRING; 436907.XP_001645248.1; -. DR EnsemblFungi; EDO17390; EDO17390; Kpol_1060p46. DR GeneID; 5545608; -. DR KEGG; vpo:Kpol_1060p46; -. DR eggNOG; ENOG410IE9E; Eukaryota. DR eggNOG; ENOG4111CR2; LUCA. DR InParanoid; A7TK44; -. DR OrthoDB; EOG7KM62C; -. DR PhylomeDB; A7TK44; -. DR Proteomes; UP000000267; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000267}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000267}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 194 212 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 721 AA; 83049 MW; 3BBAFFE5367DCDD2 CRC64; MKGGKDRHEV VEVNNGNGSM KKAYKDLLME RISIGNSRVQ RGPLAEGGGS GSSIEDTAEG TTNRYNIGMI EETGESINDY DDFKKNILNN ESISDGNDED YEDDDWLNYS DNDDDDDDVD DDDDEGNSLQ EANESFIEDY DEIDDYTSTD MENISTEDTD YEYEDEYDTM LHDKSVRYNE DITTSNSNKF LRNISLGITM MFILLLTIPL ILNGKTDSSI SEFSTIKPTS LGNIQKQVNH LYKEMNTRND QYQSDLDKTI KVVISQFEKN IKKLIPSNIL DFQAQLELLN TKVNSLSSSL TNWDTMNKDY RSKFSMENIT EWHDKLIEEL NHRLPNEIPV VVNGTSSMLV IPELHEYISG LLSDLIQHVE PTDLKSELKY DLNEYIKEVL ENQLQYVDKD YFITELNRNL QLNKHEIWQE VSTKLEQIEN ENAKYKSSIN ISPDQYSTIL LKKMVNKIYN ANQHQWEDDL DFGSLSMGTR LLNHLTSSTW KYGYGTSPIE LLSTTRGSST YWQCDSTNDC SWAIRFQQPL HLFRISYVHG RLTNNVHMMN SAPKVISVYV KLADDRSSIS KFLDTAKLFK QGQLFAKDST YIKIGQYNYN LAETEIRQQF PLPPWFIKLR PLVHSIVFQV DENYGNREFT SLKKFVVKAI TQSDLEITSA GEFPFKAGDI PDYSSPSYLD DFERLHLKTP RRNNDGDDPD QEPKFDDSDN IPSFGQDELD I // ID A8E4Y1_XENTR Unreviewed; 349 AA. AC A8E4Y1; DT 13-NOV-2007, integrated into UniProtKB/TrEMBL. DT 13-NOV-2007, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=LOC100127596 protein {ECO:0000313|EMBL:AAI53375.1}; DE Flags: Fragment; GN Name=LOC100127596 {ECO:0000313|EMBL:AAI53375.1}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|EMBL:AAI53375.1}; RN [1] {ECO:0000313|EMBL:AAI53375.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Testes {ECO:0000313|EMBL:AAI53375.1}; RG NIH - Xenopus Gene Collection (XGC) project; RL Submitted (SEP-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC153374; AAI53375.1; -; mRNA. DR UniGene; Str.34606; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 84 104 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 127 150 {ECO:0000256|SAM:Coils}. FT COILED 157 177 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:AAI53375.1}. SQ SEQUENCE 349 AA; 40667 MW; 627DD6693160DC8A CRC64; LRRSERNRIN LKSPSSESKE RSKATMSPKP TSGKEKLETA QPPITLQGTR RIKHSNPYVT KDIRKGESME IEVASTSRNN NFDCLYEFVF LFAVFAFVLL LIYIRSQLIS CILLEQKMRQ QNTDMTLKEI RRMKDRFQEI LNDVSEQKRT QMTKMMVQEI KNELKKWEED NVQVKDYALY SLGATIIKDK TSQSLKSDNL HWSFLGILSW PYTSCPEEIL KPDVYPGKCW TFPGSQGQVL IKLSAKIIPV AVTLQHISKT ISPSKNYSSA PRDFSVFGYE HEFQETGKIL GQFTYNPWEA LIQSFKLMND DTSRFQFIQL RILSNWGNEK YTSVYRFQVH QELPVQLRS // ID A8J6A5_CHLRE Unreviewed; 949 AA. AC A8J6A5; DT 04-DEC-2007, integrated into UniProtKB/TrEMBL. DT 04-DEC-2007, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDP00650.1}; GN ORFNames=CHLREDRAFT_176330 {ECO:0000313|EMBL:EDP00650.1}; OS Chlamydomonas reinhardtii (Chlamydomonas smithii). OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. OX NCBI_TaxID=3055 {ECO:0000313|Proteomes:UP000006906}; RN [1] {ECO:0000313|EMBL:EDP00650.1, ECO:0000313|Proteomes:UP000006906} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CC-503 {ECO:0000313|Proteomes:UP000006906}; RX PubMed=17932292; DOI=10.1126/science.1143609; RA Merchant S.S., Prochnik S.E., Vallon O., Harris E.H., Karpowicz S.J., RA Witman G.B., Terry A., Salamov A., Fritz-Laylin L.K., RA Marechal-Drouard L., Marshall W.F., Qu L.H., Nelson D.R., RA Sanderfoot A.A., Spalding M.H., Kapitonov V.V., Ren Q., Ferris P., RA Lindquist E., Shapiro H., Lucas S.M., Grimwood J., Schmutz J., RA Cardol P., Cerutti H., Chanfreau G., Chen C.L., Cognat V., Croft M.T., RA Dent R., Dutcher S., Fernandez E., Fukuzawa H., Gonzalez-Ballester D., RA Gonzalez-Halphen D., Hallmann A., Hanikenne M., Hippler M., Inwood W., RA Jabbari K., Kalanon M., Kuras R., Lefebvre P.A., Lemaire S.D., RA Lobanov A.V., Lohr M., Manuell A., Meier I., Mets L., Mittag M., RA Mittelmeier T., Moroney J.V., Moseley J., Napoli C., Nedelcu A.M., RA Niyogi K., Novoselov S.V., Paulsen I.T., Pazour G.J., Purton S., RA Ral J.P., Riano-Pachon D.M., Riekhof W., Rymarquis L., Schroda M., RA Stern D., Umen J., Willows R., Wilson N., Zimmer S.L., Allmer J., RA Balk J., Bisova K., Chen C.J., Elias M., Gendler K., Hauser C., RA Lamb M.R., Ledford H., Long J.C., Minagawa J., Page M.D., Pan J., RA Pootakham W., Roje S., Rose A., Stahlberg E., Terauchi A.M., Yang P., RA Ball S., Bowler C., Dieckmann C.L., Gladyshev V.N., Green P., RA Jorgensen R., Mayfield S., Mueller-Roeber B., Rajamani S., Sayre R.T., RA Brokstein P., Dubchak I., Goodstein D., Hornick L., Huang Y.W., RA Jhaveri J., Luo Y., Martinez D., Ngau W.C., Otillar B., Poliakov A., RA Porter A., Szajkowski L., Werner G., Zhou K., Grigoriev I.V., RA Rokhsar D.S., Grossman A.R.; RT "The Chlamydomonas genome reveals the evolution of key animal and RT plant functions."; RL Science 318:245-250(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS496138; EDP00650.1; -; Genomic_DNA. DR RefSeq; XP_001696958.1; XM_001696906.1. DR STRING; 3055.EDP00650; -. DR PaxDb; A8J6A5; -. DR PRIDE; A8J6A5; -. DR EnsemblPlants; EDP00650; EDP00650; CHLREDRAFT_176330. DR GeneID; 5722643; -. DR KEGG; cre:CHLREDRAFT_176330; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A8J6A5; -. DR Proteomes; UP000006906; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006906}; KW Reference proteome {ECO:0000313|Proteomes:UP000006906}. FT COILED 517 544 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 949 AA; 100021 MW; 1DE6D3AF027A222E CRC64; MARLVPQTTA GRVDRTAELR LTLLAVSSLA DRDRGPVPAC QQTLVPPHWA CQGLGVLRKI NLAAASDGAS IVAANKEAKR PDRLIDGDDD SYMKNACSAS KWVIVELSQL GRVDEIKITM KEMYSSRVRD FLIKGRQSHP KKDGLADYGR GLESEGWQLL GTFRAENKKG SQTFRLPRKA RVRYLLLQVL THYGSEEMCA LNDAGPSEHN CTSNGSGGGG EGGIDAAAQA AGAPTPAAGT SLAPVVQSST ASAAAAGSCE VPGTCSASGS NELDKDSQPQ QREAVVNETF SASERQAASD EEGAGRKPAA VEAPEPSTIT APGEAAGADT AATAAKPPSE VISAKSDALA GSGSSDADRA NKAAAQGGTG AATAGQGAAG VEQRDPPPST VPAVHQQLPT RLHTPKRLQM RRRRRVALWT QRLRDWLLLQ MPRWAAAQKR LARAATGARY HPQALWITRI YQAPLRTLPL DTLLSMLDGG SASKPRLAGN LFDVIKQEMM MLKLNQTRLW AYIHALVDAL NARHEELEAD AQAMEDKMTQ LDAALALAPA QAAQRAGEAT APAVEALAAR LDALEAPSDG LDSNAFRYDD ESAAPEDGTV APRSQGADLE ELLAEAEAAG GGGALSSAAF YRFRAALQDI GDEAPPLGAT QAPDQLLAID LQQLGDCLGS AVDDGDEDAE LAALLTGRPV VAAAKPGTAA AASGAVGIRP AGAGVSLPAS LQPRPVGLPA ATAVPVTVKT ARAAQLHQLQ GWQQALGKPP PAAGLLVVLL RVRVWMTSFG SCWGYPPQRV ARWGFRWRGR QLQRRRGRGC RKLAAPQLSS SVHAGACSQT GSHMSDNDPA SLAYHKEKAL KGETPAFVPT AEGWHETLAS VSEAVVKAEQ CVDVDTTKVP EKLEALQQHT IHVVQQLHHA GEDGMPATQR NAGRDVSESP AHRANLEHTR NTTTSHDPR // ID A8K129_HUMAN Unreviewed; 717 AA. AC A8K129; DT 04-DEC-2007, integrated into UniProtKB/TrEMBL. DT 04-DEC-2007, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=cDNA FLJ75032, highly similar to Homo sapiens unc-84 homolog B (C. elegans) (UNC84B), mRNA {ECO:0000313|EMBL:BAF82433.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:BAF82433.1}; RN [1] {ECO:0000313|EMBL:BAF82433.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Brain {ECO:0000313|EMBL:BAF82433.1}; RA Wakamatsu A., Yamamoto J., Kimura K., Ishii S., Watanabe K., RA Sugiyama A., Murakawa K., Kaida T., Tsuchiya K., Fukuzumi Y., RA Kumagai A., Oishi Y., Yamamoto S., Ono Y., Komori Y., Yamazaki M., RA Kisu Y., Nishikawa T., Sugano S., Nomura N., Isogai T.; RT "NEDO human cDNA sequencing project."; RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK289744; BAF82433.1; -; mRNA. DR STRING; 9606.ENSP00000385616; -. DR PaxDb; A8K129; -. DR PRIDE; A8K129; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG056957; -. DR NextBio; 35463795; -. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 213 234 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 273 293 {ECO:0000256|SAM:Coils}. FT COILED 352 372 {ECO:0000256|SAM:Coils}. FT COILED 374 401 {ECO:0000256|SAM:Coils}. FT COILED 478 498 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 717 AA; 80368 MW; 7582F03D14026BFA CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVHESWF PPRSSLEERH GDANWGEDLR VRRRRGTGGS ESSRASGLVG RKATEDFLGS SSGYSSEDDY VGYSDVDQQS SSSRLRSAVS RAGSLLWMVA TSPGRLFRLL YWWAGTTWYR LTTAASLLDV FVLTRRFSSL KTFLWFLLPL LLLTCLTYGA WYFYPYGLQT FHPALVSWWA AKDSRRPDEG WEARDSSPHF QAEQRVMSRV HSLERRLEAL AAEFSSNWQK EAMRLERLEL RQGAPGQGGG GGLSHEDTLA LLEGLVSRRE AALKEDFRRE TAARIQEELS ALRAEHQQDS EDLFKKIVRA SQESEARIQQ LKSEWQSMTQ ESFQESSVKE LRRLEDQPAG LQQELAALAL KQSSVAEEVG LLPQQIQAVR DDVESQFPAW ISQFLARGGG GRVGLLQREE MQAQLRELES KILTHVAEMQ GKSAREAAAS LSLTLQKEGV IGVTEEQVHH IVKQALQRYS EDRIGLADYA LESGGASVIS TRCSETYETK TALLSLFGIP LWYHSQSPRV ILQPDVHPGN CWAFQGPQGF AVVRLSARIR PTAVTLEHVP KALSPNSTIS SAPKDFAIFG FDEDLQQEGT LLGKFTYDQD SEPIQTFHFQ APTMATYQVV ELRILTNWGH PEYTCIYRFR VHGEPAH // ID A8NQM7_COPC7 Unreviewed; 964 AA. AC A8NQM7; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 2. DT 11-NOV-2015, entry version 27. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAU86202.2}; GN ORFNames=CC1G_03413 {ECO:0000313|EMBL:EAU86202.2}; OS Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC OS 9003) (Inky cap fungus) (Hormographiella aspergillata). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. OX NCBI_TaxID=240176 {ECO:0000313|EMBL:EAU86202.2, ECO:0000313|Proteomes:UP000001861}; RN [1] {ECO:0000313|EMBL:EAU86202.2, ECO:0000313|Proteomes:UP000001861} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003 RC {ECO:0000313|Proteomes:UP000001861}; RX PubMed=20547848; DOI=10.1073/pnas.1003391107; RA Stajich J.E., Wilke S.K., Ahren D., Au C.H., Birren B.W., RA Borodovsky M., Burns C., Canbaeck B., Casselton L.A., Cheng C.K., RA Deng J., Dietrich F.S., Fargo D.C., Farman M.L., Gathman A.C., RA Goldberg J., Guigo R., Hoegger P.J., Hooker J.B., Huggins A., RA James T.Y., Kamada T., Kilaru S., Kodira C., Kuees U., Kupfer D., RA Kwan H.S., Lomsadze A., Li W., Lilly W.W., Ma L.-J., Mackey A.J., RA Manning G., Martin F., Muraguchi H., Natvig D.O., Palmerini H., RA Ramesh M.A., Rehmeyer C.J., Roe B.A., Shenoy N., Stanke M., RA Ter-Hovhannisyan V., Tunlid A., Velagapudi R., Vision T.J., Zeng Q., RA Zolan M.E., Pukkila P.J.; RT "Insights into evolution of multicellular fungi from the assembled RT chromosomes of the mushroom Coprinopsis cinerea (Coprinus cinereus)."; RL Proc. Natl. Acad. Sci. U.S.A. 107:11889-11894(2010). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAU86202.2}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACS02000008; EAU86202.2; -; Genomic_DNA. DR RefSeq; XP_001835631.2; XM_001835579.2. DR STRING; 240176.XP_001835631.2; -. DR EnsemblFungi; EAU86202; EAU86202; CC1G_03413. DR GeneID; 6012166; -. DR KEGG; cci:CC1G_03413; -. DR EuPathDB; FungiDB:CC1G_03413; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A8NQM7; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001861; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001861}; KW Reference proteome {ECO:0000313|Proteomes:UP000001861}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 964 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002724431. FT COILED 72 92 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 964 AA; 104983 MW; 7C121F753AAA997A CRC64; MLRLQQLPLA LQVISLLFAS TAFAEPTSLN DPLRAIALHA PKRPGPPICC LQSQSSQLEV PEDEVLLSFE EWKAKQQQLQ NNQSNNKQTE QENSSNSGAG NNGATSGNEG NGHDPAAGAN AQDGSLPISH TYEELASIHP AETVLPHFQV PTTDRFNYAS LDCSARVHTA HRGAKSAASI LSSKKDRYML SPCKTKEKKF VVVELCDDIR IDTVQLANYE FFSGVFKDFS VSVAKTYTDS EGWTQAGTYR AKNVRGVQTF RFPETLRDFY RYIRIDFHSH YGNEYYCPVS LLRVYGLTHL EEWKWDIWMA ESKAKQAEAL NIKPLSIEAQ LSDETAESTG REGPSSRTEN EHGGIDATII ETLAALSSKA AELSQPVSSD LASAIPDVPP SNHNIHPYSP PPPKPAEPYS HELHDPPSHH DSYIPSHSTD TFASPGSGSP HSATNTAPSV PQPPPTASAP SSSTAVATSA VSTSTRASPS DSPVHGKAHN NTSSSQRSSS QGPSSVIILS SASAAPSHPT GVPPIHATTG GESVYRTIMN KLTALETNYT LYTRYMDQQN GAIRELIKRL GEDIGRLEGV TRAQRTHSQK MLNEWEKQRL QMLIEYNQLV SRVEHLSEEI LLEKRLGVAQ LCLMLAVLIF MGLTRGSRGE AIVVNGPTSM REWGKRHLSL SGDWTSRFRR KSNAGHSAAI RSRPLTAKPA SPSKKSAAAT DVEGKVEFPS TDDAPLKKRP LEPIQVNNAD TPTSSRVKKL SLTRSRTPSL RTNGTGKRVP HTAHIPRPQT PTPIRPVFHH RDLGSHSLQR SSSHGAQLSS GRSAKSAKKW ARTAHLHPVK TPFYPQTNSE AFAQSYDGSN PSVGAGGSNA SDLGYQDVFS APAGSFAFPK KAVRERDLPG TGNTQELSLT RDSLEIWDSK DHGWRKRRPS AAGRSSENVV AEKLEDADAE NNWIDTDSVV DSSDVDSMGL SGQL // ID A8PE15_COPC7 Unreviewed; 285 AA. AC A8PE15; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 2. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAU81114.2}; GN ORFNames=CC1G_09756 {ECO:0000313|EMBL:EAU81114.2}; OS Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC OS 9003) (Inky cap fungus) (Hormographiella aspergillata). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. OX NCBI_TaxID=240176 {ECO:0000313|EMBL:EAU81114.2, ECO:0000313|Proteomes:UP000001861}; RN [1] {ECO:0000313|EMBL:EAU81114.2, ECO:0000313|Proteomes:UP000001861} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003 RC {ECO:0000313|Proteomes:UP000001861}; RX PubMed=20547848; DOI=10.1073/pnas.1003391107; RA Stajich J.E., Wilke S.K., Ahren D., Au C.H., Birren B.W., RA Borodovsky M., Burns C., Canbaeck B., Casselton L.A., Cheng C.K., RA Deng J., Dietrich F.S., Fargo D.C., Farman M.L., Gathman A.C., RA Goldberg J., Guigo R., Hoegger P.J., Hooker J.B., Huggins A., RA James T.Y., Kamada T., Kilaru S., Kodira C., Kuees U., Kupfer D., RA Kwan H.S., Lomsadze A., Li W., Lilly W.W., Ma L.-J., Mackey A.J., RA Manning G., Martin F., Muraguchi H., Natvig D.O., Palmerini H., RA Ramesh M.A., Rehmeyer C.J., Roe B.A., Shenoy N., Stanke M., RA Ter-Hovhannisyan V., Tunlid A., Velagapudi R., Vision T.J., Zeng Q., RA Zolan M.E., Pukkila P.J.; RT "Insights into evolution of multicellular fungi from the assembled RT chromosomes of the mushroom Coprinopsis cinerea (Coprinus cinereus)."; RL Proc. Natl. Acad. Sci. U.S.A. 107:11889-11894(2010). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAU81114.2}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACS02000007; EAU81114.2; -; Genomic_DNA. DR RefSeq; XP_001840705.2; XM_001840653.2. DR STRING; 240176.XP_001840705.2; -. DR EnsemblFungi; EAU81114; EAU81114; CC1G_09756. DR GeneID; 6017357; -. DR KEGG; cci:CC1G_09756; -. DR EuPathDB; FungiDB:CC1G_09756; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8PE15; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001861; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001861}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001861}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 66 91 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 123 143 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 285 AA; 31460 MW; ADED41DFE53176FE CRC64; MAPPRAKARK WYNIHEEPQA GIGISLTPFS PDDSIQVAPC RYIRSSSQPL AVGASAHGTS WVKTGFIWLF TIFFTAFQTW CCTFAVSTVL YKFGCTPEGS TTVCIPLEIL PGFGPFFRLQ HRLAQMSGKI DDMQYKLSRL EDQLPLDFHN RDFALWGLGA QVIPELTSRD EGDANAVLPS AVLDDPPLLG ECWEFMGRRG QIGIQLSDAA NITAMSLSQP FDESVLVPLV HAHYNATSPQ SRQIFAVTSP VAKVVRFDIV VVEVLDNWGG ESTCLYHIGV HGLED // ID A8Q880_MALGO Unreviewed; 972 AA. AC A8Q880; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EDP42463.1}; GN ORFNames=MGL_3221 {ECO:0000313|EMBL:EDP42463.1}; OS Malassezia globosa (strain ATCC MYA-4612 / CBS 7966) OS (Dandruff-associated fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Malasseziomycetes; Malasseziales; Malasseziaceae; Malassezia. OX NCBI_TaxID=425265 {ECO:0000313|EMBL:EDP42463.1, ECO:0000313|Proteomes:UP000008837}; RN [1] {ECO:0000313|EMBL:EDP42463.1, ECO:0000313|Proteomes:UP000008837} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4612 / CBS 7966 {ECO:0000313|Proteomes:UP000008837}; RX PubMed=18000048; DOI=10.1073/pnas.0706756104; RA Xu J., Saunders C.W., Hu P., Grant R.A., Boekhout T., Kuramae E.E., RA Kronstad J.W., DeAngelis Y.M., Reeder N.L., Johnstone K.R., Leland M., RA Fieno A.M., Begley W.M., Sun Y., Lacey M.P., Chaudhary T., Keough T., RA Chu L., Sears R., Yuan B., Dawson T.L. Jr.; RT "Dandruff-associated Malassezia genomes reveal convergent and RT divergent virulence traits shared with plant and human fungal RT pathogens."; RL Proc. Natl. Acad. Sci. U.S.A. 104:18730-18735(2007). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDP42463.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAYY01000011; EDP42463.1; -; Genomic_DNA. DR RefSeq; XP_001729677.1; XM_001729625.1. DR STRING; 425265.XP_001729677.1; -. DR GeneID; 5853983; -. DR KEGG; mgl:MGL_3221; -. DR EuPathDB; FungiDB:MGL_3221; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A8Q880; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008837; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008837}; KW Reference proteome {ECO:0000313|Proteomes:UP000008837}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 972 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002728015. FT COILED 748 768 {ECO:0000256|SAM:Coils}. FT COILED 792 819 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 972 AA; 105889 MW; 2F242632E1DEDDC8 CRC64; MNRTVPYGIW SACVLALWLG ITASANNVDV KITTSPPSPQ HRQPAELPGE PSPLGPVFRC YSPNAPDHEP TSQDVAEVFD SSVTFRTDPR RMLLSNDIFF CEVPTQDELG RWHTRMYAVF GTVMVPRNDA PASLSTSPVH RAHEIDPGAA AAAAGVAQSE AEQRAKMHVD DPLLSFDEWK EQHLEHARKL RKSDKARERA AHKSQVSGSD VRSFDKSAFL AASLSGSSSS PSATSTSEVA RSPDREGDES TRAAHAESAA ADVAQGAAQA AYNQTMLDAS ASMDQNAPAD PGPPHIYAQE DASAGLAELK HRWNYASLDC AAILHKANPF AKSASAILSE KKDKYMLSPC PWSAGYQGDK SGRPESQFVI VELCQQIRVD TIVLSNLEFF SSMFKLFAVR VARSLHAPED EWHTLGFFHA RNARGYQVFK LSSAPQSYFR FLRIDFLEHY GTEFYCPVSL LRVYGRNERE DADDDMMDDI NALDDDDVSD MLDDTVPSEP LLELASTHAI DGNRVVERAC EREPFPGLWR PVCERVLPVV LPPPPLSMPT GLATNDSYSL MASQVPPPLD PFVPLLVDDM PTMCMSTSSV ATTSSPDPVA GRPVSTSQPS SPPSSTVSTD LSVEQQTTWT SATWTATATA ATSSSTMMPP PSSPRTTSTA SLSEPPSPST SLPPTSSSVT AAGGDLHTSS LLDGTSETRR TAEPKGNAAK PKSSNKPQGD TKSGGSESIY RTITKRLVAL EANTSLSMQF LQLNSQKLRD KLLVLEQMQE TRLAEMFAAM NASQARVWGD KVGQQQSALA ALEAQQQSFE EERTILLARI ERLAADVRSE KRWGMAQLSL LLILLLILAL TRSPTNLAHV APQSGEYKSP PSLLHAPNST EASPHPIDNM EYDSGEPLSA MSTPGRSSSF TLTPRPVASL QRARRRYATP STLIRARRLR LNGAKYRRLS DTHTPHMMPD PTATEWTEKD SD // ID A8QCJ3_MALGO Unreviewed; 770 AA. AC A8QCJ3; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EDP41634.1}; GN ORFNames=MGL_4015 {ECO:0000313|EMBL:EDP41634.1}; OS Malassezia globosa (strain ATCC MYA-4612 / CBS 7966) OS (Dandruff-associated fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Malasseziomycetes; Malasseziales; Malasseziaceae; Malassezia. OX NCBI_TaxID=425265 {ECO:0000313|EMBL:EDP41634.1, ECO:0000313|Proteomes:UP000008837}; RN [1] {ECO:0000313|EMBL:EDP41634.1, ECO:0000313|Proteomes:UP000008837} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4612 / CBS 7966 {ECO:0000313|Proteomes:UP000008837}; RX PubMed=18000048; DOI=10.1073/pnas.0706756104; RA Xu J., Saunders C.W., Hu P., Grant R.A., Boekhout T., Kuramae E.E., RA Kronstad J.W., DeAngelis Y.M., Reeder N.L., Johnstone K.R., Leland M., RA Fieno A.M., Begley W.M., Sun Y., Lacey M.P., Chaudhary T., Keough T., RA Chu L., Sears R., Yuan B., Dawson T.L. Jr.; RT "Dandruff-associated Malassezia genomes reveal convergent and RT divergent virulence traits shared with plant and human fungal RT pathogens."; RL Proc. Natl. Acad. Sci. U.S.A. 104:18730-18735(2007). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDP41634.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAYY01000018; EDP41634.1; -; Genomic_DNA. DR RefSeq; XP_001728848.1; XM_001728796.1. DR GeneID; 5853154; -. DR KEGG; mgl:MGL_4015; -. DR EuPathDB; FungiDB:MGL_4015; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8QCJ3; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000008837; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008837}; KW Reference proteome {ECO:0000313|Proteomes:UP000008837}. FT COILED 341 382 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 770 AA; 86241 MW; A0FC207AF11E0B58 CRC64; MPDMGLTNSS RRRGRRGKSN AMDTSAWTSS LYSTRDPSQV SSYYLRPYEE DHNDLSVDDS LSGLGSLLRG SQRPPHTPRS YVHSATQNAN LSGYDYAEED KFMERVESEW RQHASEHDGL ADADPPSTTP ELEKADDPVF EAQPTPQVPG LFRSVLNSTK RDEFLSSSTD EKHKNIHPEN TRGDSQFESA SGFSLSSSSS VWTVLFRCFL VIAVLFIIYR NVHKFVSSPT PNDPLVESRP WSSAGSGDEL RERISVLEGA LNRVWQSVGD VGQEMRVSQE KLMHRLSSLE QKGLLRSSLD SLERDVRSLK KHHAEGMSLW QSEKARIEAM LSQHIEQGEK SKNKKDNLAA LYDQLSGLEK RLARAEKQVS EASEAALQAK RAFEPLRDFV PDRLPVRYDA RSKQIRIDPA FWHEFRKVMP SPRGAGGDDT TIPPWHVFLA EHRNELEELF SHVARADPDT SMFLDKTTFF DLMEAEISRA KMELTTRFNE HVHGLESDVL EKVRKQQESF MEQQDALSHD ETGSELQVDQ VRELIDAALA VFAADQIGRA DYAQYSAGAR VIPSLTSPTH EVRVQGTNVH SFTSMISYLV PLPLRFGTRG ADSLSYTVRG RMPVVALHHD TSPGMCWPFS GSHGQLGVQL VRRIKVQAIT VDHVPAVLSL DGLASAPREI EVWGIAETSQ DRERVEQWRL SQAWSDEPAP VPPSPSHVFL GSFVYEAHAG SPPIQTFPVG HAVGSLGLAF RTVQFNILSN HGLRDFTCLY RVRVHGEPVG // ID A8WJ76_CAEBR Unreviewed; 330 AA. AC A8WJ76; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=Protein CBG23752 {ECO:0000313|EMBL:CAP20518.1}; GN ORFNames=CBG23752 {ECO:0000313|EMBL:CAP20518.1, GN ECO:0000313|WormBase:CBG23752}, GN CBG_23752 {ECO:0000313|EMBL:CAP20518.1}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP20518.1, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP20518.1, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP20518.1, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=AF16; RG WormBase Consortium; RL Submitted (OCT-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601379; CAP20518.1; -; Genomic_DNA. DR RefSeq; XP_002647886.1; XM_002647840.1. DR STRING; 6238.CBG23752; -. DR EnsemblMetazoa; CBG23752; CBG23752; CBG23752. DR GeneID; 8589886; -. DR KEGG; cbr:CBG23752; -. DR CTD; 8589886; -. DR WormBase; CBG23752; CBP12586; WBGene00042021; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8WJ76; -. DR OMA; CNISSHL; -. DR Proteomes; UP000008549; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}. SQ SEQUENCE 330 AA; 37773 MW; A58207EF61E81BB7 CRC64; MVSLVEHSLS SVYDFGNVVV ISLTMILYRL QTISNQNDRV NSMQSQFGNV ERKIGSLVSR KPYQDINQFE GTKPLKQSIA DVLKSMKIPT QKLTDYMEPT IIPQINQSIS KTAESTLKVP IPKEQFRFNA ADYQRGASVD MDHSSSSNLN PIIGHDQTNL VLLDRPQPPS DNAWCTNDEN PVLTVNLAKY IKPISVSYQH SKWNETIPNG APKTYDVVAC LDFYCEKWKP LVSNCIYSQY VSSEPEQMCN ISSHLDVPSI GKVQFRFREN YGDAQMTCVH LVRVYGETET PVKIKEKKVE SEEICADLRW YYHNSYFRYT WTDKNCTAHL // ID A8WXG2_CAEBR Unreviewed; 103 AA. AC A8WXG2; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=Protein CBG04402 {ECO:0000313|EMBL:CAP25109.1}; GN ORFNames=CBG04402 {ECO:0000313|EMBL:CAP25109.1, GN ECO:0000313|WormBase:CBG04402}, GN CBG_04402 {ECO:0000313|EMBL:CAP25109.1}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP25109.1, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP25109.1, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP25109.1, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601251; CAP25109.1; -; Genomic_DNA. DR RefSeq; XP_002634399.1; XM_002634353.1. DR STRING; 6238.CBG04402; -. DR EnsemblMetazoa; CBG04402; CBG04402; CBG04402. DR GeneID; 8576394; -. DR KEGG; cbr:CBG04402; -. DR CTD; 8576394; -. DR WormBase; CBG04402; CBP06739; WBGene00027078; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OrthoDB; EOG7N37DX; -. DR Proteomes; UP000008549; Chromosome IV. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}. SQ SEQUENCE 103 AA; 11733 MW; 062958495FB396C9 CRC64; MDYSNRSNLK PLIGYDQTNL VLLDRPQPPE DKAWCTFDKN PVLTINLAKH IKPISVSYQH SEWHGTIPSE APIRYDVVSI QVTTTAYPAQ VEFESVLNTN SQF // ID A8WXG3_CAEBR Unreviewed; 189 AA. AC A8WXG3; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 2. DT 11-NOV-2015, entry version 39. DE SubName: Full=Protein CBG04403 {ECO:0000313|EMBL:CAP25110.2}; GN ORFNames=CBG04403 {ECO:0000313|EMBL:CAP25110.2, GN ECO:0000313|WormBase:CBG04403}, GN CBG_04403 {ECO:0000313|EMBL:CAP25110.2}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP25110.2, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP25110.2, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP25110.2, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601251; CAP25110.2; -; Genomic_DNA. DR STRING; 6238.CBG04403; -. DR EnsemblMetazoa; CBG04403; CBG04403; CBG04403. DR WormBase; CBG04403; CBP28590; WBGene00027079; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8WXG3; -. DR OrthoDB; EOG7N37DX; -. DR Proteomes; UP000008549; Chromosome IV. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 189 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002729644. FT TRANSMEM 165 188 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 189 AA; 22085 MW; 448E721A1558A64C CRC64; MIKLILFYLI VFNHQQINHC AQMMRIQCSL LIWSSLSNQH SKWHGTIPNG APKTYDVVAC FDYYCKSWRP LVSNCKYSQN KSNEIEQVCN IHLLNVPLIR TVQFRFRENY GDTKMTCVNL LYHNSYSTYF LRKKSCTVLY ENDRCSECPE CCQECLISDY NGETLLFIFS SAILIFMIFG VFAILMSVA // ID A8X1N1_CAEBR Unreviewed; 2727 AA. AC A8X1N1; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 2. DT 11-NOV-2015, entry version 64. DE SubName: Full=Protein CBR-HECD-1 {ECO:0000313|EMBL:CAP26541.2}; GN Name=hecd-1 {ECO:0000313|WormBase:CBG05779}; GN Synonyms=Cbr-hecd-1 {ECO:0000313|EMBL:CAP26541.2}; GN ORFNames=CBG05779 {ECO:0000313|WormBase:CBG05779}, GN CBG_05779 {ECO:0000313|EMBL:CAP26541.2}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP26541.2, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP26541.2, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP26541.2, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE600909; CAP26541.2; -; Genomic_DNA. DR STRING; 6238.CBG05779; -. DR EnsemblMetazoa; CBG05779; CBG05779; CBG05779. DR WormBase; CBG05779; CBP30654; WBGene00028166; Cbr-hecd-1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR HOGENOM; HOG000018061; -. DR InParanoid; A8X1N1; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR Proteomes; UP000008549; Chromosome IV. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2727 AA; 301008 MW; 4DF91B5822A75906 CRC64; MDGIDPETLL EWLQTGIGDE RDLQLMALEQ LCMLLLMADN IDRCFESCPP RTFIPALCKI FIDETAPDNV LEVTARAITY YLDVSNECTR RITQVDGAVK AICARLAAAE MSDRSSKDLA EQCVKLLEHV CQRETMAVYD AGGINAMLNL VRVHGAQVHK DTMHSAMSVV TRLCGKMEPT DLELAKCAES LGALLEHDDP KVSESALRCF AALTDRFVRK MMDPAELAMH SNLVEHLISI MVSSNDESSP VAASANILSI VLSLIGNLCR GSSLITEKVL TSPNMIKGLK ATLTNKEERV VTDGLRLCDL LLVLLCEGRS ALPLTSVVSG DYAAGSGAER VHRQLIDAIR QKDLTALVDA IESGQVDVNF ADDVGQSLTN WASAFGSIEM VQYLCDKGAD VNKGHKSSSL HYAACFGRPD VSYDAGLLLS QVVKLLLQRG ANPDLRDEDG KTALDKARER SDDDHNQVAN ILESPSTFMR NSTLSQKTKA SASQQPGTST KQELPNPQLV RKVLHQLLPI FCDIFQRSLN RTVRRTSLSL MRKIVENIGD LRQSAGEDNA AVTRSARKMS ADVTAGAESL VAVVVSVLDQ EDDYEGHEQV LLILQSLLEK DSELWVTELI RLGVFERVEA MAKEPPKGLE EVLAAINLEG RSRVVPMEID FQNQPSSSSN DIMDAAPPSA DIAGEEPAQP SIEVLISNLG ISSYMAEPEP STPSTSSQIA APKPRSTTGS SASSAILQVV SKLSGVANLD KSSAADKKPT KIVLNQGSPY RWKEWRIVRG PTSLFIWSDV LLIELPFQSN GWFRYLADND SHVQFVTGTA NVDQQMSDEE KENFKKTERH EMVSRWNAVK GVFDDDWSTV PVSVLGVPSS AKKVSQKLEV PAWELWSSKS SELQIKSVSS SAPSGQANTM VTTIKIQDDA GGFLFESGTG RKTNVMPEHA LPPPFHTGWS SHGVTTRKMK FRQDIQKRKV QELAWKLWND HLKEAHAKPR EALVRLEEAA RLIEVSSCLH SVKTASNYKH RSAKHARIER IQEYTAAIKI LYDSVVDDRR LSTFEFSVSG IVPALFALLS AIEKSPTDFY MRRIFMESFR VGESLSQLSL KIVAVLEASE KFPQYLYDSP GGSSFGLQLL SRRVRSKLEM IPSEEGKENI DENIVNKTGK VIKCEPLASV GAIKTYLMKL VARQWHDRER SQYKYVKEIK ELKEKGKSVV LKHTSDFDEN GVIYWIGTNG KTVPTWTNPA TVKAVKVTCS DPRQPFGKPE DLLSRDSNPI NCHSSDDKNS HFTFDLGVFL IPTSYTMRHS RGYGRSALRN WTLQGSIDSK KWDDIINHVD DKSLGEPGST ATWHVPEKGT VAYRYYRIAQ NGKNSSGQTH YLSCSGFEIY GDIVDAVTEK IFEDAPKKES IAGPSSSGPS SSSSLPPLTK EQVLDMLPAH ENNNRLKSGL TIETVTAMMQ RSRHRVRDSY KLSDSKAKVV RGKDWRWEDQ DGGEGKMGRI ISPPESGWVD VAWENGYSNS YRYGANGNFD IERVSSSGHR YSSPSMPSGI PSSVMDAVRR NRAFYTAKGS GPPSALFGVP SGSSRGGENS SSSSSSPFPN LPIPPWRTSK SSTSPAIASR LISTVTSSGA SPTPPPSSTS STLSSLASGL GFGLNRHKQH NKPGASTLSR FASVKNPAPA GVAASGSSSG VAIGKKSMST TNLVDDRQAS AGPSVASTGQ AASAESLQHQ TPSLENLLAR AIPHAFGRIA ENQEQEEEPM GGEESDSAAS MRSAASSNSQ ISMDSAQQQQ QQQQDSDMTP RDAAGTPSTP RDDKNQTLSV SAPDLAAARQ RQASAEAEGD SNMDESNSED KTVGGDDAME EDDEEEETVE DEEDNDDDDD DDDESSNENQ EKLVELLANE RGLFDKLKEV ITGESLSDAS SSAKDANTNE AQKKGGSKKP KKWFKKMSSY TDVLKGLMQS RYPVTLLDPS AAGMEMDEMM DEDDYYDFSE DGPDDGDSVE DEVAAHLGMP VDSFATMVAA RTPITWRQFS ELMSGSNRER AAMARAVASS RGNPWDDETV VKCSFEALIP AFDPRPGRSN VNQTLEVELP AVVNDFGPSK TSAKRTKDSV RFFIRGPNMS GVDNITLEMD DDCSSVFSYM QKINNNANWA TKSDRGRRIW EPTYSICYCS SDNQKVEVTK IPSEESSTPV QVNQCLETIR LLARLQESIP EAEISPNVFI SDKLTLKITQ VLSDALVVAA RALPEWCSRL IYKYPCLFTV ETRNMYMQVS FFFQNLALSL EYSAKVKLIS QRKFFRQQLS VSRVLSFGFN NAAMQLLNVL EEVLKEVMLL PGNMIVIMNT VLDVSELVSD LHSSSTLSLQ QSCKESLWRY GCVMMMIPML RKMQKREKSI LVKERNRQGI FYVRRMGGLF PAPLPPGTEE TKKASDMFRV LGVFLAKVLL DGRLVDLPLS RPFLKLLVSP QVGEQPHGPN LHNVLTLDDF EEVNPVKGSF LKELIAVSNR KKMIEKEVMD QTMKRRKIAE IKLHIKGSSC KMEDLALNFT VNPPSKVFQY TEMELVDGGG DIDVTVDNVE QYVERCEEFY LNSGIAQQMR AFREGFDRVF PLSSLRGYSP EELQRLLSGE QCPEWSRDDI LNYTEPKLGY TRESPGFLRF VDVMESLTAQ ERKNFLQFAT GCSSLPPGGL ANLHPRLTIV RKVESGDGSY PSVNTCVHYL KLPEYSSAEI LRERLLTAIN EKGFHLN // ID A8X4T7_CAEBR Unreviewed; 1115 AA. AC A8X4T7; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 52. DE SubName: Full=Protein CBR-UNC-84 {ECO:0000313|EMBL:CAP27647.1}; GN Name=unc-84 {ECO:0000313|WormBase:CBG07416}; GN Synonyms=Cbr-unc-84 {ECO:0000313|EMBL:CAP27647.1}; GN ORFNames=CBG07416 {ECO:0000313|WormBase:CBG07416}, GN CBG_07416 {ECO:0000313|EMBL:CAP27647.1}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP27647.1, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP27647.1, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP27647.1, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601041; CAP27647.1; -; Genomic_DNA. DR RefSeq; XP_002645743.1; XM_002645697.1. DR STRING; 6238.CBG07416; -. DR EnsemblMetazoa; CBG07416; CBG07416; CBG07416. DR GeneID; 8587742; -. DR KEGG; cbr:CBG07416; -. DR CTD; 8587742; -. DR WormBase; CBG07416; CBP07622; WBGene00029473; Cbr-unc-84. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000018377; -. DR InParanoid; A8X4T7; -. DR KO; K19347; -. DR OMA; WKSEFAS; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000008549; Chromosome X. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 111 130 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 385 407 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 516 536 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 1115 AA; 127271 MW; C1174C753768A631 CRC64; MPDDIYDHEW KSEFASTRSG RNSPNIFAKV RRKLLLTPPV RNARSPRLTD EELDALTGDL PYATNYTYAY SKIYDPSLPD HWEVPNLTGG MSSGTLAEQE HWSAASLSRQ LLYLLRFPLY LVLHVITYIL EAIYQVVKIS TYTIWDYILY LIRLARNRYA SWQDNRRRTA LIRNRQDPFN VKLARFIRGF FETIVYVIHT PVRLWKSRGN SVSQYDYTSI KDQLENERAS RMVTRSQGLE KSRTFAGLSR SPARRATPVA ATKTTNITRV ITKVFANDES SSEAPATPTV VTTRTVRTRS VTPRFRSTRA AAATRSGIQR AAFDTPDLEV DTSLASFGLR SRAKNHLNTP EPTFDIGDVA ATSTPLMPRT DYIVDTNEKG VIHNVLYYIG YFVFLPFIAA RHIWYTIYDY GKSAYMKWTD YQPEAMEAIH VRDVNEPAPT DDNNASFIAV PWTTRISNAF SALFILIKDI FIYIGEAHEI VFEMFKGAFL ETTSYIGGLF SGLSDAFIEK RNKGTLWPML WTLWVLLLAI FLFGFLHSEN TAIRKEGFIQ EANEAKSTDG GLPTVPFWMG AVNNVKHYTW MTKEYVYDLA FNTYSFIKPV LGRTVTAPKY AWGLIASGCG TVADNFQSFR DYVSEKSYDV GYFLSYGFKE SFANVGNSVL NGTFTYAEKF GQYTNDFFSN AFSGIYDFFA YFFSGLLNIS TNTQTAIISG VKTVVYGISD FFYNYIYAPV AGFFSGNYQE LLRPIWLALR WIYDMVVLGV TSIFDLATFL VTYPVGLITR GWIKISQYAP EDPVQVIPIP QAITPTPDID RVQEQQPPEI LKKKTEVEDE EEELKIIPAP APEPIPIPTV VTTPPPVIIH QTNVVETVDK EAIIKEVTEK LRDELTAQFH QDLSAKFEQN YQTIIEQLKI VNNDVRYDNH QLEAIIRQLI YEYDTDKTGQ VDYALESSGG AVISTRCSET YKSYTRLEKF WDIPIYYHHY SPRVVIQRNS KSLFPGECWC FKDGRGYIAV ELSHFIDVSS ISYEHIGKEV APEGNRSSAP KGVLVWAYKQ IDDLSTRVLI GDYTYDLDGP PLQFFMAKHK PDFPVKFVEL EVTSNYGAQF TCLYRLRVHG KMLKV // ID A8XEJ3_CAEBR Unreviewed; 813 AA. AC A8XEJ3; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 2. DT 11-NOV-2015, entry version 36. DE SubName: Full=Protein CBG12092 {ECO:0000313|EMBL:CAP31128.2}; GN Name=suco-1 {ECO:0000313|WormBase:CBG12092}; GN ORFNames=CBG12092 {ECO:0000313|EMBL:CAP31128.2, GN ECO:0000313|WormBase:CBG12092}, GN CBG_12092 {ECO:0000313|EMBL:CAP31128.2}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP31128.2, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP31128.2, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP31128.2, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=AF16; RG WormBase Consortium; RL Submitted (OCT-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601540; CAP31128.2; -; Genomic_DNA. DR STRING; 6238.CBG12092; -. DR EnsemblMetazoa; CBG12092; CBG12092; CBG12092. DR WormBase; CBG12092; CBP30494; WBGene00033096; Cbr-suco-1. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A8XEJ3; -. DR OMA; ERCEETQ; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000008549; Chromosome I. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 813 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002729891. FT TRANSMEM 659 681 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 553 580 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 813 AA; 91891 MW; 2C41F4CA6811454D CRC64; MKLLQFLIFA VLLALPNINA NREVSLVKNW KEILLTEGED GLICTLSLDA CSRSATPYNV SKKISKTSVN ASEKEPLPEK SIESFDEWTK KRRDAVANQN VHHQKVMEPP SGATARHDEV VISLPPISRP PRNFASRECG AKIIAANPEA ENAKAVLNEK DVDDYMRNPC QSSKEKFIVV ELCEAIQIKK LAIGNFELFA SRPKTIQVFI SERYPPLASW VSLGSFHLQD HHKHLQTFEV PNTNIYAKYV RINLEDHYGK EHYCIVSVVN VMGSTLSDEY DKEEAAAHLL NVIDEKNEEP VTTPPPSEQK VQTQLPVPPR GSNYTNKLKR FSFLEIRSMC SQCSAEKVSY LHCHLLTRQS KPFKPTLTPK PVTVKPPVTE NRNLKVEIGL WAERSRHTIF EQSRRRNLAT IQRLRAEDTA TESTSTPTVP NGPVETPVPK VDEIVKTEEK ISTSDETKPT AQPDIPQQAQ EQTTPAPPKT KSESILPAGG STSQREMVLM KLSKRIAAVE MNLTLSTEYL SELSKQYVSQ MSGYQHELKE TRKASRKSAQ TVEAIMRSKI NNVKRELREL RQSVYLLQQL ENSRYKNAQN EMSRNVFMSS CHISSNVPPS PTLPKLPLII PSVSDKLENF TKFEERMKKI YQTAKSVMLG SITWNTDHLI VALISFNILA LSFLFAGVFY IHRRNKERCE ETQSIVRNEL RTRIAKVGIE NKKFISKGMR RAELAVTAAV SSALKVEKTS SSRKTMTELE TALANLFAAQ QVRIEEQFVQ NQRVLRDVLA EGQRPRADDN LSVEDSESLS ETEKEDTPIS NQD // ID A8XQ39_CAEBR Unreviewed; 520 AA. AC A8XQ39; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 2. DT 11-NOV-2015, entry version 43. DE SubName: Full=Protein CBG16940 {ECO:0000313|EMBL:CAP34764.2}; GN ORFNames=CBG16940 {ECO:0000313|EMBL:CAP34764.2, GN ECO:0000313|WormBase:CBG16940}, GN CBG_16940 {ECO:0000313|EMBL:CAP34764.2}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP34764.2, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP34764.2, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP34764.2, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=AF16; RG WormBase Consortium; RL Submitted (OCT-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601348; CAP34764.2; -; Genomic_DNA. DR ProteinModelPortal; A8XQ39; -. DR STRING; 6238.CBG16940; -. DR EnsemblMetazoa; CBG16940; CBG16940; CBG16940. DR WormBase; CBG16940; CBP36173; WBGene00036733; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; KOG3159; Eukaryota. DR eggNOG; COG0095; LUCA. DR InParanoid; A8XQ39; -. DR OrthoDB; EOG7N37DX; -. DR Proteomes; UP000008549; Chromosome I. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR GO; GO:0006464; P:cellular protein modification process; IEA:InterPro. DR InterPro; IPR004143; BPL_LPL_catalytic. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51733; BPL_LPL_CATALYTIC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}. SQ SEQUENCE 520 AA; 58636 MW; B200F1EACBCC96B7 CRC64; MIILEGKVVS GKPSENSKPL VQSAADVLKN MVMEQPIIDS KQPKAEELIT PPPPLNLSNN DSYIPKKELI LNAADYLRGA SVGNTHSSRS NLNPIIGYDQ TNLVLLDRPQ PPTHKAWCSN EHNSVLTINL AKNIKPISVS YQHSKWTHHI PMSTPRTYDV VACFDSKCQN WVLLVSNCEY SSQSIGTEQF CNVSSHLNVP LIGTVQFRFK ENYGDSQMTC VSLVRVYGEP KPEINEEEEK SRREYICNKF KWYHHNSFFK NALTHNVERS GEVLLMWSNR PSVVIGRHQN PWVEVNLPFA KETNIEIARR HSGGGTVYHD QGNLNISLLT THAQHCRPKN LKFISDALNS NFTVQIVPNS RDDMELQPGN RKCSGTAARI AKGQAYHHLT LLIDADLEIL KKSLKSPFRD QIESNATRSV RALAVGFLRE DDGNASVEGA EMAISEAYRK LFEQSQFETI DVSSKIAANP EILKILEELK SWKWIYGKSP KFQFSGENGQ EIEVKDGLIM GTDQRFSTDF // ID A8XQ41_CAEBR Unreviewed; 193 AA. AC A8XQ41; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Protein CBG16942 {ECO:0000313|EMBL:CAP34766.1}; GN ORFNames=CBG16942 {ECO:0000313|EMBL:CAP34766.1, GN ECO:0000313|WormBase:CBG16942}, GN CBG_16942 {ECO:0000313|EMBL:CAP34766.1}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP34766.1, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP34766.1, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP34766.1, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=AF16; RG WormBase Consortium; RL Submitted (OCT-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601348; CAP34766.1; -; Genomic_DNA. DR RefSeq; XP_002648828.1; XM_002648782.1. DR STRING; 6238.CBG16942; -. DR EnsemblMetazoa; CBG16942; CBG16942; CBG16942. DR GeneID; 8590839; -. DR KEGG; cbr:CBG16942; -. DR CTD; 8590839; -. DR WormBase; CBG16942; CBP10413; WBGene00036735; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8XQ41; -. DR OrthoDB; EOG7N37DX; -. DR Proteomes; UP000008549; Chromosome I. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}. SQ SEQUENCE 193 AA; 22325 MW; 8C6680DF359149BD CRC64; MFTGYDQTNL VLLDRPQPPT HKAWCSNEHN SVLTINLAKN IKPISVSYQH SKWTHHIPMS TPRTYDVVAC FDSKCQNWVL LVSNCEYSSQ SIGTEQFCNV SSHLNVPLIG TVQFRFKENY GDSQMTCVSL VRVYGEPKPE INEEEEKSLE SEEICTDLKY YYHNSYYFKY TLANKSCSTL YKNDCCSRLS RVL // ID A8XRE8_CAEBR Unreviewed; 528 AA. AC A8XRE8; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=Protein CBG17734 {ECO:0000313|EMBL:CAP35222.1}; GN ORFNames=CBG17734 {ECO:0000313|EMBL:CAP35222.1, GN ECO:0000313|WormBase:CBG17734}, GN CBG_17734 {ECO:0000313|EMBL:CAP35222.1}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP35222.1, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP35222.1, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP35222.1, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE600976; CAP35222.1; -; Genomic_DNA. DR RefSeq; XP_002634380.1; XM_002634334.1. DR STRING; 6238.CBG17734; -. DR EnsemblMetazoa; CBG17734; CBG17734; CBG17734. DR GeneID; 8576375; -. DR KEGG; cbr:CBG17734; -. DR CTD; 8576375; -. DR WormBase; CBG17734; CBP10639; WBGene00037288; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8XRE8; -. DR OMA; HFNAANI; -. DR OrthoDB; EOG7N37DX; -. DR Proteomes; UP000008549; Chromosome IV. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 2. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}. SQ SEQUENCE 528 AA; 60355 MW; 3319E5A5728A3328 CRC64; MGFSMQSQFG AIERKVEMLI SRKSNPDKTD YQRGASVDMD HSSSSNLNPI IGYDQTNLVL LYRPQLLADK AWCSNAENPV LTINLAKYIK PISVSYEHAH WEGIVQNGAP RTYDVVASLD FYCEKWKPLV SNCEYSQYGS NEQMCNISSR IDVPLIGKVQ FRSRENYGGT KMTCVHLVRV YGETKTPAKI EEKNLRSEEI YTDLRWYYHN SYFKYIWVFK LQSQVHKLEK KSEHKNHRFE NSEDIKPSIS DVILGSKIQG QTKPPVPINL KDPMISNEEF QFNAADFLNG ASVDNDHSSS SNLNPIIGYD QTNLVLLDRP QPPTDKAWCS NAENPVLTIY LAKYIKPISV SYQHSKWHGT IPNGAPKTYD VVACLDYNCE KLERLVSNCE YKSYGAGAQE QVCNINPHQN VSSIGKVQFR FRENYGNTEM TCVHLVRVYG ETKTPVKSKE KNLKSEEICS NLKLFIFYNE AKDIRLKMEK EPNPQEVELS ETEPLTSNAP TTSYTEVEET EKPETTPSDK PPRRPGVW // ID A8XRE9_CAEBR Unreviewed; 415 AA. AC A8XRE9; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 2. DT 11-NOV-2015, entry version 37. DE SubName: Full=Protein CBG17732 {ECO:0000313|EMBL:CAP35223.2}; GN ORFNames=CBG17732 {ECO:0000313|EMBL:CAP35223.2, GN ECO:0000313|WormBase:CBG17732}, GN CBG_17732 {ECO:0000313|EMBL:CAP35223.2}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP35223.2, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP35223.2, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP35223.2, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE600976; CAP35223.2; -; Genomic_DNA. DR STRING; 6238.CBG17732; -. DR EnsemblMetazoa; CBG17732; CBG17732; CBG17732. DR WormBase; CBG17732; CBP34804; WBGene00037287; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A8XRE9; -. DR OMA; YNGLASK; -. DR OrthoDB; EOG7N37DX; -. DR Proteomes; UP000008549; Chromosome IV. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 57 76 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 380 406 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 415 AA; 46943 MW; 00B3848675CF264E CRC64; MKPKFGSETK PFLEDVESSK PPNPFQFKKN EYYGSQKPIL KEQSFYRYIK FQVFHQPIVN AVVALFAFLV LVQIYSDRIR IFALEQRLYQ VELQLNSAPR EDSHKANTVI QETAKTVTPT TNIVPTVKPE SSNKKSEFNA ASLILGATVE TELSSNPVPS GDGFLDKLMF KLGSDQSGYV LLDRDPILAG KAWCSDEKNP VLTVKLEEDI KPIAVSYQHS KWNGTVPADA PMTYDVVACI NESCNVTVAL ASNCNYSSSK NEKEQKCQIS NTYPSVNRVQ FRFNKNHGNS SKTCLYLIRV YGEAKEVVRN QKEDNVELKK SRVGLCSRLA WFHKKIPIFY NGLASKNCTT LYSNECCDEC PECCSGCQIN DGDFLNNFQF ILIFFVLFFI LFPIYIAGIS ACCVGLRRLF SRAFY // ID A8XRF8_CAEBR Unreviewed; 482 AA. AC A8XRF8; DT 15-JAN-2008, integrated into UniProtKB/TrEMBL. DT 15-JAN-2008, sequence version 1. DT 11-NOV-2015, entry version 51. DE SubName: Full=Protein CBR-SUN-1 {ECO:0000313|EMBL:CAP35232.1}; GN Name=sun-1 {ECO:0000313|WormBase:CBG17722}; GN Synonyms=Cbr-sun-1 {ECO:0000313|EMBL:CAP35232.1}; GN ORFNames=CBG17722 {ECO:0000313|WormBase:CBG17722}, GN CBG_17722 {ECO:0000313|EMBL:CAP35232.1}; OS Caenorhabditis briggsae. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6238 {ECO:0000313|EMBL:CAP35232.1, ECO:0000313|Proteomes:UP000008549}; RN [1] {ECO:0000313|EMBL:CAP35232.1, ECO:0000313|Proteomes:UP000008549} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AF16 {ECO:0000313|EMBL:CAP35232.1, RC ECO:0000313|Proteomes:UP000008549}; RX PubMed=14624247; DOI=10.1371/journal.pbio.0000045; RA Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., RA Chinwalla A., Clarke L., Clee C., Coghlan A., Coulson A., RA D'Eustachio P., Fitch D.H., Fulton L.A., Fulton R.E., RA Griffiths-Jones S., Harris T.W., Hillier L.W., Kamath R., RA Kuwabara P.E., Mardis E.R., Marra M.A., Miner T.L., Minx P., RA Mullikin J.C., Plumb R.W., Rogers J., Schein J.E., Sohrmann M., RA Spieth J., Stajich J.E., Wei C., Willey D., Wilson R.K., Durbin R., RA Waterston R.H.; RT "The genome sequence of Caenorhabditis briggsae: a platform for RT comparative genomics."; RL PLoS Biol. 1:166-192(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE600976; CAP35232.1; -; Genomic_DNA. DR RefSeq; XP_002634370.1; XM_002634324.1. DR STRING; 6238.CBG17722; -. DR EnsemblMetazoa; CBG17722; CBG17722; CBG17722. DR GeneID; 8576365; -. DR KEGG; cbr:CBG17722; -. DR CTD; 8576365; -. DR WormBase; CBG17722; CBP10634; WBGene00037278; Cbr-sun-1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000021176; -. DR InParanoid; A8XRF8; -. DR OMA; VPNHAPK; -. DR OrthoDB; EOG7BZVX6; -. DR Proteomes; UP000008549; Chromosome IV. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008549}; KW Reference proteome {ECO:0000313|Proteomes:UP000008549}. FT COILED 168 188 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 482 AA; 55064 MW; 00553A62BEB44482 CRC64; MALRHTISPQ LSNRPSPPIT RSVSRNGHRP TFETSTPVTR RSLQPGMEIG TIERVFEQAD DSTIDLNSSK FVYKEHFTVK ERTSMQKEMW YDWLVYHIRM IRRHLFPNTE KIRETLLVLV LLFILHKYAL DCWYDPTTKA EQPINHQSRI EFDSKWTPEI ERIIEESNKN LQMSISIIKD RENQLEDRTT HLETLSDDEK GWKESVVEEI RKIKASQTAI DQLVESLKKD LEENKLSKVI ITEEDRQPTG PPEPPGSSSS ILLHPMHFLH RSPIGVNVAN SLIGASIDYS CSSRPVSAKD GIFYDIMSYF GSFKEGYVLL DREVLTPGEA WCTYDKRPTL TVKLARFVIP TAVSYQHVRW SGIVPNHAPK LYDLVACLDS CCTQSEPLIL NCEYTASEEG PDEQEQFCSI PTINSMQPIN HVQFRFRENH GNMTKTCAYL VRVYGKPVNP QPPLTEQDAA ALDNGTTSHL ESTLVDSVPE SA // ID A9JT00_DANRE Unreviewed; 849 AA. AC A9JT00; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=Zgc:92151 protein {ECO:0000313|EMBL:AAI55146.1}; GN Name=sun1 {ECO:0000313|ZFIN:ZDB-GENE-050522-551}; GN ORFNames=zgc:92151 {ECO:0000313|EMBL:AAI55146.1}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|EMBL:AAI55146.1}; RN [1] {ECO:0000313|EMBL:AAI55146.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Testes {ECO:0000313|EMBL:AAI55146.1}; RG NIH - Zebrafish Gene Collection (ZGC) project; RL Submitted (NOV-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC155145; AAI55146.1; -; mRNA. DR UniGene; Dr.105339; -. DR STRING; 7955.ENSDARP00000104532; -. DR PaxDb; A9JT00; -. DR ZFIN; ZDB-GENE-050522-551; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 275 297 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 309 333 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 483 503 {ECO:0000256|SAM:Coils}. FT COILED 521 548 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 849 AA; 94229 MW; 871939B50411AF24 CRC64; MDFSRLHTYT PPHCTPDNTG YTYSLSSSYS TAALEFEKEH KINPVYDSPK MSRRSLRLQT SSGLYDNSFT EVAGNHSVGS YKRTNTSTTT TTSSSSSVSR SVRGRRQQQD SSIYESQSVT GTPQSTSDLS FTSTDASLIS NLLDQSTLRQ SSTTETYSAT RRRRAVNRSL LENGNVSKTE AHANLANGYF CKDCSFHAEG NEKETSYSVP YSTSESAAYQ TTEAADATMT TMTTSLNSVD GAAHDSYCGS VNVRDVVTAD HLNLNGSLWK AATGAFWWLG TGWYQLVALM SLINVFLLTR CLPKLLKLLL FLLPFLLLFG LWYLGLPIAL SFLPAVNLTE WKTSVTSFAS LPALPSFPSF PSLPALPSFT KEPLLKEQDV PPLVVAQAAS DSINSERLAL LEQRVSALWE SVRQGELKAK QQHEEALGLT QSLQEQIKTQ TDRESLGLWV TELLQPKFTA LEGDMKTETL SRAETEEQHI QHQNILEARL AELEVLLQNL NSRTEDIHLS QQTPVQAPVS VGVSQEKHEA LLSEVQRLEA ELGRIRGDLQ GVMGCQGKCD RLDTIHETVS AQVKEQLYAL LYGRDRGEAV IPEPLLPWLA SQYTSTSDLT ATLVTLERSI LGNLSLQLQE SKHQQASAET VTQTVAHTAE AAGMSEEQVQ LIVQRALKLY SEDRTGQVDY ALESGGGSVL STRCSETYET KTALMSLFGI PLWYFSQSPR VVIQPDMYPG NCWAFKGSQG YLVIRLSLRV IPNGFCLEHI PKSLSPSGNI SSAPRRFSVY GLDDEYQDEG KLLGDYTYQE DGDSLQNFPV MEENDKAFQI IEMRVLSNWG HPEYTCLYRF RVHGKPHAQ // ID A9RVG5_PHYPA Unreviewed; 300 AA. AC A9RVG5; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ77053.1}; GN ORFNames=PHYPADRAFT_178574 {ECO:0000313|EMBL:EDQ77053.1}; OS Physcomitrella patens subsp. patens (Moss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Bryophyta; OC Bryophytina; Bryopsida; Funariidae; Funariales; Funariaceae; OC Physcomitrella. OX NCBI_TaxID=3218 {ECO:0000313|Proteomes:UP000006727}; RN [1] {ECO:0000313|EMBL:EDQ77053.1, ECO:0000313|Proteomes:UP000006727} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Gransden 2004 {ECO:0000313|Proteomes:UP000006727}; RX PubMed=18079367; DOI=10.1126/science.1150646; RA Rensing S.A., Lang D., Zimmer A.D., Terry A., Salamov A., Shapiro H., RA Nishiyama T., Perroud P.-F., Lindquist E.A., Kamisugi Y., RA Tanahashi T., Sakakibara K., Fujita T., Oishi K., Shin-I T., RA Kuroki Y., Toyoda A., Suzuki Y., Hashimoto S.-I., Yamaguchi K., RA Sugano A., Kohara Y., Fujiyama A., Anterola A., Aoki S., Ashton N., RA Barbazuk W.B., Barker E., Bennetzen J.L., Blankenship R., Cho S.H., RA Dutcher S.K., Estelle M., Fawcett J.A., Gundlach H., Hanada K., RA Heyl A., Hicks K.A., Hughes J., Lohr M., Mayer K., Melkozernov A., RA Murata T., Nelson D.R., Pils B., Prigge M., Reiss B., Renner T., RA Rombauts S., Rushton P.J., Sanderfoot A., Schween G., Shiu S.-H., RA Stueber K., Theodoulou F.L., Tu H., Van de Peer Y., Verrier P.J., RA Waters E., Wood A., Yang L., Cove D., Cuming A.C., Hasebe M., RA Lucas S., Mishler B.D., Reski R., Grigoriev I.V., Quatrano R.S., RA Boore J.L.; RT "The Physcomitrella genome reveals evolutionary insights into the RT conquest of land by plants."; RL Science 319:64-69(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS544920; EDQ77053.1; -; Genomic_DNA. DR RefSeq; XP_001758231.1; XM_001758179.1. DR UniGene; Ppa.14475; -. DR STRING; 3218.PP1S31_189V6.1; -. DR GeneID; 5921316; -. DR KEGG; ppp:PHYPADRAFT_178574; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A9RVG5; -. DR KO; K19347; -. DR Proteomes; UP000006727; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006727}; KW Reference proteome {ECO:0000313|Proteomes:UP000006727}. SQ SEQUENCE 300 AA; 33246 MW; 7BB67D78A783F91E CRC64; MSFTYLQVQL EVLDMKIEKE TNEVRNEFAQ KLDTQVAEVA SGVRGLRAQV DQLYESGVPL NRHEVMELVK NVVEQRASES TSKSFSLEDV RSVARKIVMS ELEKHAADGI GRTDYALASG GGRVVDHSEG VFLGRGQQWS SLMFGHIVPG GTRKHPLAQK VLQPSFGQPG ECLPLRGSNV FLEISLRTAI RPDAVTLEHV AKSVAYDLSS APKEFQLYGW REMRPTDGIA QPSPEHKLLG QFIYNSDGPS NVQTFQLSKD DVGDEPINMV RLQVLSNHGS PLHTCIYRIR VHGVDPQATL // ID A9RX16_PHYPA Unreviewed; 1132 AA. AC A9RX16; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ76548.1}; GN ORFNames=PHYPADRAFT_71659 {ECO:0000313|EMBL:EDQ76548.1}; OS Physcomitrella patens subsp. patens (Moss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Bryophyta; OC Bryophytina; Bryopsida; Funariidae; Funariales; Funariaceae; OC Physcomitrella. OX NCBI_TaxID=3218 {ECO:0000313|Proteomes:UP000006727}; RN [1] {ECO:0000313|EMBL:EDQ76548.1, ECO:0000313|Proteomes:UP000006727} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Gransden 2004 {ECO:0000313|Proteomes:UP000006727}; RX PubMed=18079367; DOI=10.1126/science.1150646; RA Rensing S.A., Lang D., Zimmer A.D., Terry A., Salamov A., Shapiro H., RA Nishiyama T., Perroud P.-F., Lindquist E.A., Kamisugi Y., RA Tanahashi T., Sakakibara K., Fujita T., Oishi K., Shin-I T., RA Kuroki Y., Toyoda A., Suzuki Y., Hashimoto S.-I., Yamaguchi K., RA Sugano A., Kohara Y., Fujiyama A., Anterola A., Aoki S., Ashton N., RA Barbazuk W.B., Barker E., Bennetzen J.L., Blankenship R., Cho S.H., RA Dutcher S.K., Estelle M., Fawcett J.A., Gundlach H., Hanada K., RA Heyl A., Hicks K.A., Hughes J., Lohr M., Mayer K., Melkozernov A., RA Murata T., Nelson D.R., Pils B., Prigge M., Reiss B., Renner T., RA Rombauts S., Rushton P.J., Sanderfoot A., Schween G., Shiu S.-H., RA Stueber K., Theodoulou F.L., Tu H., Van de Peer Y., Verrier P.J., RA Waters E., Wood A., Yang L., Cove D., Cuming A.C., Hasebe M., RA Lucas S., Mishler B.D., Reski R., Grigoriev I.V., Quatrano R.S., RA Boore J.L.; RT "The Physcomitrella genome reveals evolutionary insights into the RT conquest of land by plants."; RL Science 319:64-69(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS544922; EDQ76548.1; -; Genomic_DNA. DR RefSeq; XP_001758570.1; XM_001758518.1. DR UniGene; Ppa.16032; -. DR STRING; 3218.PP1S33_275V6.1; -. DR GeneID; 5921786; -. DR KEGG; ppp:PHYPADRAFT_71659; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A9RX16; -. DR Proteomes; UP000006727; Partially assembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006727}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006727}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 49 72 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 822 842 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1132 AA; 124376 MW; 85CFA94BE65D71BA CRC64; MLKKVAKSKH LKATRKKSER IERCKGDQLS QKELLQPEHG RRRYGSYGIS WFAVARIPAA LLVFVPLVLL FPSIEKLPNL EVIQPDRVIA RLVGGINVVN GVSVLVLGEL VGFFHSTILA ARNRAAPSSL SKTRESHSKK SYASHLSRQW CYLNDTEHLD AGACSVVNGS MAFANGDTQS LGTKIIHTIE GKSLYEELSA SKKHMSFFAS ASSVAEDGHD FFPVESAAFI ADEEAPSRRN GATSLPWSSG LRQPLKFCLV GVAGVDCLRH SSESDASTVQ LLEHGSPNGT ISWLSELPFL DQMCCSRERL STSGDSEAIV IDKPCLICQR DSLTDSVSCE PLCQMPTINH LTQSATRVCL CGPSFVEAFS ADVSKFRLSG NPFNTTPSGE AETTETPSLG QPSEVPIVPL EIVTQSSADQ AVKEELLKSD TSVEVQQELS RPLRVTQVKS LDEYKKTVLE KRRTVNGSGS IHYPQNQNPD GRFNYAAVSH GAKVVASNKD AKGASNLLVP DKDKYLRNPC SAEDKYIVVE LAEETFVDTV LIGNLEYHSS NVKNFELLGS PEVYPTEKWI SLGNFEAENV RHIQNFTLPE PKWVRTLKLH LLTHYGSEFY CTLTVLQIHG VDAIEHLLED WIVGDDVDLG KGGRRVLPNG TTGSSSSGAS AGEGVSDKGV TKSFKPNDTG DDMDPSDLLM DPLEKPMKDE CESNVNPEVR RKEEPAKGVS GGPQEGWLHL SGRPSGESVL KILMQKVKQL ELNHSLLDSY LGDLFEKYKG MFADIDNDLA AVAAQIRNET VIASSLVAHL HKIEMKRELE NEALDAELSA KFDALQNDMD SMRIRVKEME NREALAITIA LICLRSVDAY VADGYCVSLD FGNDPVILGR IYFECSQISY TRTIEFRWFC ALISFHMEVP DRHYLEGAVV RILLNPDANQ QQSCRFLVFS ECPCNFSLSS LKLLDLLEVR LSSIELSSAG TAMALKTRFY LWLCSFVLFV NCLLYTVDGG ELTLGVAAAF SVKHPQTYVN RKLRASIPSC TKADISITQG KSGNSNGIPA FSVQITNLCI NHNCQLRNIH VACAAFASAR PLDSRVFQRI KYNDCLVMGG APLRAGGSVA FEYANSSEYP MHVISADLGP CS // ID A9T9W0_PHYPA Unreviewed; 1300 AA. AC A9T9W0; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ59767.1}; GN ORFNames=PHYPADRAFT_168838 {ECO:0000313|EMBL:EDQ59767.1}; OS Physcomitrella patens subsp. patens (Moss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Bryophyta; OC Bryophytina; Bryopsida; Funariidae; Funariales; Funariaceae; OC Physcomitrella. OX NCBI_TaxID=3218 {ECO:0000313|Proteomes:UP000006727}; RN [1] {ECO:0000313|EMBL:EDQ59767.1, ECO:0000313|Proteomes:UP000006727} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Gransden 2004 {ECO:0000313|Proteomes:UP000006727}; RX PubMed=18079367; DOI=10.1126/science.1150646; RA Rensing S.A., Lang D., Zimmer A.D., Terry A., Salamov A., Shapiro H., RA Nishiyama T., Perroud P.-F., Lindquist E.A., Kamisugi Y., RA Tanahashi T., Sakakibara K., Fujita T., Oishi K., Shin-I T., RA Kuroki Y., Toyoda A., Suzuki Y., Hashimoto S.-I., Yamaguchi K., RA Sugano A., Kohara Y., Fujiyama A., Anterola A., Aoki S., Ashton N., RA Barbazuk W.B., Barker E., Bennetzen J.L., Blankenship R., Cho S.H., RA Dutcher S.K., Estelle M., Fawcett J.A., Gundlach H., Hanada K., RA Heyl A., Hicks K.A., Hughes J., Lohr M., Mayer K., Melkozernov A., RA Murata T., Nelson D.R., Pils B., Prigge M., Reiss B., Renner T., RA Rombauts S., Rushton P.J., Sanderfoot A., Schween G., Shiu S.-H., RA Stueber K., Theodoulou F.L., Tu H., Van de Peer Y., Verrier P.J., RA Waters E., Wood A., Yang L., Cove D., Cuming A.C., Hasebe M., RA Lucas S., Mishler B.D., Reski R., Grigoriev I.V., Quatrano R.S., RA Boore J.L.; RT "The Physcomitrella genome reveals evolutionary insights into the RT conquest of land by plants."; RL Science 319:64-69(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS545080; EDQ59767.1; -; Genomic_DNA. DR RefSeq; XP_001775438.1; XM_001775386.1. DR UniGene; Ppa.14143; -. DR STRING; 3218.PP1S191_3V6.1; -. DR GeneID; 5938605; -. DR KEGG; ppp:PHYPADRAFT_168838; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A9T9W0; -. DR Proteomes; UP000006727; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006727}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006727}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 53 72 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 818 838 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1300 AA; 143661 MW; 00743FBB0996A4D4 CRC64; MQKKGGKSKH HRAAGKEVER IERCEGVALF QKDLQQQQQL QELGKERKGC YRISWCAVVS ISATLFLFLS LFPSLPSIVK LPDLEGVRAS WQHFSLYSRL SHPEFEISGL SHFTPLVSKN WVSPFSLSKT KELHSRKWFA SQSSGQRCHL NDTAHLDADT CLIFNGSAVI GNGGCLKSIY AGDGSRNSHE SFSASRKYMS FFASASSVAE EADHFLPVES PSPSSSDEES SSLGDVIISL PWSSSVGEPL KLCLVGETGA DCLRHSSDGD AFSSQGSPDG TVSWVSELPS LDQICPTQES LPSSSDSEAE LSSAIQINQQ YLVCRRNLGT GSVVCEPLCQ VSTLDYLTQS AARVCPCSPS LVEESNADAS RISFSGSLLS KASSVDELQI SDAPSLEEPS EVPEISFDIV TQTLANEDVS EEIVKFDELV EIQQEVSRPS RVVQVKSLDE YKKAVIEKRR TVNGSGSLHH LQHQNQEGRF NYADVSHGAK VVASNKDAKG ASNLLVPDKD KYLRNPCSAE DKHIVVELAE ETLVDTIIIG NLEYHSSNVK NFELLGSPEV YPTDDWISLG NFEAENVRHI QNFTLPEPKW VRTLKLRLLS HYGSEFYCTL TLLQIHGVDA IEHLLEDWIV GDDVDLGKGV RRIIPNGTSG MGNGGRAGES ISDKGETASL KANDSDNDLD SLDTLMDPLE KPVKDERGIS VGPEERVKKE AAKGVNGGPP EAWLHLSGRP SGESVLKILM QKVKQLELNH SLLDSYIGEL YEKYKEMFAD IDNDLAGVAA QLRNETAIAA TLVAHLQEIE LRREAENEAL NARLSSKFDA LQNDMELMRV RIQNMENREA LAITIALICL HGVETQPVFG LVYHCSRMCH QVSNLERPIC GDEPLDTHMS DEELLDFESA SHSPNQERCS MLITAPCHFD RQYTDAESFM NLSVVSCQQL SALMCYALSS SKIEVDPFLP TCKYLTVLNI YLNRRTSRRA PDPATTRKPY VQVTLVLCCS NFTHVKRNVL EVWSVPTSLS FQIEASHLLR WRSHYAGVPQ TTISEPSSVT LEYTHYDWSS LIDSWTWRYR VQRFVRAEYR RFGQGESHTR MSQRRQSHPI WCCMCPTVLV KPARGCSDEN EAMEIGETLS SIELSSSGTA MALKTRFYLW LCSFMLLGNC LLHTVEGGEL GLGLTAAVSV KHPQTYVNRK LRASTPSCTK ADISITQGKS GNSNGIPAFS VQITNLCVNH NCQLKNIHVA CAAFASARPL DSHVFQRIKY NDCLVMGGAP LRAGGSVAFE YANSSEYPMH VISAELGPCA // ID A9TD16_PHYPA Unreviewed; 293 AA. AC A9TD16; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 14-OCT-2015, entry version 26. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ58664.1}; DE Flags: Fragment; GN ORFNames=PHYPADRAFT_143719 {ECO:0000313|EMBL:EDQ58664.1}; OS Physcomitrella patens subsp. patens (Moss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Bryophyta; OC Bryophytina; Bryopsida; Funariidae; Funariales; Funariaceae; OC Physcomitrella. OX NCBI_TaxID=3218 {ECO:0000313|Proteomes:UP000006727}; RN [1] {ECO:0000313|EMBL:EDQ58664.1, ECO:0000313|Proteomes:UP000006727} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Gransden 2004 {ECO:0000313|Proteomes:UP000006727}; RX PubMed=18079367; DOI=10.1126/science.1150646; RA Rensing S.A., Lang D., Zimmer A.D., Terry A., Salamov A., Shapiro H., RA Nishiyama T., Perroud P.-F., Lindquist E.A., Kamisugi Y., RA Tanahashi T., Sakakibara K., Fujita T., Oishi K., Shin-I T., RA Kuroki Y., Toyoda A., Suzuki Y., Hashimoto S.-I., Yamaguchi K., RA Sugano A., Kohara Y., Fujiyama A., Anterola A., Aoki S., Ashton N., RA Barbazuk W.B., Barker E., Bennetzen J.L., Blankenship R., Cho S.H., RA Dutcher S.K., Estelle M., Fawcett J.A., Gundlach H., Hanada K., RA Heyl A., Hicks K.A., Hughes J., Lohr M., Mayer K., Melkozernov A., RA Murata T., Nelson D.R., Pils B., Prigge M., Reiss B., Renner T., RA Rombauts S., Rushton P.J., Sanderfoot A., Schween G., Shiu S.-H., RA Stueber K., Theodoulou F.L., Tu H., Van de Peer Y., Verrier P.J., RA Waters E., Wood A., Yang L., Cove D., Cuming A.C., Hasebe M., RA Lucas S., Mishler B.D., Reski R., Grigoriev I.V., Quatrano R.S., RA Boore J.L.; RT "The Physcomitrella genome reveals evolutionary insights into the RT conquest of land by plants."; RL Science 319:64-69(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS545096; EDQ58664.1; -; Genomic_DNA. DR RefSeq; XP_001776531.1; XM_001776479.1. DR GeneID; 5939702; -. DR KEGG; ppp:PHYPADRAFT_143719; -. DR InParanoid; A9TD16; -. DR KO; K19347; -. DR Proteomes; UP000006727; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006727}; KW Reference proteome {ECO:0000313|Proteomes:UP000006727}. FT COILED 22 42 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EDQ58664.1}. SQ SEQUENCE 293 AA; 32416 MW; 69193036B0EADE42 CRC64; VQLEVLDMKI EKGSNDLRNE FKEKLDSQVA EMESGVKDLK AQVDRLYQGG GPLNRDEVIE LVKKFMEHRA SDSTGKSFSL EDVRSIARKI VMSEVEKHAA DGIGRTDYAL ASGGGRVVDH SEGVFLGRGR QLFSMVFSSI LTGEARKHPL AQKVLEPSFG EPGECLPLKG SNVFVEISLR TAILPDAVTL EHVSKSVAYD LSSAPKEFQL YGWREARPAD RSAQPSPVHR LLGQFVYKTD GPSNIQTFHL SKEDVGDEPI NMVRLHVLSN YGSTLHTCIY RVRVHGVDPQ GTL // ID A9V1D6_MONBE Unreviewed; 936 AA. AC A9V1D6; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ88548.1}; GN ORFNames=37385 {ECO:0000313|EMBL:EDQ88548.1}; OS Monosiga brevicollis (Choanoflagellate). OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. OX NCBI_TaxID=81824 {ECO:0000313|Proteomes:UP000001357}; RN [1] {ECO:0000313|EMBL:EDQ88548.1, ECO:0000313|Proteomes:UP000001357} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MX1 / ATCC 50154 {ECO:0000313|Proteomes:UP000001357}; RX PubMed=18273011; DOI=10.1038/nature06617; RG JGI Sequencing; RA King N., Westbrook M.J., Young S.L., Kuo A., Abedin M., Chapman J., RA Fairclough S., Hellsten U., Isogai Y., Letunic I., Marr M., Pincus D., RA Putnam N., Rokas A., Wright K.J., Zuzow R., Dirks W., Good M., RA Goodstein D., Lemons D., Li W., Lyons J.B., Morris A., Nichols S., RA Richter D.J., Salamov A., Bork P., Lim W.A., Manning G., Miller W.T., RA McGinnis W., Shapiro H., Tjian R., Grigoriev I.V., Rokhsar D.; RT "The genome of the choanoflagellate Monosiga brevicollis and the RT origin of metazoans."; RL Nature 451:783-788(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH991554; EDQ88548.1; -; Genomic_DNA. DR RefSeq; XP_001746652.1; XM_001746600.1. DR STRING; 431895.XP_001746652.1; -. DR EnsemblProtists; EDQ88548; EDQ88548; MONBRDRAFT_37385. DR GeneID; 5891801; -. DR KEGG; mbr:MONBRDRAFT_37385; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; A9V1D6; -. DR Proteomes; UP000001357; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001357}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001357}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 936 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002745166. FT TRANSMEM 803 826 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 745 765 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 936 AA; 98026 MW; B86B2C07CAB23BE8 CRC64; MARGASIRRL RLALVLTCLL LAAVGGQVDG GHDKSVHERP EDLADKDAGF IVESETAPEK AGPGSNAIHA TVVTPSDVPP APQAELGSAP SPASMPSPPT ASDVAEGVDD LPIHEVPTPD AAEPSQGSDA SPDDGEVSPE ESPSAQVDES LTFDEWKQLI NEDEEIIEAA KASAILKPVK DTYLIAPCEA DIELTIELCE HIIVQQVKLA NYEMFSSMIK TLALYVAERS SPENWHKVAE LQLADVRDLQ TFAIDDVTIF AKFLKLEAID HYGNEFYCPI KYETAQSPYC AVFFKGGQIV LQVFGVTMLE EFEYMVGDPD SGIEPAQPSV AINAAASHTP TNEEASTEDN TGGASATGDA VNKIGQFVAK ALNVGHMLWG EKPNRPAASA PPALAAAPAL SQMMQNHTLD ASLNDPQQHF LWPICSPRAN ETLPPILDSA LAGAETADNS EAQQTHHEAE QGGAGMADMD GMEAGERCEG ACDGPRADSA HPAGAPARQA AADSSDTAES NKEDASTKPQ EQVAPADDTH VSTQVPTAPT DSAAPTLGSG GAELMPAGHA EETGTPVSAS NPEAQPLHHE GDSKDTPGSV DDHRASPEGV IRATLAHVEA EQAAIKNATS AEAAPTPASA PAPAPAPAPT PAPASAPAPA PTPTPASSAA GRSSEGHEKN NTQTGAPANK AGHGKLNSAG KAGANKAGGG VDEHGPHAQP QLKATSLSVK GSALAQLRTM INAVEKNLTM SSTYLNKLSQ KALALDKNVS QMNQTLHAEL QTLTGAVQNL HAQVETLVKV SERLVGVNYH TTFFYWMWPN FLVLLVQIGV ALLFYWTLRR GNTVLLGLQQ QQQQALGVDG RAPPWANVTF PDVSLLPEPV PFSPDRRGSG EAEPRRGRAP QNALYSTPGR GVMSPLSGRS RSGSPLPSPS LAPRRRSFLT GGAHET // ID A9V320_MONBE Unreviewed; 486 AA. AC A9V320; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ87986.1}; GN ORFNames=33018 {ECO:0000313|EMBL:EDQ87986.1}; OS Monosiga brevicollis (Choanoflagellate). OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. OX NCBI_TaxID=81824 {ECO:0000313|Proteomes:UP000001357}; RN [1] {ECO:0000313|EMBL:EDQ87986.1, ECO:0000313|Proteomes:UP000001357} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MX1 / ATCC 50154 {ECO:0000313|Proteomes:UP000001357}; RX PubMed=18273011; DOI=10.1038/nature06617; RG JGI Sequencing; RA King N., Westbrook M.J., Young S.L., Kuo A., Abedin M., Chapman J., RA Fairclough S., Hellsten U., Isogai Y., Letunic I., Marr M., Pincus D., RA Putnam N., Rokas A., Wright K.J., Zuzow R., Dirks W., Good M., RA Goodstein D., Lemons D., Li W., Lyons J.B., Morris A., Nichols S., RA Richter D.J., Salamov A., Bork P., Lim W.A., Manning G., Miller W.T., RA McGinnis W., Shapiro H., Tjian R., Grigoriev I.V., Rokhsar D.; RT "The genome of the choanoflagellate Monosiga brevicollis and the RT origin of metazoans."; RL Nature 451:783-788(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH991556; EDQ87986.1; -; Genomic_DNA. DR RefSeq; XP_001747062.1; XM_001747010.1. DR STRING; 431895.XP_001747062.1; -. DR EnsemblProtists; EDQ87986; EDQ87986; MONBRDRAFT_33018. DR GeneID; 5892383; -. DR KEGG; mbr:MONBRDRAFT_33018; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; A9V320; -. DR Proteomes; UP000001357; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001357}; KW Reference proteome {ECO:0000313|Proteomes:UP000001357}. SQ SEQUENCE 486 AA; 53294 MW; 7D1EBA6F485F684B CRC64; MSVVRASTPT RQRRRTGPRT PPGFATNTLF SDDEDADRSV ASEYENQAET FASVLHSRIN IATPNTSSSI RHVQTTTELR RVDNDLGILS EDELELGPNE TLLYDDDGPI VYERRVHRVL RNTSEDDSAS AQTESVGRSP HAFAPRIIDS ALLAARYQQE RTSLDDVDKA LQSLIQARTQ AEASAHSQTE RPKGHFTICK QIKACVSSNL IMLRDKLARL EAQQTNFTHQ QLHVRDALNS QAQIMRQTMM TMAEAQFVNK QKASNASRSA AACPKCPSCP TCPTCPTCPN CPTNAPTKAC PETAPAPPSP LVNLASYARG ARIIETHCAR ARSSSWRSRW ESLFSMQPDQ SARAMLSDSM EPKQCWAFRG AAATALIQLA APTHVESVAL SHVAQAALPA SHNQSSAPRR FRVWAAGNSA APAMPAAHKM LLEADFDPAR SPQSFAIPAV HQHEARFIKL EIQSNHGNEY TCVYSFQVNG RATLVA // ID A9VB23_MONBE Unreviewed; 2345 AA. AC A9VB23; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 47. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDQ85263.1}; GN ORFNames=38937 {ECO:0000313|EMBL:EDQ85263.1}; OS Monosiga brevicollis (Choanoflagellate). OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga. OX NCBI_TaxID=81824 {ECO:0000313|Proteomes:UP000001357}; RN [1] {ECO:0000313|EMBL:EDQ85263.1, ECO:0000313|Proteomes:UP000001357} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MX1 / ATCC 50154 {ECO:0000313|Proteomes:UP000001357}; RX PubMed=18273011; DOI=10.1038/nature06617; RG JGI Sequencing; RA King N., Westbrook M.J., Young S.L., Kuo A., Abedin M., Chapman J., RA Fairclough S., Hellsten U., Isogai Y., Letunic I., Marr M., Pincus D., RA Putnam N., Rokas A., Wright K.J., Zuzow R., Dirks W., Good M., RA Goodstein D., Lemons D., Li W., Lyons J.B., Morris A., Nichols S., RA Richter D.J., Salamov A., Bork P., Lim W.A., Manning G., Miller W.T., RA McGinnis W., Shapiro H., Tjian R., Grigoriev I.V., Rokhsar D.; RT "The genome of the choanoflagellate Monosiga brevicollis and the RT origin of metazoans."; RL Nature 451:783-788(2008). CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH991575; EDQ85263.1; -; Genomic_DNA. DR RefSeq; XP_001749884.1; XM_001749832.1. DR SMR; A9VB23; 1263-1318. DR STRING; 431895.XP_001749884.1; -. DR EnsemblProtists; EDQ85263; EDQ85263; MONBRDRAFT_38937. DR GeneID; 5895203; -. DR KEGG; mbr:MONBRDRAFT_38937; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; A9VB23; -. DR KO; K12231; -. DR OMA; NILEATM; -. DR Proteomes; UP000001357; Unassembled WGS sequence. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0043161; P:proteasome-mediated ubiquitin-dependent protein catabolic process; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR000421; FA58C. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50022; FA58C_3; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000001357}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001357}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2345 AA; 258087 MW; 455241D025BED886 CRC64; MEVDPDTLLE WLSLPGDTQL IALEQLCMLL LLSDNVDRVF EHQTPNLPGF HPAALLTVRC PPRRFIPALC NVLRNDDCPI NILEATMRAL TFYLDVSGDC ARRVVGEEGA VANICQRLGC AELLQPSSRD LAIQCIKVLE LISNRDADAV YQAGALRSCL TFVMTGKEIV FKDAINSAMS VVTKCCSRLS AADAFVADAI AHFTLLLNED PDFVPRVLPC LAAIVDKFQR ENADFDILVG SGMVTILIEL LNRSAPQPDL DSSAVGPGSD SDVPQARTIV GLLLALARAS PQVTRLLQAS PILPTAVAAC LCRNDRLAAD AIDLVELLFV VLLEGRGALQ KLRQKRVIAL GGRDASLRSV IEHIRNKDTG AVLEAVEGGI DPNSTDEVGQ TLLNWASAFG TNDMVEFLCE SGADPDRGER SSSLHYAASF GRPQIVRLLL RYNASTELHD DAGKTALERA REKPDDAHGE CVALLENPEA HRDGDDEDED EEEEDQEAEV ARPAGGSDAL VIGASSRTRA DSEHTSSAGP TGTAVSGTGR EEAAAFSGFH EHPDTAQAFL EGMLPVVCQA VSSVQDAALQ SSLLRQLNKV ATTIPKPMLE ALAQSEALAL EQAIIPLPDI LQQDNGWHLH QATTFLQTLM SRLDDRVRDH LLRLGIVPLL QQLAAKQATE TEPVSVAQPG MKLEARHPTL GGFYAATLLA TNSALYDSWH VRLDGSEHQE HDFWTRPMSD AVRPLGMSQA RGQTLQLPEV DGATPSSWQA YLEATAAQSI PAEAFDRATA VKEKDARKRR EAYGIAAAIC EAYFSDADSM RAVVKRLQQL GNRLVTMTKA TPNGCVTPAE PSSPQASMHD SLGMTFALDR QLSVNSQMAL QRKVDMQACL ANLRTLLLDE SISAFELLQA KLVPRCAVQP SCIALSSSRL LEFADAQNLL FFLQHESQDM AVSRVDAAAD RNMRCELFKL AFAVPVDVAP ERNPLRTLVL KVIDVLEHTE SLPILLHESP GSGHGLQVLQ KRLKFKLLRD PRDETLLDLT GNAFKMEPLA TVQSLIDFLE PKVQKQWCDF PRPEISFVQR LVGGERVVCT YQSDFDEQGV LYWIGTNGLT ANWINPSKVA LVFINTSCGR KDSWFSIDLG IYVKPTCYTL RHARGYHRSA LRDWDFQVSE DGDTWTTIRE HRGDTALDEP GSTATFEVSA PEHATGWRHF RIFMRGPTAN SNTHYLSCSG FELYGTITGA SEVSFAKAVL REERKVYALQ RHAKKAAAKF KIGTRVKRGP CWKWGNQDGD PPGPGTVTGL PRNGWIDVKW DAGCSNSYRV GADDKYDLLP LDDEAHNDAP DDSDMRTAPA ASSHEAATAT GAASDPAQET TEMSTVADAL VQDMVFDMSA DGEGMDDDGD DDDDDDDDDD DDDEDDAMDS GEDSYGALAH VFGAPRSAAM ATTSGHMEED SVLEGLRDLE AGFAHDAFPM STPSRLGAHR RLLELMRKRD TIAASPRTRE LSWDENFVLK RQFSALVPAF DPRPGRHNVS ATTDLRVPPP GSPTDSFAAR TMRRRQEQRI CLFLVDSEAN GDPETLRENL TPDHMHRLPS DSTIFRAIQR HWGRGTWVAG SLGQLSLWNQ SSTALCLTPA GFGSFDADNS SDRLKRTWEP TYTIVYRSES VAPEAMDEDG WDLQSVRARL EAGELKRADV FAYLQSKASV AWLRLWNLND QLTSQQVKGW SIKQVVAAFK DFLQTRTQGE DAVDPDAPRP PPLNVSAFNA EALLAATSVA VADDTTTPAL QLIEFLYDMM QAAVVEEFSG AGEAATDTFA LNVDKADFVS HRLTSKLRQQ LMDPIVLASE ALPEWCEELT TRCPVLFPLE ARQLYFSCTA FGVSRAIAWI QSRQDDARGR QDAAEHRIGR LTHERVVLQR GDNFFEWACN AFRLHADHKS ILEVEFEGEE GTGLGPTLEF YSLMASELQR KDLCLWICED ALGASARTVD LGTGNKPPGY YVQRSNGLFP APLPQTGKLI DSVCERFFVL GIFLAKALQD DRRIDLPLSR SFCKLLVGKE LVFSDLYDVA PELATTLRVL EVCVCLSSDM ETASAFSTMC TSIQLFELCI HRSAEEQGVA AQKRLIDAEC EGDTEERDAA YEELTFMWPD TNTAVRVSDL SLSMVYSPSS TVFGLAEIPL RPNGEDEDVA LHNLSEYIER LTAFVLREGV ARQLEALRDG FNSVFPIEKL GSFTADEIPH VLCGDQTVDW SFDELVTNTL PRHGYHTDSK GYLNFLRVLT ELTQDQRKAF LQFATGCPSL PPGGIKNLHP QMTIVRKLAE TGAEDDMFPS VNTCAHYVKL PEYSSAELLR ERLLVAITTK GFHMN // ID A9Z1W8_HUMAN Unreviewed; 354 AA. AC A9Z1W8; DT 05-FEB-2008, integrated into UniProtKB/TrEMBL. DT 05-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 53. DE SubName: Full=SUN domain-containing protein 5 {ECO:0000313|Ensembl:ENSP00000364673}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSP00000364673}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000364673, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000364673, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=11780052; DOI=10.1038/414865a; RA Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., RA Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., RA Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M., Brown A.J., RA Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., RA Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., RA Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., RA Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., RA Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., RA Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., RA Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., RA Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., RA Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., RA Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., RA Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., RA Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., RA Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., RA Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., RA Rogers J.; RT "The DNA sequence and comparative analysis of human chromosome 20."; RL Nature 414:865-871(2001). RN [2] {ECO:0000313|Ensembl:ENSP00000364673} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000364673}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL121756; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL139826; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_011526876.1; XM_011528574.1. DR UniGene; Hs.375186; -. DR ProteinModelPortal; A9Z1W8; -. DR SMR; A9Z1W8; 149-339. DR PRIDE; A9Z1W8; -. DR Ensembl; ENST00000375523; ENSP00000364673; ENSG00000167098. DR GeneID; 140732; -. DR CTD; 140732; -. DR HGNC; HGNC:16252; SUN5. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG055206; -. DR NextBio; 35465459; -. DR Proteomes; UP000005640; Chromosome 20. DR Bgee; A9Z1W8; -. DR ExpressionAtlas; A9Z1W8; baseline and differential. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|PeptideAtlas:A9Z1W8}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT COILED 130 157 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 354 AA; 40398 MW; DEC52E5F5DE91BC7 CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSPNMTWFTC FACSLRTQAQ QVLFNTCRCK LLCQKLMEKT GILLLCAFGF WMFSIHLPSK MKVWQDDSIN GPLQSLRLYQ EKVRHHSGEI QDLRGSMNQL IAKLQEMEAM SDEQKMAQKI MKMIHGDYIE KPDFALKSIG ASIDFEHTSV TYNHEKAHSY WNWIQLWNYA QPPDVILEPN VTPGNCWAFE GDRGQVTIQL AQKVYLSNLT LQHIPKTISL SGSLDTAPKD FVIYGMEGSP KEEVFLGAFQ FQPENIIQMF PLQNQPARAF SAVKVKISSN WGNPGFTCLY RVRVHGSVAP PREQPHQNPY PKRD // ID B0CM82_PAPAN Unreviewed; 438 AA. AC B0CM82; DT 26-FEB-2008, integrated into UniProtKB/TrEMBL. DT 26-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Sperm associated antigen 4 (Predicted) {ECO:0000313|EMBL:ABY64674.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPANP00000016749}; GN Name=SPAG4 {ECO:0000313|EMBL:ABY64674.1}; OS Papio anubis (Olive baboon). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Papio. OX NCBI_TaxID=9555 {ECO:0000313|EMBL:ABY64674.1}; RN [1] {ECO:0000313|EMBL:ABY64674.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (JAN-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000028761} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., RA Aqrawi P.A., Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., RA Bandaranaike D.B., Battles P.B., Bell A.B., Beltran B.B., RA Berhane-Mersha D.B., Bess C.B., Bickham C.B., Bolden T.B., RA Carter K.C., Chau D.C., Chavez A.C., Clerc-Blankenburg K.C., RA Coyle M.C., Dao M.D., Davila M.L.D., Davy-Carroll L.D., Denson S.D., RA Dinh H.D., Fernandez S.F., Fernando P.F., Forbes L.F., Francis C.F., RA Francisco L.F., Fu Q.F., Garcia-Iii R.G., Garrett T.G., Gross S.G., RA Gubbala S.G., Hirani K.H., Hogues M.H., Hollins B.H., Jackson L.J., RA Javaid M.J., Jhangiani S.J., Johnson A.J., Johnson B.J., Jones J.J., RA Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K., Kovar C.K., RA Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L., RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L., RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M., RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N., RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O., RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., RA Perez Y.P., Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., RA Rouhana J.R., Ruiz M.R., Ruiz S.-J.R., Saada N.S., Santibanez J.S., RA Scheel M.S., Schneider B.S., Simmons D.S., Sisson I.S., Tang L.-Y.T., RA Thornton R.T., Tisius J.T., Toledanes G.T., Trejos Z.T., Usmani K.U., RA Varghese R.V., Vattathil S.V., Vee V.V., Walker D.W., RA Weissenberger G.W., White C.W., Williams A.W., Woodworth J.W., RA Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N., Nazareth L.N., RA Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.; RT "Whole Genome Assembly of Papio anubis."; RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Ensembl:ENSPANP00000016749} RP IDENTIFICATION. RG Ensembl; RL Submitted (OCT-2014) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AHZZ01051316; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; DP000552; ABY64674.1; -; Genomic_DNA. DR RefSeq; NP_001162482.1; NM_001169011.1. DR UniGene; Pan.18075; -. DR Ensembl; ENSPANT00000011816; ENSPANP00000016749; ENSPANG00000001308. DR GeneID; 100137482; -. DR CTD; 6676; -. DR GeneTree; ENSGT00390000011587; -. DR HOVERGEN; HBG079205; -. DR OMA; KHTPNFY; -. DR Proteomes; UP000028761; Chromosome 10. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000028761}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000028761}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 135 158 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 48011 MW; 417E10F9DB3297B1 CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSVSIT SEDSNGLRSA GPGPGDPEGR GARGPSCGEP ALSAGVPGGT TWAGSSRQKP APRSHNWQTA RGAATVRGGA SEPTGSPAVS EEPLDLLPTL DLRQEMPPSR VFKSFLSLLF QVLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLAA IWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SAPRDFAVFG LQVDDETEVF LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSATGGPH // ID B0CV30_LACBS Unreviewed; 908 AA. AC B0CV30; DT 26-FEB-2008, integrated into UniProtKB/TrEMBL. DT 26-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDR13257.1}; GN ORFNames=LACBIDRAFT_292492 {ECO:0000313|EMBL:EDR13257.1}; OS Laccaria bicolor (strain S238N-H82 / ATCC MYA-4686) (Bicoloured OS deceiver) (Laccaria laccata var. bicolor). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. OX NCBI_TaxID=486041 {ECO:0000313|Proteomes:UP000001194}; RN [1] {ECO:0000313|EMBL:EDR13257.1, ECO:0000313|Proteomes:UP000001194} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S238N-H82 / ATCC MYA-4686 {ECO:0000313|Proteomes:UP000001194}; RX PubMed=18322534; DOI=10.1038/nature06556; RA Martin F., Aerts A., Ahren D., Brun A., Danchin E.G.J., Duchaussoy F., RA Gibon J., Kohler A., Lindquist E., Pereda V., Salamov A., RA Shapiro H.J., Wuyts J., Blaudez D., Buee M., Brokstein P., RA Canbaeck B., Cohen D., Courty P.E., Coutinho P.M., Delaruelle C., RA Detter J.C., Deveau A., DiFazio S., Duplessis S., RA Fraissinet-Tachet L., Lucic E., Frey-Klett P., Fourrey C., RA Feussner I., Gay G., Grimwood J., Hoegger P.J., Jain P., Kilaru S., RA Labbe J., Lin Y.C., Legue V., Le Tacon F., Marmeisse R., Melayah D., RA Montanini B., Muratet M., Nehls U., Niculita-Hirzel H., RA Oudot-Le Secq M.P., Peter M., Quesneville H., Rajashekar B., Reich M., RA Rouhier N., Schmutz J., Yin T., Chalot M., Henrissat B., Kuees U., RA Lucas S., Van de Peer Y., Podila G.K., Polle A., Pukkila P.J., RA Richardson P.M., Rouze P., Sanders I.R., Stajich J.E., Tunlid A., RA Tuskan G., Grigoriev I.V.; RT "The genome of Laccaria bicolor provides insights into mycorrhizal RT symbiosis."; RL Nature 452:88-92(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS547093; EDR13257.1; -; Genomic_DNA. DR RefSeq; XP_001875755.1; XM_001875720.1. DR STRING; 486041.XP_001875755.1; -. DR EnsemblFungi; EDR13257; EDR13257; LACBIDRAFT_292492. DR GeneID; 6071399; -. DR KEGG; lbc:LACBIDRAFT_292492; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B0CV30; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001194; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001194}; KW Reference proteome {ECO:0000313|Proteomes:UP000001194}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 908 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002746948. FT COILED 303 323 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 908 AA; 99300 MW; 6EAA82293BA81A9C CRC64; MLLLTFLTTT LFVILFADAA FSEPTSPNDQ FRAIALHVTR PTEPPVCCLK PLSPIEPVDD EILLSFEDWK AKQTAMQQAQ LNAREREKDT TNRSGNGVGG ANNGGAGTSD GTEGLVSKMD STPSNSGSSS TSEDSAQNPM TEPLSPHFQV PLTDRFNYAS LDCSARVHMA HRSAKSASSI LSSKRDRYML SPCKSSKQEK QFVVVELCDD VRIDTVQLAN FEFFSGVFKD FNVSVAKTWS TGVDGWTLAG SYKAKNVRGV QSFHPPTFLR DFYRYIRIDF LTHYGNEYYC PVSLLRVYGL THLEQWKWDI WEAESRAKQA ELEKAAPPHP AAHREVVAEA TLPAHITVSD ISTSVISSSE RSVDGVASSP PLAADTKAHT QSNNLLPSHG EPSSRLPPST IPKTPSANSI SPSITDTQNE PFTSASNHDH QPSHIPHDIY PQYSDAFVPS PSASSKDTSK SSSIIFDQSK SPSVSSTTAP IPNNHIVNPD VQPPPNIKNP APSPHQGTGN GNPSSITPGS PTTIIVPPPP HPTVAAGGGG ESIYRTIMNR LTALEANHTL YTRYIEQQNS AIRDVIKRLS EDVGRLEGNG RAQALMYQRS IQNSERQGRQ LQMDYGQLMA QIEHLSEEII LEKRLGVAQL CLLLAVLIFM GLTRGSRGEP PFMEAAAPAR INKSMREWGR RHLSFSGDWT NRFKRKNSGV DASRSRSPPR LVRPHPQTAK SHPPASLKSP LPADVRSRTK PTPSTAPLAR SRTPSFRTAT ARLRHPATPT RAVQSAIQRS NSHGAGSLWT GNVPKSAKKW ARTAHLHEVR SAGRVRERVL SAGGRENREN EDVFLSPPPP HLPAVPTGPI LLRSRMWGKG KEMDKGGVVP TSLDLDDILD QDGDSWVDTD SVDDGLEMDQ TIQVPNLT // ID B0D5N3_LACBS Unreviewed; 1012 AA. AC B0D5N3; DT 26-FEB-2008, integrated into UniProtKB/TrEMBL. DT 26-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDR10047.1}; GN ORFNames=LACBIDRAFT_317978 {ECO:0000313|EMBL:EDR10047.1}; OS Laccaria bicolor (strain S238N-H82 / ATCC MYA-4686) (Bicoloured OS deceiver) (Laccaria laccata var. bicolor). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. OX NCBI_TaxID=486041 {ECO:0000313|Proteomes:UP000001194}; RN [1] {ECO:0000313|EMBL:EDR10047.1, ECO:0000313|Proteomes:UP000001194} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S238N-H82 / ATCC MYA-4686 {ECO:0000313|Proteomes:UP000001194}; RX PubMed=18322534; DOI=10.1038/nature06556; RA Martin F., Aerts A., Ahren D., Brun A., Danchin E.G.J., Duchaussoy F., RA Gibon J., Kohler A., Lindquist E., Pereda V., Salamov A., RA Shapiro H.J., Wuyts J., Blaudez D., Buee M., Brokstein P., RA Canbaeck B., Cohen D., Courty P.E., Coutinho P.M., Delaruelle C., RA Detter J.C., Deveau A., DiFazio S., Duplessis S., RA Fraissinet-Tachet L., Lucic E., Frey-Klett P., Fourrey C., RA Feussner I., Gay G., Grimwood J., Hoegger P.J., Jain P., Kilaru S., RA Labbe J., Lin Y.C., Legue V., Le Tacon F., Marmeisse R., Melayah D., RA Montanini B., Muratet M., Nehls U., Niculita-Hirzel H., RA Oudot-Le Secq M.P., Peter M., Quesneville H., Rajashekar B., Reich M., RA Rouhier N., Schmutz J., Yin T., Chalot M., Henrissat B., Kuees U., RA Lucas S., Van de Peer Y., Podila G.K., Polle A., Pukkila P.J., RA Richardson P.M., Rouze P., Sanders I.R., Stajich J.E., Tunlid A., RA Tuskan G., Grigoriev I.V.; RT "The genome of Laccaria bicolor provides insights into mycorrhizal RT symbiosis."; RL Nature 452:88-92(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS547098; EDR10047.1; -; Genomic_DNA. DR RefSeq; XP_001879432.1; XM_001879397.1. DR STRING; 486041.XP_001879432.1; -. DR EnsemblFungi; EDR10047; EDR10047; LACBIDRAFT_317978. DR GeneID; 6075224; -. DR KEGG; lbc:LACBIDRAFT_317978; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B0D5N3; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001194; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001194}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001194}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 483 503 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 524 545 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 659 679 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1012 AA; 109701 MW; EDBA61367FC61260 CRC64; MSFSGTPLGQ GRRLDHNTFL GKQPSGPTSP HHIIPTSYAY GAPTLASRSP TKPTSPVRDR RFHDDDDDEE NENEPALARF ARQKQREHAS INNLSSRPGG PKTFTSPPKP EKWSVKDTSV NIATAFTQAA NDMLPAHTTP NTSWASSSSS SRTVVPRSTS VEYESQVQST TSRRLAAPNS KLGSRPPNSN SNPSRKPLSK ADSSLHVPDS EDERPNANGR GKSPMEQVAG AANRALQTAT FYLTARSKEP QDTSGAANNQ NGNDSSYDYG AEESIYQASQ GKRTSNAAHK RNRISVDNKA WKPSASDFES DEEYSDDGKK GRGAKKKGPH GGPLTNLPSI GPSKGRKKKT KGTKGSKGNI AGGEHENESD GETQTDIQSQ SAQLRSSAQP SRPSIPRSIP PENYDPEDTS VDVEQGLHSI PEIDDIDLPP EDLSAQQRAR SMEPKPKRRT SRSRTPARER ARFSIGGALG SIVNLTFKGS MSIVTLFLHL LSNVLFLLGR VFGTMFDIVF NRPYLWVKSS RAKGLATFAK YVFLAATLVS AWFVLRSPTA LSYIPKLSFP SSSSTRSSPV YTAPEVPAAN IMELAERLLR IENALSGLSV DSERAKAKAD DGVRGYHDLM GRLGVLETRL GAESRKIVEA ETRARDAAGR GLSGVKQDVE VLQAQIVAQQ KLLEKEREEK EKGQGHKQGH TSDEEARVKL KALEERVGFV EGGVRDALEL GTKASSAAAA AATAGNNKAP AAAIPGTEWW NKRVKSGLQI KSSDGTDVTA LIADLVDTAT SIRSKDTIAK PDYALHSGGA RIIPSLTTPT FEIRPSTLRG QVVGLLTGNG YAIGRPPITA LHHEVNNGHC WPFAGSEGQL GVALAAPTYV EEVSVDHVAK ELAFDMRSAP REMEVWGMVE GKDNVARVKE WKEHSGGQEI LNAIGYPTTL PREPEYIRLA NFTYDVASTR GVQTFPVDED VRRLGVDFGI VVLRVLSNWG QGEFTCLYRF RVHGQRMVDG PPPMNLGEDA SP // ID B0DGD3_LACBS Unreviewed; 377 AA. AC B0DGD3; DT 26-FEB-2008, integrated into UniProtKB/TrEMBL. DT 26-FEB-2008, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDR06128.1}; GN ORFNames=LACBIDRAFT_300275 {ECO:0000313|EMBL:EDR06128.1}; OS Laccaria bicolor (strain S238N-H82 / ATCC MYA-4686) (Bicoloured OS deceiver) (Laccaria laccata var. bicolor). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. OX NCBI_TaxID=486041 {ECO:0000313|Proteomes:UP000001194}; RN [1] {ECO:0000313|EMBL:EDR06128.1, ECO:0000313|Proteomes:UP000001194} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S238N-H82 / ATCC MYA-4686 {ECO:0000313|Proteomes:UP000001194}; RX PubMed=18322534; DOI=10.1038/nature06556; RA Martin F., Aerts A., Ahren D., Brun A., Danchin E.G.J., Duchaussoy F., RA Gibon J., Kohler A., Lindquist E., Pereda V., Salamov A., RA Shapiro H.J., Wuyts J., Blaudez D., Buee M., Brokstein P., RA Canbaeck B., Cohen D., Courty P.E., Coutinho P.M., Delaruelle C., RA Detter J.C., Deveau A., DiFazio S., Duplessis S., RA Fraissinet-Tachet L., Lucic E., Frey-Klett P., Fourrey C., RA Feussner I., Gay G., Grimwood J., Hoegger P.J., Jain P., Kilaru S., RA Labbe J., Lin Y.C., Legue V., Le Tacon F., Marmeisse R., Melayah D., RA Montanini B., Muratet M., Nehls U., Niculita-Hirzel H., RA Oudot-Le Secq M.P., Peter M., Quesneville H., Rajashekar B., Reich M., RA Rouhier N., Schmutz J., Yin T., Chalot M., Henrissat B., Kuees U., RA Lucas S., Van de Peer Y., Podila G.K., Polle A., Pukkila P.J., RA Richardson P.M., Rouze P., Sanders I.R., Stajich J.E., Tunlid A., RA Tuskan G., Grigoriev I.V.; RT "The genome of Laccaria bicolor provides insights into mycorrhizal RT symbiosis."; RL Nature 452:88-92(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS547109; EDR06128.1; -; Genomic_DNA. DR RefSeq; XP_001882989.1; XM_001882954.1. DR STRING; 486041.XP_001882989.1; -. DR EnsemblFungi; EDR06128; EDR06128; LACBIDRAFT_300275. DR GeneID; 6078675; -. DR KEGG; lbc:LACBIDRAFT_300275; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B0DGD3; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001194; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001194}; KW Reference proteome {ECO:0000313|Proteomes:UP000001194}. SQ SEQUENCE 377 AA; 42084 MW; 006A622049E07101 CRC64; MPPSPSFADY VPRDFQDPFA LECFLTNDSK IPEGFNLIGH SEPCQSGPVA VPHVKQPSSS TSHRIQKFVL VTAVIAIVSM WKIAIVTCRV SRTVCTLYSH IYFPFDLEPI CAFAHSYHQV LTLHPPRSAV FSENDLLGHP TRDEFHNLNA CVTTVEDGFH RFQDWNEMSR ARNFASKGEG AEIIQTHTSE THGIPRETIY SWFQRKVRGF DMNSVLINPP SVVMEKTFEL GECWEFTGPS GHIGISFSGP IEIANVTIHQ SHGIVSAKEH ARVPRDLVLW GQVDISANIT NMSLENRTMD DFLRTGIRGS RSGTLWRLAE MQYAQVSKAQ TFKLSQAAEN VDASFEAIIV EVLSNWGAPT TCLYHISVHG VEPAEGL // ID B0KWM3_CALJA Unreviewed; 438 AA. AC B0KWM3; DT 18-MAR-2008, integrated into UniProtKB/TrEMBL. DT 18-MAR-2008, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Sperm associated antigen 4 (Predicted) {ECO:0000313|EMBL:ABY90119.1}; GN Name=SPAG4 {ECO:0000313|EMBL:ABY90119.1}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|EMBL:ABY90119.1}; RN [1] {ECO:0000313|EMBL:ABY90119.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DP000582; ABY90119.1; -; Genomic_DNA. DR RefSeq; XP_002747277.1; XM_002747231.2. DR STRING; 9483.ENSCJAP00000032677; -. DR GeneID; 100389411; -. DR KEGG; cjc:100389411; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000246956; -. DR HOVERGEN; HBG079205; -. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 47962 MW; F19AFECFA7C2F23D CRC64; MRRSSRPGSA SSPRKHTPNF FSENSSMSVT SEDSNGRRSA GPEPGEPEGR IAQGRSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNAETA CGAATVRGGA SEPTGSPVVS EEPLALLPTL DLRQEMPPPR LSKSFLSLLF QVLRVSLSLA GNALVSVYRE VCSIRFLFTA VSLLSLFLAV IWLGLLYLVS PSENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLS EDFVRKPDYA LSSVGASIDL EKTSHDYADR NTAYFWNRFR FWNYARPPTV ILEPDVSPGN CWAFEGDQGH VVIRLPSRVQ LSDITLQHPP PSVAHTGGAD SAPRDFAVFG LQVDDETEVF LGKFTFNVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSATGGPH // ID B0QY60_HUMAN Unreviewed; 198 AA. AC B0QY60; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 11-NOV-2015, entry version 47. DE SubName: Full=SUN domain-containing protein 2 {ECO:0000313|Ensembl:ENSP00000390154}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSP00000390154}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000390154, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000390154, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=10591208; DOI=10.1038/990031; RA Dunham I., Hunt A.R., Collins J.E., Bruskiewich R., Beare D.M., RA Clamp M., Smink L.J., Ainscough R., Almeida J.P., Babbage A.K., RA Bagguley C., Bailey J., Barlow K.F., Bates K.N., Beasley O.P., RA Bird C.P., Blakey S.E., Bridgeman A.M., Buck D., Burgess J., RA Burrill W.D., Burton J., Carder C., Carter N.P., Chen Y., Clark G., RA Clegg S.M., Cobley V.E., Cole C.G., Collier R.E., Connor R., RA Conroy D., Corby N.R., Coville G.J., Cox A.V., Davis J., Dawson E., RA Dhami P.D., Dockree C., Dodsworth S.J., Durbin R.M., Ellington A.G., RA Evans K.L., Fey J.M., Fleming K., French L., Garner A.A., RA Gilbert J.G.R., Goward M.E., Grafham D.V., Griffiths M.N.D., Hall C., RA Hall R.E., Hall-Tamlyn G., Heathcott R.W., Ho S., Holmes S., RA Hunt S.E., Jones M.C., Kershaw J., Kimberley A.M., King A., RA Laird G.K., Langford C.F., Leversha M.A., Lloyd C., Lloyd D.M., RA Martyn I.D., Mashreghi-Mohammadi M., Matthews L.H., Mccann O.T., RA Mcclay J., Mclaren S., McMurray A.A., Milne S.A., Mortimore B.J., RA Odell C.N., Pavitt R., Pearce A.V., Pearson D., Phillimore B.J.C.T., RA Phillips S.H., Plumb R.W., Ramsay H., Ramsey Y., Rogers L., Ross M.T., RA Scott C.E., Sehra H.K., Skuce C.D., Smalley S., Smith M.L., RA Soderlund C., Spragon L., Steward C.A., Sulston J.E., Swann R.M., RA Vaudin M., Wall M., Wallis J.M., Whiteley M.N., Willey D.L., RA Williams L., Williams S.A., Williamson H., Wilmer T.E., Wilming L., RA Wright C.L., Hubbard T., Bentley D.R., Beck S., Rogers J., Shimizu N., RA Minoshima S., Kawasaki K., Sasaki T., Asakawa S., Kudoh J., RA Shintani A., Shibuya K., Yoshizaki Y., Aoki N., Mitsuyama S., RA Roe B.A., Chen F., Chu L., Crabtree J., Deschamps S., Do A., Do T., RA Dorman A., Fang F., Fu Y., Hu P., Hua A., Kenton S., Lai H., Lao H.I., RA Lewis J., Lewis S., Lin S.-P., Loh P., Malaj E., Nguyen T., Pan H., RA Phan S., Qi S., Qian Y., Ray L., Ren Q., Shaull S., Sloan D., Song L., RA Wang Q., Wang Y., Wang Z., White J., Willingham D., Wu H., Yao Z., RA Zhan M., Zhang G., Chissoe S., Murray J., Miller N., Minx P., RA Fulton R., Johnson D., Bemis G., Bentley D., Bradshaw H., Bourne S., RA Cordes M., Du Z., Fulton L., Goela D., Graves T., Hawkins J., RA Hinds K., Kemp K., Latreille P., Layman D., Ozersky P., Rohlfing T., RA Scheet P., Walker C., Wamsley A., Wohldmann P., Pepin K., Nelson J., RA Korf I., Bedell J.A., Hillier L.W., Mardis E., Waterston R., RA Wilson R., Emanuel B.S., Shaikh T., Kurahashi H., Saitta S., RA Budarf M.L., McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., RA Edelmann L., Kim U.J., Shizuya H., Simon M.I., Dumanski J.P., RA Peyrard M., Kedra D., Seroussi E., Fransson I., Tapia I., Bruder C.E., RA O'Brien K.P., Wilkinson P., Bodenteich A., Hartman K., Hu X., RA Khan A.S., Lane L., Tilahun Y., Wright H.; RT "The DNA sequence of human chromosome 22."; RL Nature 402:489-495(1999). RN [2] {ECO:0000213|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Burckstummer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [3] {ECO:0000313|Ensembl:ENSP00000390154} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [4] {ECO:0000213|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000390154}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL008583; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL021707; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL021806; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; KF457451; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; B0QY60; -. DR STRING; 9606.ENSP00000385616; -. DR PaxDb; B0QY60; -. DR Ensembl; ENST00000455125; ENSP00000390154; ENSG00000100242. DR HGNC; HGNC:14210; SUN2. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR ChiTaRS; SUN2; human. DR NextBio; 35465677; -. DR Proteomes; UP000005640; Chromosome 22. DR Bgee; B0QY60; -. DR ExpressionAtlas; B0QY60; baseline and differential. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|MaxQB:B0QY60, KW ECO:0000213|PeptideAtlas:B0QY60}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000390154}. SQ SEQUENCE 198 AA; 21773 MW; 31F6B28FCAEF9514 CRC64; VHHIVKQALQ RYSEDRIGLA DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGKFTY DQDGEPIQTF HFQVRDAAVL AEVAQTPHPG SRCSLGTYPA RHGVQRGKPC SPFPQLKM // ID B0W5T7_CULQU Unreviewed; 2813 AA. AC B0W5T7; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 11-NOV-2015, entry version 53. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:EDS35740.1}; GN ORFNames=CpipJ_CPIJ002462 {ECO:0000313|EMBL:EDS35740.1}; OS Culex quinquefasciatus (Southern house mosquito) (Culex pungens). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. OX NCBI_TaxID=7176 {ECO:0000313|Proteomes:UP000002320}; RN [1] {ECO:0000313|Proteomes:UP000002320} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JHB {ECO:0000313|Proteomes:UP000002320}; RG The Broad Institute Genome Sequencing Platform; RA Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C.D., RA Hannick L.I., Megy K., O'Leary S.B., Pearson M., Haas B.J., RA Mauceli E., Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., RA Amedeo P., Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., RA Camaro F., Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., RA Lawson D., Montgomery P., Nene V., Nusbaum C., Puiu D., RA Romero-Severson J., Severson D.W., Shumway M., Sisk P., Stolte C., RA Zeng Q., Eisenstadt E., Fraser-Liggett C.M., Strausberg R., RA Galagan J., Birren B., Collins F.H.; RT "Annotation of Culex pipiens quinquefasciatus."; RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS231844; EDS35740.1; -; Genomic_DNA. DR RefSeq; XP_001844071.1; XM_001844019.1. DR UniGene; Cpi.12614; -. DR SMR; B0W5T7; 1461-1529. DR STRING; 7176.CPIJ002462-PA; -. DR EnsemblMetazoa; CPIJ002462-RA; CPIJ002462-PA; CPIJ002462. DR GeneID; 6033644; -. DR KEGG; cqu:CpipJ_CPIJ002462; -. DR VectorBase; CPIJ002462; Culex quinquefasciatus. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR HOGENOM; HOG000018061; -. DR InParanoid; B0W5T7; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B0W5T7; -. DR Proteomes; UP000002320; Partially assembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002320}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EDS35740.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000002320}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1444 1464 {ECO:0000256|SAM:Coils}. FT COILED 2655 2675 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2813 AA; 305595 MW; 2AAE03E3892E7363 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSSECTR RIVAIDGAIK AICNRLVVAD LESRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDNGSQIHK DTLHSAMAVV SRLCTKVEPQ GANVQTCVES LSTLLQHEDP LVADGALKCF ASVADRFTRK GVDPRPLAEY GLVTELLNRL SNAAGPAGPV PAASSEASGA AASGGSQSNS QEAASTTTTT TTVQLSSSAP KTQGAAEAGR SSQSIATTIS LLSTLCRGSP SITHSLLRSK LAEAMERAFK GDERCVLDCM RLADLILLLL FEGRQALGRV GGSSGQLAPR VRRADSSTER THRQLIDCIR SKDTEALIEA IESGGIDVNC MDDVGQTLLN WASAFGTLEM VEFLCDKGAD VNKGQRSSSL HYAACFGRPG IAKVLLKHGA NPDLRDEDGK TPLDKARERP DEGHREVASI LQSPGEWMTA ATRSELLKSE DSEEGGDAEP RGDPEMAPVY LKFFLPIFCK TFQSTMLASV RRSSLGLIKK MIQYVQPEML SRLCSSEGLQ SHEQSLGTLL VEVVASVLDN EISYSWPSLP ASSPSSTPIF GPLPAPPPPP PPPSMLPITV STPPVVSILG SRLGYAERVS KYVQNLRQQN QNSNRPASTT TTTSFKTTAV DDEDGHLVVL TIIEELMSKT QNDFLDHFAR LGVFSKVQAL MGDSGTGFGS ASGAGGGEGG DNTVIKSSST SSSEEVAPKA VLAVPPTPSP PATTNTTGGD QAGPIPPPAV PVPVAAAGVT TATVEDAKEI LPGKAYHWRD WSICRGRDCL YVWSDSAALE LSNGSNGWFR FILDGKLATM YSSGSPENGS DSSENRGEFL EKLQRARGAV RQGTVSQPIL STPSLSRIVV GNWVLQSQKE HQLHINNSEG HQVTILQDEL PGFIFESNRG TKHTFTAETT LGPDFAAGWI NTKKKKMRSK AEAQKYQVKN IARDLYNRYF KAAQAVPRGA VAKLCAIVRL IESALEEQCA PKPLQQRISP TSSTWQEKLH NALNDLVQLL NEDGVISAYE MHSSGLVQAL VAVLSRNFWE LGMNRSKANK YQKQRISIFK KCMYGDSKNG KNTANILVQK LVAVLESIEK LPVYMYDSPG GSYGLQILTK RLSFRLERAA CEQTLFDRTG RNLKMEPLAT VGQLNKYLLK MVAKQWYDME RTSFLYLKKL KEAKSGPAAQ FKHQHDFDEN GIIYFIGTNG KSTEWVNPAQ YGLVTVTSSE GKQLPYGKLE DILSRDSVSV NCHTKDNKKS WFAIDLGMFI VPTAYTLRHA RGYGRSALRN WMFQMSKDGV NWVTMLTHSD DKSLAEPGST CTWPLECSAD EQQGFRHVRI HQNGRNASGQ THYLSLSGFE IYGKVVSVCE DMGKAAAKEN EAKLRKERRQ IRAQLKYITQ GARVIRGVDW HWDDQDGAHP GEGTVTGEIH NGWIDVKWDH GLRNSYRMGA EGKYDLKLAN SEGLTAPYDI NNSGSGMVPL SGAGTVSSAK KVYDKSLNVL TSRKSSSTPS LPEATENKSA SVASTEQATS VDNLAWKQAV EVIAENVLSC ARSDLANTSG GSSSNDLSTP NANNNNNLNN QEVSVIVHSL GERGNIPDLS QINTSTSTLV SDLATITENL TLSDNIKNNI GAASASSQFV SNFGTQLAAS SSSSSSEENN KANNITAYLP TKLDVLDKMR EGVDMLRNNT NNLLSSELLT QSNLLSSVKI ALPTPAPSTG AGAAAPPPPA GTIFVASTST SSTSTLPPIA DDRDVANNLR NNIVVVPSSG GVTKKVLNEP ASTPDDRDVA NNLRNNIVVV DSESPTVVAA SSSKEVVPDS PSAVVAANPM SVSVPNLTSN ESTTPSESQT PTGLLETFAA IARRRTSGSS VATPPGIVGT SSSGNGSNNA TPNNNQLSSV TSNIQANSSF FPRGPNSVTS LVKLALSSNF HSGLLSTAQS YPSLSSSSNN AAANTNNNPS SGTGANANSG SGQAAASLLN PALTMSLTST NSDSEQVSLE DFLEQCRAPT LLGDLEDDED MEDENDDDEN EDEYEEVGNT LLQVMVSRNL LSFMDEETLE NRLAAAGKRK SWDDEFVLKR QFSALIPAFD PRPGRTNVNQ TSDLDIPAPG SSTDNTAHPS SSNSEHAPLP QPSLALLLRG PNINGVNDVE IPLSQPDWTI FRAVQELILQ TNMTKQDKFR KIWQPTYTIV YREASALTGG REDFSSGEEG RATPVPVSMF SQRSGGSTLS PSSPIPGTPS TPAHCTVEDV LQLLGQLNSI NQSLASSPSN NDKNLESISN VLIPDTFMSK KITNKLQQQI QDPLVLASGS LPKWCEDFNQ SCPFLFPFET RQLYFNCTAF GASRSIVWLQ SQRDVNLERQ RAPGLSPRHA DQHEFRVGRL KHERVKVPRG ENLLEWAQQV MKHHCNRKSV LEVEFQGEEG TGLGPTLEFY ALVAAELQRS DLGMWLCDDE PKLIEDEIDL GEGSKPIGYY VRRSTGLFPA PLPQESEVCD FVASYFWFLG VFLAKVLQDG RLVDLPLSNS FLQLLCHNKS ISRDAGASSK SDDVMISSLM SEESDRDLVD KLANDGCWYD GILSQENLHE IDPIRYEFLK ELQELVQQKQ NIEQNDDLSS EEKLLQISEL KFNTKTGSVA LEDLALTFTY LPSSKNYGYQ SADLIPNGAN IDVTINNVEE YCNLTINFCL QEGISKQLAA FHRGFCEVFP LNKLAAFTPE EIRKMLCGEQ NPEWTREDLM TYTEPKLGYT KER // ID B0W8S5_CULQU Unreviewed; 1282 AA. AC B0W8S5; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDS39304.1}; GN ORFNames=CpipJ_CPIJ003500 {ECO:0000313|EMBL:EDS39304.1}; OS Culex quinquefasciatus (Southern house mosquito) (Culex pungens). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. OX NCBI_TaxID=7176 {ECO:0000313|Proteomes:UP000002320}; RN [1] {ECO:0000313|Proteomes:UP000002320} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JHB {ECO:0000313|Proteomes:UP000002320}; RG The Broad Institute Genome Sequencing Platform; RA Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C.D., RA Hannick L.I., Megy K., O'Leary S.B., Pearson M., Haas B.J., RA Mauceli E., Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., RA Amedeo P., Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., RA Camaro F., Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., RA Lawson D., Montgomery P., Nene V., Nusbaum C., Puiu D., RA Romero-Severson J., Severson D.W., Shumway M., Sisk P., Stolte C., RA Zeng Q., Eisenstadt E., Fraser-Liggett C.M., Strausberg R., RA Galagan J., Birren B., Collins F.H.; RT "Annotation of Culex pipiens quinquefasciatus."; RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS231860; EDS39304.1; -; Genomic_DNA. DR RefSeq; XP_001845140.1; XM_001845088.1. DR STRING; 7176.CPIJ003500-PA; -. DR EnsemblMetazoa; CPIJ003500-RA; CPIJ003500-PA; CPIJ003500. DR GeneID; 6034870; -. DR KEGG; cqu:CpipJ_CPIJ003500; -. DR VectorBase; CPIJ003500; Culex quinquefasciatus. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000044781; -. DR InParanoid; B0W8S5; -. DR OMA; FEAFETD; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B0W8S5; -. DR Proteomes; UP000002320; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002320}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002320}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 37 59 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 129 149 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 829 874 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1282 AA; 140195 MW; 53C8475A93405BCD CRC64; MYMVVSLYLV VWFYVEWRLS MKLLETLLDR ARKMRKLTVL LAFCWVLLVS QILAISSLYN DGSKQSGRGT CLNTPAEEID SIGNSCHLNI GRPFQVDNPH PCAAQSRIPV RHGGPLVAPT GRTRQQSTGV VMMMIFLKMF WFYVLEAVLQ DIPGTPSMIT LVDYQTSEMN IESKLEENIE ATLKNITHQL GEPAAAADEV TVEPDEKQAE VADVPAQPSE QPTLLDDKIN DSGGGSAEDR VVEDVQNITT KESNLTEENP MPVFSEWAQK QMAEAEKKLG EHVNASEKKR SAKPPGHKMP PLKLRAKNYA APDCGAKIIA SNPEAQSTGS VLTSHKDEYL LNPCTSKIWF VVELCEPVQA ERVELANFEL FSSSPKDFSV AVSNRFPTRD WSNVGKFTAK DERDVQNFNL HPHLFGKFVR VEIHSHYNSE HYCPVSLFRV YGTSEFEAFE TDNTPSLGDE GDDDEELLVT GGSTGKHVDG VDGNDRNILK SASDAVMNIV KKAAKVLGKT NENNSLSNDT EPSGGDQNET DTELSVVLHG LSQPQCRVHC FTLPYQPRCV SCSPELRDRV EQTMSCKHNL LTSLLAIDLI SASFDESQHL LCANILGFCL DESAQQSSQN INRSVINLLP ADTIAAFCNI RAYEKGLLKT VVTEKPPSAP TTAEQTTSTV TTESPTKVVQ EEPTLTTIPA AKPATKDEPT TSRPATSSET DPETPQKDNV NIFNVPEEAP PLEPPPSARS EQHLEEPLIV PPPSDVAPDD DVHSSQEDLD DSLLLDHHDG GIPMITTTTT PTPGSPAAGQ KVQPESVFLR LSNRIKALER NMSLSGQYLE ELSRRYRKQV EELQQSHAKT LHEIEEQTRR MHESEATLRE ENERLRAEFV GFRDSILSWK NITIGLVGFA VVNIVMVCAL VRSCGGGGSG ARGESERDSI DRELAQVSAS GKPIRERLLR RKSIDGVIGG SVQSVAGGTL RKKRPSEEAL NISGTYENLL IDDGGGDSAK VERKKGRNKH RKVSAPAMTQ SQLPQVNGKS KRAASVEPPA VKAVTKTELM RTESAPEPRK PSPEAVTSLD DTNRIDEIPF LEDNDEFIIP TASDLSYNEF VPDSTSEANK TGNGMLSSTS SIDSKSTKSG KGRRLSSPAF FKSSLLRSSR KSSGKKSTPS QNVSSASSNT SSNAGSSNVR ISINVHSPSA RAIAVEDDAD SCQVDGSTTT NNNNSSNGWE WYKLKKSSSQ DKFTKRKSKS ESPDADVVGN GSALKSSLSF NGGSDSKSAN GGSFRRLFRK VF // ID B0X9T7_CULQU Unreviewed; 232 AA. AC B0X9T7; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDS43289.1}; GN ORFNames=CpipJ_CPIJ016366 {ECO:0000313|EMBL:EDS43289.1}; OS Culex quinquefasciatus (Southern house mosquito) (Culex pungens). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Culicini; Culex; Culex. OX NCBI_TaxID=7176 {ECO:0000313|Proteomes:UP000002320}; RN [1] {ECO:0000313|Proteomes:UP000002320} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JHB {ECO:0000313|Proteomes:UP000002320}; RG The Broad Institute Genome Sequencing Platform; RA Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C.D., RA Hannick L.I., Megy K., O'Leary S.B., Pearson M., Haas B.J., RA Mauceli E., Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., RA Amedeo P., Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., RA Camaro F., Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., RA Lawson D., Montgomery P., Nene V., Nusbaum C., Puiu D., RA Romero-Severson J., Severson D.W., Shumway M., Sisk P., Stolte C., RA Zeng Q., Eisenstadt E., Fraser-Liggett C.M., Strausberg R., RA Galagan J., Birren B., Collins F.H.; RT "Annotation of Culex pipiens quinquefasciatus."; RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS232550; EDS43289.1; -; Genomic_DNA. DR RefSeq; XP_001866409.1; XM_001866374.1. DR STRING; 7176.CPIJ016366-PA; -. DR EnsemblMetazoa; CPIJ016366-RA; CPIJ016366-PA; CPIJ016366. DR GeneID; 6049694; -. DR KEGG; cqu:CpipJ_CPIJ016366; -. DR VectorBase; CPIJ016366; Culex quinquefasciatus. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000002320; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002320}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002320}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 104 126 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 232 AA; 26427 MW; 0DC7751A0A60E79E CRC64; MSQDGARTMR YKCRNGTLQL AHHFSDFILV QLRVSFLFAE ELLLMEDLSA RAPPGSWMML RRDTGNGTKK CACKKLEKFT RKYQIVYEQI YHDVKPKKRN EISIVWIGGS IGVPIFVVIV CIVIQLNSAV VVTGFSLEHI SKLLAPNNQI DSAPKNFSVW GLATEHDPDP VQLGSYVYQD NSAALQYFPV DEPTRPELAG RAFRIVELRI ESNHGNAHYT CLYRFRVHGE RV // ID B0Y1U3_ASPFC Unreviewed; 732 AA. AC B0Y1U3; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDP52052.1}; GN ORFNames=AFUB_060860 {ECO:0000313|EMBL:EDP52052.1}; OS Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) OS (Aspergillus fumigatus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=451804 {ECO:0000313|Proteomes:UP000001699}; RN [1] {ECO:0000313|Proteomes:UP000001699} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CEA10 / CBS 144.89 / FGSC A1163 RC {ECO:0000313|Proteomes:UP000001699}; RX PubMed=18404212; DOI=10.1371/journal.pgen.1000046; RA Fedorova N.D., Khaldi N., Joardar V.S., Maiti R., Amedeo P., RA Anderson M.J., Crabtree J., Silva J.C., Badger J.H., Albarraq A., RA Angiuoli S., Bussey H., Bowyer P., Cotty P.J., Dyer P.S., Egan A., RA Galens K., Fraser-Liggett C.M., Haas B.J., Inman J.M., Kent R., RA Lemieux S., Malavazi I., Orvis J., Roemer T., Ronning C.M., RA Sundaram J.P., Sutton G., Turner G., Venter J.C., White O.R., RA Whitty B.R., Youngman P., Wolfe K.H., Goldman G.H., Wortman J.R., RA Jiang B., Denning D.W., Nierman W.C.; RT "Genomic islands in the pathogenic filamentous fungus Aspergillus RT fumigatus."; RL PLoS Genet. 4:E1000046-E1000046(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS499597; EDP52052.1; -; Genomic_DNA. DR EnsemblFungi; CADAFUBT00006066; CADAFUBP00005945; CADAFUBG00006066. DR HOGENOM; HOG000176993; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001699; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001699}. FT COILED 433 453 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 732 AA; 81529 MW; 8CE5400CF2B1E43D CRC64; MPPKRTRRAG AAARSEASII FGHSSPSVSN QPLPDVPTQP SWAYGSPAAP VLPRRLVAKD IGLAEVAESI DQTIRDAEKR DRRNDPDEAN DTDDRPHMNT RSRRRPSAAN ASPVRRRTKR EPTPDQVQLL DALREATVSP NQRNGENETQ AERSTATPTP PIPHTLSTMS SPTSQILPDP KYPSLPIEQL YPSPLQRIGS PTRNDASLEM SQNTGIDDNE SVISWMVERD IHDDDLQRTR SARYRREPVG KNITAPPRRF SGLAFANETI VEEDEPDSRL SVSKTPQEST VESEAQSDHQ TESDQPLVPL EPQKEPPPQV EEVSSAPART IIPNFFTKDQ SFNNSTTQPS DQSFTDHARS TAADSFIPRI SVSLPWTQIL RLSGAILLTA ISLLTIYSFS DSIANIPHDI ASHFPFRNPA PSISLNISDI EALNSLNNQV MRLGAQVSSI SKELSVVKSE VKNVGGPTTI IEPVKVPKKP NFLSIGTGVL IDPRMTSPTY GEKKSRLPKW LRDRASVWGE APRPKPNPPL TALVPWDSVG DCWCSAPRNG VSQLALHLSR PIVPEEVVVE HIPKHATLNP GAAPKDMELW VQYTINKSTS GELPTDAGSA GWYKSYLNWL LSFESGVLET EYQSPMLSER FSLHDYIMSY LRPAYHNEPE SAYWNATTLG PTFYRVGKWK YDIHGQHHVQ EFSLDAIIDQ PDIRVDRVAF RVNSNWGANF TCFYRLKLYG HL // ID B0Y3F6_ASPFC Unreviewed; 842 AA. AC B0Y3F6; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 14-OCT-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDP51397.1}; GN ORFNames=AFUB_054030 {ECO:0000313|EMBL:EDP51397.1}; OS Neosartorya fumigata (strain CEA10 / CBS 144.89 / FGSC A1163) OS (Aspergillus fumigatus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=451804 {ECO:0000313|Proteomes:UP000001699}; RN [1] {ECO:0000313|Proteomes:UP000001699} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CEA10 / CBS 144.89 / FGSC A1163 RC {ECO:0000313|Proteomes:UP000001699}; RX PubMed=18404212; DOI=10.1371/journal.pgen.1000046; RA Fedorova N.D., Khaldi N., Joardar V.S., Maiti R., Amedeo P., RA Anderson M.J., Crabtree J., Silva J.C., Badger J.H., Albarraq A., RA Angiuoli S., Bussey H., Bowyer P., Cotty P.J., Dyer P.S., Egan A., RA Galens K., Fraser-Liggett C.M., Haas B.J., Inman J.M., Kent R., RA Lemieux S., Malavazi I., Orvis J., Roemer T., Ronning C.M., RA Sundaram J.P., Sutton G., Turner G., Venter J.C., White O.R., RA Whitty B.R., Youngman P., Wolfe K.H., Goldman G.H., Wortman J.R., RA Jiang B., Denning D.W., Nierman W.C.; RT "Genomic islands in the pathogenic filamentous fungus Aspergillus RT fumigatus."; RL PLoS Genet. 4:E1000046-E1000046(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS499597; EDP51397.1; -; Genomic_DNA. DR EnsemblFungi; CADAFUBT00005383; CADAFUBP00005290; CADAFUBG00005383. DR HOGENOM; HOG000172520; -. DR OrthoDB; EOG7SBNXT; -. DR PhylomeDB; B0Y3F6; -. DR Proteomes; UP000001699; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001699}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 842 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002760407. FT TRANSMEM 689 706 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 842 AA; 92013 MW; 8A842CB8914CE73C CRC64; MMLSSRCVGA FLINLAALAL LTAGQAVIRE QSQPLCLARG WRDTEAEFIR WPVCIETRWS RSGITVAGGP SVTMSSISSN SPPTSKASHT AVQQHSGKEQ EQDTDSPLDN AKFLSFEDWK KQNLAKVGQS AENVGGNRRS GVTGNESRRR PTGISNALDS LGEDAEIELD FGGFGADAPE AARPPSFGSG VQVGKSAGSV DSKTGGDANG PSPGMIRSGS SRRKDAGTTC KERFNYASFD CAATVLKTNP ECQGSSSVLI ENKDSYMLNE CRAKNKFLIL ELCDDILVDT VVLANYEFFS SIFHTFRVSV SDRYPAKPDQ WRELGVFEAR NTREVQAFAV ENPLIWARYL KIEFLTHYGN EFYCPLSLIR VHGTTMLEEY KHDGEASRVD DEIVDETLEP DHAVTAAIAE PSENSSDLGA ENRESMRRKL QDGLQDACPN PAQGLERLLA NYLDSEICSV QARPTRTAGQ ERADAAVQHD SPSTDTTPPG PEASGPIVPG AGNGTKFAPD ARRAAGQSGA DGNPLPASMA TMSEPVQHDT TSEADQKSTA SSQEEQVPSV DSAKFSATQP PSPNPTTQES FFKSVNKRLQ MLESNSTLSL LYIEEQSRIL RDAFSKVEKR QLSKTSTFLE NLNVTVMNEL RQFREQYDQV WKTVALEFET QRIQYHQEIF SLSAQLGVLA DELVFQKRVA VIQSIMVLFC FGLVLFSRGA MSSYMEFPSV QNMVSRSYSL RSSSPPFSSP SMSPSSTRPS FTYRSRHRRN GTDDTQDSAP SPTISYSPPT PNSETSVPLE SIEKQESPPS PGDLELPDIE LPQFRSQSSP PVLKSGEDSD GEISETSGSM EV // ID B1MTM5_CALMO Unreviewed; 438 AA. AC B1MTM5; DT 29-APR-2008, integrated into UniProtKB/TrEMBL. DT 29-APR-2008, sequence version 1. DT 14-OCT-2015, entry version 8. DE SubName: Full=Sperm associated antigen 4 (Predicted) {ECO:0000313|EMBL:ACA64887.1}; GN Name=SPAG4 {ECO:0000313|EMBL:ACA64887.1}; OS Callicebus moloch (Dusky titi monkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Pitheciidae; Callicebinae; Callicebus. OX NCBI_TaxID=9523 {ECO:0000313|EMBL:ACA64887.1}; RN [1] {ECO:0000313|EMBL:ACA64887.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DP000642; ACA64887.1; -; Genomic_DNA. DR HOVERGEN; HBG079205; -. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 48224 MW; 74931D1B9E2414F2 CRC64; MRRSSRPGSA LSPRKHTPNF FSDNSSMSVT SEDSNGRGSA GPGPGEPEGR RAQGPSCGEP ALSAGVPGGT TCAGSSRQKP APRSHNGETA YSAATVRGGT SEPTGSPVVS EEPFALLPTL DLRQEMPAPR LSKSFLSLLF QVLRVVLSLA GDALVSVYRE VCSIRFLFTA VSLLSLFLAV IWLGLLYLVS PLENEPKEML TLSEYHERVR SQEQQLQQLQ AELDKLHKEV WTVRAVNSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL EKTSHDYADR NTAYFWNRFS FWNYARPPTI ILEPDVFPGN CWAFEGDQGH VVIRLPGRVQ LSDITLQHPP PSVAHTGGAN SAPRDFAVFG LQVDDETEVF LGKFTFDVEK LEIQTFHLQS DPPAAFPKVK IQILSNWGHP RFTCLYRVRV HGVRISERAE GSATGGPH // ID B2AVM5_PODAN Unreviewed; 1006 AA. AC B2AVM5; DT 20-MAY-2008, integrated into UniProtKB/TrEMBL. DT 20-MAY-2008, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Podospora anserina S mat+ genomic DNA chromosome 7, supercontig 1 {ECO:0000313|EMBL:CAP68449.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CDP31921.1}; GN ORFNames=PODANS_7_2430 {ECO:0000313|EMBL:CAP68449.1}; OS Podospora anserina (strain S / ATCC MYA-4624 / DSM 980 / FGSC 10383) OS (Pleurage anserina). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Lasiosphaeriaceae; OC Podospora. OX NCBI_TaxID=515849 {ECO:0000313|EMBL:CAP68449.1, ECO:0000313|Proteomes:UP000001197}; RN [1] {ECO:0000313|EMBL:CAP68449.1, ECO:0000313|Proteomes:UP000001197} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S / ATCC MYA-4624 / DSM 980 / FGSC 10383 RC {ECO:0000313|Proteomes:UP000001197}, and RC S mat+ {ECO:0000313|EMBL:CAP68449.1}; RX PubMed=18460219; DOI=10.1186/gb-2008-9-5-r77; RA Espagne E., Lespinet O., Malagnac F., Da Silva C., Jaillon O., RA Porcel B.M., Couloux A., Aury J.-M., Segurens B., Poulain J., RA Anthouard V., Grossetete S., Khalili H., Coppin E., RA Dequard-Chablat M., Picard M., Contamine V., Arnaise S., Bourdais A., RA Berteaux-Lecellier V., Gautheret D., de Vries R.P., Battaglia E., RA Coutinho P.M., Danchin E.G.J., Henrissat B., El Khoury R., RA Sainsard-Chanet A., Boivin A., Pinan-Lucarre B., Sellem C.H., RA Debuchy R., Wincker P., Weissenbach J., Silar P.; RT "The genome sequence of the model ascomycete fungus Podospora RT anserina."; RL Genome Biol. 9:R77.1-R77.22(2008). RN [2] {ECO:0000313|EMBL:CAP68449.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=S mat+ {ECO:0000313|EMBL:CAP68449.1}; RA Genoscope - CEA; RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:CDP31921.1} RP NUCLEOTIDE SEQUENCE. RA Genoscope - CEA; RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|EMBL:CDP31921.1} RP NUCLEOTIDE SEQUENCE. RA Grognet P., Bidard F., Kuchly C., Chan Ho Tong L., Coppin E., RA Ait Benkhali J., Couloux A., Wincker P., Debuchy R., Silar P.; RT "Maintaining two mating types: Structure of the mating type locus and RT its role in heterokaryosis in Podospora anserina."; RL Submitted (SEP-2014) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU633900; CAP68449.1; -; Genomic_DNA. DR EMBL; FO904942; CDP31921.1; -; Genomic_DNA. DR RefSeq; XP_001907776.1; XM_001907741.1. DR STRING; 515849.XP_001907776.1; -. DR EnsemblFungi; CAP68449; CAP68449; PODANS_7_2430. DR GeneID; 6192227; -. DR KEGG; pan:PODANSg4811; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000001197; Chromosome 7. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001197}; KW Reference proteome {ECO:0000313|Proteomes:UP000001197}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 1006 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002773992. SQ SEQUENCE 1006 AA; 109701 MW; 808CE2AF2A77133B CRC64; MRSSPTPWRT SPALGLVLLG LYATAVVSSG SISETPAVTA PSVTTTAIVT TPTAPATAAL SVTSPGVCEF KTINYITHTL PQQCLRSAWT SPRPAATAAV ESTAAETVTV SITSVVDNAT QQEQAAGTEQ KEEEESVAFM SFEEWKEMML RKSGQDPANI KKAQKQHHGD HHKPDREPGL NNDNMDSLGN DGEILDFDAL TEKVTEITSS SSGDAVAEAS KEVQEEQILY DDNKTPYYRS KDAGKTCKER FSFSSFDAGA TILKTSPGAK NAKAILVENK DTYMLLECHR KNKFVIVELS DDILVDTVVL ANFEFFSSMI RKFRVSASDR YPVKLDKWVD LGTFEARNAR DIQPFLVEHP QIYTKYIRIE FLSHYGNEYY CPVSLLRVHG TRMLDSWKEP REDDEPEQIE GSSPQEVVPE IQEPPSEPTS VMESEQANDT KEASPDTIAI DTGSSPWQPY DSHFVLETCA MRSTTTSDPT PASGPDGAEK HSNSTSKADA GAEKAVPADK AKETPKAEKI FPSPVTGDTA TGQGQPNTAP PASNPPPQPD SGDNAHTGNH QDNQKKPPNR ASDTSPDTTP SQGSAKTPGK EGEKPSNATR SKTTPTSGHP SSSPTVQESF FKQVNKRLQH LESNTSLSLQ YIEEQSRFLQ EVLRTMERKQ LTRVDSFLNT LNQTVFSELR HVRTQYDQMW QSTVIALETQ REQSDRQIVA LTTRLNVLAD EVVFQKRMAI FQSVLILSCL VLVIFTNRGG SDSSFLPPSL SRDPSSAAAA YYRRYAAGFM SNGARSETAS PPPISPVPGS SHFDSIHHRM NLSPSPPSAS SSSAATLAAS ALPRQIYSPT GIHKRPVPAH REKSLPMIPP LTPESSREGT PAIHISNHGS PPDDQDELGS NNNNKLQPLL EGIREESPSP SPSPSPSPGE TATQRRRRQQ QLHQPSTSSL LSLSSTDYHE VSSIETNGED GGKERTPSSP EVEGQESEDE EDDQENVSQD TRGARKPLPA LPDGPT // ID B2G466_ZYGRO Unreviewed; 596 AA. AC B2G466; DT 10-JUN-2008, integrated into UniProtKB/TrEMBL. DT 10-JUN-2008, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein YOR154W {ECO:0000313|EMBL:CAQ43375.1}; GN Name=Zr_YOR154W {ECO:0000313|EMBL:CAQ43375.1}; GN ORFNames=Zrou_1p83 {ECO:0000313|EMBL:CAQ43375.1}; OS Zygosaccharomyces rouxii (Candida mogii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Zygosaccharomyces. OX NCBI_TaxID=4956 {ECO:0000313|EMBL:CAQ43375.1}; RN [1] {ECO:0000313|EMBL:CAQ43375.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=CBS 732 {ECO:0000313|EMBL:CAQ43375.1}; RA Gordon J.L., Wolfe K.H.; RT "Zygosaccharomyces rouxii homologs of Saccharomyces cerevisiae RT chromosome III."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM989980; CAQ43375.1; -; Genomic_DNA. DR STRING; 4956.XP_002497932.1; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 596 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002776146. FT TRANSMEM 549 566 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 134 154 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 596 AA; 67999 MW; 12A3E9B4FCC1C38D CRC64; MWLLWNFLVI LAYSGILICC EDNYSASSSV IVNETQSSGS SCQSPKSLNE DFSTLSSKIG WETPSTSKNV TLSSSSRGGS KSFSSNNDRT SMLMQQQAVK SFGPNAADLK QGTDTSNQTF LSFNEWRLAK INQEVIQEQQ RSKAKEQMES LESEPLGDDM EIELSVFSTT DAIKDKQESE PEGKVYNHKF NFASLDCAAT IVKTNSEASG ATSILTENKD KYLLNPCSAP NKFIIIELCQ DILVEEVALA NFEFFSSTFS RIRLSVSDLY PVAKNGWRVL GEFDAENSRN LQSFPIQNPQ IWARYLRIEI LTHHDKEFYC PVSLVRVHGK TMMDEFKMEN TQELPSNQEN SQEVEEPEDD TSEQCINEII EKCNSWPSID PDNITYLPDL PETFSNCQSK LVPLKFEEFL KELNRSHCLP KNKNNSSTFS PSPAFSTEES IFKNIMKRLT TLESNANLTV LYIEEQSKLL AESFEQMERT QFFNFDNLVS IFNQTIMENL NVLRVFANQL KDQSIRILEE QKLNNDQFTT QNTIKLANLE KELRIQQRFA YTITTGLIAV MVYFIFHRES YLDNHKKSIS TDTQAVEENK EIVDSI // ID B2KID0_RHIFE Unreviewed; 440 AA. AC B2KID0; DT 10-JUN-2008, integrated into UniProtKB/TrEMBL. DT 10-JUN-2008, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=Sperm-associated antigen 4 protein (Predicted) {ECO:0000313|EMBL:ACC68957.1}; GN Name=SPAG4 {ECO:0000313|EMBL:ACC68957.1}; OS Rhinolophus ferrumequinum (Greater horseshoe bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Rhinolophidae; Rhinolophinae; Rhinolophus. OX NCBI_TaxID=59479 {ECO:0000313|EMBL:ACC68957.1}; RN [1] {ECO:0000313|EMBL:ACC68957.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (APR-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DP000720; ACC68957.1; -; Genomic_DNA. DR HOVERGEN; HBG079205; -. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 137 160 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 166 191 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 204 238 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 440 AA; 47925 MW; F2B5FAA7F8483A4D CRC64; MRRSPRPGSV ASPHKHTPNF YSGNSNSSGS ATSGDSSGHR SAGPGPGEPE GGRAQGSSCG EPALSSGVPG GTARAGSSRQ KPAPRSHNGR TACGAATVRG GASEPAGSPV VSEEQFDLLS TLDRRQEMPP LRVSKSFLSL LFQVLSALLS RLGDVLVIVY REVCSIRFLL TAVSLLSLFV TALWWGFLCL VPPLENEPKE MLTVSEYHER VRSQGQQLQQ LQAELNKLHR EVSSVRVANS ERVAKLVFQR LNEDFVQKPD YALSSVGASI DLEKTSHDYQ DANTAYFWNR FSFWNYARPP TVILEPDVFP GNCWAFEGDQ GQVVIRLPGR VQLSDITLQH PPASVAHTGG ANSAPRDFVV YGLQVDDKTE VFLGKFTFDV EKSEIQTFHL QNDPPTAFPK VKIQILSNWG HPRFTCLYRV RAHGMRTSEG AGDSATGGPH // ID B2RRF3_MOUSE Unreviewed; 1254 AA. AC B2RRF3; DT 01-JUL-2008, integrated into UniProtKB/TrEMBL. DT 01-JUL-2008, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=AI848100 protein {ECO:0000313|EMBL:AAI38380.1}; DE SubName: Full=MCG22687 {ECO:0000313|EMBL:EDL39309.1}; GN Name=Suco {ECO:0000313|MGI:MGI:2138346}; GN Synonyms=AI848100 {ECO:0000313|EMBL:AAI38380.1, GN ECO:0000313|MGI:MGI:2138346}; GN ORFNames=mCG_22687 {ECO:0000313|EMBL:EDL39309.1}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090 {ECO:0000313|EMBL:AAI38380.1}; RN [1] {ECO:0000313|EMBL:EDL39309.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Mixed {ECO:0000313|EMBL:EDL39309.1}; RX PubMed=12040188; DOI=10.1126/science.1069193; RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Miklos G.L., Wides R., RA Halpern A., Li P.W., Sutton G.G., Nadeau J., Salzberg S.L., Holt R.A., RA Kodira C.D., Lu F., Chen L., Deng Z., Evangelista C.C., Gan W., RA Heiman T.J., Li J., Li Z., Merkulov G.V., Milshina N.V., Naik A.K., RA Qi R., Shue B.C., Wang A., Wang J., Wang X., Yan X., Ye J., RA Yooseph S., Zhao Q., Zheng L., Zhu S.C., Biddick K., Bolanos R., RA Delcher A.L., Dew I.M., Fasulo D., Flanigan M.J., Huson D.H., RA Kravitz S.A., Miller J.R., Mobarry C.M., Reinert K., Remington K.A., RA Zhang Q., Zheng X.H., Nusskern D.R., Lai Z., Lei Y., Zhong W., Yao A., RA Guan P., Ji R.R., Gu Z., Wang Z.Y., Zhong F., Xiao C., Chiang C.C., RA Yandell M., Wortman J.R., Amanatides P.G., Hladun S.L., Pratts E.C., RA Johnson J.E., Dodson K.L., Woodford K.J., Evans C.A., Gropman B., RA Rusch D.B., Venter E., Wang M., Smith T.J., Houck J.T., Tompkins D.E., RA Haynes C., Jacob D., Chin S.H., Allen D.R., Dahlke C.E., Sanders R., RA Li K., Liu X., Levitsky A.A., Majoros W.H., Chen Q., Xia A.C., RA Lopez J.R., Donnelly M.T., Newman M.H., Glodek A., Kraft C.L., RA Nodell M., Ali F., An H.J., Baldwin-Pitts D., Beeson K.Y., Cai S., RA Carnes M., Carver A., Caulk P.M., Center A., Chen Y.H., Cheng M.L., RA Coyne M.D., Crowder M., Danaher S., Davenport L.B., Desilets R., RA Dietz S.M., Doup L., Dullaghan P., Ferriera S., Fosler C.R., RA Gire H.C., Gluecksmann A., Gocayne J.D., Gray J., Hart B., Haynes J., RA Hoover J., Howland T., Ibegwam C., Jalali M., Johns D., Kline L., RA Ma D.S., MacCawley S., Magoon A., Mann F., May D., McIntosh T.C., RA Mehta S., Moy L., Moy M.C., Murphy B.J., Murphy S.D., Nelson K.A., RA Nuri Z., Parker K.A., Prudhomme A.C., Puri V.N., Qureshi H., RA Raley J.C., Reardon M.S., Regier M.A., Rogers Y.H., Romblad D.L., RA Schutz J., Scott J.L., Scott R., Sitter C.D., Smallwood M., RA Sprague A.C., Stewart E., Strong R.V., Suh E., Sylvester K., RA Thomas R., Tint N.N., Tsonis C., Wang G., Wang G., Williams M.S., RA Williams S.M., Windsor S.M., Wolfe K., Wu M.M., Zaveri J., RA Chaturvedi K., Gabrielian A.E., Ke Z., Sun J., Subramanian G., RA Venter J.C., Pfannkoch C.M., Barnstead M., Stephenson L.D.; RT "A comparison of whole-genome shotgun-derived mouse chromosome 16 and RT the human genome."; RL Science 296:1661-1671(2002). RN [2] {ECO:0000313|EMBL:AAI38380.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Brain {ECO:0000313|EMBL:AAI38380.1}; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RA Gerhard D.S., Wagner L., Feingold E.A., Shenmen C.M., Grouse L.H., RA Schuler G., Klein S.L., Old S., Rasooly R., Good P., Guyer M., RA Peck A.M., Derge J.G., Lipman D., Collins F.S., Jang W., Sherry S., RA Feolo M., Misquitta L., Lee E., Rotmistrovsky K., Greenhut S.F., RA Schaefer C.F., Buetow K., Bonner T.I., Haussler D., Kent J., RA Kiekhaus M., Furey T., Brent M., Prange C., Schreiber K., Shapiro N., RA Bhat N.K., Hopkins R.F., Hsie F., Driscoll T., Soares M.B., RA Casavant T.L., Scheetz T.E., Brown-stein M.J., Usdin T.B., RA Toshiyuki S., Carninci P., Piao Y., Dudekula D.B., Ko M.S., RA Kawakami K., Suzuki Y., Sugano S., Gruber C.E., Smith M.R., RA Simmons B., Moore T., Waterman R., Johnson S.L., Ruan Y., Wei C.L., RA Mathavan S., Gunaratne P.H., Wu J., Garcia A.M., Hulyk S.W., Fuh E., RA Yuan Y., Sneed A., Kowis C., Hodgson A., Muzny D.M., McPherson J., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madari A., Young A.C., Wetherby K.D., RA Granite S.J., Kwong P.N., Brinkley C.P., Pearson R.L., Bouffard G.G., RA Blakesly R.W., Green E.D., Dickson M.C., Rodriguez A.C., Grimwood J., RA Schmutz J., Myers R.M., Butterfield Y.S., Griffith M., Griffith O.L., RA Krzywinski M.I., Liao N., Morin R., Morrin R., Palmquist D., RA Petrescu A.S., Skalska U., Smailus D.E., Stott J.M., Schnerch A., RA Schein J.E., Jones S.J., Holt R.A., Baross A., Marra M.A., Clifton S., RA Makowski K.A., Bosak S., Malek J.; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [3] {ECO:0000313|EMBL:EDL39309.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Mixed {ECO:0000313|EMBL:EDL39309.1}; RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC138379; AAI38380.1; -; mRNA. DR EMBL; CH466520; EDL39309.1; -; Genomic_DNA. DR RefSeq; NP_766233.2; NM_172645.2. DR UniGene; Mm.170002; -. DR STRING; 10090.ENSMUSP00000044815; -. DR GeneID; 226551; -. DR KEGG; mmu:226551; -. DR UCSC; uc007dft.2; mouse. DR CTD; 51430; -. DR MGI; MGI:2138346; Suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000070169; -. DR HOVERGEN; HBG107549; -. DR OMA; SSPWFES; -. DR NextBio; 378226; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1254 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5005336362. FT COILED 936 956 {ECO:0000256|SAM:Coils}. FT COILED 986 1006 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1254 AA; 139527 MW; 52330ED0DCE0F0F7 CRC64; MKKYRRALAL VSCLSLCSLV WLPSWHVCCK ESSSASTSYY SQDDNCAIGS EDTQFQKKNE REEPSNAELS GKSNSYLTIS PEGNKIKDDY TVDVQDLETT KLSLPVVEAL PTVDLHEESS SVVVGSETIE NSSSSSTSER TPVSELDEVE KSGTLSIAKP GEVEQPEADC DAGEAPDADA PVEQPAFVSP PESLVGQHIE NVSSSHGKEK VTKSEFESKV SVSEQDGGDP KSALNTSDTL KNESSDYTKP GETDPTSVTS PKDPEDIPTF DEWKKKVMEV EKEKSLSTGQ SLHPSSNGGP HATKKVQKNR NNYASVECGA KILAANPEAK STSAILIENM DLYMLNPCST KIWFVIELCE PIQVKQFDIA NYELFSSTPK DFLVSISDRY PTNKWIKLGT FHGRDERNVQ SFPLDEQMYA KYVKMFIKYI KVELLSHFGS EHFCPLSLIR VFGTSMVEEY EEIADSQYQS ERQELFDEDY DYPLDYNTVE DKSSKNLLGS ATNAILNMVN IAANILGAKT EDLTEGNKSI SENATATTEP KMTESTRVST PVPSPEYVIK EVHTHDREPS TSDPPKESPI VQLVQEEEEE ASPSTVTLLG SGEQEDESSS WFESETHILC SELTSICCIS SFSEYIYKWC SVRIALYRQR SRTVSKGKDF VPPQPSLLLP VESVEVSVPQ PPSGDVDSEN MEREAETVDL DDLSSVHQGH LINHTVDTIE LEPSYPQTLS QSLLLDVTPE MNSLSKVEGS ESVKSEGGYI PSQLMTQESS VEFDDKTEKK TESFSSAEKL SVIYETSKVN EVMDNTVKED ILSTEVVTKF PETVVPPPMN TATVPEGESV ETKPSIADTL KHTVTPVMDP SLPEVKEDEQ SPEDALLRGL QRTATDFYAE LQNSTDLGYG NGNLVHGSNQ KESVFMRLNN RIKALEVNMS LSGRYLEELS QRYRKQMEEM QKAFNKTIVK LQNTSRIAEE QDQRQTEAIH LLQAQLTNMT QLVSNLSATV AELKREVSDR QSYLVMSLVL CVVLGLMLCM QRCRTTSQFD GDYISKLPKS NQYPSPKRCF SSYDDMNLKR RTSFPLIRSK SLQFTGKEVD PNDLYIVEPL KFSPEKKKKR CKYKTEKIET IKPADPLHPI ANGDIKGRKP FTNQRDFSNM GEVYHSSYKG PPSEGSSETS SQSEESYFCG ISACTSLCNG QTQKTKTEKR ALKRRRSKVQ DQGKLIKALI QTKSGSLPSL HDIIKGNKEI TVGAFGVTAV SGHI // ID B2VS46_PYRTR Unreviewed; 973 AA. AC B2VS46; DT 01-JUL-2008, integrated into UniProtKB/TrEMBL. DT 01-JUL-2008, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDU40059.1}; GN ORFNames=PTRG_00621 {ECO:0000313|EMBL:EDU40059.1}; OS Pyrenophora tritici-repentis (strain Pt-1C-BFP) (Wheat tan spot OS fungus) (Drechslera tritici-repentis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; Pyrenophora. OX NCBI_TaxID=426418 {ECO:0000313|Proteomes:UP000001471}; RN [1] {ECO:0000313|Proteomes:UP000001471} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Pt-1C-BFP {ECO:0000313|Proteomes:UP000001471}; RX PubMed=23316438; DOI=10.1534/g3.112.004044; RA Manning V.A., Pandelova I., Dhillon B., Wilhelm L.J., Goodwin S.B., RA Berlin A.M., Figueroa M., Freitag M., Hane J.K., Henrissat B., RA Holman W.H., Kodira C.D., Martin J., Oliver R.P., Robbertse B., RA Schackwitz W., Schwartz D.C., Spatafora J.W., Turgeon B.G., RA Yandava C., Young S., Zhou S., Zeng Q., Grigoriev I.V., Ma L.-J., RA Ciuffetti L.M.; RT "Comparative genomics of a plant-pathogenic fungus, Pyrenophora RT tritici-repentis, reveals transduplication and the impact of repeat RT elements on pathogenicity and population divergence."; RL G3 (Bethesda) 3:41-63(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS231615; EDU40059.1; -; Genomic_DNA. DR RefSeq; XP_001930954.1; XM_001930919.1. DR STRING; 426418.XP_001930954.1; -. DR EnsemblFungi; EDU40059; EDU40059; PTRG_00621. DR GeneID; 6340201; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B2VS46; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001471; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001471}; KW Reference proteome {ECO:0000313|Proteomes:UP000001471}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 973 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002784226. SQ SEQUENCE 973 AA; 105948 MW; 60D5C144D2DE0B16 CRC64; MITTGAPLRN WTILLLLCSL PTAVLAEVAN GTATDESSAT AASGATTQHT STPTPLTSSI RRYTDSETTS PYRTINYITH TLRQQCAKAT WSAPHEAAST NGTIVERGIV RLQTPIPIRE GPEGTIKEEE QPASPGTVSE PGATSSGTPS EEPELELETD SPFDNANFLS FEEWKKKNLA EVGQSPENVG QGRAAAANQP ARRRPVNVNA LDSLGDEGEI SIDFSGFGSP EDGSVANSIQ QGRHSAGATK APEGEGKVAP SAWSLSKDAG KTCKERFNYA SFDCAATVLK TNKQAKSSSS ILVENKDSYM LNTCSSDNKF LIVELCDDIL VDTVVLANYE FFSSMFRHFR VSVSDRYPVK MEKWRTLGTF EARNSRDIQP FLITEPQIWA RYLRIEFLTQ YGNEYYCPLS LLRVHGTTMM EQFRREEEGA RGIDDDDDLE AEGVDVKKPA EDSGPLPPEE IPIEAIKGSS FDSGGSAVAQ PVGHQATSQD TAVKSAPTIE PSSSSTSTAA MEASAGKVTD TPQTRSISDS PSESLPSPAT GSSVGRDTNI TAARETQSSD RHGPMSGGVD KPQVHSPMSE ASSTQSPSVS RDDGSPVSLT NTAVSSSSNS AAKASTNNTV VSQQTQSPGR GSATQPNAPT PSTQESFFKS IHKRLQYLEA NSTLSLQYIE EQSRALRDAF VKVEKRQLAK TEKFLDHLNS TVMLELKSFR TMYDQLWQST VIELESMKER QKSEMGEIGT RLSLMADELV WQKRMAVVQS TLLLLCLGLV LFVRSGTLGS NADVPIVQQL GSKYTSFFES SPPRSPPESG MARRRRTFKS MWRSESEQDG QQAPSDTETE GLRSPGQTTY DPLTPDTPSN RHGRDFSPEI NSVKAAPHPT QENMPPTPSF EDQAVRIQVL ETQSGPATPN GTRDSRPSWE EVDRAMDLLK AEEQSQSPPR PKARDRGKKQ KRSPLRRAQS NHESATDEEP PPP // ID B2WGA3_PYRTR Unreviewed; 812 AA. AC B2WGA3; DT 01-JUL-2008, integrated into UniProtKB/TrEMBL. DT 01-JUL-2008, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Spindle pole body-associated protein sad1 {ECO:0000313|EMBL:EDU42010.1}; GN ORFNames=PTRG_08959 {ECO:0000313|EMBL:EDU42010.1}; OS Pyrenophora tritici-repentis (strain Pt-1C-BFP) (Wheat tan spot OS fungus) (Drechslera tritici-repentis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; Pyrenophora. OX NCBI_TaxID=426418 {ECO:0000313|Proteomes:UP000001471}; RN [1] {ECO:0000313|Proteomes:UP000001471} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Pt-1C-BFP {ECO:0000313|Proteomes:UP000001471}; RX PubMed=23316438; DOI=10.1534/g3.112.004044; RA Manning V.A., Pandelova I., Dhillon B., Wilhelm L.J., Goodwin S.B., RA Berlin A.M., Figueroa M., Freitag M., Hane J.K., Henrissat B., RA Holman W.H., Kodira C.D., Martin J., Oliver R.P., Robbertse B., RA Schackwitz W., Schwartz D.C., Spatafora J.W., Turgeon B.G., RA Yandava C., Young S., Zhou S., Zeng Q., Grigoriev I.V., Ma L.-J., RA Ciuffetti L.M.; RT "Comparative genomics of a plant-pathogenic fungus, Pyrenophora RT tritici-repentis, reveals transduplication and the impact of repeat RT elements on pathogenicity and population divergence."; RL G3 (Bethesda) 3:41-63(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS231624; EDU42010.1; -; Genomic_DNA. DR RefSeq; XP_001939291.1; XM_001939256.1. DR EnsemblFungi; EDU42010; EDU42010; PTRG_08959. DR GeneID; 6347245; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; B2WGA3; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001471; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001471}; KW Reference proteome {ECO:0000313|Proteomes:UP000001471}. SQ SEQUENCE 812 AA; 89784 MW; DE2DC0D5DED0B90A CRC64; MSSRVNADEP TPYRRSSRLS ARASSVAAES AITNATSSGV KRSKTTLTKV SARRSNAYGA SGRVGNPDKL TAGPTTGFAQ AFQNQRGQST DREDDDSEEE EAKGNDTDEL AGRPQTAFMK QAHHAGQFAP SSKSKAAPGY SFIDSDDLTP SEDELAASSV GNTTKSFGPS HETGMLVSQD PFAGFQIPDE SPFTKPVGPI RRPASRTING SRTQAPTPVQ AQIPAPAKTS IQSKTPTQVK TPTQAQIPIK SFVRSQAHVK PAPAQVPARV TPTGLEQSID EVVAEEQARL QRDGPPSSQP QSQSQSLRQP SRRRPHHKGV AELNAWIGDV EASDDEEDEP VWPWKKLSTW AFWGLALSLL LAWALSSMMA TEHAESSLRT PSLVKAVGDR VVYTYDQVAA YISPPTGPSE MDQEIDRVKA YKANGEDHFL WARMSNMDTK NDRRISELRT ALLELKDQLP DMMLMRREQD GSLRISDEFW HALLSKARSS ESDSEWARFL ADSKGKLRDL FDPSVHHERG NTETWAEAVT RDEFVRHMEK QYHNITSRVD KKVEEAIRAQ SAQIKTTMQA EAKKMMMDQI HLHALAQANL VANYESHLTK PNYFSPGLGA IIDPDMSSTT FYDRPGRLAE VARRLSWLPR RNPPVAALTK WEEPGDCWCS AGQSQGSTGQ AQLAVKLARP VIPKQVTIEH IPMSMVPARN ISNAPRDIEL WVQTDTPINA YYSHRQVSCK DPPPESVSPA ISWKCLGSFK YNIHASNHLQ TFDLAGEPSE PIRNTILRVT SNWGASHTCL YQVRLHGTDA DSDYEYPVGL MD // ID B3LA24_PLAKH Unreviewed; 1705 AA. AC B3LA24; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAQ41747.1}; GN ORFNames=PKH_126700 {ECO:0000313|EMBL:CAQ41747.1}; OS Plasmodium knowlesi (strain H). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium). OX NCBI_TaxID=5851 {ECO:0000313|EMBL:CAQ41747.1, ECO:0000313|Proteomes:UP000000622}; RN [1] {ECO:0000313|EMBL:CAQ41747.1, ECO:0000313|Proteomes:UP000031513} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H {ECO:0000313|EMBL:CAQ41747.1, RC ECO:0000313|Proteomes:UP000031513}; RA Pain A., Boehme U., Berry A.E., Mungall K., Finn R., Jackson A.P., RA Mourier T., Mistry J., Pasini E.M., Aslett M., Balasubrammaniam S., RA Borgwardt K., Brooks K., Carret C., Carver T.J., Cherevach I., RA Chillingworth T., Clarke T.G., Galinski M.R., Hall N., Harper D., RA Harris D., Hauser H., Ivens A., Janssen C.S., Keane T., Larke N., RA Lapp S., Marti M., Moule S., Meyer I.M., Ormond D., Peters N., RA Sanders M., Sanders S., Sergeant T.J., Simmonds M., Smith F., RA Squares R., Thurston S., Tivey A.R., Walker D., White B., RA Zuiderwijk E., Churcher C., Quail M.A., Cowman A.F., Turner C.M.R., RA Rajandream M.A., Kocken C.H.M., Thomas A.W., Newbold C.I., RA Barrell B.G., Berriman M.; RT "The genome of Plasmodium knowlesi strain H, a zoonotic malaria RT parasite with host range from monkey to man."; RL Nature 455:799-803(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM910994; CAQ41747.1; -; Genomic_DNA. DR RefSeq; XP_002260480.1; XM_002260444.1. DR EnsemblProtists; CAQ41747; CAQ41747; EBG00001283881. DR EnsemblProtists; PKH_126700; PKH_126700; PKH_126700. DR GeneID; 7322083; -. DR KEGG; pkn:PKH_126700; -. DR EuPathDB; PlasmoDB:PKH_126700; -. DR HOGENOM; HOG000282163; -. DR InParanoid; B3LA24; -. DR Proteomes; UP000031513; Chromosome 12. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000031513}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000031513}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 1705 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002791032. FT TRANSMEM 1663 1686 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 376 396 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1705 AA; 199856 MW; DB87860D675BE99A CRC64; MIWWFLISIN FLFFLIKSFF TPKDTTYMND INNNQDSAST EKYILGENYN LTSLKLKIDF GSLDTGTKII EHSKGIINIR SIQQYDYDSY MLTPCNSDIW WIYSFSDIIH IEKIGLVSLE HYASNFKVIE ILGTDVYPTK KWKKLGKIAT NFTKSFELFN IYDYCKNYDE DNCWVKYMKF VVLSHHNLEN NYYCTLTHLQ IFASSGVDML SDKIYSDDNI TEIEKDFHDN YKHEKNEKIR DHEIIENLEV LLEDKERMQG GGTAAGPSDR IYGQRHVGNN HEEDALRSTE SITIEQSPSG ATLPTLSNFT PFEGSFLDPE TIEQELIDTD LMGSKLMDTE LIKKELMDTE LIERELMNAE LVERDSKSGN FNEPSLENLT EQIGNAFEEK RDHERNTIRE VQVNTSQHKE EEKDKVKEKE RVIVSSSLST EQNVFRQRRY NQGTSSHSLV IMNKAVKKKY AVLQNFNKLR ISKLLKKYPF VNYKNVIKKY MPYLAYLTRS KDELSYHHDS LKDAIQMDRF INPLDEAPLA KSTVGKYVPT SRRFLSPFYL LLRGTFPKLY SRVDISRPYT SYGVTTPHTF DKKTIPASPS GTSQKKSYNM NCYNYGAIKN VQNSILQNRR IYSLLTICKA RLLCHHDMVC LKKHARDISK LLTFRRNLIK STFRKYLIFI SGKRLPPRRR KKYKILKKKK RRKIRKKSAM FRKLFLLKQF NKIGYVKKLH VSYSPIFHFP TCTVSRYLPR KEEDPLMNYI WEELQRQNRI KEQPQDGLIM IYLFNEVRKA QTTRNLGSLK WCFDFMKKWK NTFNSVLLMY FIRCIPLSSL RNRGSCVDII MNGLILKKKN KWTLEKELQN LLFGHSDRSR TCKRHDRLGK TDLSYYAYRE EGQEEEDDVY NEEVDVEEDA EDDTDNGAYY IDEEQPHSKN FLKNDQVNVE RPLLNAESPP TFNTFTLYLH MLRCRNVHSR YHIMVLMRNP RDMKLMAKLM TDIWSNVSSR NMGGDLKKVE RYIRGVILNH VANYTTNNLS SKKLPTVKYK LVYTISPWMG KATLMDLSYR VKNRRRLLRA RQDILNVQEK KEENIQYQRK VQCEAITELF VNLITGRLSS WRTSLERENK GVKENEQSAR TGGGSNGYLK EESKRTYQEK YIQRSQRGHL DEANMCLTLR EMENMIYSDR YLKEYVDEAG IHKEMQKKDK MKNIINAKYE RKNNDKSIKI LNEIKETEES NSKLVNEVYD IISEYDDNNE SKIYSVQLKS GKQIPLIINK PENKMKDKVI KSEAKKKYIK DNKLQYLDEF DHVLNDHIKY DHHEQNCIRE EKIEEKAKNT RGHALLTLVD KVKTIENKNN YVISKLKDVI KITNNKTKII YHMLSNFKIL QNTISLLLKY IMINEKNMKN LHKNRNKSES FFKIFKDICI LQINDKKQHF DSLQYICNYL QDLLYDEVEK IYLFEKSTGA RGTAAAGVGT GPSGGRRTSV PNGGSTSSSI NGKFPLCGEE EENILLHNKN SFHEKNSNFI FKEKKRNIFN FWYYENHCHN DIFKTPLIYY NSSVDSLQTF YFKIYNFFRK LTSFNYVVYK FRHYKRMLIS YLTRGHHAGD SGAQFNSTQF GDAQFSGAQS NGTQSNGTQS NGTQSKGTQS NGTQLNGTAH TGEQFGGSQF VGAYGAPQNT GKLYAFLLGL FLIIFLINNF FCFLLYKHLS NKLNMFVQGC TCHRK // ID B3LCE6_PLAKH Unreviewed; 873 AA. AC B3LCE6; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Nuclear protein, putative {ECO:0000313|EMBL:CAQ42105.1}; GN ORFNames=PKH_143230 {ECO:0000313|EMBL:CAQ42105.1}; OS Plasmodium knowlesi (strain H). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium). OX NCBI_TaxID=5851 {ECO:0000313|EMBL:CAQ42105.1, ECO:0000313|Proteomes:UP000000622}; RN [1] {ECO:0000313|EMBL:CAQ42105.1, ECO:0000313|Proteomes:UP000031513} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H {ECO:0000313|EMBL:CAQ42105.1, RC ECO:0000313|Proteomes:UP000031513}; RA Pain A., Boehme U., Berry A.E., Mungall K., Finn R., Jackson A.P., RA Mourier T., Mistry J., Pasini E.M., Aslett M., Balasubrammaniam S., RA Borgwardt K., Brooks K., Carret C., Carver T.J., Cherevach I., RA Chillingworth T., Clarke T.G., Galinski M.R., Hall N., Harper D., RA Harris D., Hauser H., Ivens A., Janssen C.S., Keane T., Larke N., RA Lapp S., Marti M., Moule S., Meyer I.M., Ormond D., Peters N., RA Sanders M., Sanders S., Sergeant T.J., Simmonds M., Smith F., RA Squares R., Thurston S., Tivey A.R., Walker D., White B., RA Zuiderwijk E., Churcher C., Quail M.A., Cowman A.F., Turner C.M.R., RA Rajandream M.A., Kocken C.H.M., Thomas A.W., Newbold C.I., RA Barrell B.G., Berriman M.; RT "The genome of Plasmodium knowlesi strain H, a zoonotic malaria RT parasite with host range from monkey to man."; RL Nature 455:799-803(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM910996; CAQ42105.1; -; Genomic_DNA. DR RefSeq; XP_002262227.1; XM_002262191.1. DR EnsemblProtists; CAQ42105; CAQ42105; EBG00001281778. DR EnsemblProtists; PKH_143230; PKH_143230; PKH_143230. DR GeneID; 7323121; -. DR KEGG; pkn:PKH_143230; -. DR EuPathDB; PlasmoDB:PKH_143230; -. DR HOGENOM; HOG000281004; -. DR InParanoid; B3LCE6; -. DR Proteomes; UP000031513; Chromosome 14. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000031513}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000031513}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 224 248 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 321 348 {ECO:0000256|SAM:Coils}. FT COILED 357 384 {ECO:0000256|SAM:Coils}. FT COILED 418 438 {ECO:0000256|SAM:Coils}. FT COILED 471 509 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 873 AA; 100812 MW; 3F6F2901E1C14C13 CRC64; MSAGASNTSG NPRGRGGRGS KLSKNNHIAQ NASSVANSGN SANNGNAQNT LHGLNPPSHK GLVESEISEV GLPATDRSDD NNSTIDNDER NTLIHVLHSY EDFQKKNMNY GKVNPYRKRE SQLSKMRKSI VKLFTISDIR LDESEPNGNA DNNAYSISAS KRKDKAHSSN FLTSRAAMWN EIENNDDPEY DLKLESSKNK SIIDIVTHYL NVLINDIIND KKGITYIAIL MIILSVLITC ISGVMTLFNN KTGEKNNFNL HTTKNNYDDI NKFMNYIKLG NEENNRSEFL RLYQLLEEFK TSMNQNMNEN MNTILINKKE NHALYELNAN MENKLKELEK KLLLNTKDID YFKIHSKKEV ENFKKILQEN YQSFQNKLKD YVKTVDTIKK DIHKKNSLLN DVEKKMNKSQ MDIKKDVSDR VENEKRGLLN TISELQKKIK WIESKLAFHA SNSHDPLQLD NPAQGGLMHE NDQAEARERQ IEQRIATWNE EHVDLIQDIQ KELNLLKESA KKSTNFLDDV LPSFEHKILK NVESKIKYYL EMYKKDIINE ITESKVIYNE EKYKTITLKQ EKMQSELLKT ISSQIKAQTK IIKDDLNKSL HTMVDQKQIK MDNEYPVKSA KVNYDILDML QKKVDELYNE FILDYNEIDW ALESLGARIV YKMTSSPLNR NDFIEKFLNQ IASYLPSEEI YGMIKPMGKD PSIILKPSNF PGDCFSFNGS KGKITIHLPA TIDVSSISIQ HVHENISNNS NATPKYFSVY GVVDSNWPEH FESQDINYDD FKNSSLYSCL HSVYGNLQPK EILDKWLKGN KNPGLLHLGD FYFDRKKRIS TYPTKHCFPM KRIIFEFTEN YGAPYTCVYR LKVHGKRCIR KFK // ID B3LJI6_YEAS1 Unreviewed; 587 AA. AC B3LJI6; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 14-OCT-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDV10739.1}; GN ORFNames=SCRG_01546 {ECO:0000313|EMBL:EDV10739.1}; OS Saccharomyces cerevisiae (strain RM11-1a) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=285006 {ECO:0000313|EMBL:EDV10739.1, ECO:0000313|Proteomes:UP000008335}; RN [1] {ECO:0000313|Proteomes:UP000008335} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RM11-1a {ECO:0000313|Proteomes:UP000008335}; RG The Broad Institute Genome Sequencing Platform; RA Birren B.W., Lander E.S., Galagan J.E., Nusbaum C., Devon K., RA Cuomo C., Jaffe D.B., Butler J., Alvarez P., Gnerre S., Grabherr M., RA Kleber M., Mauceli E.W., Brockman W., MacCallum I.A., Rounsley S., RA Young S.K., LaButti K., Pushparaj V., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., Luoma S., White J., O'Leary S., Kodira C.D., Zeng Q., RA Yandava C., Alvarado L., Pratt S., Kruglyak L.; RT "Annotation of the Saccharomyces cerevisiae RM11-1a genome."; RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH408045; EDV10739.1; -; Genomic_DNA. DR EnsemblFungi; EDV10739; EDV10739; SCRG_01546. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008335; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008335}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67254 MW; 8C28CE40124071A8 CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFGQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID B3MIM5_DROAN Unreviewed; 548 AA. AC B3MIM5; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=GF11073 {ECO:0000313|EMBL:EDV38101.1}; GN Name=Dana\GF11073 {ECO:0000313|EMBL:EDV38101.1}; GN ORFNames=Dana_GF11073 {ECO:0000313|EMBL:EDV38101.1}, GN GF11073 {ECO:0000313|FlyBase:FBgn0088113}; OS Drosophila ananassae (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7217 {ECO:0000313|Proteomes:UP000007801}; RN [1] {ECO:0000313|EMBL:EDV38101.1, ECO:0000313|Proteomes:UP000007801} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14024-0371.13 {ECO:0000313|Proteomes:UP000007801}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH902619; EDV38101.1; -; Genomic_DNA. DR RefSeq; XP_001961279.1; XM_001961243.1. DR STRING; 7217.FBpp0114265; -. DR EnsemblMetazoa; FBtr0115773; FBpp0114265; FBgn0088113. DR GeneID; 6493939; -. DR KEGG; dan:Dana_GF11073; -. DR FlyBase; FBgn0088113; Dana\GF11073. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B3MIM5; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B3MIM5; -. DR Proteomes; UP000007801; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007801}; KW Reference proteome {ECO:0000313|Proteomes:UP000007801}. FT COILED 55 75 {ECO:0000256|SAM:Coils}. FT COILED 168 209 {ECO:0000256|SAM:Coils}. FT COILED 544 548 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 548 AA; 61994 MW; 69E7AFE6A1F774EE CRC64; MASIEKNIQK ALTAEEYENI LNHVNSYMQQ LVDLKLQQHS EKQQQLSPQQ LQIIVQLMKE NLQQFSASRT ELSEKDLADL ALKVKLELQN SGVWQSEVKL TPANLEEITK LIKTEVNLHQ SHYTIQLEQV DFGALLERIL GAPELADFVD ARINLRVHQL ETKEGSGALA AEQQIEQLNR EVAFIKLALS DKQAENADLQ QSLSSLRLSQ EDLIERMQQH ELSQDQRFSG LLADIDAKLA ALNDSQFALL NKQVKLSLVE ILGFKQATAG GAGSQLDEVD LQNWVRSMFV AKDYLEQQLL ELNERTNNHI RDEIDRSSIL LMSDISERLK REMLLVVEAK HNESAGALKG HIREEEVRQI VKTVLAIYDA DKTGLVDFAL ESAGGQILST RCTESYQTKS AQISVFGIPL WYPSNTPRVA ISPNVQPGEC WAFQGFPGFL VLKLNSLVYV TGFTLEHIPK SLSPTGRIDS APRNFTVWGL EHEKDFEPVL FGEYEYQDNG ASLQYFAIQN LDIKRPYEVV ELRIETNHGQ PTYTCLYRFR VHGKPPAS // ID B3MJT9_DROAN Unreviewed; 1390 AA. AC B3MJT9; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GF14555 {ECO:0000313|EMBL:EDV31428.1}; GN Name=Dana\GF14555 {ECO:0000313|EMBL:EDV31428.1}; GN ORFNames=Dana_GF14555 {ECO:0000313|EMBL:EDV31428.1}, GN GF14555 {ECO:0000313|FlyBase:FBgn0091582}; OS Drosophila ananassae (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7217 {ECO:0000313|Proteomes:UP000007801}; RN [1] {ECO:0000313|EMBL:EDV31428.1, ECO:0000313|Proteomes:UP000007801} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14024-0371.13 {ECO:0000313|Proteomes:UP000007801}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH902620; EDV31428.1; -; Genomic_DNA. DR RefSeq; XP_001962207.1; XM_001962171.1. DR STRING; 7217.FBpp0117747; -. DR EnsemblMetazoa; FBtr0119255; FBpp0117747; FBgn0091582. DR GeneID; 6497378; -. DR KEGG; dan:Dana_GF14555; -. DR FlyBase; FBgn0091582; Dana\GF14555. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B3MJT9; -. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B3MJT9; -. DR Proteomes; UP000007801; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007801}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007801}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1390 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002792924. FT TRANSMEM 985 1006 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 329 356 {ECO:0000256|SAM:Coils}. FT COILED 921 969 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1390 AA; 151773 MW; 337C721F17FD294C CRC64; MHLRLHLVRF MYINLLLSCC FWLYDNVAAE KAGPDASSED ARAPVTVIPT QQLEEPAKPE TLPPPRPPPD DEAGEPPTIA LELEPGSGPR TGDTSVPVEE EAEVAAHRVN SNSEYVVSES DLGANGSEAP LDEATAASST AQVPNNQLNN HNPNSVEEVD SVSESQPLES PKEKQQEQDS QPPEEKADES FPEIITELPT VTVTELPLDH LKNRLESVIL DDVQVPASNK SEEETPQQQL PQDNQPQQLN EAVKEVPQKD EQVPKINDPG GGIELEATDT MPSIEGSGTS EEQAAGSPGT PPASSNATES TINLTNGNEE VPMPVFSEWA QKQIEAEASR EQAMELEQQV VNKSAQRKNN TGSAKNKLPT LKLRSKNYAS PDCGAKIIAH NTESSHTSSV LTQSRDEYML STCGDRIWLV VELCEAIQAQ KVDVANFELF SSSPKNFSVY VSKRYPTREW SNVGRFEAED KRNIQTFELH PHLFGKFVRL DITSHYASEH FCPLSLFRVF GTSEYEAFET EIRPSDELDD FYDDFGMQDQ AVGSGGNIFQ SASDAVIQMV KKAAEVLAKP TKALKWSAES LLCRTPIFGA YTCSNCNSTL VEKINSLISC QFQHLQVLLS LSRLKYDLVH SRVCQEDFGI SLIASYGSGS QTSKMSKQQS YFLSLLPAEH IGAMCKLLEA EQNVTEMPSV KHHVSEPQQD QDNATAKGVR EDCPAKEVPA KEPATQPSLE VLVPETSQEV PSTQHQSTTS GETASTTNST PPGDVNIFNV PPHVEEVVVK EPLPLPSEPS IVSTLEPSDV ESSTNAPATA QTSSEAPGTG DTPAEEGGPA NWENMELLGT TVASITAGGG AAAAAAAVVN GNGNLGAASA GTPTGSPPAA GTGVNLQQKL TNGAQSESVF IRLSNRIKAL ERNMSLSGQY LEELSRRYKK QVEELQQTLT QQTLTVRQLE DQSRRYVEQE QLYQQQSAEL AGEVRALTYQ VQACIMVIII VGTCIFLMFV LGTVYYRKLR RQQQQLKKDQ PPVAIAKPKL DRRKSYEQML NPSTPKQRRP SEEAMMILKE CGDNVSQEQN SSSRQRKISV CYGSNNNIAT NMIGATTNGG PSVRSSVHKR KGAKHSWHNS LDTTETSCGE QTDKFFDVDT LKSLKQIPGK VPKKKSQPQM GLKRQESAPA TFSQDQTFED PATQSDFDES LMLDDDDLAN FIPNSDLAYN EFMPEGPSGY QILDTVDGKT GSEKSKSKSR RLSSPAFFKS PFSRSKNKGY GFNGIKNSHS VHEPTSWEWY RLKRGEKQQQ IQKDKLASKS LPNASLDSSS LSEVNFPLKS TNEATQNSFR ILGEAILSSG EGRITPNGNG NGIGLASSSS GSGSGGSTTS STTKKKQRAF NNLLRKAFDF // ID B3MK80_DROAN Unreviewed; 280 AA. AC B3MK80; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=GF15387 {ECO:0000313|EMBL:EDV31498.1}; GN Name=Dana\GF15387 {ECO:0000313|EMBL:EDV31498.1}; GN ORFNames=Dana_GF15387 {ECO:0000313|EMBL:EDV31498.1}, GN GF15387 {ECO:0000313|FlyBase:FBgn0092412}; OS Drosophila ananassae (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7217 {ECO:0000313|Proteomes:UP000007801}; RN [1] {ECO:0000313|EMBL:EDV31498.1, ECO:0000313|Proteomes:UP000007801} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14024-0371.13 {ECO:0000313|Proteomes:UP000007801}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH902620; EDV31498.1; -; Genomic_DNA. DR RefSeq; XP_001962277.1; XM_001962241.1. DR STRING; 7217.FBpp0118579; -. DR EnsemblMetazoa; FBtr0120087; FBpp0118579; FBgn0092412. DR GeneID; 6498196; -. DR KEGG; dan:Dana_GF15387; -. DR FlyBase; FBgn0092412; Dana\GF15387. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B3MK80; -. DR OMA; HVAREMT; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B3MK80; -. DR Proteomes; UP000007801; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007801}; KW Reference proteome {ECO:0000313|Proteomes:UP000007801}. SQ SEQUENCE 280 AA; 30842 MW; 308E4944114BC8A0 CRC64; MKGSSKNTRD ILRLRDDVED ISHMMAQQQE DCKDSQIPFG SPCKFGCGSS ESKLSGSGKC DNRDLNAYVD TLVKRKLGHL MDDVYNLKKT VMNSQCAAKG GQSGVKSEPA PADKVRLNYA SEELGARILS AVAVPIGGTN IIRKLLGLEF NANPPINMLR PSLAPGACFG FKGRRATVTV QLAKPIKVET ITLSHVAKEM TPRLCSNSAP KDFDVYGLQA DCQKRELLGH WRYDNDAKKR TQSYKAMAKC SFRKLVFVFN TNHGANATCV YRVEVFGKLK // ID B3MPQ1_DROAN Unreviewed; 2704 AA. AC B3MPQ1; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=GF14121 {ECO:0000313|EMBL:EDV32299.1}; GN Name=Dana\GF14121 {ECO:0000313|EMBL:EDV32299.1}; GN ORFNames=Dana_GF14121 {ECO:0000313|EMBL:EDV32299.1}, GN GF14121 {ECO:0000313|FlyBase:FBgn0091148}; OS Drosophila ananassae (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7217 {ECO:0000313|Proteomes:UP000007801}; RN [1] {ECO:0000313|EMBL:EDV32299.1, ECO:0000313|Proteomes:UP000007801} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14024-0371.13 {ECO:0000313|Proteomes:UP000007801}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH902620; EDV32299.1; -; Genomic_DNA. DR RefSeq; XP_001963078.1; XM_001963042.1. DR ProteinModelPortal; B3MPQ1; -. DR STRING; 7217.FBpp0117313; -. DR EnsemblMetazoa; FBtr0118821; FBpp0117313; FBgn0091148. DR GeneID; 6496949; -. DR KEGG; dan:Dana_GF14121; -. DR FlyBase; FBgn0091148; Dana\GF14121. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; B3MPQ1; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B3MPQ1; -. DR Proteomes; UP000007801; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007801}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007801}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1287 1314 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2704 AA; 299461 MW; 376FCDCF95737225 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLTF IRDCGSQVHK DTLHSSMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDT MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLNRL RSVGGNAHTS ANASLPADAI NENVAGAAVA PNANKVKTSE AAASPQSIST TISLLSTLCR GSPSITHDIL RSQLADAIER ALQGDERCVL DCMRFADLLL LLLFEGRQAL NRGSSNPNQG QLAPRPRRNN TNTDRTHRQL IDCIRSKDSE ALREAIETGG IDVNCMDDVG QTLLNWASAF GTLEMVEYLC EKGADVNKGQ RSSSLHYAAC FGRPAIAKIL LKFGAYPDLR DEDGKTPLDK ARERLDDGHR EVAAILQSPG EWMSPDHSLL NKDGKKYTLM EPRGDPEMAP IYLKLLLPIF CRTFLGSMLG SVRRASLALI KKIVQYAYPT VLQSLSESGY SEDAASTSAH NGGNLLIEVV ASVLDNEDDD DGHLIVLNII EEIMCKTQEE FLDHFARLGV FAKVQALMDT DTEDVYAQGS QDESSPTIRS STSAVVAPRS TSDDPMEDAK EILQGKPYHW REWSICRGRD CLYVWSDSVA LELSNGSNGW FRFIIDGKLA TMYSSGSPEN GNDSSENRGE FLEKLMRARS CVITGIVSQP ILPTASALRL VVGNWVLQSH KTNQLQIHNT EGHQVTVLQD DLPGFIFESN RGTKHTFTAE TVLGPDFASG WSTAKKKRNK SKTEGQKFQV RNLSREIYNK YFKSAQTVPR GAVTVLTDIV KQIETSFEEQ NMAPNGGWDT TLSSALTKLS QLIHEDGVVS AYEMHSSGLV QALVAVLSVN PWESNSPRGK RNKMQKQRVA VFRKCILEDN GESASNKPRT KSTASILIQK LVSVLESTEK LPVYLYDSPC TGYSLQILQK RLRFRLERAD CETTLFDRSG RTLKMEPLAT VAQLSKYLLK MVAKQWYDLD RSTYFYLKKI RDHKPGTVFS HSFDFDEEGL IFYIGSNAKT CDWVNPAQYG LVQVTSSEGK TLPYGKLEDI LSRDSISLNC HTKDNKKAWF AIDLGVYIIP TAYSLRHARG YARSALRNWL LQGSKDGIVW TTLSTHVDDK SLVDPGSTAT WPISCPQDDS QRYRHIRIQQ NGRNASGQTH YLSLSGFEIY GRIVGVADDI GKSVKEAEAK IRRERRQIRA QLKHMTTGAK VIRGVDWRWE DQDGCGEGTI TGEIHNGWID VKWDHGVRNS YRMGADGKYD LKLADCEYLS IFEGNPTSIV PSSTTTKVGD KTNTLTSRKS SSTPSLPEAT EKTQNSEGAS NQTVSADNLA WKQAVETIAE NVFASAKTQI ISNQLAMNSS SSREIRNKHK ESGSSQMHKD NISGPSPLSR DLEHISDLSA INNSMPAINS SIVSDLATIS ENLSLAELSK ENICGTISSV SKENVAGGQS LAIADAQSAS PRESDIKNIS NIEENNKTNA NNSVNKTSKE LLANLRTSSA AACQQVTQLS TEALEMIDKM RDGVDMIRNN SNNILATDTF PLPSTNPATS IKKHSKAQGT VNPENANEKQ IIAPGEDYPG KNIKKSSVTL KPAQQPNAVL SIVDIKDQQI TGESVSVPSQ MSISVPNLTT TSASEVPSTS EVATHTGLLE TFAAIARRRT SQGTNIQENQ IMNADVNVNE HGDQNPSGSF LGHSVTSLVK LALSSNFHSG LLSTAQSYPS LSSNNSENIT PSNPTNTSAG QQSASTINHT LTMSLTSTSS DSEQVSLEDF LESCRAPAML GDLDDEDDID EDNDEEENED EYEEVGNTLL QVMVSRNLLT FMDDEALENR LVGVTKRKSW DDEFVLKRQF SALIPAFDPR PGRTNVNQTS DLELPAIGVE PPKPQQSGNE TIEQPLLGLK LRGPGIGGTP EVEIDLNNTD WTIFRAVQEL LQCSQLNKVD KFRKIWEPTY TIVYREVSPE AQEPEEFPQT PDVSSKSGAS TLSPNSPMHI GFNLADNNLC SVDDVLELLT QINALNQSEI ELDGKDHPGP LLSEDLFISK KITNKLQQQI QDPLVLSSNA LPNWCENLNQ SCPFLFPFET RQLYFNCTSF GASRSIVCLQ SQRDVTLERQ RVPIMSPRRD DHDFRIGRLK HERVKVPRNE DLLKWAMQVM KTHCNRKSVL EVEFLDEEGT GLGPTLEFYA LVAAEIQRSD LCMWLCDDQL GEDAESPEDP VEGSPKPVGY YVNRREHGLF PAPLPQNTEA CEKVLKYFWF FGVFVAKVLQ DMRLVDIPLS TSFLQLLCHN KVFSRNLQKV ISDRRNGDLS VVSEESDLLD TCTKLLRTDC NKSNIFGGIL SLENLKEIDP TRYQFLQELQ NLLMRKQSID FDDSLSAEKK QELINELKLH TQNGLEVSLE DLALTFTYLP SSSVYGYTQA ELMPNGSAVN VDINNLEAYC ELLMNFILQD GIAQQMKAFS DGFNEVFPLK KLAAFTPAEA RMMICGEQFP HWSREDIISY TEPKLGYNKD SPGFLRFVNV LLSMSGDERK AFLQFTTGCS SLPPGGLANL HPRLTVVRKV DAGVGSYPSV NTCVHYLKLP DYPTEEIMKE RLLTATKEKG FHLN // ID B3N4H4_DROER Unreviewed; 270 AA. AC B3N4H4; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=GG10304 {ECO:0000313|EMBL:EDV58886.1}; GN Name=Dere\GG10304 {ECO:0000313|EMBL:EDV58886.1}; GN ORFNames=Dere_GG10304 {ECO:0000313|EMBL:EDV58886.1}, GN GG10304 {ECO:0000313|FlyBase:FBgn0102613}; OS Drosophila erecta (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7220 {ECO:0000313|Proteomes:UP000008711}; RN [1] {ECO:0000313|EMBL:EDV58886.1, ECO:0000313|Proteomes:UP000008711} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14021-0224.01 {ECO:0000313|Proteomes:UP000008711}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH954177; EDV58886.1; -; Genomic_DNA. DR RefSeq; XP_001969827.1; XM_001969791.1. DR EnsemblMetazoa; FBtr0130358; FBpp0128850; FBgn0102613. DR GeneID; 6541765; -. DR KEGG; der:Dere_GG10304; -. DR FlyBase; FBgn0102613; Dere\GG10304. DR OMA; HVAREMT; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B3N4H4; -. DR Proteomes; UP000008711; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008711}. SQ SEQUENCE 270 AA; 29682 MW; 296568BC3DE530AE CRC64; MAYNSRNNLG IMRLREDVDD ISHILRQQQV GCKGAQGSCK VSCAGGDPKG SCDHRDVSAY VDTLLKRKMG HLMDDVYNLK KQVMSADCSS KSGQAAPKPE AASLARPRIN YASEDLGARI INVKAQPIGG TNFIKWLLGL DFSANPPVNM IRAALSPGAC FGFNGSQATV TLQLAKTIVV EVISLTHVAR EMTPSLCVRS APKNFDGLRN DNSKKELLGQ WSYDNAANRR TQSYSVRSEF FFRNLAFLFN SNHGANSTCI YRVEVYGRLH // ID B3N961_DROER Unreviewed; 2724 AA. AC B3N961; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 48. DE SubName: Full=GG23957 {ECO:0000313|EMBL:EDV58496.1}; GN Name=Dere\GG23957 {ECO:0000313|EMBL:EDV58496.1}; GN ORFNames=Dere_GG23957 {ECO:0000313|EMBL:EDV58496.1}, GN GG23957 {ECO:0000313|FlyBase:FBgn0116096}; OS Drosophila erecta (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7220 {ECO:0000313|Proteomes:UP000008711}; RN [1] {ECO:0000313|EMBL:EDV58496.1, ECO:0000313|Proteomes:UP000008711} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14021-0224.01 {ECO:0000313|Proteomes:UP000008711}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH954177; EDV58496.1; -; Genomic_DNA. DR RefSeq; XP_001969437.1; XM_001969401.1. DR ProteinModelPortal; B3N961; -. DR EnsemblMetazoa; FBtr0144011; FBpp0142503; FBgn0116096. DR GeneID; 6541442; -. DR KEGG; der:Dere_GG23957; -. DR FlyBase; FBgn0116096; Dere\GG23957. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B3N961; -. DR Proteomes; UP000008711; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008711}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1301 1328 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2724 AA; 302002 MW; 1EA7123E0EF3A1C9 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSQVHK DTLHSAMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDS MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLKRL KSVGGNTHSA LSAAGTQPTS SSQPAATTNS DAINENIAGT ATISNSTKVK SSDAAASPQS ISTTISLLST LCRGSPSITH DILRSQLADA LERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNNP NQGQLAPRPR RNNTNTDRTH RQLIDCIRSK DSEALREAIE SGGIDVNCMD DVGQTLLNWA SAFGTLEMVE YLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKFGAYP DLRDEDGKTP LDKARERLDD GHREVAAILQ SPGEWMSPDH SLLNKDGKKY TLMEPRGDPE MAPIYLKVLL PIFCRTFLGS MLGSVRRASL ALIKKIVQYA YPTVLQSLSE TSFSEDAAST SGQNGGNLLI EVVASVLDNE DDDDGHLIVL NIIEEIMCKT QEEFLDHFAR LGVFAKVQAL MDNDAEELYV QLPVTSEEPA AAQRSSTLAV TPRSTSDDPM EDAKEILQGK PYHWREWSIC RGRDCLYVWS DSVALELSNG SNGWFRFIID GKLATMYSSG SPENGNDSSE NRGEFLEKLM RARSCVIAGV VSQPILPTAS ALRLVVGNWV LQSQKTNQLQ IHNTEGHQVT VLQDDLPGFI FESNRGTKHT FSAETVLGPD FASGWSTAKK KRNKSKTEGQ KFQVRNLSRE IYNKYFKSAQ TIPRGAVAIL TDIVKQIEIS FEEQHMAPNG NWETTLSDAL MKLSQLIHED GVVSAYEMHS SGLVQALVAV LSVNHWEANS PRCKRNKMQK QRVSVFKKCI LEDNVESATN KPRTKSTASI LIQKLVSVLE STEKLPVYLY DTPCTGYSLQ ILQKRLRFRL ERAECESTLF DRSGRTLKME PLATIGQLSK YLLKMVAKQW YDLDRSTYFY LKKIREHRTG SVFTHCFDFD EEGLLFYIGS NAKTCDWVNP AQYGLVQVTS SEGKTLPYGK LEDILSRDSI SLNCHTKDNK KAWFAIDLGV YIIPTAYTLR HARGYGRSAL RNWLLQGSKD GLTWTTLSTH VDDKSLVEPG STATWPITCA TDDSVRYRHI RIQQNGRNAS GQTHYLSLSG FEIYGRVVGV ADDIGKSVKE AEAKIRRERR QIRAQLKHMT TGARVIRGVD WRWEDQDGCA EGTITGEIHN GWIDVKWDHG VRNSYRMGAE GKYDLKLADC EYLSAFDGNQ SMGNTGTAPK ASEKGNTLTS RKSSSTPSLP EATEKNQNSE GASNQTVSAD NLAWKQAVET IAENVFASAK TQIISNQLAM NTSSSREARA KHKESGTNQM HKDNISGPSP LSRELEHISD LSAINNSMPA INSSIVSDLA TISENLSLTE LSKENICSVL TPSYKPAESV TASQSSSHPD VQSSSPREND IKNISNIEEN NKMNANNSVN KISKDLLANL RTSNIAGCPP VTQLSTEALE MIDKMRDGVD MIRNNSNNIL STDTFPVPCT NVPVGVKKTP KAQALINPDN ANQKQIIVTT EEFPTKSSKK PSVTLKPAQQ PNAVLSIVDI KDPQISTENV SVPSQMSISV PNLTTTSASE VPSTSEVATH TGLLETFAAI ARRRTSQGTN IQDNQIMNTE ANVNEHGDQN ASGSFLGHSV TSLVKLALSS NFHSGLLSTA QSYPSLSSNN SENIAPSNPS NTSAGQQSAS TINHTLTMSL TSTSSDSEQV SLEDFLESCR APALLGDLDD EDDMDEDNDE EENEDEYEEV GNTLLQVMVS RNLLTFMDDE AMENRLVGVT KRKSWDDEFV LKRQFSALIP AFDPRPGRTN VNQTSDLEIS PLGAELPKPQ QSGPETIEQP LLGLKLRGPG IGGIPEVEID LSNTDWTIFR AVQELLQCSQ LNKLDKFRKI WEPTYTIVYR EVSPEAQEST CLESEEFPQT PDVSSKSGAS TLSPNSPMHI GFNVADNNLC SVDDVLELLT QINGLNQSEI DSDVKEHGVS VLSEDLFISK KITNKLQQQI QDPLVLASNA LPNWCENLNQ SCPFLFPFET RQLYFNCTSF GASRSIVCLQ SQRDVTVERQ RIPIMSPRRD DHEFRIGRLK HERVKVPRNE DLLMWAMQVM KTHCNRKSVL EVEFLDEEGT GLGPTLEFYA LVAAEIQRSD LCMWLCDDDL GEDTEISPQT AEGNSKPVGY YVNRREHGIF PAPLPQNTEI CEKVLKYFWF FGVFVAKVLQ DMRLVDIPLS TSFLQLLCHN KVLSRNLQKV ISDRRNGDLS VVSEESDIVE TCTKLLRTDS NKSNAFGGIL SLENLKEIDP TRYQFLQEMQ NLLMRKQSIE FDDTISAEKK QELTNELKLH TQNGLEVSLE DLSLTFTYLP SSSIYGYTQA ELLPNGSSVN VTIDNLEAYC ELLMNFILQD GIAQQMKAFS DGFNEVFPLK KLAAFTPSEA RMMICGEQFP HWSREDIISY TEPKLGYNKD SPGFQRFVNV LLSMSGDERK AFLQFTTGCS SLPPGGLANL HPRLTVVRKV DAGVGSYPSV NTCVHYLKLP DYPTEEIMKE RLLTATKEKG FHLN // ID B3NB12_DROER Unreviewed; 564 AA. AC B3NB12; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=GG23226 {ECO:0000313|EMBL:EDV59777.1}; GN Name=Dere\GG23226 {ECO:0000313|EMBL:EDV59777.1}; GN ORFNames=Dere_GG23226 {ECO:0000313|EMBL:EDV59777.1}, GN GG23226 {ECO:0000313|FlyBase:FBgn0115377}; OS Drosophila erecta (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7220 {ECO:0000313|Proteomes:UP000008711}; RN [1] {ECO:0000313|EMBL:EDV59777.1, ECO:0000313|Proteomes:UP000008711} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14021-0224.01 {ECO:0000313|Proteomes:UP000008711}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH954177; EDV59777.1; -; Genomic_DNA. DR RefSeq; XP_001970718.1; XM_001970682.1. DR EnsemblMetazoa; FBtr0143280; FBpp0141772; FBgn0115377. DR GeneID; 6541885; -. DR KEGG; der:Dere_GG23226; -. DR FlyBase; FBgn0115377; Dere\GG23226. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B3NB12; -. DR Proteomes; UP000008711; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008711}. FT COILED 183 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 564 AA; 63748 MW; F21DCAD5236509B9 CRC64; MELPTAISPQ QEEEAIKVNM ASIERNIQKA LTAEEYENIL NHVNSYVQQL VELKMQQHAK ELPPQQIQLI VQLMKENLQQ IAHKTQLSEK DLTDLVTKLK LELQGSGGWP DGAKLSRANL EEITRLVKSE LHLHESHYKI QLDRIDFPAL LEQILAAPAL TDFVDARIGL QVGELEQKES SGASDAEVQI ERLNREIAFI KLALSDKQAE NADLHLSISN LKLGHEDLLE RIQQHELAQD KRFHGLLAEI ESKLSALNDS QFALLNKQIK LSLVEILGFK QSTAGGAAGQ LDDFDLQTWV RSMFVAKDYL EQQLLELNKR TNNNIRDEIE RSSILLMSDI SERLKREILL VVEAKQNEST KALKGHIREE EVRQIVKTVL AIYDADKTGL VDFALESAGG QILSTRCTES YQTKSAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLGKSRL NSLVYVTGFT LEHIPKSLSP TGRIDSAPRN FTVWGLEQEK DPEPVLFGEY QFEDNGASLQ YFAVQNLDIK RPYEIVELRI ETNHGQPTYT CLYRFRVHGK PPAT // ID B3NLZ4_DROER Unreviewed; 1259 AA. AC B3NLZ4; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=GG21560 {ECO:0000313|EMBL:EDV54530.1}; DE Flags: Fragment; GN Name=Dere\GG21560 {ECO:0000313|EMBL:EDV54530.1}; GN ORFNames=Dere_GG21560 {ECO:0000313|EMBL:EDV54530.1}, GN GG21560 {ECO:0000313|FlyBase:FBgn0113738}; OS Drosophila erecta (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7220 {ECO:0000313|Proteomes:UP000008711}; RN [1] {ECO:0000313|EMBL:EDV54530.1, ECO:0000313|Proteomes:UP000008711} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14021-0224.01 {ECO:0000313|Proteomes:UP000008711}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH954179; EDV54530.1; -; Genomic_DNA. DR RefSeq; XP_001974130.1; XM_001974094.1. DR EnsemblMetazoa; FBtr0141614; FBpp0140106; FBgn0113738. DR GeneID; 6548888; -. DR KEGG; der:Dere_GG21560; -. DR FlyBase; FBgn0113738; Dere\GG21560. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B3NLZ4; -. DR Proteomes; UP000008711; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008711}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 850 871 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 188 215 {ECO:0000256|SAM:Coils}. FT COILED 786 834 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EDV54530.1}. SQ SEQUENCE 1259 AA; 137218 MW; 23DAABA7F440D1AA CRC64; NGRSYRRASM SKYLIDCERG TGSGSGTGAG IGIGIGEPLW GLKPFPTPFA TQTQLCPGQG THGTIHSAHQ NALENCLVGG SYENLVVRSC APLPRRWCHL ILILPSPGVW DRESQSHSLS SVLVILINDP GGGIEVEGMA PQEAAAVGET QESREELQPG SAAFNETGGT ANLTNASEEV PMPVFSEWAQ KQMEAEASRE QAMELEQQVV NKSAQRKNNT GSSSGKPPTL KLRSKNYASP DCGAKIIAHN SESKHTEAVL TQSTDEYMLS TCESRIWFVV ELCEAIQAQK VDVANYELFS SSPKNFTVAV SKRFPTRDWS NVGRFAAEDK RTIQTFELHP HLFGKFVRVE ITSHYANEHF CPLSLFRVFG TSEYEAFETE IRPSDDLDDF YDDYGAQEQK AAVGSGGNIF QSASDAVMQM VKKAAEVLVK PTKALKWSAE SVLCQTPAFE TYSCSNCNTT LVERINSLLS CQFQQLQALL SLSRLRSDLL HSRVCQEQFG ISLMGSDFAS KMGKEQSYFL SMLPAEHVGA MCKLVQAEQN VTDQRHTKAP TLKQHVSSPE AVQDNATATG VRQDCDNSKE RQPTKTATKE PQTPSLEVVV PEVSQEVPSL EDQSSTSSET VSTTNSTPAD VNIFNVPSES EEVEVKVELP PEPTLPKTLE PSDVESFTDA PSTNAPLTSS EASANADLGM EEGNPTNWEG IDSLLTTTVA SITAGGGAAA AAAAVVNGNG NIGGAGIVAA GGPASVSSVN MQQKLTNGAQ SESVFIRLSN RIKALERNMS LSGQYLEELS RRYKKQVEEL QQTLTQQTLT VRQLEDQSRR YVEQEQLYQQ HSAELAGEVR ALSYQVQACI LVIIIVGTCI FLMLVLGTVY YRKLRRQQQQ LLKKDQAGHP PVAAKPKLDR RKSYEQMPNQ TTPKQRRPSE EAMLILKECG DSNLQEQDPS HRQRKISVCY GSNNNIAANM VIANTNGAAS VRNSLHRRKG AKHSWHNSLD TTETSCGEQT DKFFDVDTLK STKQICGKPG KKKSHQQLKP LALKRQESAP ATYTPDMQAE EPATQSDFDE SLMLDDDDLA NFIPTSDLAY NEFMPEGPSG YQIVDTVDGK PGKEQGTKKS RRLSSPAFFK SPFSKSKNKG YSFNGVKNSH SVHEPTSWEW YRLKRSEKQQ QAKLASKSLP SASLDSSSLS EVNFPLSSSS ATQNSVRILG EAILSSGEGR ITPNGNGNAA SGVLASSSSG SGSGGSTTSS TTKKKQRALN NLFRKAFDF // ID B3RF50_SORAR Unreviewed; 290 AA. AC B3RF50; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 14-OCT-2015, entry version 11. DE SubName: Full=Sperm-associated antigen 4 protein (Predicted) {ECO:0000313|EMBL:ACE77676.1}; GN Name=SPAG4 {ECO:0000313|EMBL:ACE77676.1}; OS Sorex araneus (Eurasian common shrew) (European shrew). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Insectivora; Soricidae; Soricinae; OC Sorex. OX NCBI_TaxID=42254 {ECO:0000313|EMBL:ACE77676.1}; RN [1] {ECO:0000313|EMBL:ACE77676.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (JUN-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DP000783; ACE77676.1; -; Genomic_DNA. DR HOVERGEN; HBG102068; -. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 54 88 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 290 AA; 32629 MW; 51485364A35B413A CRC64; MGRGIEGEAQ RCLRISEEGL GRVGETLQGR AARRGTSTLL ISLHQQEPKE ILTLSQYHER VRSQGQQLEQ LQAELDKLHK EVSSVRAANS ERVAELVFQR LHEDFVRKPD YALSSVGASI DLEKTSQDYV DTDTGYFWNH FNLWNYARPP TVILEPDVFP GNCWAFEGDQ GQVVIRLPGR VQLSDITLQH PPHSVAHTGG ADSAPRNFTV YGIEADDETE VFLGNFTFEV EKSEIQTFHL QNDPPAAFPK VKIQILSNWG HPRFTCLYRV RAHGLKTTEE TGDSAPGELH // ID B3RP41_TRIAD Unreviewed; 905 AA. AC B3RP41; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 27. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EDV28123.1}; GN ORFNames=TRIADDRAFT_53396 {ECO:0000313|EMBL:EDV28123.1}; OS Trichoplax adhaerens (Trichoplax reptans). OC Eukaryota; Metazoa; Placozoa; Trichoplax. OX NCBI_TaxID=10228 {ECO:0000313|Proteomes:UP000009022}; RN [1] {ECO:0000313|EMBL:EDV28123.1, ECO:0000313|Proteomes:UP000009022} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Grell-BS-1999 {ECO:0000313|EMBL:EDV28123.1, RC ECO:0000313|Proteomes:UP000009022}; RX PubMed=18719581; DOI=10.1038/nature07191; RA Srivastava M., Begovic E., Chapman J., Putnam N.H., Hellsten U., RA Kawashima T., Kuo A., Mitros T., Salamov A., Carpenter M.L., RA Signorovitch A.Y., Moreno M.A., Kamm K., Grimwood J., Schmutz J., RA Shapiro H., Grigoriev I.V., Buss L.W., Schierwater B., RA Dellaporta S.L., Rokhsar D.S.; RT "The Trichoplax genome and the nature of placozoans."; RL Nature 454:955-960(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS985242; EDV28123.1; -; Genomic_DNA. DR RefSeq; XP_002109957.1; XM_002109921.1. DR STRING; 10228.TriadP53396; -. DR EnsemblMetazoa; TriadT53396; TriadP53396; TriadG53396. DR GeneID; 6750619; -. DR KEGG; tad:TRIADDRAFT_53396; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B3RP41; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000009022; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009022}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009022}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 905 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002798378. FT TRANSMEM 871 894 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 832 866 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 905 AA; 102328 MW; ECD475FD1A955E31 CRC64; MDNMKFFFFY LLILVLRLHA GRCDNDNDMN ANAEQIGASR IQNNLQGYNS NSNDVSYDDG TTSSIGASDD NSGHSQSDEQ IIHNDDKQLD EDENGEASHQ NLNLDVKQSL DVELNEAVDD IKKDTDKLPV VPQNMVKNEQ TTEIDVDNMK AKITTEKEED TAMPSFEEWK KKQFEDSMKR GHHNSISASN NLPRKNRQPS QNNYASSSCG AKIVESNSEA KNAEGILIGD KDVYMNNPCS ANIWFVIELC DHLKIESIEI ANLELFSSRP ESFRVSISQR NPTREWKVID TFKAKDERKI QSFAMDIDDF ARFVKVEILS VFRDEHYCPL TFFRVLGTTW VDDFDDSETA DNDLDGQDTS IKVNSSQQID NSTNEKSDVV QESNKGNKST IMGLGLDAVI TIVKKVGEAF TKRSRNETTS DHAIGNDNHN LTGYFPNYDI CYIKNETLSL YWYLMVKYNR KFIEAIVNAT SKSSQKISNE NEKNKLGDSG KKITANRYVS NDKSIEDVSI RTFNFMPFTE YCQFLDLPFR AIFGSYVEEI ASIFYLHHCL SCCVIINDPL YNTSTIKKFD KLSNLIPNKP INEVSSIDLS PLAQLTYSMN SINQPSELEL KSDTVSTLEF DKSIELIMQT NLPMQESQPV INIQSSRTTV QEQSGSDYLL SPGSKASVVN KMISTSDGFE NIDHEFSTVS SILQPSVTVD SSEYKSKVAN VDSNKRRDAS ESTSNSKNET TPSSNLTSVG GSIQSTLEKL LNVDFDYDLD IDFSPSESAT VHKTGANRES VLTRLNNRIK TLEKNYKLTT LYLEDFSKKF VKRLDDVQKA SDKRMNLLTS LIEQNTKSVQ SLMNRLNHLE INLEDITNEI RSISEKSHHK GFGETSVCLL IEIMCVLIVL CFVFSSKQKR NNTRH // ID B3S288_TRIAD Unreviewed; 403 AA. AC B3S288; DT 02-SEP-2008, integrated into UniProtKB/TrEMBL. DT 02-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EDV23071.1}; GN ORFNames=TRIADDRAFT_57968 {ECO:0000313|EMBL:EDV23071.1}; OS Trichoplax adhaerens (Trichoplax reptans). OC Eukaryota; Metazoa; Placozoa; Trichoplax. OX NCBI_TaxID=10228 {ECO:0000313|Proteomes:UP000009022}; RN [1] {ECO:0000313|EMBL:EDV23071.1, ECO:0000313|Proteomes:UP000009022} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Grell-BS-1999 {ECO:0000313|EMBL:EDV23071.1, RC ECO:0000313|Proteomes:UP000009022}; RX PubMed=18719581; DOI=10.1038/nature07191; RA Srivastava M., Begovic E., Chapman J., Putnam N.H., Hellsten U., RA Kawashima T., Kuo A., Mitros T., Salamov A., Carpenter M.L., RA Signorovitch A.Y., Moreno M.A., Kamm K., Grimwood J., Schmutz J., RA Shapiro H., Grigoriev I.V., Buss L.W., Schierwater B., RA Dellaporta S.L., Rokhsar D.S.; RT "The Trichoplax genome and the nature of placozoans."; RL Nature 454:955-960(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS985247; EDV23071.1; -; Genomic_DNA. DR RefSeq; XP_002113981.1; XM_002113945.1. DR STRING; 10228.TriadP57968; -. DR EnsemblMetazoa; TriadT57968; TriadP57968; TriadG57968. DR GeneID; 6755514; -. DR KEGG; tad:TRIADDRAFT_57968; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B3S288; -. DR KO; K19347; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000009022; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009022}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009022}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 66 86 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 130 150 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 190 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 403 AA; 45341 MW; C927A68D733503EB CRC64; MALNNQTDFP QVSKIPDNEG LFAQGSLYCS MKRDSLGSVA GRKSVRSNNW YRKYGGMTYD HGPLRALIGL VRVFVIIIYT ISALIIYTDA VLIIKIKGII KTAFMLPLYP SLNIKTSHKT PTQTTLNASL SLAIIVVGLT CTIAVFIYPV TFQKIQSVSS TLLTKSSGDP VIKDTNNTNC LNGIGDRIRI RELEAVIMQL KSEIQEIKSY PRQKVILSQD ITFNIPQTLN RVYLMINYLE DNLKELKKSM ISKEIVEQIV NSALKLYDED KIGLADYALY PAGGRVISIG NTKPYLNSEG KPFHSPNIMI QPDLQPGNCW AFEGRMGEVT IQGLDDIYAH EKLLLGSFTF EDSNVMNLQR FTVQHFPNRP FNLIKIIFTS NHGSSYTCVY RFRVHGFLEK STL // ID B4DIU6_HUMAN Unreviewed; 706 AA. AC B4DIU6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=cDNA FLJ59542, highly similar to Sad1/unc-84-like protein 2 {ECO:0000313|EMBL:BAG58608.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:BAG58608.1}; RN [1] {ECO:0000313|EMBL:BAG58608.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Hippocampus {ECO:0000313|EMBL:BAG58608.1}; RA Wakamatsu A., Yamamoto J., Kimura K., Ishii S., Watanabe K., RA Sugiyama A., Murakawa K., Kaida T., Tsuchiya K., Fukuzumi Y., RA Kumagai A., Oishi Y., Yamamoto S., Ono Y., Komori Y., Yamazaki M., RA Kisu Y., Nishikawa T., Sugano S., Nomura N., Isogai T.; RT "NEDO human cDNA sequencing project focused on splicing variants."; RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK295787; BAG58608.1; -; mRNA. DR UniGene; Hs.517622; -. DR UniGene; Hs.744734; -. DR STRING; 9606.ENSP00000385616; -. DR PaxDb; B4DIU6; -. DR PRIDE; B4DIU6; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG056957; -. DR NextBio; 35472359; -. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 169 190 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 202 223 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 262 282 {ECO:0000256|SAM:Coils}. FT COILED 341 361 {ECO:0000256|SAM:Coils}. FT COILED 363 390 {ECO:0000256|SAM:Coils}. FT COILED 393 420 {ECO:0000256|SAM:Coils}. FT COILED 467 487 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 706 AA; 79084 MW; DF8C8081F851C0B3 CRC64; MVSPPSAGQR LRGVPVWAAG AFRFSSGEES TSHLIMSRRS QRLTRYSQGD DDGSSSSGGS SVAGSQSTLF KDSPLRTLKR KSSNMKRLSP APQLGPSSDA HTSYYSESLV HESWFPPRSS LEELHGDANW GYSDVDQQSS SSRLRSAVSR AGSLLWMVAT SPGRLFRLLY WWAGTTWYRL TTAASLLDVF VLTRRFSSLK TFLWFLLPLL LLTCLTYGAW YFYPYGLQTF HPALVSWWAA KDSRRPDEGW EARDSSPHFQ AEQRVMSRVH SLERRLEALA AEFSSNWQKE AMRLERLELR QGAPGQGGGG GLSHEDTLAL LEGLVSRREA ALKEDFRRET AARIQEELSA LRAEHQQDSE DLFKKIVRAS QESEARIQQL KSEWQSMTQE SFQESSVKEL RRLEDQLAGL QQELAALALK QSSVAEEVGL LPQQIQAVRD DVESQFPAWI SQFLARGGGG RVGLLQREEM QAQLRELESK ILTHVAEMQG KSAREAAASL SLTLQKEGVI GVTEEQVHHI VKQALQRYSE DRIGLADYAL ESGGASVIST RCSETYETKT ALLSLFGIPL WYHSQSPRVI LQPDVHPGNC WAFQGPQGFA VVRLSARIRP TAVTLEHVPK ALSPNSTISS APKDFAIFGF DEDLRQEGAL LGKFTYDQDG EPIQTFHFQA PTMATYQVVE LRILTNWGHP EYTCIYRFRV HGEPAH // ID B4DYM4_HUMAN Unreviewed; 883 AA. AC B4DYM4; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 44. DE SubName: Full=SUN domain-containing ossification factor {ECO:0000313|Ensembl:ENSP00000476704}; DE SubName: Full=cDNA FLJ60972 {ECO:0000313|EMBL:BAG63786.1}; GN Name=SUCO {ECO:0000313|Ensembl:ENSP00000476704}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:BAG63786.1}; RN [1] {ECO:0000313|Ensembl:ENSP00000476704, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16710414; DOI=10.1038/nature04727; RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., RA Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C., RA Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., RA McDonald L., Evans R., Phillips K., Atkinson A., Cooper R., Jones C., RA Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., RA Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I., Aubin K., RA Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., RA Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., RA Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., RA Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., RA Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., RA Ghori M.R., Gibson R., Gilby L.M., Gillett W., Glithero R.J., RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., RA Hammond S., Harrison E.S., Hart E., Haugen E., Heath P.D., Holmes S., RA Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., RA James R., Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., RA Kibukawa M., Kimberley A.M., King A., Knights A.J., Lad H., Laird G., RA Lawlor S., Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., RA Lush M.J., Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., RA Matthews N.S., McLaren S., Milne S., Mistry S., Moore M.J., RA Nickerson T., O'Dell C.N., Oliver K., Palmeiri A., Palmer S.A., RA Parker A., Patel D., Pearce A.V., Peck A.I., Pelan S., Phelps K., RA Phillimore B.J., Plumb R., Rajan J., Raymond C., Rouse G., RA Saenphimmachak C., Sehra H.K., Sheridan E., Shownkeen R., Sims S., RA Skuce C.D., Smith M., Steward C., Subramanian S., Sycamore N., RA Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., White S., RA Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., Wilming L., RA Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., Durbin R., RA Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., Ross M.T., RA Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R., Banerjee R., RA Bryant S.P., Burford D.C., Burrill W.D., Clegg S.M., Dhami P., RA Dovey O., Faulkner L.M., Gribble S.M., Langford C.F., Pandian R.D., RA Porter K.M., Prigmore E.; RT "The DNA sequence and biological annotation of human chromosome 1."; RL Nature 441:315-321(2006). RN [2] {ECO:0000313|EMBL:BAG63786.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Testis {ECO:0000313|EMBL:BAG63786.1}; RA Wakamatsu A., Yamamoto J., Kimura K., Ishii S., Watanabe K., RA Sugiyama A., Murakawa K., Kaida T., Tsuchiya K., Fukuzumi Y., RA Kumagai A., Oishi Y., Yamamoto S., Ono Y., Komori Y., Yamazaki M., RA Kisu Y., Nishikawa T., Sugano S., Nomura N., Isogai T.; RT "NEDO human cDNA sequencing project focused on splicing variants."; RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000213|PubMed:18669648} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=18669648; DOI=10.1073/pnas.0805139105; RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., RA Elledge S.J., Gygi S.P.; RT "A quantitative atlas of mitotic phosphorylation."; RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008). RN [4] {ECO:0000313|Ensembl:ENSP00000476704} RP IDENTIFICATION. RG Ensembl; RL Submitted (DEC-2013) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; KF455074; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; Z94054; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; Z96050; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AK302509; BAG63786.1; -; mRNA. DR RefSeq; NP_001269679.1; NM_001282750.1. DR UniGene; Hs.204559; -. DR ProteinModelPortal; B4DYM4; -. DR STRING; 9606.ENSP00000263688; -. DR PaxDb; B4DYM4; -. DR Ensembl; ENST00000610051; ENSP00000476704; ENSG00000094975. DR GeneID; 51430; -. DR KEGG; hsa:51430; -. DR UCSC; uc010pmn.2; human. DR CTD; 51430; -. DR HGNC; HGNC:1240; SUCO. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR HOGENOM; HOG000070169; -. DR HOVERGEN; HBG107549; -. DR GenomeRNAi; 51430; -. DR NextBio; 35476256; -. DR Proteomes; UP000005640; Chromosome 1. DR ExpressionAtlas; B4DYM4; baseline and differential. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|MaxQB:B4DYM4, KW ECO:0000213|PeptideAtlas:B4DYM4}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 FT CHAIN 30 883 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002801100. FT COILED 615 635 {ECO:0000256|SAM:Coils}. FT COILED 821 841 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 883 AA; 98284 MW; 0059225E2BE16962 CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFQKKN TESKKLSPPV VETLPTVDLH EESSNAVVDS ETVENISSSS TSEITPISKL DEIEKSGTIP IAKPSETEQS ETDCDVGEAL DASAPIEQPS FVSPPDSLVG QHIENVSSSH GKGKITKSEF ESKVSASEQG GGDPKSALNA SDNLKNESSD YTKPGDIDPT SVASPKDPED IPTFDEWKKK VMEVEKEKSQ SMHASSNGGS HATKKVQKNR NNYASVECGA KILAANPEAK STSAILIENM DLYMLNPCST KIWFVIELCE PIQVKQLDIA NYELFSSTPK DFLVSISDRY PTNKWIKLGT FHGRDERNVQ SFPLDEQMYA KYVKMFIKYI KVELLSHFGS EHFCPLSLIR VFGTSMVEEY EEIADSQYHS ERQELFDEDY DYPLDYNTGE DKSSKNLLGS ATNAILNMVN IAANILGAKT EDLTEGNKSI SENATATAAP KMPESTPVST PVPSPEYVTT EVHTHDMEPS TPDTPKESPI VQLVQEEEEE ASPSTVTLLG SGEQEDESSP WYRKQMEEMQ KAFNKTIVKL QNTSRIAEEQ DQRQTEAIQL LQAQLTNMTQ LVSNLSATVA ELKREVSDRQ SYLVISLVLC VVLGLMLCMQ RCRNTSQFDG DYISKLPKSN QYPSPKRCFS SYDDMNLKRR TSFPLMRSKS LQLTGKEVDP NDLYIVEPLK FSPEKKKKRC KYKIEKIETI KPEEPLHPIA NGDIKGRKPF TNQRDFSNMG EVYHSSYKGP PSEGSSETSS QSEESYFCGI SACTSLCNGQ SQKTKTEKRA LKRRRSKVQD QGKLIKTLIQ TKSGSLPSLH DIIKGNKEIT VGTFGVTAVS GHI // ID B4DZJ3_HUMAN Unreviewed; 1099 AA. AC B4DZJ3; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=cDNA FLJ60200 {ECO:0000313|EMBL:BAG64105.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:BAG64105.1}; RN [1] {ECO:0000313|EMBL:BAG64105.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Testis {ECO:0000313|EMBL:BAG64105.1}; RA Wakamatsu A., Yamamoto J., Kimura K., Ishii S., Watanabe K., RA Sugiyama A., Murakawa K., Kaida T., Tsuchiya K., Fukuzumi Y., RA Kumagai A., Oishi Y., Yamamoto S., Ono Y., Komori Y., Yamazaki M., RA Kisu Y., Nishikawa T., Sugano S., Nomura N., Isogai T.; RT "NEDO human cDNA sequencing project focused on splicing variants."; RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK302949; BAG64105.1; -; mRNA. DR UniGene; Hs.204559; -. DR STRING; 9606.ENSP00000263688; -. DR PaxDb; B4DZJ3; -. DR PRIDE; B4DZJ3; -. DR UCSC; uc010pmm.1; human. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOVERGEN; HBG107549; -. DR NextBio; 35476495; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1099 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002803688. FT TRANSMEM 1012 1030 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 936 956 {ECO:0000256|SAM:Coils}. FT COILED 986 1006 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1099 AA; 122433 MW; B94E8704CF57E1EF CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFQKKD EREGPINAES LGKSGSNLPI SPKEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQGGGD PKSALNASDN LKNESSDYTK PGDIDPTSVA SPKDPEDIPT FDEWKKKVME VEKEKSQSMH ASSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPEYVTTEVH THDMEPSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VALYRQRSRT ALSKGKDYLV LAQPPLLLPA ESVDVSVLQP LSGELENTNI EREAETVVLG DLSSSMHQDD LVNHTVDAVE LEPSHSQTLS QSLLLDITPE INPLPKIEVS ESVEYEAGHI PSPVIPQESS VEIDNETEQK SESFSSIEKP SITYETNKVN ELMDNIIKED VNSMQIFTKL SETIVPPINT ATVPDNEDGE AKMNIADTAK QTLISVVDSS SLPEVKEEEQ SPEDALLRGL QRTATDFYAE LQNSTDLGYA NGNLVHGSNQ KESVFMRLNN RIKALEVNMS LSGRYLEELS QRYRKQMEEM QKAFNKTIVK LQNTSRIAEE QDQRQTEAIQ LLQAQLTNMT QLVSNLSATV AELKREVSDR QSYLVISLVL CVVLGLMLCM QRCRNTSQFD GDYISKLPKS NQYPSPKRCF SSYDDMNLKR RTSFPLMRSK SLQLTGKEGR FLWIYNMYI // ID B4E278_HUMAN Unreviewed; 583 AA. AC B4E278; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=cDNA FLJ55571, highly similar to Sad1/unc-84 protein-like 1 {ECO:0000313|EMBL:BAG65040.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:BAG65040.1}; RN [1] {ECO:0000313|EMBL:BAG65040.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Trachea {ECO:0000313|EMBL:BAG65040.1}; RA Wakamatsu A., Yamamoto J., Kimura K., Ishii S., Watanabe K., RA Sugiyama A., Murakawa K., Kaida T., Tsuchiya K., Fukuzumi Y., RA Kumagai A., Oishi Y., Yamamoto S., Ono Y., Komori Y., Yamazaki M., RA Kisu Y., Nishikawa T., Sugano S., Nomura N., Isogai T.; RT "NEDO human cDNA sequencing project focused on splicing variants."; RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK304150; BAG65040.1; -; mRNA. DR UniGene; Hs.438072; -. DR STRING; 9606.ENSP00000384015; -. DR PaxDb; B4E278; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR NextBio; 35477220; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 68 91 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 98 117 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 192 212 {ECO:0000256|SAM:Coils}. FT COILED 226 260 {ECO:0000256|SAM:Coils}. FT COILED 273 293 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 583 AA; 66177 MW; 9BDDAA81893C2BC2 CRC64; MKLSYESENY KLKTHESKDC ESESYKSKSH ESKAHASYYG RMNVREVLRE DGHLSVNGEA LWKAASGVFW WLGIGWYQFV TLISWLNVFL LTRCLRNICK FLVLLIPLFL LLAGLSLRGQ GNFFSFLPVL NWASMHRTQR VDDPQDVFKP TTSRLKQPLQ GDSEAFPWHW MSGVEQQVAS LSGQCHHHGE NLRELTTLLQ KLQARVDQME GGAAGPSAST DFMAFHQEHE VRMSHLEDIL GKLREKSEAI QKELEQTKQK TISAVGEQLL PTVEHLQLEL DQLKSELSSW RHVKTGCETV DAVQERVDVQ VREMVKLLFS EDQQGGSLEQ LLQRFSSQFV SKGDLQTMLR DLQLQILRNV THHVSVTKQL PTSEAVVSAV SEAGASRITE AQARAIVNSA LKLYSQDKTW MVDFALESGG GSILSTRCSE TYETKTALMS LFGIPLWYFS QSPRVVIQPD IYPGNCWAFK GSQGYLVVRL SMMIHPAAFT LEHIPKTLSP TGNISSAPKD FAVYGLENEY QEEGQLLGQF TYDQDGESLQ MFQALKRPDD TAFQIVELRI FSNWGHPEYT CLYRFRVHGE PVK // ID B4E2A6_HUMAN Unreviewed; 752 AA. AC B4E2A6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 37. DE SubName: Full=cDNA FLJ55508, highly similar to Sad1/unc-84-like protein 2 {ECO:0000313|EMBL:BAG65068.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:BAG65068.1}; RN [1] {ECO:0000313|EMBL:BAG65068.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Trachea {ECO:0000313|EMBL:BAG65068.1}; RA Wakamatsu A., Yamamoto J., Kimura K., Ishii S., Watanabe K., RA Sugiyama A., Murakawa K., Kaida T., Tsuchiya K., Fukuzumi Y., RA Kumagai A., Oishi Y., Yamamoto S., Ono Y., Komori Y., Yamazaki M., RA Kisu Y., Nishikawa T., Sugano S., Nomura N., Isogai T.; RT "NEDO human cDNA sequencing project focused on splicing variants."; RL Submitted (OCT-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK304188; BAG65068.1; -; mRNA. DR RefSeq; NP_001186508.1; NM_001199579.1. DR RefSeq; NP_001186509.1; NM_001199580.1. DR RefSeq; NP_056189.1; NM_015374.2. DR UniGene; Hs.517622; -. DR UniGene; Hs.744734; -. DR STRING; 9606.ENSP00000385616; -. DR PaxDb; B4E2A6; -. DR PRIDE; B4E2A6; -. DR DNASU; 25777; -. DR GeneID; 25777; -. DR KEGG; hsa:25777; -. DR UCSC; uc011anz.2; human. DR CTD; 25777; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG056957; -. DR KO; K19347; -. DR GenomeRNAi; 25777; -. DR NextBio; 46920; -. DR Genevisible; B4E2A6; HS. DR GO; GO:0005635; C:nuclear envelope; IDA:LIFEdb. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 215 236 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 248 269 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 308 328 {ECO:0000256|SAM:Coils}. FT COILED 387 407 {ECO:0000256|SAM:Coils}. FT COILED 409 436 {ECO:0000256|SAM:Coils}. FT COILED 439 466 {ECO:0000256|SAM:Coils}. FT COILED 513 533 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 752 AA; 83965 MW; 10EEE401D14DA067 CRC64; MVSPPSAGQR LRGVPVWAAG AFRFSSGEES TSHLIMSRRS QRLTRYSQGD DDGSSSSGGS SVAGSQSTLF KDSPLRTLKR KSSNMKRLSP APQLGPSSDA HTSYYSESLV HESWFPPRSS LEELHGDANW GEDLRVRRRR GTGGSESSRA SGLVGRKATE DFLGSSSGYS SEDDYVGYSD VDQQSSSSRL RSAVSRAGSL LWMVATSPGR LFRLLYWWAG STWYRLTTAA SLLDVFVLTR RFSSLKTFLW FLLPLLLLTC LTYGAWYFYP YGLQTFHPAL VSWWAAKDSR RPDEGWEARD SSPHFQAEQR VMSRVHSLER RLEALAAEFS SNWQKEAMRL ERLELRQGAP GQGGGGGLSH EDTLALLEGL VSRREAALKE DFRRETAARI QEELSALRAE HQQDSEDLFK KIVRASQESE ARIQQLKSEW QSMTQESFQE SSVKELRRLE DQLAGLQQEL AALALKQSSV AEEVGLLPQQ IQAVRDDVES QFPAWISQFL ARGGGGRVGL LQREEMQAQL RELESKILTH VAEMQGKSAR EAAASLSLTL QKEGVIGVTE EQVHHIVKQA LQRYSEDRIG LADYALESGG ASVISTRCSE TYETKTALLS LFGIPLWYHS QSPRVILQPD VHPGNCWAFQ GPQGFAVVRL SARIRPTAVT LEHVPKALSP NSTISSAPKD FAIFGFDEDL QQEGTLLGKF TYDQDGEPIQ TFHFQAPTMA TYQVVELRIL TNWGHPEYTC IYRFRVHGEP AH // ID B4G749_DROPE Unreviewed; 1391 AA. AC B4G749; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=GL19600 {ECO:0000313|EMBL:EDW29248.1}; GN Name=Dper\GL19600 {ECO:0000313|EMBL:EDW29248.1}; GN ORFNames=Dper_GL19600 {ECO:0000313|EMBL:EDW29248.1}, GN GL19600 {ECO:0000313|FlyBase:FBgn0157198}; OS Drosophila persimilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744}; RN [1] {ECO:0000313|EMBL:EDW29248.1, ECO:0000313|Proteomes:UP000008744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MSH-3 / Tucson 14011-0111.49 RC {ECO:0000313|Proteomes:UP000008744}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH479180; EDW29248.1; -; Genomic_DNA. DR RefSeq; XP_002015252.1; XM_002015216.1. DR EnsemblMetazoa; FBtr0185215; FBpp0183707; FBgn0157198. DR GeneID; 6589368; -. DR KEGG; dpe:Dper_GL19600; -. DR FlyBase; FBgn0157198; Dper\GL19600. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4G749; -. DR Proteomes; UP000008744; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008744}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008744}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1391 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002803074. FT TRANSMEM 969 990 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 175 195 {ECO:0000256|SAM:Coils}. FT COILED 295 322 {ECO:0000256|SAM:Coils}. FT COILED 905 953 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1391 AA; 149873 MW; 1AA961AC643893C1 CRC64; MHIRLHLVRF MYINLILSCC FWLYDNVAAA DADTKSGGGG GGVADADAAT AAAAAAEQAS AQHRHQPEAP PKPATVTPEP PPDRTLKKVA ATTADTHPAA ADREPLVESD YGNRPRTSEN PKKAKGLAAA TTTSGRDTFP EIITELPTVT VTELPLDRIK NRLESVILEE IQPPTSNKSE EKQQQQQQHQ QEQEEQPLLP LFEAVKEVPQ KDEQMPKIND PGGGVDLDGL LASAEAVGGP GDDGTGNVTS DEQQTGGAAA GAGAGEGEGT ANDTEAKANL TKANEEVPMP VFSEWAQKQM EAEASREQAM ELEQQVANNS AQRRNNTGSA SGKPSTLKLR SKNYASPDCG AKIIASNGDA TNTGAVLTHS SDEYMLSTCG SRIWFVVELC EAIQAQKVEL ANFELFSSSP KNFTVAVSKR FPTRDWSNVG RFAAEDKRTV QTFELHPHLF GKFVRVDIHS HYSKEHFCPV SLFRVFGTSE FEAFETEIRP SDELDDFDDD FGGGGQEQGS SHKATAGGGG GGIFQSASDA VIQMVKKAGE VLLKPTKALK WSPESLLCRT PALGAFSCSS CNSTLVERIN SLLSCQFQQL QGLLNHSQLR SDLLQSRVCL EEYGISLRGN PSASGLAKRQ SYFLSMLPAE HVGAMCKLLQ AEQNITVEQQ QMEAPQLKPP AEQEQENATA AGEASSQQEV IREELQEYPP SGEIVTPEAV ASQEMPSIRE KPDPSTTTTA STTNSTPADV NIFNVSDELE DLEVPVAAPQ PTVAAPVVPT LVESPSDWET STLAPSSSEM PLANAELAIE DGSPASWESL DNLLTTTVAS ITAGGSAAVA TAAAIAGNAN GNHLGGGGAG AGAGVGAGAG AGGIGSNVNL QQKLTNGAQS ESVFIRLSNR IKALERNMSL SGQYLEELSR RYKKQVEELQ QTLTQQTLTV RSLEDQSRRY IEQEQLYQQQ SAELAGEVRA LSYQVQACIL VIIIVGTCIF LMLVLGTVYY RKLRRQTQQL KEEQPSSHAK VPKPKLDRRK SYEQMLNQST PKQRRPSEEA MLILKDCGDS QLVGGCQDLS NRQRKISVCY GSNNNIAANM MTGNPNLRAS LHRRKGAKHS WHNSLDTAAT TLCAAEQLDT FFDADTLKSQ KQQPSSSAGS KAGRKKSLQQ QLALKRQESA PASMMQNELA EEEPASQSDF DESLMLDDDD LANFIPTSDL AYNEFMPEGP SGYLVLDAVD GAQKPPEQPQ KQATKKSSRR LSSPGFFKSP FSKSKNKGGN YNGFGGIKNS HSVHESTSWE WYRLKRNEKQ QQQPNHHSKQ TKSLPNSSLD SSSLSEVNFS LNSNSNSTQN SFRILGEAIR SSGESSITPN GNGNGSSCSA SGSGSNSGGS TTSSTAKKKQ RAFNNIFRKV F // ID B4G7I2_DROPE Unreviewed; 2719 AA. AC B4G7I2; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=GL19165 {ECO:0000313|EMBL:EDW28382.1}; GN Name=Dper\GL19165 {ECO:0000313|EMBL:EDW28382.1}; GN ORFNames=Dper_GL19165 {ECO:0000313|EMBL:EDW28382.1}, GN GL19165 {ECO:0000313|FlyBase:FBgn0156764}; OS Drosophila persimilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744}; RN [1] {ECO:0000313|EMBL:EDW28382.1, ECO:0000313|Proteomes:UP000008744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MSH-3 / Tucson 14011-0111.49 RC {ECO:0000313|Proteomes:UP000008744}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH479180; EDW28382.1; -; Genomic_DNA. DR RefSeq; XP_002014386.1; XM_002014350.1. DR ProteinModelPortal; B4G7I2; -. DR EnsemblMetazoa; FBtr0184780; FBpp0183272; FBgn0156764. DR GeneID; 6589249; -. DR KEGG; dpe:Dper_GL19165; -. DR FlyBase; FBgn0156764; Dper\GL19165. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4G7I2; -. DR Proteomes; UP000008744; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 7. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008744}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008744}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1307 1327 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2719 AA; 303193 MW; 0DD242F779B0D808 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDEHAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICSHLVIAD ISSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSLVHK DTLHSAMSVV SRLCTKVEPN SPCIQNCVQS LSTLLQHEDP MVSDGALKCF ASVADRFTRK WIDPAPLAEY GLVSELLKRL NSVGGNVNTH SSLSTGPQPL SITSITPNQD SMDDGNGSEE ATTISSSKVK ASDSGISPQS ISTTISLLST LCRGSPSITY DLLRSQLSDS IERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNHG NQGQLAPRPR RNNSNTDRTH RQLIDCIRSK DSEALREAIE TGGIDVNCMD DVGQTLLNWA SAFGTLEMVE FLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKYGAYP DLRDEDGKTP LDKARERSDD GHREVSAILQ SPGEWMSTDH SLLSKDGKKY TFLEPRGDPE MAPIYLKLLL PIFCRTFLGS MLGSVRRASL ALIKKIVQYA YPSVLQSLSE TINTEDTTST IAQNGGNLLI EVVASVLDNE DDDDGHLIVL NVIEEIMCKT NDEFLNQFAR LGVFAKVQTL MEHDVEDTSP HFSGNSDDLT LLRRSSINAE SQKSTSEDSM EDAKEILQGK PYHWRDWSLC RGRDCLYVWS DSIALELSNG SNGWFRFIID GKLATMYSSG SPENGNDSSE NRGEFLEKLL RARSCVVPGT VSQPILPTTS SLRLVVGNWV LQSNKTNQLQ IHNTEGQQVT ILQDDLLGFI FESNRGTKHT FTAETVLGPD FASGWSTAKK KRIKSKTDGQ RFQIRNLSRD IYNKYFKSAQ TIPRGAVGKL TDIVNKIELS FEEQNISPDG NWETILSTAL MELSQLIHED GVVSAYEMHS SGLVQALVAV LSVNHWESHS ARCKRSKMQK QRVSVFKKCI LEDNIESVPS KSRAKSTASI LIQKLVSVLE STEKLPVYLY DAPCTGYNLQ ILQKRLRFRL ERAQCENTLF DRTGRTLKME PLATIGQLSK YLLKMVAKQW YDLDRSTYFY LKRLRENQHG VLFTHSFDFD EEGLLYYIGS NAKTCDWVNP ALYGLVQVTS SEGKTLPYGK VEDILSRDSI SLNCHTKDNK KAYFAIDLGV FIVPTAYTLR HARGYGRSAL RNWLLQASKD GVCWTTLSTH IDDKSLLEPG STATWSINCA SDSVGYRHIR IQQNGRNASG QTHYLSLSGF EIYGRVIGVS EDIGKSVKEA EAKIRRERRQ VRAQLKHMTT GARVVRGIDW RWDDQDGCSE GTITGEIHNG WIDVKWDHGV RNSYRMGSEG KYDLKLADCE NYSMFEGIQS IKTINTETKV KDKTSTLSSR KSSSTPSLAE ATEKNQNAEG ASNQTVSADN LTWKQAVETI AENVFSSAKT QIISNQLSIN TSSRETRTNN KEPSSNKMPK DSINGGTPLS RELEHISDLS AINNSIPAIN SAILSDLATI SENLGEISKE NICSVIAPIN TQAETTSASP SSFLPSTLNS VPQGADIQNL SNIEENNKMN ANYSVNKMPK ELPVNLRATN ITGVVQYVTQ YSNDAVEIID KIDDGLDMMR NNSNNILSTE TCSCTNPGLD IKEHMITKVL IAENNQQRQI QFVTDKFSKK NFNNCNVTLK PSSAEPNTVM SIDDVKNSQI SPEIVSVASQ MSISVPNLTT TTVSEVPTIS EGDTHTGLLE TFAAIARRRT SQGTHMQDNQ LMRADVNINQ HTDQSVATSS FLGNSVTSLV KLALSSNFHS GLLSTAQSYP SLSSNNSERV PPANPSNCST GQQPSSSINH TLTMSLTSTS SDSEQVSLED FLESCRAPAL LGDVDDEEDM DEDNDEEENE DEYEEVGNTL LQVMVSRNLL TFMDDEALEN RLAGVSKRKS WDDEFVLKRQ FSALIPAFDP RPGRTNVNQT SDLEIPPLGL DLPSPQQSGY DSIEQPILGL KLRGPGIGSI PDVEIELNNS NWTIFRAVQE LLQHSQLNKV DKFRKVWEPT YTIIYREILP EGQENTYLES DEILQTPDMS SKSGASTLSP NSPMHIGFNK AENNLCSVDD VLELLIRINS LNQSELDFES KERSVPLLPD ELFVSKKITN KLQQQIQDPL VLSCNALPNW CENLNQSCPF LFPFETRQLY LNCTSFGASR SIVCLQSQRD LTAERQRIPI MSPRRDDHEF RIGRLKHERV KVPRNENLLK WAMQVMKTHC NRKSVLEVEF LDEEGTGLGP TLEFYALVAA EIQRSDLCMW LCDDELGEEN TQDISEILEG NSKPIGYYVN RREHGLFPAP LPQNSEKCES VLKYFWFFGV FIAKVLQDMR LVDIPLSTSF LQLLCHKKMS LRSLQKIISE RRPGEISLIS GDLLETCTDL LRTDYNKSNL FGGILHLENL KEVDPTRYQF LLELQDLLLR KQSIDFDCSI SLERKQELIN DLKLCTKNGL EVSLEDLALT FTYLPSSSIY GYTQAELLPN GSSINVTINN LEAYCELLLN FILQDGIAKQ MKAFSDGFNE VFPLKKLTAF SAAEARLMIC GEQYPHWNRE DIISYTEPKL GYSKDSPGFL RFVNVLLGMS GNERKAFLQF TTGCSSLPPG GLSNLHPRLT VVRKVDAGVG SYPSVNTCVH YLKLPDYPTE KIMKERLLTA TKEKGFHLN // ID B4G8L6_DROPE Unreviewed; 529 AA. AC B4G8L6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=GL19315 {ECO:0000313|EMBL:EDW28696.1}; GN Name=Dper\GL19315 {ECO:0000313|EMBL:EDW28696.1}; GN ORFNames=Dper_GL19315 {ECO:0000313|EMBL:EDW28696.1}, GN GL19315 {ECO:0000313|FlyBase:FBgn0156914}; OS Drosophila persimilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744}; RN [1] {ECO:0000313|EMBL:EDW28696.1, ECO:0000313|Proteomes:UP000008744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MSH-3 / Tucson 14011-0111.49 RC {ECO:0000313|Proteomes:UP000008744}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH479180; EDW28696.1; -; Genomic_DNA. DR RefSeq; XP_002014700.1; XM_002014664.1. DR EnsemblMetazoa; FBtr0184930; FBpp0183422; FBgn0156914. DR GeneID; 6589131; -. DR KEGG; dpe:Dper_GL19315; -. DR FlyBase; FBgn0156914; Dper\GL19315. DR OMA; WTSELAR; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4G8L6; -. DR Proteomes; UP000008744; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008744}; KW Reference proteome {ECO:0000313|Proteomes:UP000008744}. SQ SEQUENCE 529 AA; 55721 MW; D06A1017E253A0E5 CRC64; MSGRTSNVSD ISVLRDEVNQ LSKLVRGSPT AVGAFESSKD APGPPGSSHF VPKEPGAEVA KWTSELARKV DVLMDDSLPV GYNRLNFASD ELGASIVSVE ASPIGHSGIF KRLLGLEFSS NPPVNMLRPS LSPGACFGYR GVRAIAIIHL AKEIIVDTIT LSHPPKDMMP NLCENAPKDF KVIGIKPNYN EKEPLGQFTY HNHANRRTEI YRIDNKSTFR RLVLEFYSNH GGQFTCIYRV EVYGSLPAPD PQGNERGRGK DHGKGDLHAE GDDNGQGGDV SGQSDLSVPE AVRGPVDSRG TEDSTGPLYT STPRDSSRPE DVSGGDLRRG DDVRGQKEVC GRDGEICDKP GCKRCAPRDS SGPVDSSGPE TVRGPVDSSG PVGSSGPVDS SGPRYSSRLG DSSAPDESTG PLYTSTPRDS SRPGDSSGPE SVRGPIDSSG TVDSSGPVDS SGPVDSSGPR DSSRLGDSSA PDESTGPLYT STPRDSSRPG DSSGPEDVSG GDVRGQNEGC GRDGGSCKNP GCKNCRREL // ID B4GCA2_DROPE Unreviewed; 623 AA. AC B4GCA2; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GL11123 {ECO:0000313|EMBL:EDW31420.1}; GN Name=Dper\GL11123 {ECO:0000313|EMBL:EDW31420.1}; GN ORFNames=Dper_GL11123 {ECO:0000313|EMBL:EDW31420.1}, GN GL11123 {ECO:0000313|FlyBase:FBgn0148733}; OS Drosophila persimilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744}; RN [1] {ECO:0000313|EMBL:EDW31420.1, ECO:0000313|Proteomes:UP000008744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MSH-3 / Tucson 14011-0111.49 RC {ECO:0000313|Proteomes:UP000008744}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH479181; EDW31420.1; -; Genomic_DNA. DR RefSeq; XP_002015530.1; XM_002015494.1. DR EnsemblMetazoa; FBtr0176738; FBpp0175230; FBgn0148733. DR GeneID; 6590786; -. DR KEGG; dpe:Dper_GL11123; -. DR FlyBase; FBgn0148733; Dper\GL11123. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4GCA2; -. DR Proteomes; UP000008744; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008744}; KW Reference proteome {ECO:0000313|Proteomes:UP000008744}. FT COILED 91 118 {ECO:0000256|SAM:Coils}. FT COILED 128 155 {ECO:0000256|SAM:Coils}. FT COILED 241 268 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 623 AA; 70693 MW; A23440A0CA79D271 CRC64; MRNAWSLLQE DQKLSYVQHV QALLPLPLTL LATWRGHLSS ATASLKSLLE LPLVPRAPEE AETIKSNMAG IEQSIRKALT AEEYENILNH VNSYVQQLVE LKLQQQQQQQ QYTQREQQLS PQQIHIIVQL MKQNLQDFTA KVELSEQDLN DLAAKLKLEL QRSGDWQPEA RLSTANLEEI NRLIKAEVNL QESHYTLLLE KIDWGALLER ILGSPKLADF VDGRINLALQ EEKVLKDGSG SHATEQEIDR LKKEIAFIKL ALSDNQAENT NLQQSISRLK IGQEDLLERM QQHELASDQR FSLLLAEIET KLAALNDSQF FLLNKQVKLS LVEILGFKQS TMGSGKDGAK LDDIDLQNWV RSVFVAKDYL EQQLLELNKR TNNNIRDEIE RSSIVLMSEI SERLKREILL AVEAKHNESN SSIEGEIGEE AVRQIVKAVL ATYDADKTGL VDFALESAGG QILSTRCTES YQTKTAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLVLKLN SLVYVTGFTL EHIPKSLSPT GRIDSAPRNF TVWGLEHEKD QDPVLFGEYE YQDNGASLQY FTLQNLEIQR PYEIVELRIE TNHGQPTYTC LYRFRVHGKP PAS // ID B4HQC8_DROSE Unreviewed; 569 AA. AC B4HQC8; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GM20901 {ECO:0000313|EMBL:EDW46667.1}; GN Name=Dsec\GM20901 {ECO:0000313|EMBL:EDW46667.1}; GN ORFNames=Dsec_GM20901 {ECO:0000313|EMBL:EDW46667.1}, GN GM20901 {ECO:0000313|FlyBase:FBgn0175782}; OS Drosophila sechellia (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7238 {ECO:0000313|Proteomes:UP000001292}; RN [1] {ECO:0000313|EMBL:EDW46667.1, ECO:0000313|Proteomes:UP000001292} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Rob3c / Tucson 14021-0248.25 RC {ECO:0000313|Proteomes:UP000001292}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH480816; EDW46667.1; -; Genomic_DNA. DR RefSeq; XP_002032654.1; XM_002032618.1. DR EnsemblMetazoa; FBtr0203886; FBpp0202378; FBgn0175782. DR GeneID; 6607900; -. DR KEGG; dse:Dsec_GM20901; -. DR FlyBase; FBgn0175782; Dsec\GM20901. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4HQC8; -. DR Proteomes; UP000001292; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001292}; KW Reference proteome {ECO:0000313|Proteomes:UP000001292}. FT COILED 183 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 569 AA; 64615 MW; F69965D8BF7805BE CRC64; MEMATVRSSQ REDEAIKVNM ASIEQNIQKA LTAEEYENIL NHVNSYVQQL VELKMQQHSK ELAPQQVQLI VQLMKENLHQ IVHKTELSEK DLSDLAIKLK MELQSSGGWQ DGAKLSQANL EEITRLIKSE VHLHESHYTI QLDRIDFPSL LERILAARAL ADFVDARISL RVGELDPKES SGSSDAEIQI ERLNREIAFI KLALSDKQAE NADLHQSISN LKLGQEDLLE RIQQHELAQD RRFHGLLAEI ENKLSALNDS QFALLNKQIK LSLVEILGFK QSTAGGAAGQ LDDFDLQTWV RSVFVAKDYL EQQLLELNKR TNNNIRDEIE RSSILLMSDI SQRLKREILL VVEAKHNEST KALKGHIREE EVRQIVKTVL AIYDADKTGL VDFALESAGG QILSTRCTES YQTKSAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLGKSRV KMLKLNSLVY VTGFTLEHIP KSLSPTGRIE SAPRNFTIWG LEQEKDQEPV LFGEYQFEDN GASLQYFAVQ NLDIKRPYEI VELRIETNHG HPTYTCLYRF RVHGKPPAT // ID B4HWJ6_DROSE Unreviewed; 2725 AA. AC B4HWJ6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 48. DE SubName: Full=GM11846 {ECO:0000313|EMBL:EDW52391.1}; GN Name=Dsec\GM11846 {ECO:0000313|EMBL:EDW52391.1}; GN ORFNames=Dsec_GM11846 {ECO:0000313|EMBL:EDW52391.1}, GN GM11846 {ECO:0000313|FlyBase:FBgn0166787}; OS Drosophila sechellia (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7238 {ECO:0000313|Proteomes:UP000001292}; RN [1] {ECO:0000313|EMBL:EDW52391.1, ECO:0000313|Proteomes:UP000001292} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Rob3c / Tucson 14021-0248.25 RC {ECO:0000313|Proteomes:UP000001292}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH480818; EDW52391.1; -; Genomic_DNA. DR RefSeq; XP_002036468.1; XM_002036432.1. DR ProteinModelPortal; B4HWJ6; -. DR EnsemblMetazoa; FBtr0194831; FBpp0193323; FBgn0166787. DR GeneID; 6611951; -. DR KEGG; dse:Dsec_GM11846; -. DR FlyBase; FBgn0166787; Dsec\GM11846. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4HWJ6; -. DR Proteomes; UP000001292; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000001292}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001292}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2725 AA; 302037 MW; A9023265E8EC0BA3 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSQVHK DTLHSSMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDP MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLKRL KSVGGNTHSS LTAAGTQPTS SSQPAATTNS DAINENVAGT ATISNSTKVK SSDAAASPQS ISTTISLLST LCRGSPSITH DILRSQLADA LERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNNP NQGQLAPRPR RNNTNTDRTH RQLIDCIRSK DSEALREAIE SGGIDVNCMD DVGQTLLNWA SAFGTLEMVE YLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKFGAYP DLRDEDGKTP LDKARERLDD GHREVAAILQ SPGEWMSPDH SLLNKDGKKY TLMEPRGDPE MAPVYLKVLL PIFCRTFLGS MLGSVRRASL ALIKKIVQYA YPTVLQSLSE TSFSEDAAST SGQNGGNLLI EVVASVLDNE DDDDGHLIVL NIIEEIMCKT QEEFLDHFAR LGVFAKVQAL MDTDAEELYV QLPGTVEEPA TAQRSSTSVV VAPRPTSDDP MEDAKEILQG KPYHWREWSI CRGRDCLYVW SDSVALELSN GSNGWFRFII DGKLATMYSS GSPENGNDSS ENRGEFLEKL MRARSCVIAG VVSQPILPTA SALRLVVGNW VLQSQKTNQL QIHNTEGHQV TVLQDDLPGF IFESNRGTKH TFSAETVLGP DFASGWSTAK KKRNKSKTEG QKFQVRNLSR EIYNKYFKSA QIIPRGAVAI LTDIVKQIEL SFEEQHMAPN GNWETTLSDA LMKLSQLIHE DGVVSAYEMH SSGLVQALVA VLSVNHWETN SPRCKRNKMQ KQRVAVFKKC ILEDNVESAT NKPRTKSTAS ILIQKLVSVL ESTEKLPVYL YDSPCTGYSL QILQKRLRFR LERAECESTL FDRSGRTLKM EPLATIGQLS KYLLKMVAKQ WYDLDRSTYF YLKKIREHRT GTVFAHSFDF DEEGLLFYIG SNAKTCDWVN PAQYGLVQVT SSEGKTLPYG KLEDILSRDS ISLNCHTKDN KKAWFAIDLG VYIIPTAYTL RHARGYGRSA LRNWLLQGSK DGSTWTTLST HVDDKSLVEP GSTATWPITC ATDDSVRYRH IRIQQNGRNA SGQTHYLSLS GFEIYGRVVG VADDIGKSVK EAEAKTRRER RQIRAQLKHM TTGARVIRGV DWRWEEQDGC AEGTITGEIH NGWIDVKWDH GVRNSYRMGA EGKYDLKLAD GEYLSAFDGN QSMSSASTAA KSNEKGNTLT SRKSSSTPSL PEATEKNQNS EGASNQTVSA DNLAWKQAVE TIAENVFASA KTQIISNQLA MNTSSSREAR AKHKESGTNQ MHKDNISGPS PLSRELEHIS DLSAINNSMP AINSSIVSDL ATISENLSLT ELSKENICSV LTPTYKPAES VTESQSSSHP DVQSSSPREN DIKNISNIEE NNKMNANNSV NKISKDLLAN LRTSNIAGCP PVTQLSTEAL EMIDKMRDGV DMIRNMSNNI LSTDTFPVPC TNVPVGGKKT PKAQALINPD NANQKQIIVT SEEFPTKSSK KPSVTLKPAQ QPNAVLSIVD IKEQPISNEN VSVPSQMSIS VPNLTTTSAS EVPSTSEVAT HTGLLETFAA IARRRTSQGT NIQDNQIMNA EANVNEHGDQ NASGSFLGHS VTSLVKLALS SNFHSGLLST AQSYPSLSSN NSENIAPSNP SNNSAGQQSA STINHTLTMS LTSTSSDSEQ VSLEDFLESC RAPALLGDLD DEDDMDEDND EEENEDEYEE VGNTLLQVMV SRNLLTFMDD EAMENRLVGV TKRKSWDDEF VLKRQFSALI PAFDPRPGRT NVNQTSDLEI SPLGAELPKP QQSGPETIEQ PLLGLKLRGP GIGGIPEVEI DLSNTDWTIF RAVQELLQCS QLNKLDKFRK IWEPTYTIVY REVSPEAQES TCLESEEFPQ TPDVSSKSGA STLSPNSPMH IGFNVADNNL CSVDDVLELL TQINGLNQSE IDSDVKEHGV SVLSEDLFIS KKITNKLQQQ IQDPLVLASN ALPNWCENLN QSCPFLFPFE TRQLYFNCTS FGASRSIVCL QSQRDVTVER QRIPIMSPRR DDHEFRIGRL KHERVKVPRN EDLLMWAMQV MKTHCNRKSV LEVEFLDEEG TGLGPTLEFY ALVAAEIQRS DLCMWLCDDD LGEDMENSPQ SAEGNSKPVG YYVNRREHGI FPAPLPQNSE ICENVLKYFW FFGVFVAKVL QDMRLVDIPL STSFLQLLCH NKVLSRNLQK VISDRRNGDL SVVSEESDIV ETCTKLLRTD SNKSNAFGGI LSLENLKEID PTRYQFLQEM QNLLLRKQSI EFDDTISAEK KHELINELKL QTQNGLEVSL EDLALTFTYL PSSSIYGYTQ AELLPNGSSV NVTIDNLEAY CELLMNFILQ DGIAQQMKAF SDGFNEVFPL KKLAAFTPSE ARMMICGEQF PHWSREDIIS YTEPKLGYNK DSPGFQRFVN VLLSMSGDER KAFLQFTTGC SSLPPGGLAN LHPRLTVVRK VDAGVGSYPS VNTCVHYLKL PDYPTEEIMK ERLLTATKEK GFHLN // ID B4HX67_DROSE Unreviewed; 289 AA. AC B4HX67; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=GM11032 {ECO:0000313|EMBL:EDW52612.1}; GN Name=Dsec\GM11032 {ECO:0000313|EMBL:EDW52612.1}; GN ORFNames=Dsec_GM11032 {ECO:0000313|EMBL:EDW52612.1}, GN GM11032 {ECO:0000313|FlyBase:FBgn0165977}; OS Drosophila sechellia (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7238 {ECO:0000313|Proteomes:UP000001292}; RN [1] {ECO:0000313|EMBL:EDW52612.1, ECO:0000313|Proteomes:UP000001292} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Rob3c / Tucson 14021-0248.25 RC {ECO:0000313|Proteomes:UP000001292}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH480818; EDW52612.1; -; Genomic_DNA. DR RefSeq; XP_002036689.1; XM_002036653.1. DR EnsemblMetazoa; FBtr0194017; FBpp0192509; FBgn0165977. DR GeneID; 6612172; -. DR KEGG; dse:Dsec_GM11032; -. DR FlyBase; FBgn0165977; Dsec\GM11032. DR OMA; HVAREMT; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4HX67; -. DR Proteomes; UP000001292; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001292}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001292}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 12 30 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 289 AA; 32302 MW; 979F432FBE4EBFC8 CRC64; MDGCRRARKR VYVSYLLSFV LLACFFYYLM VHNSRNNLGI MRLREDVDDI SHILRQQQID SKVDQGSCKF NCLGGDPKGL GSSGRCSNRD VSAYVDTLFK RKIGHLMDDV YNLKKQVMSS GCSSKTAQST PKHESAALAK PRINYASEDL GARIINVKAK SLDGTNIIRS VLGLDFSSNP PVNMIRAGLS PGSCFGFNGS RATVTLHLAR TIIVEAITLT HVAREMTPDL CVKSAPKNFD WSYDNAANKR TQSYSVRSDY YFRNLDFSFN SNHGANTTCI YRVEVYGRL // ID B4IFE4_DROSE Unreviewed; 1635 AA. AC B4IFE4; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GM23316 {ECO:0000313|EMBL:EDW46366.1}; GN Name=Dsec\GM23316 {ECO:0000313|EMBL:EDW46366.1}; GN ORFNames=Dsec_GM23316 {ECO:0000313|EMBL:EDW46366.1}, GN GM23316 {ECO:0000313|FlyBase:FBgn0178183}; OS Drosophila sechellia (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7238 {ECO:0000313|Proteomes:UP000001292}; RN [1] {ECO:0000313|EMBL:EDW46366.1, ECO:0000313|Proteomes:UP000001292} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Rob3c / Tucson 14021-0248.25 RC {ECO:0000313|Proteomes:UP000001292}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH480833; EDW46366.1; -; Genomic_DNA. DR RefSeq; XP_002042454.1; XM_002042418.1. DR EnsemblMetazoa; FBtr0206301; FBpp0204793; FBgn0178183. DR GeneID; 6618179; -. DR KEGG; dse:Dsec_GM23316; -. DR FlyBase; FBgn0178183; Dsec\GM23316. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4IFE4; -. DR Proteomes; UP000001292; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001292}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001292}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1635 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002810584. FT TRANSMEM 1224 1245 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 565 592 {ECO:0000256|SAM:Coils}. FT COILED 1160 1208 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1635 AA; 179394 MW; A503EF2A33547BFC CRC64; MQMRLHLVRF MFINLLLSCC FWLYDNVAAA DSQIPSSDGD SVKDAARKTN IEPEEPATPP RPEPPPPPLR DGAGVLLPSP QEAAPFFATS SSSSAPVPQT GNRPISGDTS VPVKELPALS DQQRANSHSE YLSKIDLAAD VGGSSTAQVP NNQLNYNKRS SAGKVHSFSE SQQLEQPKQL EQEGPATGQP QAPELQEPPQ PPQAEERQLA FMQMSRTYSS RGLYYIYCIY KTADPLSMIA QRSARSVLSH DEMKCNDRSY RLASMSKYLI DCERGTGIGG SEDRGVIVGA KAVSDTLCHP NAAVPRARNT RLHPFGPQKW FGKTAWCRKS IFPRCRLVGW RGATWATSLP AKCSRCFSPA SAGRSEMIAR CRSGCGCGFG FGFGCAEWCD RVHRYHTGGA TSSSSSRRWV FGFASRSRIH YLVSYLVIFP EIITELPTVT ITELPLDRVK NRLDSVILDG SPATAGNHSD EEQHQQQQPH EDQHMQVLEA DEEVPQKDEQ MPKINDPGGG IQVDGMITQE AASVGETQES SEELQPGSAA FNETEGTANL TNANEEVPMP VFSEWAQKQM EAEASREQAM ELEQQVVNKS AQRKNNTGSS SGKPPTLKLR SKNYASPDCG AKIIAHNSES KHTEAVLTQS TDEYMLSTCE SRIWFVVELC EAIQAQKVDV ANYELFSSSP KNFTVAVSKR FPTRDWSNVG RFAAEDKRTI QTFELHPHLF GKFVRVDITS HYSNEHFCPL SLFRVFGTSE YEAFETEIRP SDDLDDFYDD YGAQEQKAAV GSGGNIFQSA SDAVMQMVKK AAEVLVKPTK ALKWSEESEL CQTPAFESYS CSNCNATLVE RINSLLSCQF QQLQALLSLS HLRSDLLNSR VCHEEFGISL TGSEFASKMG KEQSYFLSML PAEHLGAMCK LIQAEQNVTD QNHTKAPSLK QHVSSPEPVQ DNATATGVRQ DCENSKERQS RKTPTKEPLT PSLEVVVPEV SQEVPSLEDQ SSTSSETVST KNSTPADVNI FNMPSESEEV VKVQLPPEPT LPTTLQPSDV ESFTDAPSTN ALRASSEANG DLGMEEGNPA NWDGIDNLLT TTVASITAGG GAAAAAAAVV NGNGNIGGAG VVGAGGPASV SSVNMQQKLT NGAQSESVFI RLSNRIKALE RNMSLSGQYL EELSRRYKKQ VEELQQTLTQ QTLTVRQLED QSRRYVEQEQ LYQQHSAELA GEVRALSYQV QACILVIIIV GTCIFLMLVL GTVYYRKLRR QQQQLLKKDQ ADHPPVAAKP KLDRRKSYEQ MPNQSTPKQR RPSEEAMLIL KECGDSNMQE LDPPSRQRKI SVCYGSNNNI AANMAIANTN GGASVRNSLH RRKGAKHSWH SSLDTTETTC GEQTDKFFDV DTLKSIKQSC GKPGKKKSHQ QLKPLSLKRQ ESAPATYTPD LQGEEPATQS DFDESLMLDD DDLANFIPTS DLAYNEFMPE GPSGYQIVDT VDGKPGKEPG TKKSRRLSSP AFFKSPFSKS KNKGYSFNGV QNSHSVHEPT SWEWYRLKRS EKHQQQQQAK LASKSLPSAS LDSSSLSEVN FPLNSSTAQN SFRILGEAIL SSGEGRITPN GNGNAMSGGL ASSSSGSGSG GSTTSSTTKK KQRALNNLFR KAFDF // ID B4J6T6_DROGR Unreviewed; 549 AA. AC B4J6T6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=GH21760 {ECO:0000313|EMBL:EDW02017.1}; GN Name=Dgri\GH21760 {ECO:0000313|EMBL:EDW02017.1}; GN ORFNames=Dgri_GH21760 {ECO:0000313|EMBL:EDW02017.1}, GN GH21760 {ECO:0000313|FlyBase:FBgn0129221}; OS Drosophila grimshawi (Fruit fly) (Idiomyia grimshawi). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila. OX NCBI_TaxID=7222 {ECO:0000313|Proteomes:UP000001070}; RN [1] {ECO:0000313|EMBL:EDW02017.1, ECO:0000313|Proteomes:UP000001070} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15287-2541.00 {ECO:0000313|Proteomes:UP000001070}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH916367; EDW02017.1; -; Genomic_DNA. DR RefSeq; XP_001987150.1; XM_001987114.1. DR STRING; 7222.FBpp0155666; -. DR EnsemblMetazoa; FBtr0157174; FBpp0155666; FBgn0129221. DR GeneID; 6561031; -. DR KEGG; dgr:Dgri_GH21760; -. DR FlyBase; FBgn0129221; Dgri\GH21760. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B4J6T6; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4J6T6; -. DR Proteomes; UP000001070; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001070}; KW Reference proteome {ECO:0000313|Proteomes:UP000001070}. FT COILED 54 74 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 549 AA; 62604 MW; A50CB737CE78EF9B CRC64; MEQVQQNLQR TLTPEEYGNI LNHVNSYVQQ LVELKLQQQP QQQQLSPQQI QIIVKLMREN LEQFAEQKTQ FSEQQLADLA LRLKLQLQQS GEWQAANLRL SPEQLAEITR QIKAEFKLEE THYTLLLERI DVPQLLHRLL SAPGLATFVD ERIHLALLQQ QGQTRVREQA EEGSGQAAVD QLKREIAFIK LTLSDKHAEN ADLQQSISSL KLSQDDLLER MQQHELAQDQ RFSGLLAEIE AKLASLNDSQ FALLNKQVKL TLVEILGFKQ QAAGGQLNDV DLQSWVRNMF VAKEYLEQQL LELNKRTDNN LRAEIERSSL VLMSDISERL KREILLTVEA KYNGSSQVVK SELHEDEVRS IVKAVLAVYD ADKTGLVDFA LESAGGQILS TRCTESYQTK SAQISVFGIP LWYPTNTPRV AISPNVQPGE CWAFQGFPGF LVLKLNSLVY VTGFTLEHIS KSLSPTGRID SAPRNFTVWG LEHEKDQDPV LFGEYEYQDN KASLQYFAVQ NVDIKRPFEI VELRIESNHG QPDYTCLYRF RVHGKPPST // ID B4JAJ0_DROGR Unreviewed; 1229 AA. AC B4JAJ0; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GH10851 {ECO:0000313|EMBL:EDW02776.1}; GN Name=Dgri\GH10851 {ECO:0000313|EMBL:EDW02776.1}; GN ORFNames=Dgri_GH10851 {ECO:0000313|EMBL:EDW02776.1}, GN GH10851 {ECO:0000313|FlyBase:FBgn0118332}; OS Drosophila grimshawi (Fruit fly) (Idiomyia grimshawi). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila. OX NCBI_TaxID=7222 {ECO:0000313|Proteomes:UP000001070}; RN [1] {ECO:0000313|EMBL:EDW02776.1, ECO:0000313|Proteomes:UP000001070} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15287-2541.00 {ECO:0000313|Proteomes:UP000001070}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH916368; EDW02776.1; -; Genomic_DNA. DR RefSeq; XP_001987909.1; XM_001987873.1. DR STRING; 7222.FBpp0144757; -. DR EnsemblMetazoa; FBtr0146265; FBpp0144757; FBgn0118332. DR GeneID; 6562374; -. DR KEGG; dgr:Dgri_GH10851; -. DR FlyBase; FBgn0118332; Dgri\GH10851. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B4JAJ0; -. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4JAJ0; -. DR Proteomes; UP000001070; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001070}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001070}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 1229 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002808695. FT TRANSMEM 839 860 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 66 86 {ECO:0000256|SAM:Coils}. FT COILED 138 165 {ECO:0000256|SAM:Coils}. FT COILED 775 823 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1229 AA; 133531 MW; 9BA5314CDCC58B8D CRC64; MKIRLHLVRF MYINLLLSLP KIIPELPAVT VTTELPVEVE QVKDRLEAVI LPNAPPAHDS SNQSESQQAS EVLEGAEKVV QNAEQKLNDP GGGLELGTGE LNATSDANAT GDSGGPIASS AASNASVEEV PMPVFSEWAQ KQMEAEASRE QAMELEQQVV NNSAQRRNAT GSGSGKAPSL KTRAKNYASP DCGAKIIASN GDATNTGAVL SHSSDEYMLS TCGSRIWFVV ELCEAVQAQK VELANFELFS SSPKNFTVAV SKRFPTRDWS NVGRFAAEDK RTVQTFELHP HLFGKFVRVD IHSHYSKEHF CPISLFRVFG TSEFEAFETE IRHSDELDDF DDDFAGQEQP HKQLQGNANI FQSASDAVIQ MVKKAAQVLV KPTKALRSPE SLHCFTPTAG NYSCQSCNST VVERINNLLS CQSEQLQQLL QLPQLRAQLL RTRICHIEYK ISFGSASIKE QQATCSGMDK RQSFYLSILP TEHVGAMCML LQAEYNSITA EQPVKLPQAA TKQQQQQLHF NVAEQLNENV TTADPAKESE AVKTTTPKEP LPAADTLSPE IVTAETSEAP KTTTTTEVNF EPVATIEAPP SDVNIFNVPT DTEKEVPTTT TGSAASIVRD EAGTPTPAGT PMDVKEMELA INPDSPVTTS EQSPTSATTV ESDGGDLVGS GSGLDDSNLA NWESIDSLLT TTVASITAGG GAAAAAAAVV NGNAHIGSTG TAAGAAGGVV SAGGAGGINL QPKLTNGAQS ESVFIRLSNR IKALERNMSL SGQYLEELSR RYKKQVEELQ QTLTQQTLTV RTLEEQSRRY IEQEQMYQQQ SAELAGDVRA LTYQVQACIF VIIIVGTCIF LMLVVGTVYY RKLRHQNQQL QPLVATATTG LANQKLSRRK SYEQMLQQSP GKQRRPSEEA MLILNGCGDV SVVDPSEVAN SNSRQRKISV CYGSNNNIAG NVFNTRTSLH KRKGAKHTFH ASLDSAQVVY SEHPDKFFDV DTLKNNNKLP HKAAKKKSQQ LYQQELKRQD SAPANCTQSS LAEEEPTQSD FDESLILDDE DLCNFIPNSD LAYNEFMPDG PSGYQIVDTV DGKSEKSTKK SRRVSSPAFF KSPFSKSKNK GSNLNGMGGV KGSQSAHEAT SWEWYRLKRN EKHAKQLSVP SVSLTPNASL DSSSLSEINF PFNSTQNSFR ILGEAIMSSG ETKATGKTEG SSSSSGASTT SSTTKKKQRA FNNIFRKVF // ID B4JBN2_DROGR Unreviewed; 200 AA. AC B4JBN2; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GH10752 {ECO:0000313|EMBL:EDW02967.1}; GN Name=Dgri\GH10752 {ECO:0000313|EMBL:EDW02967.1}; GN ORFNames=Dgri_GH10752 {ECO:0000313|EMBL:EDW02967.1}, GN GH10752 {ECO:0000313|FlyBase:FBgn0118233}; OS Drosophila grimshawi (Fruit fly) (Idiomyia grimshawi). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila. OX NCBI_TaxID=7222 {ECO:0000313|Proteomes:UP000001070}; RN [1] {ECO:0000313|EMBL:EDW02967.1, ECO:0000313|Proteomes:UP000001070} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15287-2541.00 {ECO:0000313|Proteomes:UP000001070}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH916368; EDW02967.1; -; Genomic_DNA. DR RefSeq; XP_001988100.1; XM_001988064.1. DR STRING; 7222.FBpp0144658; -. DR EnsemblMetazoa; FBtr0146166; FBpp0144658; FBgn0118233. DR GeneID; 6562054; -. DR KEGG; dgr:Dgri_GH10752; -. DR FlyBase; FBgn0118233; Dgri\GH10752. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B4JBN2; -. DR KO; K19347; -. DR OMA; CFGFRGS; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4JBN2; -. DR Proteomes; UP000001070; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001070}; KW Reference proteome {ECO:0000313|Proteomes:UP000001070}. SQ SEQUENCE 200 AA; 22183 MW; 9148A5369205A3A0 CRC64; MDDVYNLKKQ VMTSGCAFQG NLPETKNEAM ALSTMRINYA SEELGASIIN VLAKPIGEVN FIRKLLGLEF MANPPVNMLR SNLLPGSCFG FRGSNATVFL HLAKTIIIEE FSLTHVPKET TPSRCVDNAP KDFEVYGLPP GSHKKELLGQ WTYENAPKKR AQSFLCKNTS SFQSLVITFN SNHGANSTCI YRIEVYGKLP // ID B4JPQ1_DROGR Unreviewed; 2746 AA. AC B4JPQ1; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 51. DE SubName: Full=GH13561 {ECO:0000313|EMBL:EDV98881.1}; GN Name=Dgri\GH13561 {ECO:0000313|EMBL:EDV98881.1}; GN ORFNames=Dgri_GH13561 {ECO:0000313|EMBL:EDV98881.1}, GN GH13561 {ECO:0000313|FlyBase:FBgn0121037}; OS Drosophila grimshawi (Fruit fly) (Idiomyia grimshawi). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Hawaiian Drosophila. OX NCBI_TaxID=7222 {ECO:0000313|Proteomes:UP000001070}; RN [1] {ECO:0000313|EMBL:EDV98881.1, ECO:0000313|Proteomes:UP000001070} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15287-2541.00 {ECO:0000313|Proteomes:UP000001070}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH916372; EDV98881.1; -; Genomic_DNA. DR RefSeq; XP_001992956.1; XM_001992920.1. DR STRING; 7222.FBpp0147467; -. DR EnsemblMetazoa; FBtr0148975; FBpp0147467; FBgn0121037. DR GeneID; 6567028; -. DR KEGG; dgr:Dgri_GH13561; -. DR FlyBase; FBgn0121037; Dgri\GH13561. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; B4JPQ1; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4JPQ1; -. DR Proteomes; UP000001070; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001070}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001070}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1904 1924 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2746 AA; 302750 MW; 25016DE7B5400128 CRC64; MSDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNRLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GCGLNCVLSF IRDCGSQVHK DTLHSAMSVV SRLCTKVEPN SPCINNCVES LSTLLQHEDA LVSDGALKCF ASVADRFTRK WIDPAPLAEY GLVTELLKRL ENAGGPLTSA TKLNAPQLPS PNLDRTENVS GSGAGSTTAG AVAATTTTAT TGKLPASESM RSPQSISTTI SLLSTLCRGS SSITRDILRS QLPEAIERAL KGDERCVLDC MRFADLLLLL LFEGRQALNR GSANSNQGQL VPRPKRTDST TDRTHRQLID CIRSKDTEAL QEAIETCGVD VNCMDDVGQT LLNWASAFGT LEMVEYLCEK GADVNKGQRS SSLHYAACFG RPGIAKILLK FGAYPDLRDE DGKTPLDKAR ERSDDGHREV AAILQSPGEW MSSGHLLAAN KDGQTCTLIE PRGDPEMAPV YLKLLLPIFC RTFQGSMLTS VRRASLGLIK KIVQYAHPTV LKSICKKCNE PSTSISAQSV GNLLTEVIAS VLDSEDDEDG HLIILNIIEE LMCKTQEEFL EHFARLGVFS KVQALMDHGS EDASASQLAG SSQDLSKAAA ALCMPHRSVQ DDLVEDAKEI LQGKPYHWRD WSICRGRDCL YVWSDSAALE LSNGSNGWFR FILDGKLATM YSSGTPENGN DSSENRGEFL EKLVRARSSV SPGTPSQPIL PTVCVLRLVV GNWVLQSHKP NQLQIHNTEG HQVTILQDDM QGFIFESNRG TKHSLTAETS LGADFASGWS TEKKKRIKSK TDVQKVQVVN VSREIYNKYF KTAQVIPRGA VAKLTAIVKQ INVALEEQRL GNSNWSTTLN TALTNLSQLI HEDGVVSAYE MHSSGLVQSL VAVLSNNYWE MHLSRCKTNK MLKERIDIFK KCIFGECNIE SSSSTPKNTA SILIQKLVAV LESTEKLPVF LYDAPGTGYG LQILQKRLRF RLERASCEST LFDRTGRTLK IEPLATVGQL AKYLLKMVAK QWYDLDRSTF LYLKKLREPK QPENITFTHN FDFDEEGLIY YIGSNGRTYE WVNPAQYGLV QVTSSEGKTL PYGKLEDILS RDGVSLNCHT KDNKKAWFAI DLGVYIVPTA YTLRHARGYG RSALRNWLLQ ASKDSVNWTT LISHVDDKSL VEPGSTATWP IICNADDTKG YRHIRIQQNG RNASGQTHYL SLSGFEIYGR VVGVCDDIGK TIKEAEAKTR RERRQIRAQL KHITSGARVV RGVDWRWDDQ DGCCEGTITG EIHNGWIDVK WDHGVRNSYR MGAEGKYDLK LANLESVSVF EGGVNSMLPI ASSGGAGVGC GKKNDKTNVL TSRKSSSTPS LPEATEINKN HHPEDASNQT VSADNLAWKQ TVETITENVF SSAKSHIANN QANAAAAAAA AADASAADHQ TSSSSSSTNV VASPLIRELQ HIADLSTINN SMPAINLSNA SDLATITESL SIVEISKEKS TDLKSSESAA TAAAANPVAS SKQHAAKRIP HIEENNKMNA NNVAKGLLIN LKQLNAGGGV GSGGNSQSQC GGSAETHEII DKMRDGVDMI RNNTNNILSS DELHIPRTAT AGLKGNTKVS VLIQPGNLQK NESVDADKSS TPPQDENPGI SIENIMNPQI SNDVVSAVSN QMSISVPNLT TSSSEVSSTS EVAVHTGLLE TFAAIARRRT SQDTNNEENQ SNNTNANKNE HSNQNVGGAS SFFARGPNSV TSLVKLALSS NFHSGLLSTA QSYPSLSSNN GENSSSNSTN KKLSQQHPTP SINPTLTMSL TSTSSDSEQV SLEDFLESCR APTLLGDMED EDDMEEENDE EENEDEYEEV GNTLLQVMVS RNLLTFMDDE SLENRLAGVS KRKSWDDEYV LKRQFSALIP AFDPRPGRTN VNQTTDLEIC PPDSSPETYQ QSGQINSEQT TLGLKLRGPG VGGVPDVEID LDNSEWTIFR AVQELLQNSQ LNKNDKFRKL WEPTYTIVYR ETFPTVLECG GSYIESDEGQ KTPGVSSRSG ASTLSPNSPV HGGLSITDNN LCSVEDVLDL LTQINALNQT DELDGDEQSA TSSTANPTCL PAELFMSKKI TNKLQQQIHD PLVLSSNALP NWCEDLNQSC PFLFPFETRQ LYFNCTAFGS SRSIVCLQSQ RDITNERQRI PIMSPRRDDQ HDFRVGRLKH ERVKVPRNKD LLRWAIQVMK AHCNRKSVLE VEFLDEEGTG LGPTLEFYAL VAAEIQRSDL CMWLCDDELS DDDQEATNIQ DAAPQFHNIA DATKPIGYYV NRRENGLFPA PLPQNSELCE QVSKYYWFFG VFIAKVLQDM RLVDMPLSKS FLQLLCHNKI LSRNDIKLRS NARFKDLIVS SVVSRDSEGL ASSDSKGLND DYSQSKWFSG VLNIENLKEI DATRYQFLIE LQDLLMRKQT IELDDSLSCD EKDELINNLK MCTKNDIEVS LEELALTFTY LPSSSVYGYT YAELIPNGAS IEVDINNLEA YCELLLNFML QDGIAMQMQA FHDGFCEVFP LNKLAAFNPT EARMMICGEQ YPQWTREDLM AYTEPKLGYS KDSPGFLRFV NVLLSFSGAE RKAFLQFTTG CSSLPPGGLA NLHPRLTVVR KVDAGTGSYP SVNTCVHYLK LPDYPSEDIM KDRLLTATKE KGFHLN // ID B4KFW3_DROMO Unreviewed; 188 AA. AC B4KFW3; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=GI16904 {ECO:0000313|EMBL:EDW12089.1}; GN Name=Dmoj\GI16904 {ECO:0000313|EMBL:EDW12089.1}; GN ORFNames=Dmoj_GI16904 {ECO:0000313|EMBL:EDW12089.1}, GN GI16904 {ECO:0000313|FlyBase:FBgn0139650}; OS Drosophila mojavensis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7230 {ECO:0000313|Proteomes:UP000009192}; RN [1] {ECO:0000313|EMBL:EDW12089.1, ECO:0000313|Proteomes:UP000009192} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15081-1352.22 {ECO:0000313|Proteomes:UP000009192}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH933807; EDW12089.1; -; Genomic_DNA. DR RefSeq; XP_002002647.1; XM_002002611.1. DR EnsemblMetazoa; FBtr0167629; FBpp0166121; FBgn0139650. DR GeneID; 6576659; -. DR KEGG; dmo:Dmoj_GI16904; -. DR FlyBase; FBgn0139650; Dmoj\GI16904. DR InParanoid; B4KFW3; -. DR KO; K19347; -. DR OMA; CFGFRGS; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4KFW3; -. DR Proteomes; UP000009192; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009192}; KW Reference proteome {ECO:0000313|Proteomes:UP000009192}. SQ SEQUENCE 188 AA; 21184 MW; FCDEE37917A0265F CRC64; MDDVYSIKKR IEQSACKESL ESAAEANKQE IYPPRINYAS EELGARIISA LANPIADTNL LKTLLGLEFS TNPPINMLRP SLMPGSCFGF RGSQATITLH LAKTIYVEQI SLTHVPKEMT PNKCVNNAPK DFEGVTSNKK KELLGNWRFK NEPNERTENY IVNNNCPFRI LLFKFNSNHG ANSTCIYR // ID B4KIZ6_DROMO Unreviewed; 1214 AA. AC B4KIZ6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=GI19642 {ECO:0000313|EMBL:EDW13509.1}; GN Name=Dmoj\GI19642 {ECO:0000313|EMBL:EDW13509.1}; GN ORFNames=Dmoj_GI19642 {ECO:0000313|EMBL:EDW13509.1}, GN GI19642 {ECO:0000313|FlyBase:FBgn0142379}; OS Drosophila mojavensis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7230 {ECO:0000313|Proteomes:UP000009192}; RN [1] {ECO:0000313|EMBL:EDW13509.1, ECO:0000313|Proteomes:UP000009192} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15081-1352.22 {ECO:0000313|Proteomes:UP000009192}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH933807; EDW13509.1; -; Genomic_DNA. DR RefSeq; XP_002004067.1; XM_002004031.1. DR EnsemblMetazoa; FBtr0170367; FBpp0168859; FBgn0142379. DR GeneID; 6578150; -. DR KEGG; dmo:Dmoj_GI19642; -. DR FlyBase; FBgn0142379; Dmoj\GI19642. DR InParanoid; B4KIZ6; -. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4KIZ6; -. DR Proteomes; UP000009192; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009192}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009192}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 1214 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002811279. FT TRANSMEM 817 838 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 133 160 {ECO:0000256|SAM:Coils}. FT COILED 753 784 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1214 AA; 132564 MW; 88B988844D4EA93F CRC64; MKIRLHLVRF MYINLILSLP EIITDLPTVT VAELPVELEH VKERLTLVMP HDEHSSSNNT ENQPPSEAQV AQKAELNIND PGGGVKINSS TGSESNAVLE ANVTGDESAV DAVAIESRNA TVEEVPMPVF SEWAQKQIEA EASREQAMEL EQQVVNTSAQ RKDATGSGSG KPTSLKLRAK NYASPDCGAK IIASNADATN TGAVLSHSSD EYMLSTCGSR IWFVVELCEA VQAQKVELAN FELFSSSPKN FTVAVSKRFP TRDWSNVGRF AAEDKRTVQT FELHPHLFGK FVRVDIHSHY SKEHFCPISL FRVFGTSEFE AFETEIRHSE ELDDFDDDFG GQEQQHKPIT GTDGGGANIF QSASDAVIQM VKKAAQVLVK PTKAIRWSSD SSLCYTPTVG MFSCKSCNSS IVDRINNLLS CQSPQLELLL VLPQLRNHLL QTRICQSDYN ISLGLPRKAD SLTSGMDKRQ SFYLSVLSAE HVGAICKLLE ANLSSTKEKI VDLTEPAQQV DVTSILSENV SKSEPSKELE GDAKITIPNE SLPGSTPILK TVPVPEMPSC NEVPVVSLQQ NGPVTDDSAI ADPPSDVNIF NVPLANQKDP TPASAPLVPD SLPPSLTSPS SSAVDPNELQ PTKAPSITTT NHPPSTSGES DNGDLIGSGN GLDDGNLSNW ESIDSLLTTT VASITAGGGA AAAAAAVVNG NTNMGNSAGT GTAGSLNLQP KLTHGPQSES VFIRLSNRIK ALERNMSLSG QYLEELSRRY KKQVEELQQT LTQQALTVRT LEDQSRRYLE QEQLYQHQSA ELAGEVRALT YQVQACILVI IIVGTCIFLM LVVGTVFYRK LRRQTQQLMP TRAAMELTKS NICRRKSYEE MMQQSLGKQR RPSEEAMLIL NGCGDRSVEE QSEVGHGGRQ RKISVCYGSN NNITNHMINT RTSLHRRKGA KHSWQSNIDS APVSCVENVD KFFDVDTLKA QRMLTKPTKK KSLQFFPQEL KRQESAPANC TQEYRAEEHT QSDFDESLIL DDDDLCNFIP NTDLAYNEFM PDGPSGYQIV DTVDGKCDKG QATKKSRRVS SPAFFKSPFS KSKNKESSLN GDGSGNRIKA SQSAHEATSW EWYRLKRSDK QSNSKPLTTE SVASNSPNAS IDNSSLSEVS FPLNSTQNSF RILGEAILSS GEGRASGKSS IVAAGSSSSS GASTTSSTTK KKQRAFNNIF RKVF // ID B4KJ20_DROMO Unreviewed; 2647 AA. AC B4KJ20; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=GI19489 {ECO:0000313|EMBL:EDW13533.1}; GN Name=Dmoj\GI19489 {ECO:0000313|EMBL:EDW13533.1}; GN ORFNames=Dmoj_GI19489 {ECO:0000313|EMBL:EDW13533.1}, GN GI19489 {ECO:0000313|FlyBase:FBgn0142227}; OS Drosophila mojavensis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7230 {ECO:0000313|Proteomes:UP000009192}; RN [1] {ECO:0000313|EMBL:EDW13533.1, ECO:0000313|Proteomes:UP000009192} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15081-1352.22 {ECO:0000313|Proteomes:UP000009192}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH933807; EDW13533.1; -; Genomic_DNA. DR RefSeq; XP_002004091.1; XM_002004055.1. DR ProteinModelPortal; B4KJ20; -. DR EnsemblMetazoa; FBtr0170214; FBpp0168706; FBgn0142227. DR GeneID; 6578174; -. DR KEGG; dmo:Dmoj_GI19489; -. DR FlyBase; FBgn0142227; Dmoj\GI19489. DR InParanoid; B4KJ20; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4KJ20; -. DR Proteomes; UP000009192; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009192}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000009192}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1822 1842 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2647 AA; 295076 MW; AA1298A4FFB62F3A CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNRLVVAD LSSRTSRDLA EQCIKVLELI CSREAGAVFE GSGLNCVLSF IRDCGSQVHK DTLHSAMSVV SRLCTKVEPN SPCIDNCVES LSTLLQHEDA LVADGALKCF ASVADRFTRK WVDPAPLAEY GLVAELLKRL ENAGGANSHS LSTSKLSGPV PANLNSDRLN ENVSGSSSNA AGKMQSNEFR SPQSISTTIS LLSTLCRGSP SITRDILRSP LPNAIETALK GDERCVLDCM RFADLLLLLL FEGRQALNRG SANSNQGQLA PRPKRNDSST DRTHRQLIDC IRSKDTEALQ EAIETCGVDV NCMDDVGQTL LNWASAFGTL EMVEYLCEKG ADVNKGQRSS SLHYAACFGR PGIAKILLKF GAYPDLRDED GKTPLDKARE RSDDGHREVA AILQSPGEWM SSGNILTNKE GQIYTSLEPR GDPEMAPVYL KLLLPILCRT FQGSMLPSVR RASLSLIKKI VQYAHPSVLK NICRKHDEPT TSIAIQSVGN LLTEVIASVL DSEDDEDGHV IILNIIEELM YDLVEDAKEI LQGKPYHWRD WSICRGRDCL YVWSDSAALE LSNGSNGWFR FILDGKLATM YSSGSPENGN DSSENRGEFL EKLVRARSSV SPGTPSQPIL PNVCVLRLVV GNWVLQSHKP NQLQIHNTEG HQVTILQDDI QGFIFESNRG TKHSFTAETT LGADFASGWS TEKKKRSKSK TDIQKVQVSN LSREIYNKYF KSAQVIPRGA VAKLTAIVKQ INLAIEEQRV ASNSKWSNTL VTALTSLSKL IHEDGVVSAY EMHSSGLVQA LVSVLSNNYW EFNLSRCKTN KMLKDRIDIF KKCIFGECDD SNIYNTKNTA SILIQKLVAV LESTEKFPLF LYDAPGSGYG LQILQKRLRF RLERAPSEST LFDRTGRTLK VEPLATVGQL AKYLLKMVAK QWYDLDRSTF LYLKQMREHK QGISFTYNFD FDEEGLIYYI GSNGKTCEWV NPAQYGLVQV TSSEGKTLPY GKLEDILSRD GVSLNCHTKD NKKAWFSIDL GVYILPTAYT LRHARGYGRS ALRNWHLQAS KDGINWTTLI NHVDDKSLSE PGSTATWPII CSSDDTKGYR HIKIQQNGRN ASNQTHYLSL SGFEIYGRVV GVCDDIGKTI KEAEAKTRRE RRQIRAQLKH ITSGARVVRG VDWRWEDQDG SGEGTVTGEI HNGWIDVKWD HGVRNSYRMG AEGKYDLKLA NLENASIFEG VNSMLPVSSG PKKMDKTNVL VSRKSSSTPS LPEATEINKN SEDTSNQTVS ADNLAWKQTV ETITENVFTS AKTHIATNQS STASVKETQA IFKEAICDQP SLVTSNIPSP LIRELQHIAD LSTINNSMPA INLNNVSDLA TISENLSIVE LSKESSSTNQ TEHKSSENNS IVSNDVKRKP YIEENNKMNV NNSVNNLAKG LLINLRQINS ASSHCLSQFP TEPRDIIDKM RDGVDMLRNN TNNILSADES HIPRSSVSAA PKENAKFSVL IQPETSQKSE ILDGVGNESV NSSTPNKNLS NSEAPQPSTF PGQIENIISP QVTNEVVSVP NQMSISVPNL TTSSEVSSTS EVAVHTGLLE TFTAIARRRT SQDTSNDANQ SNNAIANKNE HSNQNVGTGS FFARGPNSVT SLVKLALSSN FHSGLLSTAQ SYPSLSSNNG ENASSNPTNK KVGQQPTSSI NPTLTMSLTS TSSDSEQVSL EDFLESCRAP TLLGDMEDED DMEEENDEEE NEDEYEEVGN TLLQVMVSRN LLTFMDDESL ENRLAGVSKR KSWDDEFVLK RQFSALIPAF DPRPGRTNVN QTSDLEICPP DTNLETSQQS GQIIPEQTKL SLKLRGPGVG GIPDVEIDLD NSEWTIFRAV QELLQNSQIN KNDKFRKLWE PTYTIIYRET YPTVQECSYI ESEGQKTPGV SSRSGASTLS PNSPVHGGIT DNNLCSVEDV LELLTQINTL NQYEIESDDN TASLQKDSYL PAELFMSKKI TNKLQQQIHD PLVLSSNALP NWCEDLNQSC PFLFPFETRQ LYFNCTAFGA SRSIVCLQSQ RDMTTERQRV PIMSPRRDDQ HDFRVGRLKH ERVKVPRNKD LLKWAIQVMK AHCNRKSVLE VEFLDEEGTG LGPTLEFYAL VAAEIQRSDL CMWLCDDEYS ETHNEPQFHN IEDGSKPIGY YVNRREHGLF PAPLPQDSEL CEQVSKYFWF FGVFIAKVLQ DMRLVDMPLS NSFLQLLCHN KVLSRNDLKL KSKSRFQDLM ASSVVSKDAE LANIYAQFLN GDLTQCNWIN GILNIENLNE IDPTRYQFLI ELQDLLMRKQ AIDIDDTLNF EEKQKLINNL KLRTKNDIEV SLEDLGLTFT YLPSSSVYGY AYAELIPNGY LTEVNINNLE AYWELLVNFM LHDGIAKQMQ AFYDGFCEVF PLNKLAAFNP SEARMMICGE QCPQWSREDL MAYTEPKLGY SKDSPGFLRF VNVLLSLSGA ERKAFLQFTT GCSSLPPGGL ANLHPRLTVV RKVDAGQGSY PSVNTCVHYL KLPDYPSEEI MKDRLLTATK EKGFHLN // ID B4KNG3_DROMO Unreviewed; 573 AA. AC B4KNG3; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GI19306 {ECO:0000313|EMBL:EDW08922.1}; GN Name=Dmoj\GI19306 {ECO:0000313|EMBL:EDW08922.1}; GN ORFNames=Dmoj_GI19306 {ECO:0000313|EMBL:EDW08922.1}, GN GI19306 {ECO:0000313|FlyBase:FBgn0142044}; OS Drosophila mojavensis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7230 {ECO:0000313|Proteomes:UP000009192}; RN [1] {ECO:0000313|EMBL:EDW08922.1, ECO:0000313|Proteomes:UP000009192} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15081-1352.22 {ECO:0000313|Proteomes:UP000009192}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH933808; EDW08922.1; -; Genomic_DNA. DR RefSeq; XP_002004987.1; XM_002004951.1. DR EnsemblMetazoa; FBtr0170031; FBpp0168523; FBgn0142044. DR GeneID; 6579090; -. DR KEGG; dmo:Dmoj_GI19306; -. DR FlyBase; FBgn0142044; Dmoj\GI19306. DR InParanoid; B4KNG3; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4KNG3; -. DR Proteomes; UP000009192; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009192}; KW Reference proteome {ECO:0000313|Proteomes:UP000009192}. FT COILED 42 62 {ECO:0000256|SAM:Coils}. FT COILED 78 98 {ECO:0000256|SAM:Coils}. FT COILED 194 235 {ECO:0000256|SAM:Coils}. FT COILED 317 337 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 573 AA; 65261 MW; F5144E1FCB1FDCCE CRC64; MALWLPREQQ EAAAIRLNME QVQQSMQKSL TPEEYENILN HVNSYVQQLV ELKLQQQAKE QQQQQQQQLS PQQIQIIVKL MRENLQQFAD QRAQLSEQQL ADLVLRLKLE LQQSVEWQGG QIKLTSAQLE EITRLVKAEF NLKEEHYTLL LERIDLGALL ERLLGSPELA EFVDARINLG LQQQQQAKEE GSGQSNADQQ IEQLNREIAF IKLALSDKQM ENANLQQSIS NLKLTQDDLL ARMQQHELAQ DQRFSGLLAE IEAKLAALNE SQFALLNKQV KLTLVEILGF KQTAGGKLDD VDLQNWVRSM FVAKDYLEQQ LLELNEKTNN NIRAEIERSS LVLMSDISER LKREILLVVE AKQNASSQSL KALMHDDEVR NIVKSVLAVY DADKTGLVDF ALESAGGQIL STRCTESYQT KSAQISVFGI PLWYPTNTPR VAISPNVQPG ECWAFQGFPG FLGTMLKLNS MVHVTGFTLE HIPKSLSPTG RIDSAPRNFT VWGLEHEKDQ DPVLFGEYEY QDNGASLQFF PVQNMDIKRP FEIVELRIES NHGQPDYTCL YRFRVHGKPP PSS // ID B4LK93_DROVI Unreviewed; 547 AA. AC B4LK93; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=GJ22184 {ECO:0000313|EMBL:EDW61684.1}; GN Name=Dvir\GJ22184 {ECO:0000313|EMBL:EDW61684.1}; GN ORFNames=Dvir_GJ22184 {ECO:0000313|EMBL:EDW61684.1}, GN GJ22184 {ECO:0000313|FlyBase:FBgn0209297}; OS Drosophila virilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7244 {ECO:0000313|Proteomes:UP000008792}; RN [1] {ECO:0000313|EMBL:EDW61684.1, ECO:0000313|Proteomes:UP000008792} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15010-1051.87 {ECO:0000313|Proteomes:UP000008792}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH940648; EDW61684.1; -; Genomic_DNA. DR RefSeq; XP_002050491.1; XM_002050455.1. DR STRING; 7244.FBpp0236601; -. DR EnsemblMetazoa; FBtr0238109; FBpp0236601; FBgn0209297. DR GeneID; 6625721; -. DR KEGG; dvi:Dvir_GJ22184; -. DR FlyBase; FBgn0209297; Dvir\GJ22184. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B4LK93; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4LK93; -. DR Proteomes; UP000008792; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008792}; KW Reference proteome {ECO:0000313|Proteomes:UP000008792}. FT COILED 24 44 {ECO:0000256|SAM:Coils}. FT COILED 56 76 {ECO:0000256|SAM:Coils}. FT COILED 170 197 {ECO:0000256|SAM:Coils}. FT COILED 293 313 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 547 AA; 62333 MW; E6F0F6BFCD716BC8 CRC64; MEQVRQGIQK SLTPEEYENI LNHVNSYVQQ LVELKLQHQE KQQQQQLSPQ QIQIIVQLMR ENLQQFAEQR TQLSEQQLAD LALRLKLQLQ QSGEWHAEQA KFSHAQLEEL TRLIKAEFKL EEKHYTLLLE RIDLTALLKR LLGAPELAEF VDARIDLALQ QQAPGEGSGQ AVAEQQIDQL NREIAFIKLA LSDKQMENAD LQQSISSLKL TQDDLLARMQ QHELAQDQRF SGLLAEIESK LAALNDSQFA LLNKQVKLTL VEILGFKQTA GGKLNDVDLQ NWVRSMFVAK DYLEQQLLQL NEKTNNNIRA EIERSSLVLM SDISERLKRE ILLVVEAKHN ESSQVAKAHM HEDEVRSIVK AVLAIYDADK TGLVDFALES AGGQILSTRC TESYQTKSAQ ISVFGIPLWY PSNTPRVAIS PNVQPGECWA FQGFPGFLVL KLNSLVYVTG FTLEHISKSL SPTGRIDSAP RNFTVWGLEH EKDQEPVLFG EYEYQDNNAS LQFFAVQNVD IKRPFEIVEL RIETNHGQPD YTCLYRFRVH GKPPPST // ID B4LQT4_DROVI Unreviewed; 268 AA. AC B4LQT4; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GJ12925 {ECO:0000313|EMBL:EDW63468.1}; GN Name=Dvir\GJ12925 {ECO:0000313|EMBL:EDW63468.1}; GN ORFNames=Dvir_GJ12925 {ECO:0000313|EMBL:EDW63468.1}, GN GJ12925 {ECO:0000313|FlyBase:FBgn0200159}; OS Drosophila virilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7244 {ECO:0000313|Proteomes:UP000008792}; RN [1] {ECO:0000313|EMBL:EDW63468.1, ECO:0000313|Proteomes:UP000008792} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15010-1051.87 {ECO:0000313|Proteomes:UP000008792}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH940649; EDW63468.1; -; Genomic_DNA. DR RefSeq; XP_002051313.1; XM_002051277.1. DR STRING; 7244.FBpp0227342; -. DR EnsemblMetazoa; FBtr0228850; FBpp0227342; FBgn0200159. DR GeneID; 6627803; -. DR KEGG; dvi:Dvir_GJ12925; -. DR FlyBase; FBgn0200159; Dvir\GJ12925. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B4LQT4; -. DR OMA; HVAREMT; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4LQT4; -. DR Proteomes; UP000008792; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008792}; KW Reference proteome {ECO:0000313|Proteomes:UP000008792}. SQ SEQUENCE 268 AA; 29824 MW; 5CFEC2B4C0820D1E CRC64; MWNNRKSIVG LSRLQEDVNC ITQALKVDCA GQKYSNLPCG KTNCLARDSQ KSIGCNAREL NACVDTLVKR KLGNIMDDVY SLKKQVMERG CKAKCNKPPV PNTEALILAP TRINYASEEL GARIIYAIAK PISETSFIKN LLGLDFSANP PINMLRPSIL PGSCFGFRGT EATVSLHLAK VIFVDEISLS HVAKEMTPSA SVDNAPKDFE VYGLPPDSNK KELLGQWVYE NDLKKRTQNY VVKNRRIFCT LVFVFRSNHG ANSTCVYR // ID B4LR02_DROVI Unreviewed; 2710 AA. AC B4LR02; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 52. DE SubName: Full=GJ21961 {ECO:0000313|EMBL:EDW64541.1}; GN Name=Dvir\GJ21961 {ECO:0000313|EMBL:EDW64541.1}; GN ORFNames=Dvir_GJ21961 {ECO:0000313|EMBL:EDW64541.1}, GN GJ21961 {ECO:0000313|FlyBase:FBgn0209076}; OS Drosophila virilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7244 {ECO:0000313|Proteomes:UP000008792}; RN [1] {ECO:0000313|EMBL:EDW64541.1, ECO:0000313|Proteomes:UP000008792} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15010-1051.87 {ECO:0000313|Proteomes:UP000008792}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH940649; EDW64541.1; -; Genomic_DNA. DR RefSeq; XP_002052386.1; XM_002052350.1. DR ProteinModelPortal; B4LR02; -. DR STRING; 7244.FBpp0236378; -. DR EnsemblMetazoa; FBtr0237886; FBpp0236378; FBgn0209076. DR GeneID; 6629058; -. DR KEGG; dvi:Dvir_GJ21961; -. DR FlyBase; FBgn0209076; Dvir\GJ21961. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; B4LR02; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4LR02; -. DR Proteomes; UP000008792; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008792}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008792}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1882 1902 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2710 AA; 301462 MW; D7CB4ADAFD47FF48 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNRLVVAD LSSRTSRDLA EQCIKVLELI CSREAGAVFE GSGLNCVLSF IRDCGSQVHK DTLHSAMAVV SRLCTKVEPN SPCIENCVES LSTLLQHEDA LVSDGALKCF ASVADRFTRK WIDPAPLAEY GLVTELLKRL ESAGGSGNSH PLSSSKLSGP LHTNQNSDRL NENVSGSSVS AAGKIQASEL RSPQSISTTI SLLSTLCRGS PSITRDILRS QLPDAIERAL KGDERCVLDC MRFADLLLLL LFEGRQALNR GSGNSSQSQL APRPKRNDSS TDRTHRQLID CIRSKDTEAL QEAIETCGVD VNCMDDVGQT LLNWASAFGT LEMVEYLCEK GADVNKGQRS SSLHYAACFG RPGIAKILLK FGAYPDLRDE DGKTPLDKAR ERSDDGHREV AAILQSPGEW MSSGHILASK DGQLQTSIEP RGDPEMAPIY LKLLLPIFCQ TFQSSMLPSV RRASLGLIKK IVQYAHPSVL KSICKKCDEP TTSRATQSVG NLLTEVIASV LDSEDDEDGH VIILNIIEEL MCKTQEEFLE HFARLGVFSK VQALMDQGDN NSSPQFPSSS SDLCSTQKGQ AFCMAHRSAP DDLVEDAKEI FQGKPYHWRD WSICRGRDCL YVWSDSAALE LSNGSNGWFR FILDGKLATM YSSGSPENGN DSSENRGEFL EKLVRARSSV SPGTPSQPIL PTVSVLRLVV GNWVLQSHKP NQLQIHNTEG HQVTILQDDV QGFIFESNRG TKHTFTAETS LGADFASGWA TEKKKRTKSK TDIQKVQVSN VSREIYNKYF KTAQVTPRGA VAKLAAIVKQ INFAIEEQRS CCNSNWSKIL VSALTNLSQL IHEDGVVSAY EMHSSGLVQA LVAVLSNNYW EFNSSRCKTN KMLKERIDIF KKCIFGECDE SKIYNTKNTA SILIQKLVAV LESTEKLPLF LYDAPGTGYG LQILQKRLRF RLERASSEST LFDRTGRTLK VEPLATVGQL AKYLLKMVAK QWYDLDRSTF LYLKKLREQK QDISFTYNFD FDEEGLIYYI GSNGRTCEWV NPAQYGLVQV TSSEGKTLPY GKLEDILSRD AVSLNCHTKD NKKAWFSIDL GVCIVPTAYT LRHARGYGRS ALRNWLLQAS KDGINWTTLI SHVDDKSLAE PGSTATWPIV CTSDDTKGYR HIRIQQNGRN ASGQTHYLSL SGLEIYGRVV GVCDDIGKTI KEAEAKTRRE RRQIRAQLKH ITSGARVVRG VDWRWDDQDG SCQGTITGEI HNGWIDVKWD HGVRNSYRMG AEGKYDLKLA NLESVSIFEG VNSTLPVASG PGPKKTDKTN VLTSRKSSST PSLPEATEIN KNPEDASNQT VSADNLAWKQ TVETIAENVF ISAKTHITTN QANAALKKET QAQYDESSSE QTSLATTNIP SPLIRELQHI ADLSTINNSM QAINLNNVSD LAPISESLSI VEVSKEASSA KPSEGTSVVS KDIKRRPYIE ENNKMNANNS VNNLAKGLLI NLRQINSSNS HCVSQLPTET REIIDKMRDG VDMLRNNTNN ILSSDELHIP RSNVSSEPKG NAKVSVLIQH DSLQKSDTLD GASNEAVNSS TPRQNLDSSD STTQISNFPG QIDNIINPQI SNDVVSVSNQ MSISVPNLTT SSEVSSTSEV AVHTGLLETF TAIARRRTSQ DTNNDIGNQS NNTNVNKNEH SNQNVGASSF FARGPNSVTS LVKLALSSNF HSGLLSTAQS YPSLSSNNGD NTSSSNPTNK KVSQQATSSV NPTLTMSLTS TSSDSEQVSL EDFLESCRAP TLLGDMEDED DMEEENDEEE NEDEYEEVGN TLLQVMVSRN LLTFMDDESL ENRLAGVSKR KSWDDEFVLK RQFSALIPAF DPRPGRTNVN QTSDLEICAP DTNMENFQQT GQITADQTML GLKLRGPGVG GVPDVEIDLD NSEWTIFRAV QELLQNSQMN KNDKFRKLWE PTYTIVYRET FPTVKESSYM ESEEGQKTPG VSSRSGASTL SPNSPVHGGL SITDNTLCSV EDVLELLTQI NALNQCEIES GVQCTNSPKR SFLPAELFMS KKITNKLQQQ IHDPLVLSSN ALPNWCEDLN QSCPFLFPFE TRQLYFNCTA FGASRSIVCL QSQRDITTER QRIPIMSPRR DDQHDFRVGR LKHERVKVPR NKDLLKWAIQ VMKAHCNRKS VLEVEFLDEE GTGLGPTLEF YALVAAEIQR ADLCMWLCDD DLTEAQDTPQ FHNIEDTSKP IGYYVNRREH GLFPAPLPQN SELCEQVSKY FWFFGVFIAK VLQDMRLVDM PLSKSFLQLL CHNKVLSRND LKLKANTHFQ DLMASSVASK DSEFANTYSQ VLNEDFTQFN WFSGILNIEN LREIDPTRYQ FLIELQDLLM RKQAIDLDES LSMEEKSELV KNLKLRTKNN IEVSLDDLAL SFTYLPSSSV YGYTYAELIP NGYATEVNIN NLEAYCELLL NFMLHDGIAK QMQAFHDGFC EVFPLTKLAA FNPSEARMMI CGEQYPQWNR EDLMAYTEPK LGYSKDSPGF LRFVNVLLSL SGTERKAFLQ FTTGCSSLPP GGLANLHPRL TVVRKVDAGQ GSYPSVNTCV HYLKLPDYPS EEIMKDRLLT ATKEKGFHLN // ID B4LV97_DROVI Unreviewed; 1335 AA. AC B4LV97; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=GJ17429 {ECO:0000313|EMBL:EDW64357.1}; GN Name=Dvir\GJ17429 {ECO:0000313|EMBL:EDW64357.1}; GN ORFNames=Dvir_GJ17429 {ECO:0000313|EMBL:EDW64357.1}, GN GJ17429 {ECO:0000313|FlyBase:FBgn0204601}; OS Drosophila virilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila. OX NCBI_TaxID=7244 {ECO:0000313|Proteomes:UP000008792}; RN [1] {ECO:0000313|EMBL:EDW64357.1, ECO:0000313|Proteomes:UP000008792} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 15010-1051.87 {ECO:0000313|Proteomes:UP000008792}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH940649; EDW64357.1; -; Genomic_DNA. DR RefSeq; XP_002052202.1; XM_002052166.1. DR STRING; 7244.FBpp0231846; -. DR EnsemblMetazoa; FBtr0233354; FBpp0231846; FBgn0204601. DR GeneID; 6627494; -. DR KEGG; dvi:Dvir_GJ17429; -. DR FlyBase; FBgn0204601; Dvir\GJ17429. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B4LV97; -. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4LV97; -. DR Proteomes; UP000008792; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008792}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008792}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1335 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002814068. FT TRANSMEM 942 963 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 259 286 {ECO:0000256|SAM:Coils}. FT COILED 878 909 {ECO:0000256|SAM:Coils}. FT COILED 1101 1121 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1335 AA; 144747 MW; 9A3FA3FC71B82317 CRC64; MKIRLHLVRF MYINLILSCC FWLYDNAAAA DVSSANAATA ENANRAVHQA DARMQIPLAA LAPASENEGT FAMPLTKFKT VTNVAKEDAP PTATTTASTS TTTSSSPQQE EDNIVMKLGS APSSSAEAIS DAVIDTLPEI ITELPAVTVA ELPLEVEHVK DRLESVLPHE AAHSSGNKTE SQQPPSQAFE IAKEAAQKAE LNINDPGGGL ELHAFASSES NATVDGNVTS EGGQIANAAA AASSNETVEE VPMPVFSEWA QKQIEAEASR EQAMELEQQV VNNSAQRRNA TGSGSGKPPS LKLRAKNYAS PDCGAKIIAS NADATNTGAV LSHSSDEYML STCGSRIWFV VELCEAVQAQ KVELANFELF SSSPKNFTVA VSKRFPTRDW SNVGRFVAED KRTVQTFELH PHLFGKFVRV DIHSHYSKEH FCPISLFRVF GTSEFEAFET EIRHSDELDD FDDDFGGQEQ LHKTLSGTDG GGANIFQSAS DAVIQMVKKA AQVLVKPSKP LRWSTDSLLC FTPTAGLYSC QSCNSSAVDR INNLLSCQSH QLQQLLLLPQ LRTHLLQSRV CQADYNISLA SLLFREEERA TSGMDKRQSF YLSMLPAEHV GAMCKLLQAE HSSVVEYPVE LPEAERQRNL DVAQHLYENL TIADSSKEPK SEAKVTAPND SLPAVTPTLE VVSAEMSTAS EKSATDAPLD PQPTDSSTIA ATPSDVNIFN VPTDTQKNPT QGSVPIVPDT PTATPASQVD PNEVEASKTP TGPTTSHPPT SSSESDSADL LGSGNGLDDS NLSNWESIDS LLTTTVASIT AGGGAAAAAA AVVNGNANIG GTANAGNAGS LNLQPKLTHG PQSESVFIRL SNRIKALERN MSLSGQYLEE LSRRYKKQVE ELQQTLTQQS LTVRTLEDQS RRFIDQEQLY KQQSAELAGE VRALTYQVQA CILVIIIVGT CIFLMLVLGT VYYRKLRRQT QQLLPQATTD NSQQKLSRRK SYEQMTQQSP GKQRRPSEEA MLILNGCGDS SLTVQSDAGN GNGRQRKISV CYGSNNNIAN HMSNTRPSLH RRKGAKHSWH ASLDSAQVTC GENPDKFFDV DTLKSNKMQN KSAKRKSLQQ FQQELKRQES APASYTENSL TEEPTQSDFD ESLILDDEDL CNFIPNTDLA YNEFMPEGPS GYQILDTVDG KADKGQTVKK CRRVSSPAFF KSPFSKNKNK GGSLNGMGGV KGSQSAHEAT SWEWYRLRRS EKQSKQLSVP SVSISSTPNA SLDNSSLSEI NFPFNSTQNS FRILGEAILS SGESRPSGKD SAVAAGSSSS SGASTTSSTT KKKQRAFNNI FRKVF // ID B4MUP6_DROWI Unreviewed; 1063 AA. AC B4MUP6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GK14764 {ECO:0000313|EMBL:EDW76241.1}; GN Name=Dwil\GK14764 {ECO:0000313|EMBL:EDW76241.1}; GN ORFNames=Dwil_GK14764 {ECO:0000313|EMBL:EDW76241.1}, GN GK14764 {ECO:0000313|FlyBase:FBgn0216769}; OS Drosophila willistoni (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7260 {ECO:0000313|Proteomes:UP000007798}; RN [1] {ECO:0000313|EMBL:EDW76241.1, ECO:0000313|Proteomes:UP000007798} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14030-0811.24 {ECO:0000313|Proteomes:UP000007798}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH963857; EDW76241.1; -; Genomic_DNA. DR RefSeq; XP_002065255.1; XM_002065219.1. DR EnsemblMetazoa; FBtr0245415; FBpp0243907; FBgn0216769. DR GeneID; 6641879; -. DR KEGG; dwi:Dwil_GK14764; -. DR FlyBase; FBgn0216769; Dwil\GK14764. DR eggNOG; ENOG410JH9E; Eukaryota. DR eggNOG; ENOG41113P8; LUCA. DR InParanoid; B4MUP6; -. DR OMA; INFACEA; -. DR OrthoDB; EOG7SJD4K; -. DR PhylomeDB; B4MUP6; -. DR Proteomes; UP000007798; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007798}; KW Reference proteome {ECO:0000313|Proteomes:UP000007798}. SQ SEQUENCE 1063 AA; 115724 MW; DBF608A58E13FBC7 CRC64; MTDNKANNVG INRLRDEFDD MSHTLRIQNK DSTEPGSSKF GCLAGESKPT SQWDSRDINT YVDSMIRRKI GNLMDDVYSL KKKIISTDCK SNFKHATSKS DQVAVIKNRI NYASEDLGAK VINVLALPIH VQNIWQILLG LDYNSNPPVN MLRPSLTTGA CFCYNGCKAT IAMRLAKKIV VDTITYTHVA KEMTPNMCVN SAPKNFDGVH PDTSKRELLG HWVYNNVPNR RTQSYSVNSK TFFQKLVFVF NSNHGANSTC LYYAVNTYNS SDVALDKQED QHHRLRHRQS DEAATATTTT TTMGTKQKQA ASAALGEEDD GRGGAGGQKA EAAGAACATM EAEAEGGAIA AAAAADKIGW PAGLPYFDLD RDGYGLAAAA AALTPTDIEE ALAGAGVGGG GLSSSRTSLN NSTENLSYAS DNYYGDDLIL LDDDDDVEAE EISLNSDDCV YAYRGDRGDF DISLEGALPS RGGIGMGMRS GMASGPGIGG NGRHHLDVII GRDDETEFLE MDFEPDPSSE LELDGGMAAM PQANLLLMQR DFSQMSESRQ LYSSPIDDYP PQEDVLQQQK QRLSKKFARI SLNLDQIKDH TGVDLDLDME LRSGANVDGV GLPEERHEEL VPESLTQTAH SLPSSFSRST SVPPNDTEPK LTGAKPKRLM SKSSSSSSKP RRSQNSSSFD ERCFSCTDFR LGSNLQAAAS SASSTQQLDP TVAPMLVAAA AVVAMASSNS NHSSSSMAQF DLTNEQETTC LDCLEKEFLA NTIGKALDLT SCAKCRRRGH AHKGSSLSLY NQTPRCRSGS PVFFDELSGG IYTIWPTAQA QVPPPSLPPP AAAPTYNQRF ISCDNNFGGL PLLKPSFSTE PPSASLMVVS ARLSTSSSQL CDEEHLVQAL DKLHISYDKT LLHLYYERAE RDTQTVASIR PSSFDGNLKE LLLHVSKQQC NHRKLKKLIE LVTQQQLNVQ FKRTNESKPA FVPVKVSDIL EAWSRHRDLS VLRQLDNRFH QANVMGKIGH IVRQAAKGSA TVTSSSSTAA SSSSTATVRR ALPEFIMIPQ YYACGELTLT RKC // ID B4MVJ7_DROWI Unreviewed; 1218 AA. AC B4MVJ7; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=GK15092 {ECO:0000313|EMBL:EDW75717.1}; GN Name=Dwil\GK15092 {ECO:0000313|EMBL:EDW75717.1}; GN ORFNames=Dwil_GK15092 {ECO:0000313|EMBL:EDW75717.1}, GN GK15092 {ECO:0000313|FlyBase:FBgn0217097}; OS Drosophila willistoni (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7260 {ECO:0000313|Proteomes:UP000007798}; RN [1] {ECO:0000313|EMBL:EDW75717.1, ECO:0000313|Proteomes:UP000007798} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14030-0811.24 {ECO:0000313|Proteomes:UP000007798}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH963857; EDW75717.1; -; Genomic_DNA. DR RefSeq; XP_002064731.1; XM_002064695.1. DR STRING; 7260.FBpp0244235; -. DR EnsemblMetazoa; FBtr0245743; FBpp0244235; FBgn0217097. DR GeneID; 6642416; -. DR KEGG; dwi:Dwil_GK15092; -. DR FlyBase; FBgn0217097; Dwil\GK15092. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B4MVJ7; -. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4MVJ7; -. DR Proteomes; UP000007798; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007798}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007798}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 813 834 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 133 160 {ECO:0000256|SAM:Coils}. FT COILED 749 797 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1218 AA; 132756 MW; 7BE12BA518BE6AE3 CRC64; MYILTLFVCS FPEIITELPT VTVTELPLEH IKNRLDAVIL DKAEQAATAS NKSDPREQQE ENSKELPQQD EQILKINDPG GGVDTDESIL APVEGSINET LTASNDGVGQ ANDSDVITSN LTEEVPMPVF SEWAQKQMEA EASREQAMEL EQQVVNNSAQ RRNATSSGNG KQPTLKLRAK NYASPDCGAK IIASNADATN TGAVLSHSSD EYMLSTCGSR IWFVVELCEA IQAQKVELAN FELFSSSPKN FTVAVSKRFP TRDWSNVGRF AAEDKRTVQT FELHPHLFGK FVRVDIHSHY SKEHFCPVSL FRVFGTSEYE AFETEIRPND ELDDFYDDFG GQQDQAHKSS GGGAGGSGGN GAGGGGAGSG GGGNIFQSAS DAVMQIVKKA AEVLVKPTKA FKWSSESLLC RTASFDSFSC ATCNVTLVER VNTLISCQFH QLQLLLHLPQ LREDLLKSQI CEEEYGISLT SASLTFTWAK RKSYYVSMLP PEQVGALCKL LQVEHNITVE QQPATLEKPL DLPRKEEQIN LNAPEISQEI AQEEKPVEFL SQSSTPSLEL LNPQTINSQE LPVKETPSIS DQPIASKPSQ QLPPTPADVN IFNVPPVLEE AQAVPSPTQE PPALLGTEAS STEDSSTVAA STSSTTETAS SSSDLAIEET NPANWESIDN LLTTTVASIT AGGGAAAAAA AVVNGGGGAT VGGAIPAGSN GVNLQQKLTN GAQSESVFIR LSNRIKALER NMSLSGQYLE ELSRRYKKQV EELQQTLTQQ TLTVRSLEDQ SRRYIEQEQL YQQQSAELAG EVRALTYQVQ ACILVIIIVG TCIFLMLVMG TVYYRKLRRQ TQQLLPTAAA AATSPSKQKL KLSRRKSYEQ MTTTTTGESQ QSPAKQRRPS EEAMLILKEC GDSSSNAEDP SMRQRKISVC YGSNNNIAAN MLNASARASL HKRKGAKNSW HKEESVTDKF FDVDTLKSLK QQPPQIKANR KKSQPMGVGN GLKRQESAPA TYSPLDEHSE EPSQTDYDES LILDDDDLAN FIPTSDLAYN EFMPDGPSGY QILDTVDGKV DPVQENNASN NSANKKTRRL SSPGFFKSPF GKSKNKGGSF NGGIKNSHSA HEATSWEWYR LKRNEKHQQK LAARMNANSL DSASLSEVPF PLTSTENSFR ILGEAIISSG ESVNHNGNGL IQKIPLPTSS NSSSGGSTTS STTKKKQRAF NNLFRKVF // ID B4MY36_DROWI Unreviewed; 563 AA. AC B4MY36; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=GK22143 {ECO:0000313|EMBL:EDW77025.1}; GN Name=Dwil\GK22143 {ECO:0000313|EMBL:EDW77025.1}; GN ORFNames=Dwil_GK22143 {ECO:0000313|EMBL:EDW77025.1}, GN GK22143 {ECO:0000313|FlyBase:FBgn0224128}; OS Drosophila willistoni (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7260 {ECO:0000313|Proteomes:UP000007798}; RN [1] {ECO:0000313|EMBL:EDW77025.1, ECO:0000313|Proteomes:UP000007798} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14030-0811.24 {ECO:0000313|Proteomes:UP000007798}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH963894; EDW77025.1; -; Genomic_DNA. DR RefSeq; XP_002066039.1; XM_002066003.1. DR STRING; 7260.FBpp0251286; -. DR EnsemblMetazoa; FBtr0252794; FBpp0251286; FBgn0224128. DR GeneID; 6643293; -. DR KEGG; dwi:Dwil_GK22143; -. DR FlyBase; FBgn0224128; Dwil\GK22143. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B4MY36; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4MY36; -. DR Proteomes; UP000007798; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007798}; KW Reference proteome {ECO:0000313|Proteomes:UP000007798}. FT COILED 221 241 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 563 AA; 64133 MW; 83059FEF58A8E1DB CRC64; MGATLKNIIR EAPQLNEAEA IKLNLANVQN NMKKSLSPEE YENILNHVNG YVQQLMEIKL LKQREDHKEQ AKQLSPQQLA IIVGLIQEHL QGLSPERQEL SEDDLSQLID KLKLKWQPEW RLTPTNVEEI TKLIKSNVAI EENHYNLLLE RIDLTAVVEK ILLSPELASF VDQRIDGKLK DFKSSFPSND EQIEQLNTEI AFIKLALSDK HAENADLHQS ISNLKLSQED LLERMQQHEL AQDQRFSGLL AEIESKLASL NDSQFVALNK QVKLSLVEIL GFKQSDSAVK LDDIDLQNWV RSMFVAKDYL EEKLLELNKR TNNNIRDEIE RSSILLMSDI SERLKKEILL SVDVKHNASV KALKGQIREE EVRQIVKSVL AIYDADKTGL VDFALESAGG QILSTRCTES YQTKSAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLVLKLN SLVYVTGFTL EHIPKSLSPT GRIDSAPKNF TVWGLEHEKD QDPILFGEYE YQDNGASLQY FAIQNLDIKR PYEIVELRIE TNHGQPTYTC LYRFRVHGKP PAA // ID B4NQM7_DROWI Unreviewed; 2700 AA. AC B4NQM7; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 51. DE SubName: Full=GK23521 {ECO:0000313|EMBL:EDW86666.1}; GN Name=Dwil\GK23521 {ECO:0000313|EMBL:EDW86666.1}; GN ORFNames=Dwil_GK23521 {ECO:0000313|EMBL:EDW86666.1}, GN GK23521 {ECO:0000313|FlyBase:FBgn0225483}; OS Drosophila willistoni (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7260 {ECO:0000313|Proteomes:UP000007798}; RN [1] {ECO:0000313|EMBL:EDW86666.1, ECO:0000313|Proteomes:UP000007798} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tucson 14030-0811.24 {ECO:0000313|Proteomes:UP000007798}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH964295; EDW86666.1; -; Genomic_DNA. DR RefSeq; XP_002075680.1; XM_002075644.1. DR ProteinModelPortal; B4NQM7; -. DR STRING; 7260.FBpp0252664; -. DR EnsemblMetazoa; FBtr0254172; FBpp0252664; FBgn0225483. DR GeneID; 6653305; -. DR KEGG; dwi:Dwil_GK23521; -. DR FlyBase; FBgn0225483; Dwil\GK23521. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; B4NQM7; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4NQM7; -. DR Proteomes; UP000007798; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007798}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007798}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1302 1329 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2700 AA; 300747 MW; C9CF082AB92BB00D CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNRLVVAN ISSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSHVHK DTLHSAMSVV SRLCTKVEPS SPCIQNCVES LSTLLQHEDS MVSDGALKCF ASVADRFTRK WIDPAPLAEY GLVTELLRRL ENAGGATNVH TLNSSKSSRF QQPGLASILA NSDASQNTIS GIDTALNSQI QSVDSTRSPQ STSTTISLLS TLCRGSPSIT HSLLRSQLPK ALEQALNGDE RCVLDCMRFA DLLLLLLFDG RQALNRANET SNQGQLVPRP RRNDVNADRT HRQLIDCIRS KDTDALQEAI DGGGVDINCM DDVGQTLLNW ASAFGTLEMV EYLCEKGADV NKGQRSSSLH YAACFGRPAI AKILLKFGAY PDLRDEDGKT PLDKARERSD DGHREVATIL QSPGEWISSD NTLGNKDGKN YTLIEPRGDI EMAPIYLKLL LPIFCKTFQG TMLNSVRRGS LGLIKKIVQY ATPAVLKALK QNSHNEANTS ITFLSTSNLL IEVIATVLDN EDDDDGHIIV LNIIEELMSK TQDDFLEHFA RLGIFSKVQA LMDIEAPKNL PHITTTSMEE SSTKRSASLC KFIKYFTDES IEDAKEILQG KPYHWREWSI CRGRDCLYVW SESAALELSN GSNGWFRFIL DSKLATMYSS GSPENGNDSS ENRGEFLEKL LRARSSVSPG MISQPILPTA SVLRLVVGNW VLQSHKPNQL QIHNTEGHQV TILQDDLPGF IFESNRGTKH TFSAETVLGP DFSSGWTTCK KKRLKSKTDV QKLHVWNLSR EIYNKYFKSA QIITRGSVAK LSSIVGRVGL SMKEQKESST SNWSKTLSTA LNDLSQLLQE DGIVSAYEMH SSGLVQVLLS VLSKNDWHGN FSQWKLNEMH KQRIAVFKKC ILLGNNDSRP SHINSKSTAT ILLHKLIAVL ESTEKLPIYL YDAVGAGYGL QILQKRIRFR LERAACESTL FDRTGRTLKM EPLATVGQLT KYLLKMVAKQ WYDMDRSTYF YLKIMRESKS GLSFLHSFDF DEEGLLYYIG SNAKTYEWVN PAQYGLVQVT SSEGKVLPYG KLEDILSRDS VSLNCHTKDN KKAWFSIDLG VYIIPNAYSL RHARGYGRSA LRNWLLQSSR DGLTWITLIN HVDDKSLVEP GSTATWPIIC ASDETVGYRH IRIQQNGRNA SGQTHYLSLS GFEIYGKVVG VCEDIGRSVK EAEAKIRRER RQIRAQLKHL TAGARVVRGV DWRWEDQDGF SEGTITGEIH NGWIDVKWDH GVRNSYRMGA EGKYDLKLAN CENLSIFEGG NYAAPVGLSS NNKKTEKNST LTSRKSISTP SLPEATEENR NSDESSIQTT SADNLAWKHT VGTITENVFA SGKTQVTSNQ RTAKTTSTDI PIKLQESHCA NQASKSAVTC APPLIRELEH VSDLSAINNS MSAINSDLAT ISENITLAVI TKENICSVLS TTPRAIVSDA CSVSGETNIK PISHIEENNK MNANNTGNIL AKGLHMSMRS ASLRGDESVS QFSSEDAENI DKIHDGVDML RKNTNNILST EVHPFPKSQN LKLNTETKPQ SNCKSFPESQ IVVLMDELFT RNIDNSSTST SNQQSKIDGQ VGKIADTGML SEVAAVSNQM SISVPNLATT SASEVSSTIE VSAHTGLLET FGAIARRRTS QGANIQDNQI INVHNNTNEL SNQNVGTSSF FAKGPNSVTS LVKMALSSNF HSGLLSTAQS YPSLSSNNER GSAPPNGTSS NTSQQPVSSI NHTLTMSLTS TSSDSEQVSL EDFLESCRAP ALLGDLDDDD DMDEDNEEEE NEDEYEEVGN TLLQVMVSRN LLTFMDDESL ENRLVGVTKR KSWDDEFVLR RQFSALIPAF DPRPGRTNVN QTTDLEILKP GSKIENSNES GYIHVKQPNL SLKLRGPGVG GVPDVEIELD NADWTIFRYI QELLQNSQLN KVDKFRKVWE PTYTIVYKEV LPDTRENICF ESDQTSEISS KSGASTLSPN SPMHGGINTN EHSLCSVDEI LELLTQINSL NQNKLGCESK DCVLPPELFI SKKITNKLQQ QIQDPLVLSS NALPSWCEEL NQSCPFLFPF ETRQLYFNCT SFGASRSIVC LQSQRDNTIE RQRLPLLSNR RDDQPDFRVG RIKHERVKVP RNQDLLKWAM QVMKTHCDRK SVLEVEFLDE EGTGLGPTLE FYSLVAAELQ RADLCMWLCD NESINRPEST KFKENVNEES KPVGYYVNIR DHGLFPAPLP QNSEVCKNVC EYFWFFGVFI AKVLQDMRLV DMPLSTSFLQ LLCHKHLSSQ NCQKVLSEHS ELVDTCSKLI NASSENIEFN MFGGILDLEN LKEIDPTRYT FLMEVKNLLL RKQSIELDNS INFEEKEKII NDLKLCTMSG IEVSIEDLAL TFTYSPSSPI YEYGHIELLP NGASIDVNIH NLKVYCDLII SYILQDGIAN QLKAFKDGFN KVFPLKKLAA FCPSEARLML CGEQHPQWSR EDIMAYTEPK LGYSKDSPGF LRFVNVLLSL SGAERKSFVQ FITGCSSLPP GGFANLHPRL TVVRKVDAGE GSYPSVNTCV HYLKLPDYPT EEILKERLLT ATKEKGFHLN // ID B4NRY8_DROSI Unreviewed; 270 AA. AC B4NRY8; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=GD11975 {ECO:0000313|EMBL:EDX15366.1}; GN Name=Dsim\GD11975 {ECO:0000313|EMBL:EDX15366.1}; GN ORFNames=Dsim_GD11975 {ECO:0000313|EMBL:EDX15366.1}, GN GD11975 {ECO:0000313|FlyBase:FBgn0183714}; OS Drosophila simulans (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7240 {ECO:0000313|Proteomes:UP000000304}; RN [1] {ECO:0000313|EMBL:EDX15366.1, ECO:0000313|Proteomes:UP000000304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Mixed {ECO:0000313|EMBL:EDX15366.1}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH981676; EDX15366.1; -; Genomic_DNA. DR RefSeq; XP_002076141.1; XM_002076105.1. DR UniGene; Dsi.6448; -. DR EnsemblMetazoa; FBtr0211885; FBpp0210377; FBgn0183714. DR GeneID; 6739231; -. DR KEGG; dsi:Dsim_GD11975; -. DR FlyBase; FBgn0183714; Dsim\GD11975. DR OMA; HVAREMT; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4NRY8; -. DR Proteomes; UP000000304; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000304}; KW Reference proteome {ECO:0000313|Proteomes:UP000000304}. SQ SEQUENCE 270 AA; 29734 MW; 607C97F137A93D8F CRC64; MAHNSRNNLG IMRLREDVDD ISHILRQQQI DSKGAQGSSK FDCLGGEPKG LGSSGRCSNR DVSAYVDTLF KRKIGHLMDD VYNLKKQVKS ADCASKGAQS TPKPESAALA KPRINYASED LGARIINVKA KSLDGTNIIR SVLGLDFSSN PPVNMIRAGL SPGSCFGFNG TRATVTLHLA RTIIVEAITL THVAREMTPD LCVKSAPKTF DAYGLRIDNS KRELLGQWNY DNAANKRTQS YSVRSDYYLR DQDFSFTSNH GANTICIYRH // ID B4NZ85_DROYA Unreviewed; 2725 AA. AC B4NZ85; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 52. DE SubName: Full=GE26196 {ECO:0000313|EMBL:EDW88780.1}; GN Name=Dyak\GE26196 {ECO:0000313|EMBL:EDW88780.1}; GN ORFNames=Dyak_GE26196 {ECO:0000313|EMBL:EDW88780.1}, GN GE26196 {ECO:0000313|FlyBase:FBgn0243230}; OS Drosophila yakuba (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7245 {ECO:0000313|Proteomes:UP000002282}; RN [1] {ECO:0000313|EMBL:EDW88780.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). RN [2] {ECO:0000313|EMBL:EDW88780.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17550304; DOI=10.1371/journal.pbio.0050152; RA Ranz J.M., Maurin D., Chan Y.S., von Grotthuss M., Hillier L.W., RA Roote J., Ashburner M., Bergman C.M.; RT "Principles of genome evolution in the Drosophila melanogaster species RT group."; RL PLoS Biol. 5:E152-E152(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000157; EDW88780.1; -; Genomic_DNA. DR RefSeq; XP_002089068.1; XM_002089032.1. DR ProteinModelPortal; B4NZ85; -. DR STRING; 7245.FBpp0271206; -. DR EnsemblMetazoa; FBtr0272714; FBpp0271206; FBgn0243230. DR GeneID; 6528001; -. DR KEGG; dya:Dyak_GE26196; -. DR FlyBase; FBgn0243230; Dyak\GE26196. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4NZ85; -. DR Proteomes; UP000002282; Chromosome 2L. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002282}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1302 1329 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2725 AA; 302192 MW; 59CBA87E5A7A734B CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSQVHK DTLHSAMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDS MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLKRL KSVGGNNHSS LSAAGTQPTS SSQPVATTNS DAINENVAGT ATISNSTKVK SSDAAASPQS ISTTISLLST LCRGSPSITH DILRSQLADA LERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNNP NQGQLAPRPR RNNTNTDRTH RQLIDCIRSK DSEALREAIE SGGIDVNCMD DVGQTLLNWA SAFGTLEMVE YLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKFGAYP DLRDEDGKTP LDKARERLDD GHREVAAILQ SPGEWMSPDH SLLNKDGKKY TLMEPRGDPE MAPIYLKVLL PIFCRTFLGS MLGSVRRASL ALIKKIVQYA YPTVLQSLSE TSFSEDAAST SEQNGGNLLI EVVASVLDNE DDDEGHLIVL NIIEEIMCKT QEEFLDHFAR LGVFAKVQAL MDNDAEELYV QLPGSSEEPA PTQRSSTSMA VTPRSTSDDP MEDAKEILQG KPYHWREWSI CRGRDCLYVW SDSVALELSN GSNGWFRFII DGKLATMYSS GSPENGNDSS ENRGEFLEKL MRARSCVIAG VVSQPILPTA SALRLVVGNW VLQSQKTNQL QIHNTEGHQV TVLQDDLPGF IFESNRGTKH TFSAETVLGP DFASGWSTAK KKRNKSKTEG QKFQVRNLSR EIYNKYFKSA QTIPRGAVAI LTDIVKQIEL SFEEQHMAPN GNWETTLSDA LMKLSQLIHE DGVVSAYEMH SSGLVQALVA VLSVNHWETN SPRCKRNKMQ KQRVSVFKKC ILEDNVESAT NKPRTKSTAS ILIQKLVSVL ESTEKLPVYL YDTPCTGYSL QILQKRLRFR LERAECESTL FDRSGRTLKM EPLATIGQLS KYLLKMVAKQ WYDLDRSTYF YLKKIREHRT GTVFTHSFDF DEEGLLFYIG SNAKTCDWVN PAQYGLVQVT SSEGKTLPYG KLEDILSRDS ISLNCHTKDN KKAWFAIDLG VYIIPTAYTL RHARGYGRSA LRNWLLQGSK DGLTWTTLST HVDDKSLVEP GSTATWPITC ATDDSVRYRH IRIQQNGRNA SGQTHYLSLS GFEIYGRVVG VADDIGKSVK EAEAKIRRER RQIRAQLKHM TTGARVIRGV DWRWEEQDGC AEGTITGEIH NGWIDVKWDH GVRNSYRMGA EGKYDLKLAD CEFLSTFDGN QSIGTASAAP KASEKGNTLT SRKSSSTPSL PEATEKNQNT EGASNQTVSA DNLAWKQAVE TIAENVFASA KTQIISNQLA MNTSSSREAR AKHKESGTNQ VHKDNISGPS PLSRELEHIS DLSAINNSMP AINSSIVSDL ATISENLSLT ELSKENICSV LTPSYKPAES VTASQSSSHP DVQSSSPREN DIKNISNIEE NNKMNANNSV NKISKDLLAN LRTSNIAGCP PVTQLSTEAL EMIDKMRDGV DMIRNNSNNI LSTDTFPVPC TNVPVGVKKT PKAQALINPD NANQKQIIVT TEEFPTKSSK KPSVTLKPGQ QPNAVLSIVD IKDPQIATEN VSVPSQMSIS VPNLTTTSAS EVPSTSEVAT HTGLLETFAA IARRRTSQGT NIQDNQIMNA ESNVNEHGDQ NASGSFLGHS VTSLVKLALS SNFHSGLLST AQSYPSLSSN NSENIAPSNP TNTSAGQQSA STINHTLTMS LTSTSSDSEQ VSLEDFLESC RAPALLGDLD DEDDMDEDND EEENEDEYEE VGNTLLQVMV SRNLLTFMDD EAMENRLVGV TKRKSWDDEF VLKRQFSALI PAFDPRPGRT NVNQTSDLEI SPLGVELPKP QQSGPETIEQ PLLGLKLRGP GIGGIPEVEI DLSNTDWTIF RAVQELLQCS QLNKMDKFRK IWEPTYTIVY REVSPEAQET TCLDPEEFPQ TPDVSSKSGA STLSPNSPMH IGFNVADNNL CSVDDVLELL TQINGLNQSE IDSDVKEHGV SVLSEDLFIS KKITNKLQQQ IQDPLVLASN ALPNWCENLN QSCPFLFPFE TRQLYFNCTS FGASRSIVCL QSQRDVTVER QRIPIMSPRR DDHEFRIGRL KHERVKVPRN EDLLMWAMQV MKTHCNRKSV LEVEFLDEEG TGLGPTLEFY ALVAAEIQRS DLCMWLSDDE LGEDTESSPL SAEGNSKPVG YYVNRREHGI FPAPLPQNTE ICEKVLKYFW FFGVFVAKVL QDMRLVDIPL STSFLQLLCH NKVLSRNLQK VISDRRNGDL SVVSEESDIV ETCTKLLRTD SNKSNAFGGI LSLENLKEID PTRYQFLQEM QNLLMRKQSI EFDETISAEK KQELINELKL HTQNGLEVSL EDLALTFTYL PSSSIYGYTQ AELLPNGSSV NVTIDNLEAY CELLMNFILQ DGIAQQMKAF SDGFNEVFPL KKLAAFTPSE ARMMICGEQF PHWSREDIIS YTEPKLGYNK DSPGFQRFVN VLLSMSGDER KAFLQFTTGC SSLPPGGLAN LHPRLTVVRK VDAGVGSYPS VNTCVHYLKL PDYPTEEIMK ERLLTATKEK GFHLN // ID B4P194_DROYA Unreviewed; 563 AA. AC B4P194; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=GE19078 {ECO:0000313|EMBL:EDW89096.1}; GN Name=Dyak\GE19078 {ECO:0000313|EMBL:EDW89096.1}; GN ORFNames=Dyak_GE19078 {ECO:0000313|EMBL:EDW89096.1}, GN GE19078 {ECO:0000313|FlyBase:FBgn0236439}; OS Drosophila yakuba (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7245 {ECO:0000313|Proteomes:UP000002282}; RN [1] {ECO:0000313|EMBL:EDW89096.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). RN [2] {ECO:0000313|EMBL:EDW89096.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17550304; DOI=10.1371/journal.pbio.0050152; RA Ranz J.M., Maurin D., Chan Y.S., von Grotthuss M., Hillier L.W., RA Roote J., Ashburner M., Bergman C.M.; RT "Principles of genome evolution in the Drosophila melanogaster species RT group."; RL PLoS Biol. 5:E152-E152(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000157; EDW89096.1; -; Genomic_DNA. DR RefSeq; XP_002089384.1; XM_002089348.1. DR STRING; 7245.FBpp0264088; -. DR EnsemblMetazoa; FBtr0265596; FBpp0264088; FBgn0236439. DR GeneID; 6528331; -. DR KEGG; dya:Dyak_GE19078; -. DR FlyBase; FBgn0236439; Dyak\GE19078. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4P194; -. DR Proteomes; UP000002282; Chromosome 2L. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002282}. FT COILED 190 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 563 AA; 63766 MW; 56B4FF328491228A CRC64; MEVPTVISPQ QEDEAIKVNM ASIERNIQKA LTAEEYENIL DHVNSYVQQL VELKMNQHSK ELPPQQIQLI AQMIKKNLQQ IAYKTELSEK DLADLAIKLK LELQSSGGWQ DGAKLSQANL EEITKLVKSE VQFHQSHNTI KLDTIDLPAL LKQILSMPAL ADFVDARISL QVRELEKKEG SGSTDAEVQI ERLNREIAFI KLALSDKQAE NADLHQSISN LRLGHEDLLE RIKQHELAQD RRFHGLLAEI ETKLSALNDS QFALLNKQIK LSLVEILGFK QSTAGGAAGQ LDDFDLQTWV RSMFVAKDYL EQQLLELNKR TNNNIRDEIE RSSILLMSDI SERLKREILL VVEAKHNEST RALKGHIREE EVRQIVKTVL AIYDADKTGL VDFALESAGG QILSTRCTES YQTKSAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLVLKLN SLVYVTGFTL EHIPKSLSPT GRIDSAPRNF TVWGLEQEKD PEPVLFGEYE FEDNGASLQY FAVQNLDIKR QYQIVELRIE TNHGQPTYTC LYRFRVHGKP AAT // ID B4P1I3_DROYA Unreviewed; 299 AA. AC B4P1I3; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 34. DE SubName: Full=GE12763 {ECO:0000313|EMBL:EDW88090.1}; GN Name=Dyak\GE12763 {ECO:0000313|EMBL:EDW88090.1}; GN ORFNames=Dyak_GE12763 {ECO:0000313|EMBL:EDW88090.1}, GN GE12763 {ECO:0000313|FlyBase:FBgn0230485}; OS Drosophila yakuba (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7245 {ECO:0000313|Proteomes:UP000002282}; RN [1] {ECO:0000313|EMBL:EDW88090.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). RN [2] {ECO:0000313|EMBL:EDW88090.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17550304; DOI=10.1371/journal.pbio.0050152; RA Ranz J.M., Maurin D., Chan Y.S., von Grotthuss M., Hillier L.W., RA Roote J., Ashburner M., Bergman C.M.; RT "Principles of genome evolution in the Drosophila melanogaster species RT group."; RL PLoS Biol. 5:E152-E152(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000157; EDW88090.1; -; Genomic_DNA. DR RefSeq; XP_002088378.1; XM_002088342.1. DR STRING; 7245.FBpp0257773; -. DR EnsemblMetazoa; FBtr0259281; FBpp0257773; FBgn0230485. DR GeneID; 6527284; -. DR KEGG; dya:Dyak_GE12763; -. DR FlyBase; FBgn0230485; Dyak\GE12763. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OMA; HVAREMT; -. DR OrthoDB; EOG7VQJCX; -. DR PhylomeDB; B4P1I3; -. DR Proteomes; UP000002282; Chromosome 2L. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002282}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 12 30 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 299 AA; 33302 MW; 0F4FA6ABFC9687BB CRC64; MDGCRRARKR VFLTYLLSFV LLSTFFYYLM AHNSRNNLGI MRLREDVDDI SHILRQQQTG SGKFNCLGGE PKGLGSGSCA NRDVSAYVDT LFKRKLGHLM DDVYNLKKQV MSADCSSKSA QSTPKPESAS LAKPRINYAS EDLGARIINV KAKPIGGTNF IKRLLGLDFS ANPPVNMIRA GLSPGACFGF NGHQANVTLH LAKTIIVEAI SLTHVAREMT PSLCVKSAPK NFDVYGLRAD NSKRELLGQW SYDNAANRRT QSYSVRSEYF YRNLAFSFNS NHGANSTCIY RVEVYGRLQ // ID B4P6L6_DROYA Unreviewed; 1449 AA. AC B4P6L6; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=GE13552 {ECO:0000313|EMBL:EDW90968.1}; GN Name=Dyak\GE13552 {ECO:0000313|EMBL:EDW90968.1}; GN ORFNames=Dyak_GE13552 {ECO:0000313|EMBL:EDW90968.1}, GN GE13552 {ECO:0000313|FlyBase:FBgn0231215}; OS Drosophila yakuba (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7245 {ECO:0000313|Proteomes:UP000002282}; RN [1] {ECO:0000313|EMBL:EDW90968.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). RN [2] {ECO:0000313|EMBL:EDW90968.1, ECO:0000313|Proteomes:UP000002282} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tai18E2 / Tucson 14021-0261.01 RC {ECO:0000313|Proteomes:UP000002282}; RX PubMed=17550304; DOI=10.1371/journal.pbio.0050152; RA Ranz J.M., Maurin D., Chan Y.S., von Grotthuss M., Hillier L.W., RA Roote J., Ashburner M., Bergman C.M.; RT "Principles of genome evolution in the Drosophila melanogaster species RT group."; RL PLoS Biol. 5:E152-E152(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000158; EDW90968.1; -; Genomic_DNA. DR RefSeq; XP_002091256.1; XM_002091220.1. DR STRING; 7245.FBpp0258562; -. DR EnsemblMetazoa; FBtr0260070; FBpp0258562; FBgn0231215. DR GeneID; 6530329; -. DR KEGG; dya:Dyak_GE13552; -. DR FlyBase; FBgn0231215; Dyak\GE13552. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4P6L6; -. DR Proteomes; UP000002282; Chromosome 2R. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002282}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1449 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002820948. FT TRANSMEM 1037 1058 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 377 404 {ECO:0000256|SAM:Coils}. FT COILED 973 1021 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1449 AA; 159185 MW; E937FF5E34E296FF CRC64; MHMRLHLVRF MFINLLLSCC FWLYDNVAAA DPQIQSSDGI SAKDAARKTH IDPEEPEAPP KSEPPPPPLY DGAGVLLPTP HEGAPIVPTS SSSAVPQTGN RPISGDTSVP VKELPELSDQ RRANSHSEYV SEIDIAADVG DASSSTAQVP NNQLNYNKRS SAGKVHSFSE SQQLEQPKQL EQEGPAPGQP QSPELQEQQP PQADESSHFD FQWCDRVHRY HAGGATSSSS SRRWVFGIAS RSRSRSRIHY LVSLLSFFPE IITELPTVTI TELPLDRVKN RLDSVILDAA LATAGNHSDE EQHHQQPPLQ DQPIQINDPG GGIEVERIAT QEATTVGVTQ ESSEELQPRS TAFNETAGTA NLTKASEEVP MPVFSEWAQK QMEAEASREQ AMELEQQVVN KSAQRKNNTG SSNGKLPTLK LRSKNYASPD CGAKIIAHNS ESKHTEAVLT QSTDEYMLST CESRIWFVVE LCEAIQAQKV DVANYELFSS SPKNFTVAVS KRFPTRDWSN VGRFAAEDKR TIQTFDLHPH LFGKFVRVEI TSHYANEHFC PLSLFRVFGT SEYEAFETEI RPSDDLDDFY DDYGAHEQKA AVGSGGNIFQ SASDAVMQMV KKAAEVLVKP TKALKWSAES VLCQTPAFEA YSCSNCNTTL VERINRLLSC QFQQLQALLS LSRLRSDLVR SRVCQEEFGI SLMGSDFASK MGKERSYFLS MLPAEHVGAM CKLIQAEQNV TEHHSKAPSL MQHVSTPESV QDNATATGIR QDCDSNKERQ PRTPIREPLT PSLEVVVPEI SQEVPSMEDL SPTSSEKAST TNSTPADVNI FNVPSESEEV LVKVQLPPEP TLPTTLELSD VESFTDAPST NAPPTSGEAS VNGDLGMEES NPANWEGIDN LLTTTVASIT AGGGAAAAAA AVVNGNGNIG GAGFVGAGGP AIASSVNMQQ KLTNGAQSES VFIRLSNRIK ALERNMSLSG QYLEELSRRY KKQVEELQQT LTQQTLTVRQ LEDQSRRYVE QEQLYQQHSA ELAGEVRALS YQVQACILVI IIVGTCIFLM LVLGTVYYRK LRRQQQQLLK KDQAGHPPVA AKPKLDRRKS YEQMPSQTTP KQRRPSEEAM LILKECGDTN MHEQDIPNRQ RKISVCYGSN NNIAANMLIA NTNGGANVRN SLHRRKGAKQ SWHNSLDTTE TSWGEQTDKF FDVETLKSSK QSSGKAGKKK SHQQLKPLGL KRQESAPATN TSDLQAEEPA TQSDFDESLM LDDDDLANFI PTSDLAYNEF MPEGPSGYQI VDNVDGKPPK EQGTKKSRRL SSPAFFKSPF SKSKNKGYSF NGVKNSHSVH EPTSWEWYRL KRSEKHQQQQ QAKLASKSLP SPSLDSSSLS EVNFPLSSST VTQNSFRILG EAILSSGEGR ITPNGNGNAT SGVLASSSSG SGSGGSTTSS TTKKKQRALN NLFRKAFDF // ID B4Q3G1_DROSI Unreviewed; 1414 AA. AC B4Q3G1; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=GD21690 {ECO:0000313|EMBL:EDX05642.1}; DE SubName: Full=Uncharacterized protein, isoform A {ECO:0000313|EMBL:KMY91187.1}; GN Name=Dsim\GD21690 {ECO:0000313|EMBL:EDX05642.1}; GN ORFNames=Dsim_GD21690 {ECO:0000313|EMBL:EDX05642.1}, GN Dsimw501_GD21690 {ECO:0000313|EMBL:KMY91187.1}, GN GD21690 {ECO:0000313|FlyBase:FBgn0193113}; OS Drosophila simulans (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7240 {ECO:0000313|EMBL:EDX05642.1, ECO:0000313|Proteomes:UP000000304}; RN [1] {ECO:0000313|EMBL:EDX05642.1, ECO:0000313|Proteomes:UP000000304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Mixed {ECO:0000313|EMBL:EDX05642.1}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). RN [2] {ECO:0000313|EMBL:EDX05642.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Mixed {ECO:0000313|EMBL:EDX05642.1}; RG FlyBase; RL Submitted (JUN-2008) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:KMY91187.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=W501 {ECO:0000313|EMBL:KMY91187.1}; RX PubMed=22936249; DOI=10.1101/gr.141689.112; RA Hu T.T., Eisen M.B., Thornton K.R., Andolfatto P.; RT "A second-generation assembly of the Drosophila simulans genome RT provides new insights into patterns of lineage-specific divergence."; RL Genome Res. 23:89-98(2013). RN [4] {ECO:0000313|EMBL:KMY91187.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=W501 {ECO:0000313|EMBL:KMY91187.1}; RA Hu T., Eisen M.B., Thornton K.R., Andolfatto P.; RL Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases. RN [5] {ECO:0000313|EMBL:KMY91187.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=W501 {ECO:0000313|EMBL:KMY91187.1}; RG FlyBase; RL Submitted (APR-2015) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000361; EDX05642.1; -; Genomic_DNA. DR EMBL; CM002910; KMY91187.1; -; Genomic_DNA. DR RefSeq; XP_002080057.1; XM_002080021.1. DR UniGene; Dsi.6484; -. DR EnsemblMetazoa; FBtr0221600; FBpp0220092; FBgn0193113. DR GeneID; 6732978; -. DR KEGG; dsi:Dsim_GD21690; -. DR FlyBase; FBgn0193113; Dsim\GD21690. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B4Q3G1; -. DR Proteomes; UP000000304; Chromosome 2L. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000304}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000304}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 FT CHAIN 30 1414 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002822996. FT TRANSMEM 1003 1024 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 344 371 {ECO:0000256|SAM:Coils}. FT COILED 939 987 SQ SEQUENCE 1414 AA; 154341 MW; 48BBBB6DD1F825D6 CRC64; MQMRLHLVRF MFINLLLSCC FWLYDNVAAA DSQIPSSDGG SAKDAARKTH IEPEEPAAPP RPEPPPPPLR DGAGVLLPSP QEAAPFVATS SSSSAPVPQT GNRPISGDTS VPVKELPALS DQQRANSHSE YVSEIDLAAD VGGSSTAQVP NNQLNYNKRS SAGKVHSFSE SQQLEQPKQL EQEGPATGQP QAPELQEPKQ PPQAEESFPE IITELPTVTI TELPLDRVKN RLDSVILDGS PATAGNHSDE EQHQQQQPHE DQHMQVLEAD EEVPQKDEQM PKINDPGGGI QVDGMVAQEA ATVGETQESS EELQPGSAAF NETEGTANLT NANEEVPMPV FSEWAQKQME AEASREQAME LEQQVVNKSA QRKNNTGSSS GKPPTLKLRS KNYASPDCGA KIIAHNSESK HTEAVLTQST DEYMLSTCES RIWFVVELCE AIQAQKVDVA NYELFSSSPK NFTVAVSKRF PTRDWSNVGR FAAEDKRTIQ TFELHPHLFG KFVRVDITSH YSNEHFCPLS LFRVFGTSEY EAFETEIRPS DDLDDFYDDY GAQEQKAAVG SGGNIFQSAS DAVMQMVKKA AEVLVKPTKA LKWSEESVLC QTPAFEAYSC SNCNATLVER INSLLSCQFQ QLQALLSLSR LRSDLLNSRV CHEEFGISLT GSEFASKMGK EQSYFLSILP AEHVGAMCKL IQAEQNVTDQ NHTKAPSLKQ HVSSPEPVQD NATATGVRQD CENSKERQPR KTPTKDPLTP SLEVVVPEVS QEVPSLEDQS STSSETVSTT NSTPADVNIF NMPSESEEVV KVQLSPEPTL PTTLQPSDVE SFTDAPSTNA LPASSEANGD LGMEEGNPAN WDGIDNLLTT TVASITAGGG AAAAAAAVVN GNGNIGGAGV VGAGGPASVS SVNMQQKLTN GAQSESVFIR LSNRIKALER NMSLSGQYLE ELSRRYKKQV EELQQTLTQQ TLTVRQLEDQ SRRYVEQEQL YQQHSAELAG EVRALSYQVQ ACILVIIIVG TCIVLMLVLG TVYYRKLRRQ QQQLLKKDQA DHPPVAAKPK LDRRKSYEQM PNQSTPKQRR PSEEAMLILK ECGDSNMQEL DPPSRQRKIS VCYGSNNNIA ANMAIANTNG GASVRNSLHR RKGAKHSWHN SLDTTETICG EQTDKFFDVD TLKSIKQSCG KPGKKKSHQQ LKPLGLKRQE SAPATYTPDL QAEEPATQSD FDESLMLDDD DLANFIPTSD LAYNEFMPEG PSGYQIVDTV DGKPGKEPGT KKSRRLSSPA FFKSPFSKSK NKGYSFNGVK NSHSVHEPTS WEWYRLKRSE KHQQQQQAKL ASKSLPSASL DSSSLSEVNF PLNSSTAQNS FRILGEAILS SGEGRITPNG NGNAMSGGLA SSSSGSGSGG STTSSTTKKK QRALNNLFRK AFDF // ID B4Q8S5_DROSI Unreviewed; 2404 AA. AC B4Q8S5; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=GD22285 {ECO:0000313|EMBL:EDX04490.1}; GN Name=Dsim\GD22285 {ECO:0000313|EMBL:EDX04490.1}; GN ORFNames=Dsim_GD22285 {ECO:0000313|EMBL:EDX04490.1}, GN GD22285 {ECO:0000313|FlyBase:FBgn0193693}; OS Drosophila simulans (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7240 {ECO:0000313|EMBL:EDX04490.1, ECO:0000313|Proteomes:UP000000304}; RN [1] {ECO:0000313|EMBL:EDX04490.1, ECO:0000313|Proteomes:UP000000304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Mixed {ECO:0000313|EMBL:EDX04490.1}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000361; EDX04490.1; -; Genomic_DNA. DR RefSeq; XP_002078905.1; XM_002078869.1. DR ProteinModelPortal; B4Q8S5; -. DR EnsemblMetazoa; FBtr0222195; FBpp0220687; FBgn0193693. DR GeneID; 6731770; -. DR KEGG; dsi:Dsim_GD22285; -. DR FlyBase; FBgn0193693; Dsim\GD22285. DR KO; K12231; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; B4Q8S5; -. DR Proteomes; UP000000304; Chromosome 2L. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000304}; KW Reference proteome {ECO:0000313|Proteomes:UP000000304}. FT COILED 1368 1369 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2404 AA; 266178 MW; 937C9EDC6D4DBF57 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSQVHK DTLHSSMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDP MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLKRL KSVGGNTHSS LTAAGTQPTS SSQPAATTNS DAINENVAGT ATISNSTKVK SSDAAASPQS ISTTISLLST LCRGSPSITH DILRSQLADA LERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNNP NQGQLAPRPR RNNTNTDRTH RQLIDCIRSK DSEALREAIE SGGIDVNCMD DVGQTLLNWA SAFGTLEMVE YLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKFGAYP DLRDEDGKTP LDKARERLDD GHREVAAILQ SPGEWMSPDH SLLNKDGKKY TLMEPRGDPE MAPVYLKVLL PIFCRTFLGS MLGRIRRASL ALIKKIVQYA YPTVLQSLSE TSFSEDAAST SGQNGGNLLI EVIASVLDNE DDDDGHLIVL NIIEEIMCKT QEEFLDHFAR LGVFAKVQAL MDTDAEELYV QLPGTVEEPA AAQRSSTSVV VAPRPTSDDP MEDAKEILQG KPYHWREWSI CRGRDCLYVW SDSVALELSN GSNGWFRFII DGKLATMYSS GSPENGNDSS ENRGEFLEKL MRARSCVIAG VVSQPILPTA SALRLVVGNW VLQSQKTNQL QIHNTEGHQV TVLQDDLPGF IFESNRGTKH TFSAETVLGP DFASGWSTAK KKRNKSKTEG QKFQARNLSR EIYNKYFKSA QIIPRGAVAI LTDIVKQIEL SFEEQHMAPN GNWETTLSDA LMKLSQLIHE DGVDRYRVCA LLDFDEEGLL FYIGSNAKTC DWVNPAQYGL VQVTSSEGKT LPYGKLEDIL SRDSISLNCH TKDNKKAWFA IDLGVYIIPT AYTLRHARGY GRSALRNWLL QGSKDGSTWT TLSTHVDDKS LVEPGSTATW PITCATDDSV RYRHIRIQQN GRNASGQTHY LSLSGFEIYG RVVGVADDIG KSVKEAEAKT RRERRQIRAQ LKHMTTGARV IRGVDWRWEE QDGCAEGTIT GEIHNGWIDV KWDHGVRNSY RMGAEGKYDL KLADGEYLSA FDGNQSMSSA STAAKSSEKG NTLTSRKSSS TPSLPEATEK NQNTEGASNQ TVSADNLAWK QAVETIAENV FGSAKTQIIS NQIAMNTSSS REARAKHKES GTNQMHKDNI SGPSPLSREL EHISDLSAIN NSMPAINLSX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX FSVFLQTHHK PFALRVIIAL HSERKYYLNK GKLN // ID B4QDR2_DROSI Unreviewed; 569 AA. AC B4QDR2; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=GD10427 {ECO:0000313|EMBL:EDX05924.1}; GN Name=Dsim\GD10427 {ECO:0000313|EMBL:EDX05924.1}; GN ORFNames=Dsim_GD10427 {ECO:0000313|EMBL:EDX05924.1}, GN GD10427 {ECO:0000313|FlyBase:FBgn0182198}; OS Drosophila simulans (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7240 {ECO:0000313|EMBL:EDX05924.1, ECO:0000313|Proteomes:UP000000304}; RN [1] {ECO:0000313|EMBL:EDX05924.1, ECO:0000313|Proteomes:UP000000304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Mixed {ECO:0000313|EMBL:EDX05924.1}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000362; EDX05924.1; -; Genomic_DNA. DR RefSeq; XP_002080339.1; XM_002080303.1. DR EnsemblMetazoa; FBtr0210337; FBpp0208829; FBgn0182198. DR GeneID; 6733276; -. DR KEGG; dsi:Dsim_GD10427; -. DR FlyBase; FBgn0182198; Dsim\GD10427. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B4QDR2; -. DR Proteomes; UP000000304; Chromosome 2R. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000304}; KW Reference proteome {ECO:0000313|Proteomes:UP000000304}. FT COILED 183 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 569 AA; 64523 MW; CE87E027A97C3BB4 CRC64; MEMATVRSSQ REDEAIKVNM ASIEQNIQKA LTAEEYENIL NHVNSYVQQL VELKMQQHSK ELAPQQIQLI VQLMKENLQQ IVHKTELSEK DLADLAIKLK MELQSSGGWQ DGAKLSQANL EEITRLIKSE VHLHESHYTI QLDRIDFASL LERILAAPAL ADFVDARISL RVGELEPKES SGSSDAEIQI ERLNREIAFI KLALSDKQAE NADLHQSISN LKLGQEDLLE RIQQHELAQD RRFHGLLAEI ENKLSALNDS QFALLNKQIK LSLVEILGFK QSTAGGAAGQ LDDFDLQTWV RSMFVAKDYL EQQLLELNKR TNNNIRDEIE RSSILLMSDI SQRLKREILL VVEAKHNEST KALKGHIREE EVRQIVKTVL AIYDADKTGL VDFALESAGG QILSTRCTES YQTKSAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLGKSRA KMLKLNSLVY VTGFTLEHIP KSLSPTGRIE SAPRNFTVWG LEQEKDQEPV LFGEYQFEDN GASLQYFAVQ NLDIKRPYEI VELRIETNHG HPTYTCLYRF RVHGKPPAT // ID B5DE01_XENTR Unreviewed; 2533 AA. AC B5DE01; DT 14-OCT-2008, integrated into UniProtKB/TrEMBL. DT 14-OCT-2008, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAI68474.1}; GN Name=hectd1 {ECO:0000313|Xenbase:XB-GENE-1010869}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|EMBL:AAI68474.1}; RN [1] {ECO:0000313|EMBL:AAI68474.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Testes {ECO:0000313|EMBL:AAI68474.1}; RG NIH - Xenopus Gene Collection (XGC) project; RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC168474; AAI68474.1; -; mRNA. DR RefSeq; NP_001131085.1; NM_001137613.1. DR UniGene; Str.6875; -. DR ProteinModelPortal; B5DE01; -. DR STRING; 8364.ENSXETP00000030896; -. DR PaxDb; B5DE01; -. DR GeneID; 100192372; -. DR KEGG; xtr:100192372; -. DR CTD; 25831; -. DR Xenbase; XB-GENE-1010869; hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR HOVERGEN; HBG067533; -. DR KO; K12231; -. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1268 1288 {ECO:0000256|SAM:Coils}. FT COILED 1650 1674 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2533 AA; 280328 MW; 779644221E4E13BC CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVEGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DASLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAQH GLTEELLSRM AAAGGTVSGP SSACKTGRGT SGGPSTSGDS KISNQVSTIV SLLSTLCRGS PVVTHDLLRA ELLDSMESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDEKKKKDS NREEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCHSDAGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGDDL FLDQLARLGV ISKVSTLAGP TSDDENEEDS KPEKVRIHLF PLYSAHKGTQ AAASQLKEDE PQEDAKELQQ GRPYHWRDWS VIRGRDCLYI WSDAAALELS NGSNGWFRFI LDGKLATMYS SGSPEGGSDS SESRSEFLEK LQRARSQVKP STSSQPILST PGPGKLTVGN WSLTCLKDGE IAIHNSDGQQ ATILKEDLPG FVFESNRGTK HSFTAETSLG SEFVTGWTGK RGRKLKSKLE KTKQKVRTMA RDLYDDHFKA VESMPRGVVV TLRNIATQLE SAWELHTNRQ CIEGENTWRD LMKTALENLI VLLKDENTIS PYEMCSSGLV QALLTVLNNN EDCDIKQDCG QLVERLNVFK TAFSENEDDE SRPAVALVRK LIAVLESIER LPLHLYDTPG SSYNLQILTR RLRFRLERAP GETSLIDRTG RMLKMEPLAT VESLEQYLLK MVAKQWYDFD RSSFVFVRKL REGQSCVFRH QHDFDDNGIM YWIGTNAKTA YEWVNPAAYG LVVVTSSEGR NLPYGRLEDI LSRDSSALNC HTNDDKSAWF AIDLGLWVVP SAYTLRHARG YGRSALRNWV FQVSKDGQNW TTLYTHMDDC SLNEPGSTAT WPLDPAREEK QGWRHVRIKQ TGKNASGQTH YLSLSGFELY GNVTGVCEDQ LGKAAKEAEA NLRRQRRLVR SQVLKYMVPG ARVIRGIDWK WRDQDGSAQG EGTVTGELHN GTPPSWSSLV KNNCPDKAPP SSSSSCVVVG SVAGSGSRKG SSSSVCSVAS SSDVSLSCAK TERRAEEQVS DIHHDPILLL SSNQAASGSS TCPPGGETVG EGGDRKAGEA PAISMGMVSI SSPDVSSVSE LSNKEVAVPR PLGSSASNRL SVSSLLAAGA PMSSSASVPN LSSRETSSLE SFVRRVANIA RTNATNNMNL SRSSSDNNTN TLGRNAVSSA TSPLMGAQSF PNLTTTGTTS TVTMSTSSVT SSNVATATTG LSVGQSLSNT LTTSLTSTSS ESDTGQEAEY SLYDFLDSCR ASTLLAELDD DEDLPEPDEE DDENEDDNQE EQEYEEVMEE EEYETKGGRR RTWDDDYVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPAP GTPHSELLEE VECAPAPRLA LTLKVTGLGS GREVELPLNN FRSTIFFYVQ RLLQLSCNGA IKTDKLRRIW EPTYTIMYRE MKDSDKQKEC GRLGCWSVEH VEQSLGTDAL PKNDLITYLQ RNADPGFLRR WKLTGTNKSI RKNRNCSQLI AAYKDFCENG CKSLSMPAAL ATLQSADILS HSREQAQAKA GSSQNSCGVE DVLQLLRILF IVASDPYSAR TPQEDGEDML LFSVPPEEFT SKKITTKIVQ QIEEPLALAS GALPDWCEQL TSKCPFLIPF ETRQLYFTCT AFGASRAIVW LQNRREATVE RSRTASAVRR DDPGEFRVGR LKHERVKVPR GESLMEWAEN VMQIHADRKS VLEVEFLGEE GTGLGPTLEF YALVAAEFQK TDLGIWLCDD DFPDDESRQV DLGGGLKPPG YYVQRSCGLF IAPYPQDSEE LDRVTRLCHF LGVFLAKCIQ DNRLVDLPIS KPFFKLMCMG DIKSNMSKLL YASRGEESEH CTESQSEAST EDGHDALSVG SFEEDCKSEF ILDPPKPKPP AWFQGILTWE DFELINPHRA RFLRDIRELA VKRRQILGNR CLSEDEKNTQ LQELMLKNPS GSGPPVSIED LGLNFQFCPS SRVYGFSAVD LRPNGEDEMV TIDNAEEYVD LMFDFCMQTG VQKQMEAFRS GFNKVFPMEK LGSFSPEEVQ MILCGNQSPS WSAEDIINYT EPKLGYTRES PGFLRFVRVL CGMSSDERKA FLQFTTGCST LPPGGLANLH PRLTVVRKVD ATDASYPSVN TCVHYLKLPE YSSEEIMRDR LLAATMEKGF HLN // ID B5DIR5_DROPS Unreviewed; 1488 AA. AC B5DIR5; DT 14-OCT-2008, integrated into UniProtKB/TrEMBL. DT 14-OCT-2008, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=GA26023 {ECO:0000313|EMBL:EDY70207.1}; GN Name=Dpse\GA26023 {ECO:0000313|EMBL:EDY70207.1}; GN ORFNames=Dpse_GA26023 {ECO:0000313|EMBL:EDY70207.1}, GN GA26023 {ECO:0000313|FlyBase:FBgn0247400}; OS Drosophila pseudoobscura pseudoobscura (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=46245 {ECO:0000313|EMBL:EDY70207.1, ECO:0000313|Proteomes:UP000001819}; RN [1] {ECO:0000313|EMBL:EDY70207.1, ECO:0000313|Proteomes:UP000001819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MV2-25 / Tucson 14011-0121.94 RC {ECO:0000313|Proteomes:UP000001819}; RX PubMed=15632085; DOI=10.1101/gr.3059305; RA Richards S., Liu Y., Bettencourt B.R., Hradecky P., Letovsky S., RA Nielsen R., Thornton K., Hubisz M.J., Chen R., Meisel R.P., RA Couronne O., Hua S., Smith M.A., Zhang P., Liu J., Bussemaker H.J., RA van Batenburg M.F., Howells S.L., Scherer S.E., Sodergren E., RA Matthews B.B., Crosby M.A., Schroeder A.J., Ortiz-Barrientos D., RA Rives C.M., Metzker M.L., Muzny D.M., Scott G., Steffen D., RA Wheeler D.A., Worley K.C., Havlak P., Durbin K.J., Egan A., Gill R., RA Hume J., Morgan M.B., Miner G., Hamilton C., Huang Y., Waldron L., RA Verduzco D., Clerc-Blankenburg K.P., Dubchak I., Noor M.A.F., RA Anderson W., White K.P., Clark A.G., Schaeffer S.W., Gelbart W.M., RA Weinstock G.M., Gibbs R.A.; RT "Comparative genome sequencing of Drosophila pseudoobscura: RT chromosomal, gene, and cis-element evolution."; RL Genome Res. 15:1-18(2005). RN [2] {ECO:0000313|EMBL:EDY70207.1, ECO:0000313|Proteomes:UP000001819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MV2-25 {ECO:0000313|EMBL:EDY70207.1}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH379060; EDY70207.1; -; Genomic_DNA. DR RefSeq; XP_002132805.1; XM_002132769.1. DR EnsemblMetazoa; FBtr0282292; FBpp0280730; FBgn0247400. DR GeneID; 6903087; -. DR KEGG; dpo:Dpse_GA26023; -. DR FlyBase; FBgn0247400; Dpse\GA26023. DR InParanoid; B5DIR5; -. DR OMA; AYNEFMP; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000001819; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001819}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001819}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1488 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002831286. FT TRANSMEM 1066 1087 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 275 295 {ECO:0000256|SAM:Coils}. FT COILED 394 421 {ECO:0000256|SAM:Coils}. FT COILED 1002 1050 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1488 AA; 160430 MW; 235371C2D2068934 CRC64; MHIRLHLVRF MYINLILSCC FWLYDNVAAA DADTKSGGGG GGVADADAAT AAAAEQASAQ HRHQPEAPPK PATVTPEPPP DRTLKKVAAT TADTHPAAAD REPLVESDYG NRPRTSENPK KAKGVAAATT SGRDTSVHGG KVLEAASAPP HSVNSHSEYV SAVPASELNL AANERHELPS EAAAQVPNNQ LNNNNYNNIN AGNPHSVSES EQLEPPKEKQ QQQPRQKQTQ KQQGRESFPE IITELPTVTV TELPLDRIKN RLESVILEEI QSPTSNKSEE QQQQQQQQQQ EQEEQPLLPL FEAAKEVPQK DEQMPKINDP GGGVDLDGLL ASAEAVGAPG DDGTGNVTSD EQQTGGAGAG AGAGEGEGTA NDTEAKANLT KANEEVPMPV FSEWAQKQME AEASREQAME LEQQVANNSA QRRNNTGSAS GKPSTLKLRS KNYASPDCGA KIIASNGDAT NTGAVLTHSS DEYMLSTCGS RIWFVVELCE AIQAQKVELA NFELFSSSPK NFTVAVSKRF PTRDWSNVGR FAAEDKRTVQ TFELHPHLFG KFVRVDIHSH YSKEHFCPVS LFRVFGTSEF EAFETEIRPS DELDDFDDDF GGGGQEQGSS HKATAGGGGG GIFQSASDAV IQMVKKAGEV LLKPTKALKW SPESLLCRTP ALGAFSCSSC NSTLVERINS LLSCQFQQLQ GLLNHSQLRS DLLQSRVCLE EYGISLRGNP SASGLAKRQS YFLSMLPAEH VGAMCKLLQA EQNITVEQQQ MEAPQLKPPP EQEQENATAA GEASSQQEVI REELQESPPS GEIVTPEAMD SQEMPSIREK PDPSTTSTAS TTNSTPADVN IFNVSDELED LEVPVAAPQP TVAAPVASNL VESPSDWETS TLAPSSSEMP LANAELAIED GSPASWESLD NLLTTTVASI TAGGSAAVAT AAAIAGNANG NNLGGGGAGA GVGAGAGAGG IGSNVNLQQK LTNGAQSESV FIRLSNRIKA LERNMSLSGQ YLEELSRRYK KQVEELQQTL TQQTLTVRSL EDQSRRYIEQ EQLYQQQSAE LAGEVRALSY QVQACILVII IVGTCIFLML VLGTVYYRKL RRQTQQLKEE QPSSHAKVPK PKLDRRKSYE QMLNQSTPKQ RRPSEEAMLI LKDCGDSQLV GGCQDLSSRQ RKISVCYGSN NNIAANMMTG NPNLRASLHR RKGAKHSWHN SLDTAATTSC AAEQLDTFFD ADTLKSQKQQ PSSSAGSKAG RKKSLQQQLA LKRQESAPAS MMQNELAEEE PASQSDFDES LMLDDDDLAN FIPTSDLAYN EFMPEGPSGY LVLDAVDGAQ KPPEQPQKQA TKKSSRRLSS PGFFKSPFSK SKNKGGNYNG FGGIKNSHSV HESTSWEWYR LKRNEKQQQQ PNHHSKQTKS LPNSSLDSSS LSEVNFSLNS NSNSTQNSFR ILGEAIRSSG ESSITPNGNG NGSSCSASGS GSNSGGSTTS STAKKKQRAF NNIFRKVF // ID B5FW54_OTOGA Unreviewed; 443 AA. AC B5FW54; DT 14-OCT-2008, integrated into UniProtKB/TrEMBL. DT 14-OCT-2008, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Sperm associated antigen 4 (Predicted) {ECO:0000313|EMBL:ACH53035.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOGAP00000000718}; GN Name=SPAG4 {ECO:0000313|EMBL:ACH53035.1, GN ECO:0000313|Ensembl:ENSOGAP00000000718}; OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. OX NCBI_TaxID=30611; RN [1] {ECO:0000313|EMBL:ACH53035.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOGAP00000000718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B., RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N., RA Walker B.J., Sharpe T., Hall G.; RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby)."; RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Ensembl:ENSOGAP00000000718} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAQR03054396; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; DP000893; ACH53035.1; -; Genomic_DNA. DR RefSeq; XP_003788181.1; XM_003788133.1. DR STRING; 30611.ENSOGAP00000000718; -. DR Ensembl; ENSOGAT00000000805; ENSOGAP00000000718; ENSOGAG00000000805. DR GeneID; 100965074; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG079205; -. DR InParanoid; B5FW54; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005225; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 137 160 Helical. FT TRANSMEM 166 191 Helical. FT COILED 204 238 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 443 AA; 48279 MW; 406BE0EC77CD5558 CRC64; MRRSPRSGSA ASPRKHTPNF YSDNSNSSVS VTSEDSNGHR SAGPGPGEPE GRRARGSSCG EPALSAGVPG GTTWAGSSRQ KPASRSYDGQ TACGAATVRG GASEPAESPV VSEEQLDLLS TLDIRQEMPT PRVSKSFLSL LLQVLSVLLS LVGDVLVSVY REVCSIRFLL TAVSLLSLFL AALWLGLLYL VPPLENEPKE MLTLSEYHER VRSQGQQLQQ LQAELDKLHK EVSSVRAANS ERVAKLVFQR LNEDFVRKPD YALSSVGASI DLEKTSHDYE DTSTAYFWNR FSFWNYARPP TVILEPDVFP GNCWAFEGDQ GQVVIRLPGR VQLSDITLQH PPPSVAHTGG ANSAPRDFAV FGLQIDDETE VFLGKFTFDV EKSEIQTFHL QNDPPAAFPK VKIQILSNWG HPRFTCLYRV RAHGVRTSEG AGDSATGATG GPH // ID B5VS42_YEAS6 Unreviewed; 587 AA. AC B5VS42; DT 25-NOV-2008, integrated into UniProtKB/TrEMBL. DT 25-NOV-2008, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=YOR154Wp-like protein {ECO:0000313|EMBL:EDZ69247.1}; GN ORFNames=AWRI1631_153100 {ECO:0000313|EMBL:EDZ69247.1}; OS Saccharomyces cerevisiae (strain AWRI1631) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=545124 {ECO:0000313|EMBL:EDZ69247.1, ECO:0000313|Proteomes:UP000008988}; RN [1] {ECO:0000313|EMBL:EDZ69247.1, ECO:0000313|Proteomes:UP000008988} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AWRI1631 {ECO:0000313|EMBL:EDZ69247.1, RC ECO:0000313|Proteomes:UP000008988}; RX PubMed=18778279; DOI=10.1111/j.1567-1364.2008.00434.x; RA Borneman A.R., Forgan A.H., Pretorius I.S., Chambers P.J.; RT "Comparative genome analysis of a Saccharomyces cerevisiae wine RT strain."; RL FEMS Yeast Res. 8:1185-1195(2008). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EDZ69247.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABSV01002176; EDZ69247.1; -; Genomic_DNA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008988; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008988}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 583 587 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 587 AA; 67240 MW; 97291788024363FC CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFGQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWSILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID B5Y466_PHATC Unreviewed; 693 AA. AC B5Y466; DT 25-NOV-2008, integrated into UniProtKB/TrEMBL. DT 25-NOV-2008, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:ACI65423.1}; GN ORFNames=PHATR_46912 {ECO:0000313|EMBL:ACI65423.1}; OS Phaeodactylum tricornutum (strain CCAP 1055/1). OC Eukaryota; Stramenopiles; Bacillariophyta; Bacillariophyceae; OC Bacillariophycidae; Naviculales; Phaeodactylaceae; Phaeodactylum. OX NCBI_TaxID=556484 {ECO:0000313|Proteomes:UP000000759}; RN [1] {ECO:0000313|EMBL:ACI65423.1, ECO:0000313|Proteomes:UP000000759} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:ACI65423.1, RC ECO:0000313|Proteomes:UP000000759}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). RN [2] {ECO:0000313|EMBL:ACI65423.1, ECO:0000313|Proteomes:UP000000759} RP GENOME REANNOTATION. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:ACI65423.1, RC ECO:0000313|Proteomes:UP000000759}; RG Diatom Consortium; RA Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., RA Detter J.C., Lindquist E., Shapiro H., Lucas S., Glavina del Rio T., RA Pitluck S., Rokhsar D., Bowler C.; RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP001141; ACI65423.1; -; Genomic_DNA. DR RefSeq; XP_002185953.1; XM_002185917.1. DR UniGene; Ptc.8290; -. DR GeneID; 7204743; -. DR KEGG; pti:PHATR_46912; -. DR InParanoid; B5Y466; -. DR Proteomes; UP000000759; Chromosome 11. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000759}; KW Reference proteome {ECO:0000313|Proteomes:UP000000759}. FT COILED 428 448 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 693 AA; 77324 MW; C4CEFBE1D49684F5 CRC64; MSKRKGEDLP VSPDDFSPPG VKTRSIRKSG KRQRRIQVAE ESDAEPSSPS VAAFQNPPSS SSSIPTDRQS NNNNNNNNKK DDDDRDDSSD PEDPMEDSNR RRRTNPSKTH DTARRKAPPA ASGSPKEAGE NVPLSRRLDY RETDTRKESP TTLPTESVLR ERDVLENGHD KTTTDDVDAE EHRRNNDHDD VRPHVSVHIR VYQFVTELVA GPTSAPQVET EMDVPVNTDT ASAGSRTWMQ LSWVWVLLIL AWHIICFPLA INPGLVTVSS TGNYLTTVYR AQAKLAQAVQ GLQTVQHQQL ARLRTARIAL EQAENAFRSQ QLAAEENLAR LEQSWETEAS SVMERLEKEE ATARTLNHWI EQVLVEVPDD EEEEEYVEAV SVPPEIRNVL GPTQDALLDS SFITLWDVPE PVICETPDVS GLAGGLYKED VEQAISDLIT DIVQVDEEME EMVRKWVENY LDTKAGDAMT TTTTADANIP PLDGVVDADA LKKLRAFIDG RMEVERADQT GLIDYASLLN GARIIRVGDR STSMSLVDQL PVFNRLAALL SLRFYGHGPE AALLPTYPPN ALGQCWSFEP PASRRSGPFG VLTVQLSRPI HVQSVSIEHP PPELTDKSQT AIRSFRIEGF EDTQTHGKAH SLGSFEYDGQ KGLRQDFDVD RNVPRLQSIS LVVDTNWGEP YACLYRFRVH GQE // ID B6AIZ1_CRYMR Unreviewed; 1029 AA. AC B6AIZ1; DT 25-NOV-2008, integrated into UniProtKB/TrEMBL. DT 25-NOV-2008, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEA08182.1}; GN ORFNames=CMU_011310 {ECO:0000313|EMBL:EEA08182.1}; OS Cryptosporidium muris (strain RN66). OC Eukaryota; Alveolata; Apicomplexa; Conoidasida; Coccidia; OC Eucoccidiorida; Eimeriorina; Cryptosporidiidae; Cryptosporidium. OX NCBI_TaxID=441375 {ECO:0000313|Proteomes:UP000001460}; RN [1] {ECO:0000313|EMBL:EEA08182.1, ECO:0000313|Proteomes:UP000001460} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RN66 {ECO:0000313|EMBL:EEA08182.1, RC ECO:0000313|Proteomes:UP000001460}; RA Lorenzi H., Inman J., Miller J., Schobel S., Amedeo P., Caler E.V., RA da Silva J.; RL Submitted (JUN-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS989737; EEA08182.1; -; Genomic_DNA. DR RefSeq; XP_002142531.1; XM_002142495.1. DR STRING; 441375.XP_002142531.1; -. DR EnsemblProtists; EEA08182; EEA08182; CMU_011310. DR GeneID; 6997606; -. DR EuPathDB; CryptoDB:CMU_011310; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000001460; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001460}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001460}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 1029 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002842483. FT TRANSMEM 901 921 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 1029 AA; 118965 MW; 4B1DEF4B9874D5BB CRC64; MGICKLLSLV FLIVSNFSIK CKCTDLGYSC NEIYREGFIL YTNNSCPVST SIRSNLLNTS GYTVSNNIRS NALDFSGYSK PTIEEKNNLM EDMEWKLENY TICNTEVYNA NIIDYLKKQI HNKDKIYFTN LSPYSCRVSF LKKYNKMMIS KRTLIIFQSY QRFVHLLDIL SSRRPCPFCN GCKDTHLNKL PSWIPPILHK QLRSLNESLD YKLCYTLKNS PKIYSEILAI LNNLKKLNSN VGNTSDIPIE LRDILEDKTN LRLDEYPLIY FGDILNLFYE ELRNNAVQSS VCRPFQYLYK CSWHPDYESK EILSNLRSVT KFMAICLDPS SKNRRLSSIN TNDPNSTTVE SIYISSNKIV PNDPIMFSTV NPTISISSNH TTLSDRSSDS SVTSINNNQS NIFKLNRNIL KYPKDSKIKT SFPSSCEPPF FDNLIIPKLT KFSGDKSNNY HSHHHDKTNY NLCGFNNQKK SKILSSEEDA VVIDTALTNR NVIQGTSYLP QLAALHYDYA SSASNSRLLD WSSDITHPKS IQSSDPDSYL LVPCYKPMWF IIGFPEDILV EYIALFSQEY FSSSYQDLEV LVSLVYPTKQ WQSLGILRRD SKLSKEMFDI KPLCIRDNTD VKYQYNRDIN VSNSNPCWIR YMQIRALSFY KEEGHYYCHL SRLQVFGNNV LSRLEAEING ERSNSIPLSG VESVKDVENR LRKYDINNFD HTANSEIFES TKSIVNFNSM KNDSNITKIE LRNWIPTNLE DSMVSYRDKI LGSHEVSSPL LYKITTADHP LLSFVNRVKY LEEKMSDLKL YIDTVYLNLN TSIYRIDKSL IELQKSIKYF EDTINSELNT TNVENMDILF IKQPLFTLSK IVKFIFYEDE EHNTRSLLDI ISTKIVYIKS YIFSTFNIIK VNSNLILLII AIIFILQIYM LQQLITLKKK LQNTLQFIKS YSLARTSHNF YDNLPLPTAC KLQSNPEKLP DMSHMTRGYS IIDSDPKTSG VFLQNLKYDS LTKDSRICNN KSQDKTPPHN NKESLKSGD // ID B6HLM0_PENCW Unreviewed; 647 AA. AC B6HLM0; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Pc21g08270 protein {ECO:0000313|EMBL:CAP95724.1}; GN ORFNames=Pc21g08270 {ECO:0000313|EMBL:CAP95724.1}, GN PCH_Pc21g08270 {ECO:0000313|EMBL:CAP95724.1}; OS Penicillium chrysogenum (strain ATCC 28089 / DSM 1075 / Wisconsin OS 54-1255) (Penicillium notatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium; OC Penicillium chrysogenum complex. OX NCBI_TaxID=500485 {ECO:0000313|EMBL:CAP95724.1, ECO:0000313|Proteomes:UP000000724}; RN [1] {ECO:0000313|EMBL:CAP95724.1, ECO:0000313|Proteomes:UP000000724} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 28089 / DSM 1075 / Wisconsin 54-1255 RC {ECO:0000313|Proteomes:UP000000724}; RX PubMed=18820685; DOI=10.1038/nbt.1498; RA van den Berg M.A., Albang R., Albermann K., Badger J.H., Daran J.-M., RA Driessen A.J.M., Garcia-Estrada C., Fedorova N.D., Harris D.M., RA Heijne W.H.M., Joardar V.S., Kiel J.A.K.W., Kovalchuk A., Martin J.F., RA Nierman W.C., Nijland J.G., Pronk J.T., Roubos J.A., RA van der Klei I.J., van Peij N.N.M.E., Veenhuis M., von Doehren H., RA Wagner C., Wortman J.R., Bovenberg R.A.L.; RT "Genome sequencing and analysis of the filamentous fungus Penicillium RT chrysogenum."; RL Nat. Biotechnol. 26:1161-1168(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM920436; CAP95724.1; -; Genomic_DNA. DR RefSeq; XP_002567866.1; XM_002567820.1. DR EnsemblFungi; CAP95724; CAP95724; PCH_Pc21g08270. DR GeneID; 8316970; -. DR KEGG; pcs:Pc21g08270; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR BioCyc; PCHR:PC21G08270-MONOMER; -. DR Proteomes; UP000000724; Contig Pc00c21. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000724}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000724}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 288 306 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 340 360 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 647 AA; 71544 MW; 7D18CDAA9546A471 CRC64; MPPKRVTRRA GATPQRPHQD PEFESILDRA QGPVLPGVPV QASFNYGAAT TTGLPQRMSM RPNIGIDQIA GTLNAQLDVA RRRTAATKKN QAPQARQSKR EPTPDQTQLQ QSLHNAADEH SDKTPSPPVP HSVSTDSSPD AQPPLQRHLS NSPLYPSPLQ RMGSPHIRSP LGSSSPFRHS SVDNASVASW NLERDINEDD LQRTRPSKHG RNITAPPRRI SGLANVLEED EEDVKFDPAI YNNFEPDVPK ESFLRRWLGA VRSRRQEPQS RHENEPPTEV RRKSWIRAAF YLLLFLSFIF VPLIAAKLRD YLDHGALDWD SSSNMTIIKP DVLHSIRSQV SKMDAQMSSL SNEVSSARSE LSLSNAHDST PTDASLHRKP VHKVNFLSVA LGAMIDPAKT SPTLGPKQGA SLRALLWASS FVSRRPIRSP QSPMSALTTW EEVGDCWCSA PRNGTSQLSV LLGRDIVAEE LIVEHIPVGA SLEPEAAPRT IEVWARFKVN PHKAPVKAKP TPEARPGRVG FMKLFGDATV TQEPPSTQAP SSRETGLGGF LIPGIGSLHA LVMDLLRRSN PFEPPSAYSD DPILGPNFYR IGKVEYDLHS PDYAQAFKLN TIVDVSTIRV DKVVFRVTSN WGADHTCIYR FKLHGHL // ID B6HMW5_PENCW Unreviewed; 868 AA. AC B6HMW5; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Pc21g17570 protein {ECO:0000313|EMBL:CAP96654.1}; GN ORFNames=Pc21g17570 {ECO:0000313|EMBL:CAP96654.1}, GN PCH_Pc21g17570 {ECO:0000313|EMBL:CAP96654.1}; OS Penicillium chrysogenum (strain ATCC 28089 / DSM 1075 / Wisconsin OS 54-1255) (Penicillium notatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium; OC Penicillium chrysogenum complex. OX NCBI_TaxID=500485 {ECO:0000313|EMBL:CAP96654.1, ECO:0000313|Proteomes:UP000000724}; RN [1] {ECO:0000313|EMBL:CAP96654.1, ECO:0000313|Proteomes:UP000000724} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 28089 / DSM 1075 / Wisconsin 54-1255 RC {ECO:0000313|Proteomes:UP000000724}; RX PubMed=18820685; DOI=10.1038/nbt.1498; RA van den Berg M.A., Albang R., Albermann K., Badger J.H., Daran J.-M., RA Driessen A.J.M., Garcia-Estrada C., Fedorova N.D., Harris D.M., RA Heijne W.H.M., Joardar V.S., Kiel J.A.K.W., Kovalchuk A., Martin J.F., RA Nierman W.C., Nijland J.G., Pronk J.T., Roubos J.A., RA van der Klei I.J., van Peij N.N.M.E., Veenhuis M., von Doehren H., RA Wagner C., Wortman J.R., Bovenberg R.A.L.; RT "Genome sequencing and analysis of the filamentous fungus Penicillium RT chrysogenum."; RL Nat. Biotechnol. 26:1161-1168(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM920436; CAP96654.1; -; Genomic_DNA. DR RefSeq; XP_002568754.1; XM_002568708.1. DR STRING; 500485.XP_002568754.1; -. DR EnsemblFungi; CAP96654; CAP96654; PCH_Pc21g17570. DR GeneID; 8316998; -. DR KEGG; pcs:Pc21g17570; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR BioCyc; PCHR:PC21G17570-MONOMER; -. DR Proteomes; UP000000724; Contig Pc00c21. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000724}; KW Reference proteome {ECO:0000313|Proteomes:UP000000724}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 31 {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 31 Potential. FT CHAIN 32 868 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002845802. FT CHAIN 32 868 Potential. FT /FTId=PRO_5000409970. FT COILED 409 429 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 868 AA; 94522 MW; 098DFB6D112315AF CRC64; MLTAGRSLIS RIYHASILFA LCHNALSNAT AQPPPNRAQG ALPTKPVTTC PAKSTWQTEA ELFHLPPCLE TRWGPGRDMY ITSASPDSSV AANDPVINSS GVSSGTSEET VKTTPDQNAE QDQDTDSLLD GASFLSFEDW KKQNLAKVGQ SAENIGGNRR AAAAGKEDRR QPTGINNALD SLGDDSEIEL DFGGFGAETP EASPTAWGSH IPSREAGQAG HVDDRGTVDV HAQGAFHRDA SRRKDAGTTC KERFNYASFD CAATVLKTNP ECKGSSSVLI ENKDSYMLNE CRAQNKFLIL ELCDDILVDT VVLANYEFFS SIFHTFRVSV SDRYPAKPDQ WKELGVYEAR NTREVQAFAV ENSLIWARYL RIEFLTHYGH EFFCPVSLIR VHGTTMMEEY KHDESSDRVE VEELEASEAN RLSEDLENKP VEVPVEKFIP AVVPSLAVGE VCPNPLLEAA SPFAKHGDED ICGINDGPPV MTPSTSATTD QTKPAPKQNS TAVVGGNPST TNAGPSAPSK AEDTRKQGEP AKNVGTPPDA SSVSSEPAQQ NATSEVAGKA TTNSKEEQNS PPPESTRPTN TQPPSANPTT QESFFKSVHK RLQMLESNST LSLLYIEEQS RILRDAFNKV EKRQLAKTST FLENLNVTVL NELKEFREQY DHVWKSVALE FEHQRMQYHQ EVHSLSGQLG VLADELVFQK RVTVIQSIMV LCCFALVLFS RGSGSNYMEF PAVQKMVARS YSLRSSSPIF ASPSGSPGLT RPTSSSYHEN SGHHRNLSDS SDQDSAASPT AAYSPPTPTS SSTERDADET DKVEEPISPE SMSVPTLATP QPRSQSTPPV LNGRPVDLDM ASSADVDVGA DNPRSREL // ID B6JW38_SCHJY Unreviewed; 651 AA. AC B6JW38; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-OCT-2013, sequence version 2. DT 11-NOV-2015, entry version 21. DE SubName: Full=Spindle pole body protein Sad1 {ECO:0000313|EMBL:EEB05589.2}; GN ORFNames=SJAG_05196 {ECO:0000313|EMBL:EEB05589.2}; OS Schizosaccharomyces japonicus (strain yFS275 / FY16936) (Fission OS yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Schizosaccharomycetes; Schizosaccharomycetales; OC Schizosaccharomycetaceae; Schizosaccharomyces. OX NCBI_TaxID=402676 {ECO:0000313|EMBL:EEB05589.2, ECO:0000313|Proteomes:UP000001744}; RN [1] {ECO:0000313|EMBL:EEB05589.2, ECO:0000313|Proteomes:UP000001744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=yFS275 / FY16936 {ECO:0000313|Proteomes:UP000001744}; RX PubMed=21511999; DOI=10.1126/science.1203357; RA Rhind N., Chen Z., Yassour M., Thompson D.A., Haas B.J., Habib N., RA Wapinski I., Roy S., Lin M.F., Heiman D.I., Young S.K., Furuya K., RA Guo Y., Pidoux A., Chen H.M., Robbertse B., Goldberg J.M., Aoki K., RA Bayne E.H., Berlin A.M., Desjardins C.A., Dobbs E., Dukaj L., Fan L., RA FitzGerald M.G., French C., Gujja S., Hansen K., Keifenheim D., RA Levin J.Z., Mosher R.A., Mueller C.A., Pfiffner J., Priest M., RA Russ C., Smialowska A., Swoboda P., Sykes S.M., Vaughn M., RA Vengrova S., Yoder R., Zeng Q., Allshire R., Baulcombe D., RA Birren B.W., Brown W., Ekwall K., Kellis M., Leatherwood J., Levin H., RA Margalit H., Martienssen R., Nieduszynski C.A., Spatafora J.W., RA Friedman N., Dalgaard J.Z., Baumann P., Niki H., Regev A., Nusbaum C.; RT "Comparative functional genomics of the fission yeasts."; RL Science 332:930-936(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; KE651166; EEB05589.2; -; Genomic_DNA. DR RefSeq; XP_002171882.2; XM_002171846.2. DR STRING; 402676.XP_002171882.1; -. DR EnsemblFungi; EEB05589; EEB05589; SJAG_05196. DR GeneID; 7050648; -. DR EuPathDB; FungiDB:SJAG_05196; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001744; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001744}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001744}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 156 181 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 225 245 {ECO:0000256|SAM:Coils}. FT COILED 316 336 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 651 AA; 73733 MW; 671FEECFA5CD3E05 CRC64; MFTNTPVGGR RERQQHAGNL RGLAGQSSAR IHQNTAELAN QIHRARPGRL NAMPTRISVP QLRALERSPS PEHSFGLHYD TSYPEASDEE VFERIVKQSQ HSSDEDHSDS EYSYSDPDNH PRNRKGSPYS DVNETSDDEH AKELNSKSPR KRNKSWIMSP SFFLVVSVFI IIISSGVGLY LQHSRSVLPF ALSTDASSDI LQRLSKLELE MENASDIAKY LSQSKATLEK DHNHLAQNYE VLKDRINNLF AISKALNSST FNITEQGIDL SRRQEHLQDV MSTIQKKISS IQQSISRSVG TDPEYRESLT GSIERLQSLE KQQAELFQTV ANLKREALEL YSSDKNTVKG GDDGASAVSY NDGQLTVHPE FLALLSKYIS AQVEEMRKSS WQELLEDSEK IVEDIAKSTL DKSFLKRDEV SQLVAQQVQS AIEQVENQCL LHEYGFEDDE ALRTKLESFT RSIVQQMTLD TLARPNFALL STGARVIRHL TTPNYQKRPS SLFPRLMSYL VDDNIIEGNR PETALQSNNE VGMCWSFAGT FGQLGVSLSQ PIYINAVSIH HVHPSIALDI SSAPREMELW GQRYHFKKDR GYELLTTFEY QPGDNFIQTY PISTDTSKPP FKNVVLKIKS NWMNNEFTCL YQLRVHGDIP P // ID B6JZJ2_SCHJY Unreviewed; 632 AA. AC B6JZJ2; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Sad1-UNC-like carboxy terminal protein {ECO:0000313|EMBL:EEB06960.1}; GN ORFNames=SJAG_02032 {ECO:0000313|EMBL:EEB06960.1}; OS Schizosaccharomyces japonicus (strain yFS275 / FY16936) (Fission OS yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Schizosaccharomycetes; Schizosaccharomycetales; OC Schizosaccharomycetaceae; Schizosaccharomyces. OX NCBI_TaxID=402676 {ECO:0000313|EMBL:EEB06960.1, ECO:0000313|Proteomes:UP000001744}; RN [1] {ECO:0000313|EMBL:EEB06960.1, ECO:0000313|Proteomes:UP000001744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=yFS275 / FY16936 {ECO:0000313|Proteomes:UP000001744}; RX PubMed=21511999; DOI=10.1126/science.1203357; RA Rhind N., Chen Z., Yassour M., Thompson D.A., Haas B.J., Habib N., RA Wapinski I., Roy S., Lin M.F., Heiman D.I., Young S.K., Furuya K., RA Guo Y., Pidoux A., Chen H.M., Robbertse B., Goldberg J.M., Aoki K., RA Bayne E.H., Berlin A.M., Desjardins C.A., Dobbs E., Dukaj L., Fan L., RA FitzGerald M.G., French C., Gujja S., Hansen K., Keifenheim D., RA Levin J.Z., Mosher R.A., Mueller C.A., Pfiffner J., Priest M., RA Russ C., Smialowska A., Swoboda P., Sykes S.M., Vaughn M., RA Vengrova S., Yoder R., Zeng Q., Allshire R., Baulcombe D., RA Birren B.W., Brown W., Ekwall K., Kellis M., Leatherwood J., Levin H., RA Margalit H., Martienssen R., Nieduszynski C.A., Spatafora J.W., RA Friedman N., Dalgaard J.Z., Baumann P., Niki H., Regev A., Nusbaum C.; RT "Comparative functional genomics of the fission yeasts."; RL Science 332:930-936(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; KE651168; EEB06960.1; -; Genomic_DNA. DR RefSeq; XP_002173253.1; XM_002173217.2. DR STRING; 402676.XP_002173253.1; -. DR EnsemblFungi; EEB06960; EEB06960; SJAG_02032. DR GeneID; 7047880; -. DR EuPathDB; FungiDB:SJAG_02032; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001744; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001744}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001744}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 512 530 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 297 317 {ECO:0000256|SAM:Coils}. FT COILED 424 454 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 632 AA; 69894 MW; 13FD182A4A013495 CRC64; MQLASTNFCL VRLQNSMLFC RKLWILAVFL TLHLCRSQTI CSKTKISPFN EQDDSISCEA IPYSSLQSIF QITATVNNST GTLASALTRE TRTGALLSEA VNESIQNTTL STTSPTLGES ETIGTHSTET VNSSSSSEKS TAASPVPTKE AKRFNFASTD CAAAILEANA EAKMPSAILS ENRDKYMLIK CSVEPKFMVI ELCDDISIDT IQLANYEFFS STFRDVKFSV SSTYPPKGSG WTDLGTFTAR NVRTLQSFQV DYPKIWAKYL KVEFLDHYGS EFYCPVSIIR VYGKTMLDEF REERKADDDV LDSTEKEQAE AGAAEDGSAV SSDASHPNAT MQTNGSKRAA STGTDMNSAA EGSESKNDGM NQSVGTPQSQ ISHVTNAHEE ALTATQSEHS SVLSGKPTLV NQNVHATNQQ ESIYRNMYKR LSVLEEKTNQ LKAQLANLEK LTVSHFQESN STLIRVRDEY ISLLEVSLSI IHAKQELYDA EGESLTKRVN MLAQDYLLQK RLLVLQSLLL ISIIVILSFW KSPFVDRFFR RVSRARHGNS STTWSPFSDR AAASVEPETP VDEIVGSSSP TFLKRFRNRK VESVLYLDTQ QNSLSSNARS SSTEQVSVSG PPKLHARSYT VA // ID B6Q7S7_PENMQ Unreviewed; 667 AA. AC B6Q7S7; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEA28812.1}; GN ORFNames=PMAA_036050 {ECO:0000313|EMBL:EEA28812.1}; OS Penicillium marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Trichocomaceae; Talaromyces. OX NCBI_TaxID=441960 {ECO:0000313|Proteomes:UP000001294}; RN [1] {ECO:0000313|Proteomes:UP000001294} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 18224 / CBS 334.59 / QM 7333 RC {ECO:0000313|Proteomes:UP000001294}; RX PubMed=25676766; DOI=10.1128/genomeA.01559-14; RA Nierman W.C., Fedorova-Abrams N.D., Andrianopoulos A.; RT "Genome sequence of the AIDS-associated pathogen Penicillium marneffei RT (ATCC18224) and its near taxonomic relative Talaromyces stipitatus RT (ATCC10500)."; RL Genome Announc. 3:E0155914-E0155914(2015). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS995899; EEA28812.1; -; Genomic_DNA. DR RefSeq; XP_002145327.1; XM_002145291.1. DR EnsemblFungi; EEA28812; EEA28812; PMAA_036050. DR GeneID; 7023053; -. DR EuPathDB; FungiDB:PMAA_036050; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001294; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001294}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001294}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 313 337 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 667 AA; 73887 MW; B0DED62BC23D83EE CRC64; MRRTSQQRDP AYSPSGRRYS LDPSSPVRRS ERLGNGSQSS TPSFVLRATT PDARQLHQAL RAASESPSKE RRDIRARSSI HSSVTPSPPV QRTISSTSTP AAPSLNPAIT TDNENLEQQT DRGFSFFRYP RLPSIGFGMK AASIVLSSPQ GEDSAFNDNA SVISWQLERE LHRDNLQRTK PEPEISSYGL GPREGRNIRK PPRRLSGLTW TNDTTHSVDN SNTTNHYDND VKDDEEEEEE EDKASELSAV RTAPARTVIS TNVVRDSADE SVANSSRPPT ADQPAPVDKP RRPLLFEEQQ QQHEKMKETR WPLFLVTMTL FVTIFAITTY FLTGYLGSND FFPQQPKSPY PTLNATEGNI VSQISKEMTQ LANQISVVSR DVHVLRYEYQ HGVGQDTIVK PTPDLEPRIN FLSPGLGTRV NRKLTSPSVG IRRTLPRRLY EGLTGKGSPQ PNPPETALEA WDDIGDCWCG APSKTGQGQL QLALELGQRA VLNEVVVEHI SASASPEPGV APREMELWAQ FKPFHGQQPA KATETETAIT NESITKKTGW FGLFHSSTSS SSSSSTSSKP AAPSSLSSIL GSILKTLHQA YPSDPETAYA NDRLLGPTYF RLGEWEYDRT GSTVQHFALD ALIDYPMLRV DKVVVRVKSN WGGNHTCLYR VKVHGHA // ID B6QG55_PENMQ Unreviewed; 945 AA. AC B6QG55; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:EEA24440.1}; GN ORFNames=PMAA_084440 {ECO:0000313|EMBL:EEA24440.1}; OS Penicillium marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Trichocomaceae; Talaromyces. OX NCBI_TaxID=441960 {ECO:0000313|Proteomes:UP000001294}; RN [1] {ECO:0000313|Proteomes:UP000001294} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 18224 / CBS 334.59 / QM 7333 RC {ECO:0000313|Proteomes:UP000001294}; RX PubMed=25676766; DOI=10.1128/genomeA.01559-14; RA Nierman W.C., Fedorova-Abrams N.D., Andrianopoulos A.; RT "Genome sequence of the AIDS-associated pathogen Penicillium marneffei RT (ATCC18224) and its near taxonomic relative Talaromyces stipitatus RT (ATCC10500)."; RL Genome Announc. 3:E0155914-E0155914(2015). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS995901; EEA24440.1; -; Genomic_DNA. DR RefSeq; XP_002147951.1; XM_002147915.1. DR STRING; 441960.XP_002147951.1; -. DR EnsemblFungi; EEA24440; EEA24440; PMAA_084440. DR GeneID; 7025572; -. DR EuPathDB; FungiDB:PMAA_084440; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001294; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001294}; KW Reference proteome {ECO:0000313|Proteomes:UP000001294}. SQ SEQUENCE 945 AA; 103600 MW; 5C66DF47AB1DF969 CRC64; MFSSAQDFLL YWVRQRAVIA ALRSAFSNNN NNYTYSNILR GREALIIIVM TGACIVQVLR WMQVWVLLAS DFLAHATATV TQLQVRPDDT HALTCPVPTF YDLPFPICTI YDESDYRGSD YVSTLPAEPS ASSTSSTAVA AQSSSPPAAP DADVNDDSPL DNAKFLSFED WKKQNLAKIG QSVDNVGKRQ ASDVGQERRA QARPINNALD SLGDDAEIEL NFDGFGSESA HPTPWESGSP NEIRKAEGES SADNDSADGV SAVGRRKDAG TTCKERFNYA SFDCAATVLK TNPECSGSSS ILIENKDSYM LNECRAKNKF LILELCDDIL VDTIVLANYE FFSSIFRTFR VSVSDRYPVK ADKWKELGIF EAKNTRAVQA FAVENPLIWA RYLKIEFLTH YGNEFYCPLS LVRVHGTTML EEYKNEGDSS RSDEDIMETA EEVGKPVEAE RESAVQEQQE AFDNSTVPVV SPVLDISENI SPLNGSVLEI AALEFSILET ATCAANYSAL PANDMSHQMD IKTTVSASGS ANTTVVTPSD SKASPEREQT TGSETNMASR SESQHTSGRG SGTDSSATRT AGASGEDSTP TVEPTKVIPS TPPSPNPTTQ ESFFKSVNKR LQMLETNSTL SLLYIEEQSR MLRDAFSKVE KRQMAKIHTF LEELNTTVID EIRSLHLVYQ SLRTIVLDDF EHQQREVSTA ASQLAILTNE LMFQKRMTAL SSVLIMILFA LILLPRGSGI VGGIDFHSII PWSPHPRSST KISRIPSTGP SSPSLESETP TPPSVIPPMK KTHRRQCSNA LGHKSIKDLR ECTNPLAHSS KEDLQFTGYE DNGRFRSMSE FSDSDINKIP YTPFSSDIRH MLGYDSIHLG SQLNIAKDTT ASSPSASPLP NLAQDRPVSS PPVLNGPGLI PNGHHLNKDN LVDNESDGGS DSDSSTEPFP PFPTD // ID B6SKI2_MAIZE Unreviewed; 589 AA. AC B6SKI2; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 43. DE SubName: Full=Membrane protein-like; DE SubName: Full=SUN domain protein5; DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:AC194341.4_FGP003}; GN Name=AC194341.4_FG003 {ECO:0000313|EnsemblPlants:AC194341.4_FGP003}; GN ORFNames=ZEAMMB73_173025; OS Zea mays (Maize). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Tripsacinae; OC Zea. OX NCBI_TaxID=4577; RN [1] RP NUCLEOTIDE SEQUENCE. RX PubMed=18937034; DOI=10.1007/s11103-008-9415-4; RA Alexandrov N.N., Brover V.V., Freidin S., Troukhan M.E., RA Tatarinova T.V., Zhang H., Swaller T.J., Lu Y.-P., Bouck J., RA Flavell R.B., Feldmann K.A.; RT "Insights into corn genes derived from large-scale cDNA sequencing."; RL Plant Mol. Biol. 69:179-194(2009). RN [2] {ECO:0000313|EnsemblPlants:AC194341.4_FGP003, ECO:0000313|Proteomes:UP000007305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. B73 {ECO:0000313|EnsemblPlants:AC194341.4_FGP003, RC ECO:0000313|Proteomes:UP000007305}; RX PubMed=19965430; DOI=10.1126/science.1178534; RA Schnable P.S., Ware D., Fulton R.S., Stein J.C., Wei F., Pasternak S., RA Liang C., Zhang J., Fulton L., Graves T.A., Minx P., Reily A.D., RA Courtney L., Kruchowski S.S., Tomlinson C., Strong C., Delehaunty K., RA Fronick C., Courtney B., Rock S.M., Belter E., Du F., Kim K., RA Abbott R.M., Cotton M., Levy A., Marchetto P., Ochoa K., Jackson S.M., RA Gillam B., Chen W., Yan L., Higginbotham J., Cardenas M., RA Waligorski J., Applebaum E., Phelps L., Falcone J., Kanchi K., RA Thane T., Scimone A., Thane N., Henke J., Wang T., Ruppert J., RA Shah N., Rotter K., Hodges J., Ingenthron E., Cordes M., Kohlberg S., RA Sgro J., Delgado B., Mead K., Chinwalla A., Leonard S., Crouse K., RA Collura K., Kudrna D., Currie J., He R., Angelova A., Rajasekar S., RA Mueller T., Lomeli R., Scara G., Ko A., Delaney K., Wissotski M., RA Lopez G., Campos D., Braidotti M., Ashley E., Golser W., Kim H., RA Lee S., Lin J., Dujmic Z., Kim W., Talag J., Zuccolo A., Fan C., RA Sebastian A., Kramer M., Spiegel L., Nascimento L., Zutavern T., RA Miller B., Ambroise C., Muller S., Spooner W., Narechania A., Ren L., RA Wei S., Kumari S., Faga B., Levy M.J., McMahan L., Van Buren P., RA Vaughn M.W., Ying K., Yeh C.-T., Emrich S.J., Jia Y., Kalyanaraman A., RA Hsia A.-P., Barbazuk W.B., Baucom R.S., Brutnell T.P., Carpita N.C., RA Chaparro C., Chia J.-M., Deragon J.-M., Estill J.C., Fu Y., RA Jeddeloh J.A., Han Y., Lee H., Li P., Lisch D.R., Liu S., Liu Z., RA Nagel D.H., McCann M.C., SanMiguel P., Myers A.M., Nettleton D., RA Nguyen J., Penning B.W., Ponnala L., Schneider K.L., Schwartz D.C., RA Sharma A., Soderlund C., Springer N.M., Sun Q., Wang H., Waterman M., RA Westerman R., Wolfgruber T.K., Yang L., Yu Y., Zhang L., Zhou S., RA Zhu Q., Bennetzen J.L., Dawe R.K., Jiang J., Jiang N., Presting G.G., RA Wessler S.R., Aluru S., Martienssen R.A., Clifton S.W., McCombie W.R., RA Wing R.A., Wilson R.K.; RT "The B73 maize genome: complexity, diversity, and dynamics."; RL Science 326:1112-1115(2009). RN [3] RP NUCLEOTIDE SEQUENCE. RG Maize Genome Sequencing Project; RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|EnsemblPlants:AC194341.4_FGP003} RP IDENTIFICATION. RC STRAIN=cv. B73 {ECO:0000313|EnsemblPlants:AC194341.4_FGP003}; RG EnsemblPlants; RL Submitted (FEB-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EU953247; ACG25365.1; -; mRNA. DR EMBL; CM000784; AFW83172.1; -; Genomic_DNA. DR RefSeq; NP_001147071.1; NM_001153599.1. DR UniGene; Zm.31400; -. DR STRING; 4577.AC194341.4_FGP003; -. DR EnsemblPlants; AC194341.4_FGT003; AC194341.4_FGP003; AC194341.4_FG003. DR GeneID; 100280680; -. DR KEGG; zma:100280680; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR OMA; IVKEQAN; -. DR Proteomes; UP000007305; Chromosome 8. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Complete proteome {ECO:0000313|Proteomes:UP000007305}; KW Reference proteome {ECO:0000313|Proteomes:UP000007305}. SQ SEQUENCE 589 AA; 63932 MW; 328F85B502CCFC5E CRC64; MSRKRREGGG GGRGAGPVDH HGGGKGSEAG AAATDAVSMD GGLREVSVPV VFSVWCLLFL LRSQFLHSQA DDDPSSEFYE EHGMRDSYCK VRPLEAYVLP YHNDSCQSSY SHSQPPQEAP SSSALASPQY NATTGGNASS PEAAFVGLDE FRSRMMQGKA ENDTGPPTDG GVAHRLEPNG AEYNYAAAAK GAKVLAHNKE AKGAANILGG DKDKYLRNPC SAADKFVVVE LSEETLVDTV ALANLEHYSS NFREFEVYGS TSYPTEAWEL LGRFTAENAK HAQRFVLPEP RWTRYLRLRL VSHYGSGFYC ILSYLEVYGV DAVERMLQDF IAGAGAGAGA EADASRDRAP IDFANRDADC NDTTAQQDGN GGAGRNDSTA GDGKSNSSRS GDAKLPPQVA ALASPTGRIH SDGVLKILMQ KMRSLELSLS TLEEYTRELN QRYGAKLPDL QNGLSQTAVA LEKMKADVHD LVDGKDSVAK DLDDLKAWKS TVSGKLDDLI KENQEMRWSV EEMRGVQETL QNKELAVLSI SLFFACLALF KLACDRVFCL FAGKGREEPD AEEHTRSSRA WMLVLASSSF TTLIVLLYN // ID B6THU8_MAIZE Unreviewed; 462 AA. AC B6THU8; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Sad1-unc84-like protein; DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GRMZM2G109818_P01}; GN Name=cl9166_1 {ECO:0000313|EnsemblPlants:GRMZM2G109818_P01}; OS Zea mays (Maize). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Tripsacinae; OC Zea. OX NCBI_TaxID=4577; RN [1] RP NUCLEOTIDE SEQUENCE. RX PubMed=18937034; DOI=10.1007/s11103-008-9415-4; RA Alexandrov N.N., Brover V.V., Freidin S., Troukhan M.E., RA Tatarinova T.V., Zhang H., Swaller T.J., Lu Y.-P., Bouck J., RA Flavell R.B., Feldmann K.A.; RT "Insights into corn genes derived from large-scale cDNA sequencing."; RL Plant Mol. Biol. 69:179-194(2009). RN [2] {ECO:0000313|EnsemblPlants:GRMZM2G109818_P01, ECO:0000313|Proteomes:UP000007305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. B73 {ECO:0000313|EnsemblPlants:GRMZM2G109818_P01, RC ECO:0000313|Proteomes:UP000007305}; RX PubMed=19965430; DOI=10.1126/science.1178534; RA Schnable P.S., Ware D., Fulton R.S., Stein J.C., Wei F., Pasternak S., RA Liang C., Zhang J., Fulton L., Graves T.A., Minx P., Reily A.D., RA Courtney L., Kruchowski S.S., Tomlinson C., Strong C., Delehaunty K., RA Fronick C., Courtney B., Rock S.M., Belter E., Du F., Kim K., RA Abbott R.M., Cotton M., Levy A., Marchetto P., Ochoa K., Jackson S.M., RA Gillam B., Chen W., Yan L., Higginbotham J., Cardenas M., RA Waligorski J., Applebaum E., Phelps L., Falcone J., Kanchi K., RA Thane T., Scimone A., Thane N., Henke J., Wang T., Ruppert J., RA Shah N., Rotter K., Hodges J., Ingenthron E., Cordes M., Kohlberg S., RA Sgro J., Delgado B., Mead K., Chinwalla A., Leonard S., Crouse K., RA Collura K., Kudrna D., Currie J., He R., Angelova A., Rajasekar S., RA Mueller T., Lomeli R., Scara G., Ko A., Delaney K., Wissotski M., RA Lopez G., Campos D., Braidotti M., Ashley E., Golser W., Kim H., RA Lee S., Lin J., Dujmic Z., Kim W., Talag J., Zuccolo A., Fan C., RA Sebastian A., Kramer M., Spiegel L., Nascimento L., Zutavern T., RA Miller B., Ambroise C., Muller S., Spooner W., Narechania A., Ren L., RA Wei S., Kumari S., Faga B., Levy M.J., McMahan L., Van Buren P., RA Vaughn M.W., Ying K., Yeh C.-T., Emrich S.J., Jia Y., Kalyanaraman A., RA Hsia A.-P., Barbazuk W.B., Baucom R.S., Brutnell T.P., Carpita N.C., RA Chaparro C., Chia J.-M., Deragon J.-M., Estill J.C., Fu Y., RA Jeddeloh J.A., Han Y., Lee H., Li P., Lisch D.R., Liu S., Liu Z., RA Nagel D.H., McCann M.C., SanMiguel P., Myers A.M., Nettleton D., RA Nguyen J., Penning B.W., Ponnala L., Schneider K.L., Schwartz D.C., RA Sharma A., Soderlund C., Springer N.M., Sun Q., Wang H., Waterman M., RA Westerman R., Wolfgruber T.K., Yang L., Yu Y., Zhang L., Zhou S., RA Zhu Q., Bennetzen J.L., Dawe R.K., Jiang J., Jiang N., Presting G.G., RA Wessler S.R., Aluru S., Martienssen R.A., Clifton S.W., McCombie W.R., RA Wing R.A., Wilson R.K.; RT "The B73 maize genome: complexity, diversity, and dynamics."; RL Science 326:1112-1115(2009). RN [3] {ECO:0000313|EnsemblPlants:GRMZM2G109818_P01} RP IDENTIFICATION. RC STRAIN=cv. B73 {ECO:0000313|EnsemblPlants:GRMZM2G109818_P01}; RG EnsemblPlants; RL Submitted (FEB-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EU964563; ACG36681.1; -; mRNA. DR RefSeq; NP_001149754.1; NM_001156282.2. DR UniGene; Zm.94705; -. DR STRING; 4577.GRMZM2G109818_P01; -. DR EnsemblPlants; GRMZM2G109818_T01; GRMZM2G109818_P01; GRMZM2G109818. DR GeneID; 100283381; -. DR KEGG; zma:100283381; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR KO; K19347; -. DR OMA; MEIARHS; -. DR Proteomes; UP000007305; Chromosome 5. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007305}; KW Reference proteome {ECO:0000313|Proteomes:UP000007305}. FT COILED 169 189 {ECO:0000256|SAM:Coils}. FT COILED 195 229 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 462 AA; 50961 MW; 85D22413BDF6A07B CRC64; MTMSASTAAI RTANTNGNHA VSSDSHSSQD ARQRTAGITK RKALPSILQK IPSNDLSHTI RGESVLQKSN YSSEGRKDVV ASATAERQNK NTTNVVASAA AVRQKKSPTK QENANWVTAL SVLVKLCLLI SATAWMGQVF WRWQSGELSF TTLDMENRLS RVEGFKKTAK MLQLQLDVLD KKLENEIDKA KGVIAKQFED KGNKIEKKMK ILEDKTDKLD KSLAELSDMG FLSKNEFEEI LSQLKKKKGF GGTDDEISLD DIRLYAKDVV EMEIARHSAD GLGMVDYALG SGGAKVVSHS EPFMNGKNYL PGRSNVHTTA QKMLEPSFGQ PGECFAVKGS SGFVKVKLRT AIIPEAVTLE HVDKSVAYDR SSAPKDFQVR GWYQGPHDDS EKDSNVMSTL GEFSYDLDKS NAQTFQLERT AQSRVVNMVQ LDISSNHGNL ELTCIYRFRV HGRKPVSPST GG // ID B6TRH2_MAIZE Unreviewed; 439 AA. AC B6TRH2; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Sad1/unc-84-like protein 2 {ECO:0000313|EMBL:ACG39705.1}; OS Zea mays (Maize). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Tripsacinae; OC Zea. OX NCBI_TaxID=4577 {ECO:0000313|EMBL:ACG39705.1}; RN [1] {ECO:0000313|EMBL:ACG39705.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=18937034; DOI=10.1007/s11103-008-9415-4; RA Alexandrov N.N., Brover V.V., Freidin S., Troukhan M.E., RA Tatarinova T.V., Zhang H., Swaller T.J., Lu Y.-P., Bouck J., RA Flavell R.B., Feldmann K.A.; RT "Insights into corn genes derived from large-scale cDNA sequencing."; RL Plant Mol. Biol. 69:179-194(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EU967587; ACG39705.1; -; mRNA. DR UniGene; Zm.6043; -. DR STRING; 4577.GRMZM2G440614_P01; -. DR PaxDb; B6TRH2; -. DR PRIDE; B6TRH2; -. DR Gramene; B6TRH2; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 168 188 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 439 AA; 47532 MW; 9A51FE1636631974 CRC64; MASPSFAAAA TSPASSPLTL DAIPLASRPA PGAAAAQRKR SVFLLNHRPH PVSPTPPLQP TAAASAQHAR RKKTSHPPRP RWQTVLSIAA KNAALLAALL YLGDLAWRWS HPPPPSPPPD RAALEGYAAR VDEVEASLTR TFRMIQVQLE AVDRKIDGEV GAARGDLLAL LEDKRLALER QLTRLDARAG ELGDALTGLK RMEFLRKDEF EKFWDEVKGS LGSSSESEVD LDQVRALARE IVMREIEKHA ADGIGRVDYA VASGGGRVVR HSEAYVPKRG FMVWMSGVDV GPKPEKMLQP SFGEPGQCFA LQGSNGFVEV KLKSGIIPEA VTLEHVSKDV AYDRSTAPKG CRVYGWYEET PGETQSGHAA KMALAEFAYD LEKNNVQTFD VTAPDVGVIN MVRLDFTSNH GSSQLTCIYR LRVHGHEPVS PGSSAGSQA // ID B6TY16_MAIZE Unreviewed; 439 AA. AC B6TY16; DT 16-DEC-2008, integrated into UniProtKB/TrEMBL. DT 16-DEC-2008, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=Sad1/unc-84-like protein 2; DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GRMZM2G440614_P01}; GN Name=cl44325_1 {ECO:0000313|EnsemblPlants:GRMZM2G440614_P01}; OS Zea mays (Maize). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Tripsacinae; OC Zea. OX NCBI_TaxID=4577; RN [1] RP NUCLEOTIDE SEQUENCE. RX PubMed=18937034; DOI=10.1007/s11103-008-9415-4; RA Alexandrov N.N., Brover V.V., Freidin S., Troukhan M.E., RA Tatarinova T.V., Zhang H., Swaller T.J., Lu Y.-P., Bouck J., RA Flavell R.B., Feldmann K.A.; RT "Insights into corn genes derived from large-scale cDNA sequencing."; RL Plant Mol. Biol. 69:179-194(2009). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=B73; RX PubMed=19936069; DOI=10.1371/journal.pgen.1000740; RA Soderlund C., Descour A., Kudrna D., Bomhoff M., Boyd L., Currie J., RA Angelova A., Collura K., Wissotski M., Ashley E., Morrow D., RA Fernandes J., Walbot V., Yu Y.; RT "Sequencing, mapping, and analysis of 27,455 maize full-length RT cDNAs."; RL PLoS Genet. 5:E1000740-E1000740(2009). RN [3] {ECO:0000313|EnsemblPlants:GRMZM2G440614_P01, ECO:0000313|Proteomes:UP000007305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. B73 {ECO:0000313|EnsemblPlants:GRMZM2G440614_P01, RC ECO:0000313|Proteomes:UP000007305}; RX PubMed=19965430; DOI=10.1126/science.1178534; RA Schnable P.S., Ware D., Fulton R.S., Stein J.C., Wei F., Pasternak S., RA Liang C., Zhang J., Fulton L., Graves T.A., Minx P., Reily A.D., RA Courtney L., Kruchowski S.S., Tomlinson C., Strong C., Delehaunty K., RA Fronick C., Courtney B., Rock S.M., Belter E., Du F., Kim K., RA Abbott R.M., Cotton M., Levy A., Marchetto P., Ochoa K., Jackson S.M., RA Gillam B., Chen W., Yan L., Higginbotham J., Cardenas M., RA Waligorski J., Applebaum E., Phelps L., Falcone J., Kanchi K., RA Thane T., Scimone A., Thane N., Henke J., Wang T., Ruppert J., RA Shah N., Rotter K., Hodges J., Ingenthron E., Cordes M., Kohlberg S., RA Sgro J., Delgado B., Mead K., Chinwalla A., Leonard S., Crouse K., RA Collura K., Kudrna D., Currie J., He R., Angelova A., Rajasekar S., RA Mueller T., Lomeli R., Scara G., Ko A., Delaney K., Wissotski M., RA Lopez G., Campos D., Braidotti M., Ashley E., Golser W., Kim H., RA Lee S., Lin J., Dujmic Z., Kim W., Talag J., Zuccolo A., Fan C., RA Sebastian A., Kramer M., Spiegel L., Nascimento L., Zutavern T., RA Miller B., Ambroise C., Muller S., Spooner W., Narechania A., Ren L., RA Wei S., Kumari S., Faga B., Levy M.J., McMahan L., Van Buren P., RA Vaughn M.W., Ying K., Yeh C.-T., Emrich S.J., Jia Y., Kalyanaraman A., RA Hsia A.-P., Barbazuk W.B., Baucom R.S., Brutnell T.P., Carpita N.C., RA Chaparro C., Chia J.-M., Deragon J.-M., Estill J.C., Fu Y., RA Jeddeloh J.A., Han Y., Lee H., Li P., Lisch D.R., Liu S., Liu Z., RA Nagel D.H., McCann M.C., SanMiguel P., Myers A.M., Nettleton D., RA Nguyen J., Penning B.W., Ponnala L., Schneider K.L., Schwartz D.C., RA Sharma A., Soderlund C., Springer N.M., Sun Q., Wang H., Waterman M., RA Westerman R., Wolfgruber T.K., Yang L., Yu Y., Zhang L., Zhou S., RA Zhu Q., Bennetzen J.L., Dawe R.K., Jiang J., Jiang N., Presting G.G., RA Wessler S.R., Aluru S., Martienssen R.A., Clifton S.W., McCombie W.R., RA Wing R.A., Wilson R.K.; RT "The B73 maize genome: complexity, diversity, and dynamics."; RL Science 326:1112-1115(2009). RN [4] {ECO:0000313|EnsemblPlants:GRMZM2G440614_P01} RP IDENTIFICATION. RC STRAIN=cv. B73 {ECO:0000313|EnsemblPlants:GRMZM2G440614_P01}; RG EnsemblPlants; RL Submitted (FEB-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EU969881; ACG41999.1; -; mRNA. DR EMBL; BT055722; ACL54329.1; -; mRNA. DR RefSeq; NP_001146585.1; NM_001153113.1. DR UniGene; Zm.6043; -. DR STRING; 4577.GRMZM2G440614_P01; -. DR EnsemblPlants; GRMZM2G440614_T01; GRMZM2G440614_P01; GRMZM2G440614. DR GeneID; 100280181; -. DR KEGG; zma:100280181; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR OMA; RVSGWYQ; -. DR Proteomes; UP000007305; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007305}; KW Reference proteome {ECO:0000313|Proteomes:UP000007305}. FT COILED 168 188 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 439 AA; 47502 MW; 9A51F8CC9CC9BC74 CRC64; MASPSFAAAA TSPASSPLTL DAIPLASRPA PGAAAAQRKR SVFLLNHRPH PVSPTPPLQP TAAASAQHAR RKKTSHPPRP RWQTVLSIAA KNAALLAALL YLGDLAWRWS HPPPPSPPPD RAALEGYAAR VDEVEASLTR TFRMIQVQLE AVDRKIDGEV GAARGDLLAL LEDKRLALER QLTRLDARAG ELGDALAGLK RMEFLRKDEF EKFWDEVKGS LGSSSESEVD LDQVRALARE IVMREIEKHA ADGIGRVDYA VASGGGRVVR HSEAYVPKRG FMVWMSGVDV GPKPEKMLQP SFGEPGQCFA LQGSNGFVEV KLKSGIIPEA VTLEHVSKDV AYDRSTAPKG CRVYGWYEET PGETQSGHAA KMALAEFAYD LEKNNVQTFD VTAPDVGVIN MVRLDFTSNH GSSQLTCIYR LRVHGHEPVS PGSSAGSQA // ID B7FQ97_PHATC Unreviewed; 450 AA. AC B7FQ97; DT 10-FEB-2009, integrated into UniProtKB/TrEMBL. DT 10-FEB-2009, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Sad1 and UNC84 domain containing {ECO:0000313|EMBL:EEC51303.1}; GN Name=UNC84/SAD1 {ECO:0000313|EMBL:EEC51303.1}; GN ORFNames=PHATRDRAFT_32120 {ECO:0000313|EMBL:EEC51303.1}; OS Phaeodactylum tricornutum (strain CCAP 1055/1). OC Eukaryota; Stramenopiles; Bacillariophyta; Bacillariophyceae; OC Bacillariophycidae; Naviculales; Phaeodactylaceae; Phaeodactylum. OX NCBI_TaxID=556484 {ECO:0000313|Proteomes:UP000000759}; RN [1] {ECO:0000313|EMBL:EEC51303.1, ECO:0000313|Proteomes:UP000000759} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC51303.1, RC ECO:0000313|Proteomes:UP000000759}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). RN [2] {ECO:0000313|EMBL:EEC51303.1, ECO:0000313|Proteomes:UP000000759} RP GENOME REANNOTATION. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC51303.1, RC ECO:0000313|Proteomes:UP000000759}; RG Diatom Consortium; RA Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., RA Detter J.C., Lindquist E., Shapiro H., Lucas S., Glavina del Rio T., RA Pitluck S., Rokhsar D., Bowler C.; RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000605; EEC51303.1; -; Genomic_DNA. DR RefSeq; XP_002176840.1; XM_002176804.1. DR EnsemblProtists; Phatr3_J32120.t1; Phatr3_J32120.p1; Phatr3_J32120. DR GeneID; 7196808; -. DR KEGG; pti:PHATRDRAFT_32120; -. DR InParanoid; B7FQ97; -. DR KO; K19347; -. DR Proteomes; UP000000759; Chromosome 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000759}; KW Reference proteome {ECO:0000313|Proteomes:UP000000759}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 27 {ECO:0000256|SAM:SignalP}. FT CHAIN 28 450 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002855293. SQ SEQUENCE 450 AA; 49661 MW; 4C81101772A50385 CRC64; MRSKLSQLLL GVVALGTGCV LLVSSNADDT GLAGERHAAY QALASAASFQ SIVETWHTRA TQLIDRTVDL HSKYDAVEES VFEAEQAIRL FNAQQLKVVI EGFDAVHEAE LERRAIQQPQ DLEDVPMPLT KDEFRNAIPL DSIVDPSDVR MEHWVVDYID RVLNERTPPT QVQTTPSPTH SCVTPQKAVQ EVHAALVRHA TDGIGMEDHA RGARIVHEMT TTTYTPPPQS HQRLGNVWWR RFIPQDWESF LPSGWEEWDA RVPLFFSHTF GSKAPVSAKP ESILIPLTTP GACWPMDGSN GHVTLALAYP VAVSAITIDH VSKYLLNEPS EQLTSAPKDF RIVAYPPCIE HCHGLSFDVN DPFDLAQGTF ERDGTTVQTF ATQNLDPPLL GREHTMEEGS CSAAAATTCG VPDANQGLVA AVQVQILSNW GNEDYTCLYR IRVHGESADL // ID B7FRF0_PHATC Unreviewed; 146 AA. AC B7FRF0; DT 10-FEB-2009, integrated into UniProtKB/TrEMBL. DT 10-FEB-2009, sequence version 1. DT 04-MAR-2015, entry version 20. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEC50996.1}; GN ORFNames=PHATRDRAFT_31469 {ECO:0000313|EMBL:EEC50996.1}; OS Phaeodactylum tricornutum (strain CCAP 1055/1). OC Eukaryota; Stramenopiles; Bacillariophyta; Bacillariophyceae; OC Bacillariophycidae; Naviculales; Phaeodactylaceae; Phaeodactylum. OX NCBI_TaxID=556484 {ECO:0000313|Proteomes:UP000000759}; RN [1] {ECO:0000313|EMBL:EEC50996.1, ECO:0000313|Proteomes:UP000000759} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC50996.1, RC ECO:0000313|Proteomes:UP000000759}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). RN [2] {ECO:0000313|EMBL:EEC50996.1, ECO:0000313|Proteomes:UP000000759} RP GENOME REANNOTATION. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC50996.1, RC ECO:0000313|Proteomes:UP000000759}; RG Diatom Consortium; RA Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., RA Detter J.C., Lindquist E., Shapiro H., Lucas S., Glavina del Rio T., RA Pitluck S., Rokhsar D., Bowler C.; RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000605; EEC50996.1; -; Genomic_DNA. DR RefSeq; XP_002176533.1; XM_002176497.1. DR ProteinModelPortal; B7FRF0; -. DR GeneID; 7196042; -. DR KEGG; pti:PHATRDRAFT_31469; -. DR InParanoid; B7FRF0; -. DR Proteomes; UP000000759; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000759}; KW Reference proteome {ECO:0000313|Proteomes:UP000000759}. SQ SEQUENCE 146 AA; 16704 MW; 23BAC467044C140D CRC64; MLTSAFTSFP GRASSILQKN AKVYGPKHAL DIEKSSSCWN SDGTQENCQW FVVDFNRPVE PYQVNVQFQA GFSVETCTVA LKTSENDAWE PVDELEFHDV HEIQTKNLRE MKPCTALKLT LEDFTDFYGR VTIYRIEVWG KELVQG // ID B7FWZ9_PHATC Unreviewed; 743 AA. AC B7FWZ9; DT 10-FEB-2009, integrated into UniProtKB/TrEMBL. DT 10-FEB-2009, sequence version 1. DT 14-OCT-2015, entry version 25. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEC49291.1}; GN ORFNames=PHATRDRAFT_45250 {ECO:0000313|EMBL:EEC49291.1}; OS Phaeodactylum tricornutum (strain CCAP 1055/1). OC Eukaryota; Stramenopiles; Bacillariophyta; Bacillariophyceae; OC Bacillariophycidae; Naviculales; Phaeodactylaceae; Phaeodactylum. OX NCBI_TaxID=556484 {ECO:0000313|Proteomes:UP000000759}; RN [1] {ECO:0000313|EMBL:EEC49291.1, ECO:0000313|Proteomes:UP000000759} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC49291.1, RC ECO:0000313|Proteomes:UP000000759}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). RN [2] {ECO:0000313|EMBL:EEC49291.1, ECO:0000313|Proteomes:UP000000759} RP GENOME REANNOTATION. RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC49291.1, RC ECO:0000313|Proteomes:UP000000759}; RG Diatom Consortium; RA Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., RA Detter J.C., Lindquist E., Shapiro H., Lucas S., Glavina del Rio T., RA Pitluck S., Rokhsar D., Bowler C.; RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000609; EEC49291.1; -; Genomic_DNA. DR RefSeq; XP_002179468.1; XM_002179432.1. DR UniGene; Ptc.1912; -. DR GeneID; 7200120; -. DR KEGG; pti:PHATRDRAFT_45250; -. DR InParanoid; B7FWZ9; -. DR Proteomes; UP000000759; Chromosome 6. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000759}; KW Reference proteome {ECO:0000313|Proteomes:UP000000759}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 743 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002855448. SQ SEQUENCE 743 AA; 82974 MW; F3663A5B9967FFFB CRC64; MKRLRGKRIT LLLASVIPLL AVDDVPMWQS TRDSDEEHPA DQEKELSCRE QTQIDRGTKQ FREPLHRPNG DRDSALLRQS LNPDILFEGS EFGQHYNIVV PFRMGYPHKL VGSYVENRYR PPENDTSLTS ISTRLRMDVV RDKFVSKDEK SAAEIGVVER SRETIKNEKV SAYIDDSRAG EDLPGPETMH ASSGDSKGDK FSEEDGLKRV LVDYASKSAG ALILEKSSSW NGISNVLNGD KDKYAIIPCE EPQKSVVIGL SEDILVKQIV LSYYERYSSH IGTFQVMGSP QTMGNWVDLG TYTSPRGNGK HAFDLHEPSW ARYLKFRFVS HYGDEHYCTV SQISVHGSTM LQGFHEQWAE TVEEQPNDKN ERDVDVSGSK IDPTFSATDQ ENGNDGSVQG TVSTIGQCYT RLDAVCQMDY SFERSAFLFA SGRSSTPDFD LLSALSSASF CQLGRQSART NYSHFVELGR RALTISPKRG RSTKSKFVAD LSDQALFHSL TESVVVKHIQ SLISRTTGID IHVERFGVLA TVDRTPDRIS VDDSNPPATV SSASGVIAGT KLVTSEVEAI ERITDEIESQ PLLQAIQQME EKIPFDTAFH ASGFSWSKIL EQLPSAACLE KLDFADFRSG KKLNLRNGGP GSHGNAQGGG GMEPIFKKFT DEIKALQTSV SIHDQFSKAL ASCYQQVFLE LLVEMDVKRI GEENAEWFIF FLCSISMDVS NYWWCCNDFK ATDIAFFPKP HNH // ID B7NZJ0_RABIT Unreviewed; 443 AA. AC B7NZJ0; DT 10-FEB-2009, integrated into UniProtKB/TrEMBL. DT 10-FEB-2009, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Sperm-associated antigen 4 protein (Predicted) {ECO:0000313|EMBL:ACJ76627.1}; GN Name=SPAG4 {ECO:0000313|EMBL:ACJ76627.1}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986; RN [1] {ECO:0000313|EMBL:ACJ76627.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DP001043; ACJ76627.1; -; Genomic_DNA. DR RefSeq; NP_001164803.1; NM_001171332.1. DR UniGene; Ocu.7701; -. DR STRING; 9986.ENSOCUP00000016279; -. DR GeneID; 100328707; -. DR KEGG; ocu:100328707; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000246956; -. DR HOVERGEN; HBG079205; -. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 137 160 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 166 191 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 204 238 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 443 AA; 48326 MW; E15953F58AC2D4D1 CRC64; MRRSPRPGSA ASPHKHKPNF YSDNSNSSES AISGNSRGHR SAGSGPGELE GRRARGSSCG EPALSAGVPG GATWAGSSRP KPAPRSHNGQ TACGAATVRG GASEPAGAPV VPEEQLDLLS TLDLRQEMPP RPVSKSFLSL LFQVLSLLLS LTGDALVSVY REVCSIRFLL TAVSLLGFFL AVLWWGLLYL VPPLENEPKE MLTLSEYHER VRSQGQQLQQ LQAELDRLHK EVSSVRSANS ERVAKLVFQR LNEDFVRKPD YALSSVGASI DLEKTSQDYE DANTAYFWKR FSFWNYARPP TVILEPDVFP GNCWAFEGDQ GQVVIRLPGR VQLSDITLQH PPPSVAHTGG ANSAPRDFAV FGLQVDDETE VFLGKFTFDV KKSEIQTFHL QNEPPAAFPK VKIQILSNWG HPRFTCLYRV RAHGVRTSEG AGDSATGVTG GPH // ID B7P3M4_IXOSC Unreviewed; 1095 AA. AC B7P3M4; DT 10-FEB-2009, integrated into UniProtKB/TrEMBL. DT 10-FEB-2009, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEC01196.1}; GN ORFNames=IscW_ISCW000835 {ECO:0000313|EMBL:EEC01196.1}; OS Ixodes scapularis (Black-legged tick) (Deer tick). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; OC Acari; Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. OX NCBI_TaxID=6945 {ECO:0000313|Proteomes:UP000001555}; RN [1] {ECO:0000313|EMBL:EEC01196.1, ECO:0000313|Proteomes:UP000001555} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Wikel {ECO:0000313|Proteomes:UP000001555}; RG Ixodes scapularis Genome Project Consortium; RA Caler E., Hannick L.I., Bidwell S., Joardar V., Thiagarajan M., RA Amedeo P., Galinsky K.J., Schobel S., Inman J., Hostetler J., RA Miller J., Hammond M., Megy K., Lawson D., Kodira C., Sutton G., RA Meyer J., Hill C.A., Birren B., Nene V., Collins F., RA Alarcon-Chaidez F., Wikel S., Strausberg R.; RT "Annotation of Ixodes scapularis."; RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS629490; EEC01196.1; -; Genomic_DNA. DR RefSeq; XP_002404387.1; XM_002404343.1. DR STRING; 6945.ISCW000835-PA; -. DR EnsemblMetazoa; ISCW000835-RA; ISCW000835-PA; ISCW000835. DR GeneID; 8024114; -. DR KEGG; isc:IscW_ISCW000835; -. DR VectorBase; ISCW000835; Ixodes scapularis. DR CTD; 8024114; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B7P3M4; -. DR OMA; KIWFIIE; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; B7P3M4; -. DR Proteomes; UP000001555; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001555}; KW Reference proteome {ECO:0000313|Proteomes:UP000001555}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 1095 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002861359. FT COILED 765 785 {ECO:0000256|SAM:Coils}. FT COILED 808 835 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1095 AA; 119051 MW; 003084498FA7A627 CRC64; MKSLNLIGFW WALCAFELLS LKAYFECQTG NRCTCCCCED DVDLLPSVAE PQSYGGVIQS ILHSGSVTTQ VNATKDDGSG DHSDAGEAHK KVESEMAGSK PDGTPQHQPE GVPESLKDEA IVPVLKEEVT TKTSLDSTSG TTRKKGEDAS EEDRKESAEK MPTFVEWKQK MLAEHEKGGE QVPEVGGTLP KKKVSSPKSR RNYASYECGA KVLAANSEAD GAGRVLNEQV DEYMLNPCKA KIWFVVELCE MIQVSQVDLA NFELFSSMPK EFAVSVSDRY PTREWTSLGT FTALDQKAVQ SFKLQSEAYG KYIKVELLSK YGSEHYCPLS LVRVFGTSML DDYEQLVEKP QDPSQQQGAD DDEEDRQGAA EKQVSKNVVD RAKDAVMSIL KVLRRDQDGE TVNTTENPPC TDLNSNCSVL NVSRLLPHNQ TLEDQQRYQR RQQRCRLQSF GFYGSAFQAC TTCSHFVPNL GFPQTIPETP ACRFLRAAMG PEEDCPFYST VFKETKPAFT EDDSLENPSS VIAGDARSMN STESSTTLAS DEAQPSSSSA SLSDLKSSST DAAADAVSPM TSLDATPSTK LVSSLEPIQT SSVEESIQVQ SSDDPHKNAT AADFDVVLEV PKDEGTAKEE EPVPVVLLDS SSFATDAKPL STTVATPPIA QVPTSVKTTP ASSTILATAE KVAPTESAPS PAEATPASSS SVSAERGQCA YPSPVQAQLR EASTMRPPVL DVTSLTGSQK ESVFMRINNR IKALELNMSL SGQYLEELSR RYRRQMDDMQ RAFNRTVGAL NETAHAAAQR DLRQQAALVR LQQQLENLTQ VVDSLVAERQ TLSRQVFESH VCLILIEAIV LATVFSLCLR RSQQHPTQQA TQLVHPQHPL QADRVSSSEL RPQQAVNRVV LKRRSVSAGN DDQRIQEFVN VAEKKQKPSA ETLSRENSFL IVEPVVPIMM EKTPPKEKAK KYKAKKPSKK RNRALRRKTS LPALSAVPGA KTASADKVTS SAGVLFSNGA SQCADAAVTV LATQAAKGGT KPPPENTGCN GHGVGGGGGR GRTARGGRQA DCVRWNGTTN GYGCCDRDDR LGLVKFLRLP GKTPV // ID B7PLB4_IXOSC Unreviewed; 155 AA. AC B7PLB4; DT 10-FEB-2009, integrated into UniProtKB/TrEMBL. DT 10-FEB-2009, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Sad1/Unc-84 domain-containing protein, putative {ECO:0000313|EMBL:EEC07386.1}; DE Flags: Fragment; GN ORFNames=IscW_ISCW018754 {ECO:0000313|EMBL:EEC07386.1}; OS Ixodes scapularis (Black-legged tick) (Deer tick). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; OC Acari; Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes. OX NCBI_TaxID=6945 {ECO:0000313|Proteomes:UP000001555}; RN [1] {ECO:0000313|EMBL:EEC07386.1, ECO:0000313|Proteomes:UP000001555} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Wikel {ECO:0000313|Proteomes:UP000001555}; RG Ixodes scapularis Genome Project Consortium; RA Caler E., Hannick L.I., Bidwell S., Joardar V., Thiagarajan M., RA Amedeo P., Galinsky K.J., Schobel S., Inman J., Hostetler J., RA Miller J., Hammond M., Megy K., Lawson D., Kodira C., Sutton G., RA Meyer J., Hill C.A., Birren B., Nene V., Collins F., RA Alarcon-Chaidez F., Wikel S., Strausberg R.; RT "Annotation of Ixodes scapularis."; RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS739614; EEC07386.1; -; Genomic_DNA. DR RefSeq; XP_002434562.1; XM_002434517.1. DR UniGene; Isc.13777; -. DR STRING; 6945.ISCW018754-PA; -. DR EnsemblMetazoa; ISCW018754-RA; ISCW018754-PA; ISCW018754. DR GeneID; 8050878; -. DR KEGG; isc:IscW_ISCW018754; -. DR VectorBase; ISCW018754; Ixodes scapularis. DR CTD; 8050878; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; B7PLB4; -. DR KO; K19347; -. DR OMA; YPLVELR; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; B7PLB4; -. DR Proteomes; UP000001555; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001555}; KW Reference proteome {ECO:0000313|Proteomes:UP000001555}. FT NON_TER 1 1 {ECO:0000313|EMBL:EEC07386.1}. SQ SEQUENCE 155 AA; 17530 MW; CA368737424464E3 CRC64; ETYSGGAIRY ELLGFSLWTF VRTARTAIQP QMHPGECWAF RGSQGHLVVQ LARRVRPTSF AVEHIPKELA ISGSLDSAPK DFHILGLDSE TDHVGKLLGK YTYDLDGEPL QYFLVQVRDP DPGSFRFVEM KILSNHGHLE YTCLYRLRVH GVPSD // ID B7XLA3_ENTBH Unreviewed; 344 AA. AC B7XLA3; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED43274.1}; DE Flags: Fragment; GN ORFNames=EBI_23771 {ECO:0000313|EMBL:EED43274.1}; OS Enterocytozoon bieneusi (strain H348) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Enterocytozoonidae; Enterocytozoon. OX NCBI_TaxID=481877 {ECO:0000313|EMBL:EED43274.1, ECO:0000313|Proteomes:UP000001742}; RN [1] {ECO:0000313|EMBL:EED43274.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED43274.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=18060071; DOI=10.1371/journal.pone.0001277; RA Corradi N., Akiyoshi D.E., Morrison H.G., Feng X., Weiss L.M., RA Tzipori S., Keeling P.J.; RT "Patterns of genome evolution among the microsporidian parasites RT Encephalitozoon cuniculi, Antonospora locustae and Enterocytozoon RT bieneusi."; RL PLoS ONE 2:E1277-E1277(2007). RN [2] {ECO:0000313|EMBL:EED43274.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED43274.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=19132089; DOI=10.1371/journal.ppat.1000261; RA Akiyoshi D.E., Morrison H.G., Lei S., Feng X., Zhang Q., Corradi N., RA Mayanja H., Tumwine J.K., Keeling P.J., Weiss L.M., Tzipori S.; RT "Genomic survey of the non-cultivatable opportunistic human pathogen, RT Enterocytozoon bieneusi."; RL PLoS Pathog. 5:E1000261-E1000261(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EED43274.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGB01000195; EED43274.1; -; Genomic_DNA. DR RefSeq; XP_002650781.1; XM_002650735.1. DR EnsemblFungi; EED43274; EED43274; EBI_23771. DR GeneID; 8642418; -. DR EuPathDB; MicrosporidiaDB:EBI_23771; -. DR InParanoid; B7XLA3; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001742; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001742}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001742}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 168 186 Helical. {ECO:0000256|SAM:Phobius}. FT NON_TER 344 344 {ECO:0000313|EMBL:EED43274.1}. SQ SEQUENCE 344 AA; 39304 MW; 61679620184A0401 CRC64; MDKKEKITLG ANDPRGIRRS ISLVYGGEQK DPTPINLFKH SLYTTSKDQS TPKIETPNSN KGMSYYESDL DNNISIGKTS EVPTSEVYSQ GTSRPQKKYN NAITDRHNKT NEIQRSQNTT STTKPQSNSP RQIFIKKKYS DLNSEDKKNI CSRHESVQKS EKSTMYNTFK KIFIIIIAGL TIYYYLNASS ANPSPPEPIN LCTISQSEVC GISRLYRFGL FRRHTTNIDN VLMENSECVA LENRSNAYIQ IQFKKKCFVK KIGIYHPLQA NSKSAIREFA IRIVDSTEPR EIECAYTSAY GYQEYNVDAH LKAFMIRVKN NHGEKYISIY GIYAFGYCDQ ILEA // ID B7XME2_ENTBH Unreviewed; 276 AA. AC B7XME2; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED42885.1}; GN ORFNames=EBI_26663 {ECO:0000313|EMBL:EED42885.1}; OS Enterocytozoon bieneusi (strain H348) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Enterocytozoonidae; Enterocytozoon. OX NCBI_TaxID=481877 {ECO:0000313|EMBL:EED42885.1, ECO:0000313|Proteomes:UP000001742}; RN [1] {ECO:0000313|EMBL:EED42885.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED42885.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=18060071; DOI=10.1371/journal.pone.0001277; RA Corradi N., Akiyoshi D.E., Morrison H.G., Feng X., Weiss L.M., RA Tzipori S., Keeling P.J.; RT "Patterns of genome evolution among the microsporidian parasites RT Encephalitozoon cuniculi, Antonospora locustae and Enterocytozoon RT bieneusi."; RL PLoS ONE 2:E1277-E1277(2007). RN [2] {ECO:0000313|EMBL:EED42885.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED42885.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=19132089; DOI=10.1371/journal.ppat.1000261; RA Akiyoshi D.E., Morrison H.G., Lei S., Feng X., Zhang Q., Corradi N., RA Mayanja H., Tumwine J.K., Keeling P.J., Weiss L.M., Tzipori S.; RT "Genomic survey of the non-cultivatable opportunistic human pathogen, RT Enterocytozoon bieneusi."; RL PLoS Pathog. 5:E1000261-E1000261(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EED42885.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGB01000421; EED42885.1; -; Genomic_DNA. DR RefSeq; XP_002651170.1; XM_002651124.1. DR EnsemblFungi; EED42885; EED42885; EBI_26663. DR GeneID; 8642819; -. DR EuPathDB; MicrosporidiaDB:EBI_26663; -. DR InParanoid; B7XME2; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001742; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001742}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001742}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 106 124 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 276 AA; 31790 MW; 0EC20B8AFF9A19DD CRC64; MSYYESDLDN NISIGKTSEV PTSEVYSQGT SRPQRKYNNA ITDRHNKTNE IQRSQNTTST TKPQSNSPRQ IFIKKKYSDL NSEDKKNICS RHESVQKSEK STMYNTFKKI FIIIIAGLII YYYLNASSAN PSPPEPINLC TISQSEVCGI SRLYRFGLFR RHTTNIDNVL MENSECVALE NRSNAYIQIQ FKKKCFVKKI GIYHPLQANS KSAIREFAIR IVDSTEPREI ECAYTSAYGY QEYNVDAHLK AFMIRVKNNH GEKYISIYGI YAFGYC // ID B7XNV8_ENTBH Unreviewed; 106 AA. AC B7XNV8; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED42371.1}; GN ORFNames=EBI_22230 {ECO:0000313|EMBL:EED42371.1}; OS Enterocytozoon bieneusi (strain H348) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Enterocytozoonidae; Enterocytozoon. OX NCBI_TaxID=481877 {ECO:0000313|EMBL:EED42371.1, ECO:0000313|Proteomes:UP000001742}; RN [1] {ECO:0000313|EMBL:EED42371.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED42371.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=18060071; DOI=10.1371/journal.pone.0001277; RA Corradi N., Akiyoshi D.E., Morrison H.G., Feng X., Weiss L.M., RA Tzipori S., Keeling P.J.; RT "Patterns of genome evolution among the microsporidian parasites RT Encephalitozoon cuniculi, Antonospora locustae and Enterocytozoon RT bieneusi."; RL PLoS ONE 2:E1277-E1277(2007). RN [2] {ECO:0000313|EMBL:EED42371.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED42371.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=19132089; DOI=10.1371/journal.ppat.1000261; RA Akiyoshi D.E., Morrison H.G., Lei S., Feng X., Zhang Q., Corradi N., RA Mayanja H., Tumwine J.K., Keeling P.J., Weiss L.M., Tzipori S.; RT "Genomic survey of the non-cultivatable opportunistic human pathogen, RT Enterocytozoon bieneusi."; RL PLoS Pathog. 5:E1000261-E1000261(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EED42371.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGB01000849; EED42371.1; -; Genomic_DNA. DR RefSeq; XP_002651686.1; XM_002651640.1. DR EnsemblFungi; EED42371; EED42371; EBI_22230. DR GeneID; 8643363; -. DR EuPathDB; MicrosporidiaDB:EBI_22230; -. DR InParanoid; B7XNV8; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001742; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001742}; KW Reference proteome {ECO:0000313|Proteomes:UP000001742}. SQ SEQUENCE 106 AA; 12376 MW; DF9AE06BD5840731 CRC64; MKNSECVAWK NRSNAYIQIQ FKKKCFVKKI GIYHPLQANS KSAIREFAIR IVDSTEPREI ECAYTSAYGY QEYNVDAHLK AFMIRVKNNH GEKYISIYGI YAFGYC // ID B7XNZ6_ENTBH Unreviewed; 174 AA. AC B7XNZ6; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED42331.1}; GN ORFNames=EBI_25942 {ECO:0000313|EMBL:EED42331.1}; OS Enterocytozoon bieneusi (strain H348) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Enterocytozoonidae; Enterocytozoon. OX NCBI_TaxID=481877 {ECO:0000313|EMBL:EED42331.1, ECO:0000313|Proteomes:UP000001742}; RN [1] {ECO:0000313|EMBL:EED42331.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED42331.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=18060071; DOI=10.1371/journal.pone.0001277; RA Corradi N., Akiyoshi D.E., Morrison H.G., Feng X., Weiss L.M., RA Tzipori S., Keeling P.J.; RT "Patterns of genome evolution among the microsporidian parasites RT Encephalitozoon cuniculi, Antonospora locustae and Enterocytozoon RT bieneusi."; RL PLoS ONE 2:E1277-E1277(2007). RN [2] {ECO:0000313|EMBL:EED42331.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED42331.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=19132089; DOI=10.1371/journal.ppat.1000261; RA Akiyoshi D.E., Morrison H.G., Lei S., Feng X., Zhang Q., Corradi N., RA Mayanja H., Tumwine J.K., Keeling P.J., Weiss L.M., Tzipori S.; RT "Genomic survey of the non-cultivatable opportunistic human pathogen, RT Enterocytozoon bieneusi."; RL PLoS Pathog. 5:E1000261-E1000261(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EED42331.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGB01000884; EED42331.1; -; Genomic_DNA. DR RefSeq; XP_002651724.1; XM_002651678.1. DR EnsemblFungi; EED42331; EED42331; EBI_25942. DR GeneID; 8643406; -. DR EuPathDB; MicrosporidiaDB:EBI_25942; -. DR InParanoid; B7XNZ6; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001742; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001742}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001742}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 7 22 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 174 AA; 20073 MW; 67258E5CAB891EA9 CRC64; MYNTFKKIFI IIIAGLIIYY YLNASSANPS PPEPINLCTI SQSEVCGISR LYRFGLFRRH TTNIDNVLME NSECVALENR SNAYIQIQFK KKCFVKKIGI YHPLQANSKS AIREFAIRIV DSTEPREIEC AYTSAYGYQE YNVDAHLKAF MIRVKNNHGE KYISIYGIYA FGYC // ID B7XQQ7_ENTBH Unreviewed; 174 AA. AC B7XQQ7; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 16-SEP-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED41722.1}; GN ORFNames=EBI_26759 {ECO:0000313|EMBL:EED41722.1}; OS Enterocytozoon bieneusi (strain H348) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Enterocytozoonidae; Enterocytozoon. OX NCBI_TaxID=481877 {ECO:0000313|EMBL:EED41722.1, ECO:0000313|Proteomes:UP000001742}; RN [1] {ECO:0000313|EMBL:EED41722.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED41722.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=18060071; DOI=10.1371/journal.pone.0001277; RA Corradi N., Akiyoshi D.E., Morrison H.G., Feng X., Weiss L.M., RA Tzipori S., Keeling P.J.; RT "Patterns of genome evolution among the microsporidian parasites RT Encephalitozoon cuniculi, Antonospora locustae and Enterocytozoon RT bieneusi."; RL PLoS ONE 2:E1277-E1277(2007). RN [2] {ECO:0000313|EMBL:EED41722.1, ECO:0000313|Proteomes:UP000001742} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H348 {ECO:0000313|EMBL:EED41722.1, RC ECO:0000313|Proteomes:UP000001742}; RX PubMed=19132089; DOI=10.1371/journal.ppat.1000261; RA Akiyoshi D.E., Morrison H.G., Lei S., Feng X., Zhang Q., Corradi N., RA Mayanja H., Tumwine J.K., Keeling P.J., Weiss L.M., Tzipori S.; RT "Genomic survey of the non-cultivatable opportunistic human pathogen, RT Enterocytozoon bieneusi."; RL PLoS Pathog. 5:E1000261-E1000261(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EED41722.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGB01001415; EED41722.1; -; Genomic_DNA. DR RefSeq; XP_002652335.1; XM_002652289.1. DR EnsemblFungi; EED41722; EED41722; EBI_26759. DR GeneID; 8644061; -. DR EuPathDB; MicrosporidiaDB:EBI_26759; -. DR InParanoid; B7XQQ7; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001742; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001742}; KW Reference proteome {ECO:0000313|Proteomes:UP000001742}. SQ SEQUENCE 174 AA; 20061 MW; 502B7E4C458BD176 CRC64; MYNTFKKIFI IIIAGLTIYY YLNASSANPS PPEPINLCTI SQSEVCGISR LYRFGLFRRH TTNIDNVLME NSECVALENR SNAYIQIQFK KKCFVKKIGI YHPLQANSKS AIREFAIRIV DSTEPREIEC AYTSAYGYQE YNVDAHLKAF MIRVKNNHGE KYISIYGIYA FGYC // ID B7YZY0_DROME Unreviewed; 1252 AA. AC B7YZY0; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 43. DE SubName: Full=CG31678, isoform C {ECO:0000313|EMBL:ACL83050.1}; DE SubName: Full=CG31678, isoform E {ECO:0000313|EMBL:AHN54620.1}; GN ORFNames=CG31678 {ECO:0000313|EMBL:ACL83050.1, GN ECO:0000313|FlyBase:FBgn0051678}, GN Dmel_CG31678 {ECO:0000313|EMBL:ACL83050.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., RA Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., RA Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:ACL83050.1} RP NUCLEOTIDE SEQUENCE. RA Celniker S., Carlson J., Wan K., Frise E., Hoskins R., Park S., RA Svirskas R., Rubin G.; RL Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases. RN [8] {ECO:0000313|EMBL:ACL83050.1} RP NUCLEOTIDE SEQUENCE. RG Berkeley Drosophila Genome Project; RA Celniker S., Carlson J., Wan K., Pfeiffer B., Frise E., George R., RA Hoskins R., Stapleton M., Pacleb J., Park S., Svirskas R., Smith E., RA Yu C., Rubin G.; RT "Drosophila melanogaster release 4 sequence."; RL Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases. RN [9] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [10] {ECO:0000313|EMBL:ACL83050.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). RN [11] {ECO:0000313|EMBL:ACL83050.1} RP NUCLEOTIDE SEQUENCE. RG FlyBase; RL Submitted (JUL-2014) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE014134; ACL83050.1; -; Genomic_DNA. DR EMBL; AE014134; AHN54620.1; -; Genomic_DNA. DR RefSeq; NP_001137844.1; NM_001144372.2. DR RefSeq; NP_001286106.1; NM_001299177.1. DR UniGene; Dm.19853; -. DR STRING; 7227.FBpp0288498; -. DR GeneID; 35327; -. DR KEGG; dme:Dmel_CG31678; -. DR FlyBase; FBgn0051678; CG31678. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7MPRDC; -. DR GenomeRNAi; 35327; -. DR NextBio; 792976; -. DR Proteomes; UP000000803; Chromosome 2L. DR GO; GO:0046331; P:lateral inhibition; IMP:FlyBase. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:B7YZY0}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 21 46 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 839 860 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 183 210 {ECO:0000256|SAM:Coils}. FT COILED 775 823 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1252 AA; 137337 MW; 77405A6CDC2ED535 CRC64; MLLQFVMKFI CCELNKLKTN LTMFLIFFWI IWFSQIIAIS TVFMSFPEII TELPAVTITE LPLDRVMNRL DSVILDGSPA TAGNHSDEEQ HQQQQQPHED QPMQVSEADE EVPQKDEQMP KINDPGGGIQ VEGMVTPEAA TVGETQESSE ELQPGSAAFN GTEGTANLTN ANEEVPMPVF SEWAQKQMEA EASREQAMEL EQQVVNKSAQ RKNNTGSSSG KPPTLKLRSK NYASPDCGAK IIAHNSESKH TEAVLTQSTD EYMLSTCESR IWFVVELCEA IQAQKVDVAN YELFSSSPKN FTVAVSKRFP TRDWSNVGRF AAEDKRTIQT FELHPHLFGK FVRVDITSHY ANEHFCPLSL FRVFGTSEYE AFETEIRPSD DLDDFYDDYG AQEQKAAVGS GGNIFQSASD AVMQMVKKAA EVLVKPTKAL KWSEESVLCQ TPAFEAFSCI NCNTTLVERI NSLLSCQFQQ LQALLSLSHL RSDLLNSRVC QEEFGISLTG SEFASKMGKE QSYFLSMLPA EHVGAMCKLI QAEQNVTDHN HTKAPSLKQH VSSPEPVQDN ATATGVRQDC ENSKTPTKEP LTPSLEVVVP EVSQEVPSME DQSSTSSETV STTNSTPADV NIFNMPSEPE EVVVKVQLPP EPTLPTTLQP SDVESFTDAP STNALPGSSE AVANGDLGME EGNPANWDGI DNLLTTTVAS ITAGGGAAAA AAAVVNGNAN IGGAGIVGAG GPASLSSVNM QQKLTNGAQS ESVFIRLSNR IKALERNMSL SGQYLEELSR RYKKQVEELQ QTLTQQTLTV RQLEDQSRRY VEQEQLYQQH SAELAGEVRA LSYQVQACIL VIIIVGTCIF LMLVLGTVYY RKLRRQQQQL LKKDQAGHPP VAAKPKLDRR KSYEQTPNQS TPKQRRPSEE AMLILKECGD SNMQELDPPS RQRKISVCYG SNNNIAANMA IANTNGGASV RNSLHRRKGA KHSWHNSLDT TETSCGEQTD KFFDVDTLKS IKQSCGKPGK KKSLQQLKPL GLKRQESAPA TYTPDLQAEE PATQSDFDES LMLDDDDLAN FIPTSDLAYN EFMPEGPSGY QIVDTVDGKP GKEPGTKKSR RLSSPAFFKS PFSKSKNKGY SFNGVKNSHA VHEPTSWEWY RLKRSEKHQQ QQQAKLVSKS LPSASLDSSS LSEVNFPLSS STATTQNSFR ILGEAILSSG EGRITPNGNG NAMSGGLASS SSGSGSGGST TSSTTKKKQR ALNNLFRKAF DF // ID B8A716_ORYSI Unreviewed; 624 AA. AC B8A716; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEC71894.1}; GN ORFNames=OsI_04640 {ECO:0000313|EMBL:EEC71894.1}; OS Oryza sativa subsp. indica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39946 {ECO:0000313|EMBL:EEC71894.1, ECO:0000313|Proteomes:UP000007015}; RN [1] {ECO:0000313|EMBL:EEC71894.1, ECO:0000313|Proteomes:UP000007015} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. 93-11 {ECO:0000313|Proteomes:UP000007015}; RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000126; EEC71894.1; -; Genomic_DNA. DR ProteinModelPortal; B8A716; -. DR STRING; 39946.BGIOSGA000399-PA; -. DR EnsemblPlants; BGIOSGA000399-TA; BGIOSGA000399-PA; BGIOSGA000399. DR Gramene; B8A716; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OMA; YGSASYC; -. DR Proteomes; UP000007015; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR006311; TAT_signal. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. DR PROSITE; PS51318; TAT; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007015}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007015}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 41 59 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 565 585 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 606 623 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 502 522 {ECO:0000256|SAM:Coils}. FT COILED 544 564 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 624 AA; 69602 MW; D9D4F0DCB421A614 CRC64; MQRSRRALLK RKAAAAAAEE EEAGVGVATA AAAGRRRRRR LYGFSVSLVV ACWVVLLLLN PLVGHGNGQR DEGIFADEGS SDPSFDSVEP TLSEGSVDSV VQQENGENHA LPGDSCAKPD ENHVLSEETL LEKDQLCSND EAQGDGMDAL PKDNVDQGEN LPRTDDDSVV HPEGEVESEG VPRPARLSRV VPPGLDEFKT RAIAERGKGV PSGQPGNVIH RREPSGKLYN YASAAKGAKV LEFNKEAKGA SNILDKDKDK YLRNPCSAEG KFVIIELSEE TLVDTIAIAN FEHYSSNLKE FEMLSSLNYP TDSWETLGRF TVANAKIAQN FTFPEPKWAR YLKLNLLSHY GSEFYCTLSM LEVYGMDAVE KMLENLIPVE NKRLEPDDKM KEPVDQQTQL KEPTEGKESS HEPLDEDEFE LEDDKLNGDS SKNGAHDQVT ETRPIQAGRI PGDTVLKVLM QKVQSLDVSF SVLERYLEEL NSRYGQIFKD FDADIDTKDA LLEKIKLELK HLESSKDDFA KEIEGILSWK LVASSQLNQL LLDNVIIRSE LERFREKQAD LENRSFAVIF LSFVFGCLAI AKLSIGMIFN TCRLYNFEKF DRVKSGWLVL LFSSCIIASI LIIQ // ID B8AB67_ORYSI Unreviewed; 563 AA. AC B8AB67; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEC71017.1}; GN ORFNames=OsI_02710 {ECO:0000313|EMBL:EEC71017.1}; OS Oryza sativa subsp. indica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39946 {ECO:0000313|EMBL:EEC71017.1, ECO:0000313|Proteomes:UP000007015}; RN [1] {ECO:0000313|EMBL:EEC71017.1, ECO:0000313|Proteomes:UP000007015} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. 93-11 {ECO:0000313|Proteomes:UP000007015}; RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000126; EEC71017.1; -; Genomic_DNA. DR STRING; 39946.BGIOSGA003918-PA; -. DR EnsemblPlants; BGIOSGA003918-TA; BGIOSGA003918-PA; BGIOSGA003918. DR Gramene; B8AB67; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OMA; AVLKIMM; -. DR Proteomes; UP000007015; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007015}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007015}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 501 520 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 544 562 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 479 499 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 563 AA; 61063 MW; 80393960BAD7E145 CRC64; MSKKRREGGG GGNGGCDPPA VTDALSMDGG LREVSLSVVF SVWCLLFLLR SQFLHSQTDP SDFYDDVEDG MRENYCKVMP LEAYIFPTEY NASAAAPTCQ PSLHPPDQPQ QETDHRSLEP FNNTTGGKSS AEAAALDELD EFRSRILQGK AENGRVPDGA TPAAHRLEPS GAEYNYAAAS KGAKVLAHNR EAKGAANILG GDKDRYLRNP CSADDKFVDV ELSEETLVRT IGLANLEHYS SNFRDFELYG SPSYPAPAEE WELLGRFTAD NAKHAQRFVL PDPRWTRYLR LRLATHYGSG FYCILSYLEV YGIDAVEQML QEIISGSGAD TDASAAAKAE EGGDGGTLRN DIAQVNARLD GVGGGGGSAA GRNDSAGDGA GAKNNGSRMT VAGDGKPAAA GRFHGDAVLK IMMQKMRSLE LGLSTLEDYT KALNHRYGAK LPDLHTGLSQ TTMALDRMKA DVRDLVEWKG NVAKDLGELK EWRSNVEEMR SIQETMQNKE LAVLSISLFF ACLALFKLAC DRVLFLFTRK GAAAAERMCG ASKGWILVLA SSSFTTFLVL LYN // ID B8BXY4_THAPS Unreviewed; 925 AA. AC B8BXY4; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 27. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED94285.1}; GN ORFNames=THAPSDRAFT_3494 {ECO:0000313|EMBL:EED94285.1}; OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana). OC Eukaryota; Stramenopiles; Bacillariophyta; Coscinodiscophyceae; OC Thalassiosirophycidae; Thalassiosirales; Thalassiosiraceae; OC Thalassiosira. OX NCBI_TaxID=35128 {ECO:0000313|Proteomes:UP000001449}; RN [1] {ECO:0000313|EMBL:EED94285.1, ECO:0000313|Proteomes:UP000001449} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED94285.1, RC ECO:0000313|Proteomes:UP000001449}; RX PubMed=15459382; DOI=10.1126/science.1101156; RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D., RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., RA Brzezinski M.A., Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., RA Detter J.C., Glavina T., Goodstein D., Hadi M.Z., Hellsten U., RA Hildebrand M., Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., RA Lau W.W., Lane T.W., Larimer F.W., Lippmeier J.C., Lucas S., RA Medina M., Montsant A., Obornik M., Parker M.S., Palenik B., RA Pazour G.J., Richardson P.M., Rynearson T.A., Saito M.A., RA Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A., Wilkerson F.P., RA Rokhsar D.S.; RT "The genome of the diatom Thalassiosira pseudonana: ecology, RT evolution, and metabolism."; RL Science 306:79-86(2004). RN [2] {ECO:0000313|EMBL:EED94285.1, ECO:0000313|Proteomes:UP000001449} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED94285.1, RC ECO:0000313|Proteomes:UP000001449}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000640; EED94285.1; -; Genomic_DNA. DR RefSeq; XP_002288849.1; XM_002288813.1. DR EnsemblProtists; Thaps3494; Thaps3494; Thaps3494. DR GeneID; 7448776; -. DR KEGG; tps:THAPSDRAFT_3494; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; B8BXY4; -. DR Proteomes; UP000001449; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001449}; KW Reference proteome {ECO:0000313|Proteomes:UP000001449}. FT COILED 454 481 {ECO:0000256|SAM:Coils}. FT COILED 530 550 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 925 AA; 100065 MW; 37FD86B4A2A37EF5 CRC64; MAKSRSSASA SAANDDNGSA TSNQPTNGTA NNTAPVAEGP NPPGMVMRSG RKKRRGASSS AGTSDTDDNA SVASSTAASK QRRSSKRRKL DDDESSVGST ASDAESGTKA ITTTKRKSVN GGTSGRGRRK KEEEEKTELP AIEEKDVEME TVMDESGGVG GGKKNDVHVD TEVNDARKNE VVVAAAAANG ETASNTTSTA AAKQPSPAAP PLGADTLRRL VTHGLRLPSQ HTPKSAPGTN DTNNSVNNNE DGGEEDGISP AVPPGGRRLV FDTTAKKSSA VEGGGKRNKL LMQIHKERGG EGEHSTGEEG QEQQQQSTEE HQRPMSPPFP QRFPHLFIEQ ITAQYHHGTL KKEVFQCTLG MFIVFLSVFV VVNLTSVISG NALSSAWDVK MQSMQWKSWY GINNAANQTS YDVVVEDVVT TVAQGQPEQE VIEKIVEALD PKLLEEATVK LQAERLEEHK VQQLKRAIEK TEKDMQALDA AVIDWTKALP HLSDALSPSY RAKPEHSDVN KFNSQFEGLS SGIFEKYGSL SQWEAALNKA EEAMEQLTRG NGDVNAVNDA LDVVSKLSLV PAPANVLDVS TISIIGEGCE GKDYTPEDGV AENEEDEVVV VGGIDVEALD NTSDAPVRNE DAQNAYMSLM KFAQSSVAAL VGAQGPSALA QRWVQQLIKE ELKTEEDHES SVPDLPSITD ATKSSTASGD SYTARDAIID IDRRLEIEDA DRTGKFDYAS VIHGARVLRR GPRATSLSLY ETLPLLNRFM AYTKLRFYGH PAEVALRPTT PMHARGQCWS FQNEYYSRRV RQRGSELTND GYEGEYATLS VSLSSQISVS EVVMEHLPSS VASNANTAVK DFRLLGYEDV GAYGEPVELG TFQYDINGPF SMQTFTIPQT IDGMSVPKLK AVSLAVDTNW GADYTCLYRF RVHGY // ID B8CB42_THAPS Unreviewed; 1204 AA. AC B8CB42; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED89064.1}; GN ORFNames=THAPSDRAFT_24613 {ECO:0000313|EMBL:EED89064.1}; OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana). OC Eukaryota; Stramenopiles; Bacillariophyta; Coscinodiscophyceae; OC Thalassiosirophycidae; Thalassiosirales; Thalassiosiraceae; OC Thalassiosira. OX NCBI_TaxID=35128 {ECO:0000313|Proteomes:UP000001449}; RN [1] {ECO:0000313|EMBL:EED89064.1, ECO:0000313|Proteomes:UP000001449} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED89064.1, RC ECO:0000313|Proteomes:UP000001449}; RX PubMed=15459382; DOI=10.1126/science.1101156; RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D., RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., RA Brzezinski M.A., Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., RA Detter J.C., Glavina T., Goodstein D., Hadi M.Z., Hellsten U., RA Hildebrand M., Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., RA Lau W.W., Lane T.W., Larimer F.W., Lippmeier J.C., Lucas S., RA Medina M., Montsant A., Obornik M., Parker M.S., Palenik B., RA Pazour G.J., Richardson P.M., Rynearson T.A., Saito M.A., RA Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A., Wilkerson F.P., RA Rokhsar D.S.; RT "The genome of the diatom Thalassiosira pseudonana: ecology, RT evolution, and metabolism."; RL Science 306:79-86(2004). RN [2] {ECO:0000313|EMBL:EED89064.1, ECO:0000313|Proteomes:UP000001449} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED89064.1, RC ECO:0000313|Proteomes:UP000001449}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000648; EED89064.1; -; Genomic_DNA. DR RefSeq; XP_002293328.1; XM_002293292.1. DR STRING; 35128.Thaps24613; -. DR EnsemblProtists; Thaps24613; Thaps24613; Thaps24613. DR GeneID; 7445715; -. DR KEGG; tps:THAPSDRAFT_24613; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B8CB42; -. DR Proteomes; UP000001449; Chromosome 13. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001449}; KW Reference proteome {ECO:0000313|Proteomes:UP000001449}. SQ SEQUENCE 1204 AA; 129458 MW; 8E0AA16F8BDACE89 CRC64; MTVTTVDETV ILYSASTKEF DVSNGETRES ANEIEDNINV GSRQLDDVSD ADASSVDVSA TTVTVADVNG EHESDSTTPP NKIPSDTVGV ENSSGKDETS TTEAVPRDTH EEEVDTTKQA EIEATTIDES TASNTDSRTE PPKTIDNIEN GVIYYDIDDS PSIYVDERVS ASAGDTVGAL ESNDDDGSAD TSESTQLDSK EVDPSGLESD EQVSAEGAGE YSDDFSNEPG AEETQTTSPN ESGNDAASMV SDTGNPDDPN ESDPTNDDTN SDDANSIQSD EANSTDSESA DDIPDEGEDG PKEHVLVDYA SKLAGAQILE KSPSLKGTSN LLTGDIDKYA IAPCEEKKYV VIGLSEDILV KRIKLSNYER YSSHVREFQV LASQEYPAPS EYWNDLGTYT ALSKSGEQTF ELNEPAWARY LKFRFMSHYG KEHYCTLSQI KVHGSTMLQG FHEQWIESEK KDRELAEDGE VGGEQEGGES HDAETEMQGL QEYGDSVVDE IDGEESEEVF DEEDDAFSES AVMEEENGQT QDEEQVAQPT GDSKEISEDA SRDNESVNDE DITLHESVEK SGEVEPQVAS EDSDSLSTDV DIVVNTNQID DASSTEDEEK PVTANIEREQ PESPDVVDAV QRSDSTSQDD ASETTNASVD HAVSRMEEGS DGGGSASTPL SNHDADENAT NNTTDDTTTQ LDTEEAPIDS ESVGEELPPT DSIHVVTDAV KAAVADATVV IKHVKEVVQA TDTVSEIKKI IRTTIGISDE DNATAGDSQD AMPPSTESNA TIPINDTTVD AESPAKDTTT DPEVEENEDA SSEVMEQTVA KAASSVNETD STNVTQTKTK KAEGESKSST NTSAVAKAES KSKANTAVVT GQPKTIIPAN NNVIKKEAST DKLVAKLSIK YPCMKHLDFQ SFKAAKSLIA NASPGSGAMG GAKMEPIFAK ITHEIKSVQS MQHQYEQYIS SVKACYDKVL LDLVNDIDSI QTNFDGRLTI LERTLLETAR ETSPTAAALV DQGALMLSFI PFTSMPAVYP FQRISDCPEM VCMCVGIAVF VLLLRSSFKK KLKQIQQQQL GDQKKQTQMN VSAASVPKQK AARVSTPTQS ASIGKKKQTP SAISTNHNII PSEEDEIAPV SLGSNASVAP FGFEMKGSPR DVPHSVSCED SVYSIPTSAR SEPIQSGLHA SPSIAKRDSP LQRFKIKILR PGSA // ID B8CDR2_THAPS Unreviewed; 790 AA. AC B8CDR2; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED88659.1}; GN ORFNames=THAPSDRAFT_25077 {ECO:0000313|EMBL:EED88659.1}; OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana). OC Eukaryota; Stramenopiles; Bacillariophyta; Coscinodiscophyceae; OC Thalassiosirophycidae; Thalassiosirales; Thalassiosiraceae; OC Thalassiosira. OX NCBI_TaxID=35128 {ECO:0000313|Proteomes:UP000001449}; RN [1] {ECO:0000313|EMBL:EED88659.1, ECO:0000313|Proteomes:UP000001449} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED88659.1, RC ECO:0000313|Proteomes:UP000001449}; RX PubMed=15459382; DOI=10.1126/science.1101156; RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D., RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., RA Brzezinski M.A., Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., RA Detter J.C., Glavina T., Goodstein D., Hadi M.Z., Hellsten U., RA Hildebrand M., Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., RA Lau W.W., Lane T.W., Larimer F.W., Lippmeier J.C., Lucas S., RA Medina M., Montsant A., Obornik M., Parker M.S., Palenik B., RA Pazour G.J., Richardson P.M., Rynearson T.A., Saito M.A., RA Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A., Wilkerson F.P., RA Rokhsar D.S.; RT "The genome of the diatom Thalassiosira pseudonana: ecology, RT evolution, and metabolism."; RL Science 306:79-86(2004). RN [2] {ECO:0000313|EMBL:EED88659.1, ECO:0000313|Proteomes:UP000001449} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED88659.1, RC ECO:0000313|Proteomes:UP000001449}; RX PubMed=18923393; DOI=10.1038/nature07410; RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., RA Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., RA Katinka M., Mock T., Valentin K., Verret F., Berges J.A., Brownlee C., RA Cadoret J.P., Chiovitti A., Choi C.J., Coesel S., De Martino A., RA Detter J.C., Durkin C., Falciatore A., Fournet J., Haruta M., RA Huysman M.J., Jenkins B.D., Jiroutova K., Jorgensen R.E., Joubert Y., RA Kaplan A., Kroger N., Kroth P.G., La Roche J., Lindquist E., RA Lommer M., Martin-Jezequel V., Lopez P.J., Lucas S., Mangogna M., RA McGinnis K., Medlin L.K., Montsant A., Oudot-Le Secq M.P., Napoli C., RA Obornik M., Parker M.S., Petit J.L., Porcel B.M., Poulsen N., RA Robison M., Rychlewski L., Rynearson T.A., Schmutz J., Shapiro H., RA Siaut M., Stanley M., Sussman M.R., Taylor A.R., Vardi A., RA von Dassow P., Vyverman W., Willis A., Wyrwicz L.S., Rokhsar D.S., RA Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y., RA Grigoriev I.V.; RT "The Phaeodactylum genome reveals the evolutionary history of diatom RT genomes."; RL Nature 456:239-244(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000650; EED88659.1; -; Genomic_DNA. DR RefSeq; XP_002294304.1; XM_002294268.1. DR EnsemblProtists; Thaps25077; Thaps25077; Thaps25077. DR GeneID; 7443985; -. DR KEGG; tps:THAPSDRAFT_25077; -. DR Proteomes; UP000001449; Chromosome 15. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001449}; KW Reference proteome {ECO:0000313|Proteomes:UP000001449}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 790 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002869328. FT COILED 243 263 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 790 AA; 85931 MW; C840A48B04487EBB CRC64; MAARTVVLPA LLLSHTIVAS AAAAASVDNK SIYEATRRHH DQLQSQMETK LQGLDNVRTL LEGEVAALKR LLDDDDDGAS VEKLYQSMDV SSLKLMNLIS RTNALMEKVG SYNRLALSEG GVASAGASSS GVVGLVERLD SLYMEELDRR DRFGEWKDNL VADAADGEDG ASDATKSTSS PTTATESLYI TLSKLQSLLN PQTILQPSES TLQSSLLNYT KQSIISHTQS EYTKMNDHLS YIHSKYEDQI KAFQMELKHQ QLNSECIAIP RVVELVGQEL HRYYGGSVDE ELVMDYASFE NGGSVVYGLT SGVYRPCVRN DDVAVGGSSG GGRTEEGGID PRAIYERSKD KTIEAMYHRQ RDLMKSPAGE SGNLSWKDQV YQMMERIDVW EWYTSYKFGS LRQYLPEDWE RALDWMSDRL LQQSGSASGG GSWDEYTPRG TIDALIPDYV YHSLGISNSE LFGTVFGRTA SPEVAITSGT SKSGDNGSSS GNSSIKPLGN CYPLSMHSDD DPALAYISRH AHMEGGMQVM DDTSLLIGPK YTVRLSQPIY IDAVTLEHHS FPLPKAAVGE YEGKKGGESA PRYVRVVGFP PCLDTGTDGD DECSARGFEI ASPIDLGSFE YQRIAVSGRE DDYGGVGEDE GSNTAATLNR RRSIQSFAVN GGKWKPSAPI PDEASSVESS DEAGEEQGEC TLESLSCAAK PDTKPKPAED TRMGMGMGAG QCTPNYDDTE TIPSCGGDGN TESTPPSVES SRQTVAAVSF IIEENWGNDD YTCLYRVRVH GDAVEDDSLL // ID B8LM10_PICSI Unreviewed; 661 AA. AC B8LM10; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:ABR16690.1}; OS Picea sitchensis (Sitka spruce) (Pinus sitchensis). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Pinidae; Pinales; Pinaceae; Picea. OX NCBI_TaxID=3332 {ECO:0000313|EMBL:ABR16690.1}; RN [1] {ECO:0000313|EMBL:ABR16690.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Green portion of the leader tissue RC {ECO:0000313|EMBL:ABR16690.1}; RA Ralph S.G., Chun H.E., Liao N., Ali J., Reid K., Kolosova N., RA Cooper N., Cullis C., Jancsik S., Moore R., Mayo M., Wagner S., RA Holt R.A., Jones S.J.M., Marra M.A., Ritland C.E., Ritland K., RA Bohlmann J.; RT "Full length cDNA sequences from Sitka Spruce (Picea sitchensis)."; RL Submitted (JUN-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EF676808; ABR16690.1; -; mRNA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 25 43 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 607 632 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 644 660 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 503 523 {ECO:0000256|SAM:Coils}. FT COILED 542 562 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 661 AA; 74725 MW; 60329C1933F45B3C CRC64; MQKSRRAQLK KKQALEQSFL GRKRLYEVSF TLVFVAWGLL FFLNSRTGRS DGFRDADGSR QPVYGGNSTS NGQHKNEYPK NEQLAFQNEG NKFDRVQYCT ALSSADDVEN TIAMPNAVNT YVIEGDSLKE LNQSNNEESQ VDFENNVDQE IPNPASDEPE IASNCTLTED NSSVTALFID EPSKVEQSVD SFPASEGPAQ ADTAVSQSIL TEYKETHPQT HRSSRATPVG LDEFKKKASN EKDRPTGNQF GIITHRREPG GSEYNYAAAS KGAKILAHNK EVKGVQSILD KDQDKYLRNP CSAEEKFVVI ELSEETLVDT VAIANFEHYS SNLKDFELFS SLVYPTDDWV LLGNFTAGNV KHVQRFTLQE PKWARYLKLR FLNHYGSEFF CTLSTVEVYG VDAIERMLED LIAVGKHGLR NIDLSGEPSS THAIGATPLP DEKGSNSFDE LHLLFDGKEP HGGLPEKEDA SKANSPDPTV EMIQQKGGRM PGDTVLKILM QKVRSLELNL SVLEKYLEEL TIRYGDLFSE LDKELDENTL YLHQIREELN HLQEHKKMME EEIGEYRSWK FTISNKLDEL AMDNNFLRLE VQNNHLRVQH MESKETVVFG VSFIFVCIAV MKISLDFIVT IFRLCKVENK RSSAWVFLLL SSSLVSFILS L // ID B8MBQ2_TALSN Unreviewed; 887 AA. AC B8MBQ2; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:EED18185.1}; GN ORFNames=TSTA_119470 {ECO:0000313|EMBL:EED18185.1}; OS Talaromyces stipitatus (strain ATCC 10500 / CBS 375.48 / QM 6759 / OS NRRL 1006) (Penicillium stipitatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Trichocomaceae; Talaromyces. OX NCBI_TaxID=441959 {ECO:0000313|Proteomes:UP000001745}; RN [1] {ECO:0000313|Proteomes:UP000001745} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10500 / CBS 375.48 / QM 6759 / NRRL 1006 RC {ECO:0000313|Proteomes:UP000001745}; RX PubMed=25676766; DOI=10.1128/genomeA.01559-14; RA Nierman W.C., Fedorova-Abrams N.D., Andrianopoulos A.; RT "Genome sequence of the AIDS-associated pathogen Penicillium marneffei RT (ATCC18224) and its near taxonomic relative Talaromyces stipitatus RT (ATCC10500)."; RL Genome Announc. 3:E0155914-E0155914(2015). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ962655; EED18185.1; -; Genomic_DNA. DR RefSeq; XP_002482177.1; XM_002482132.1. DR STRING; 441959.XP_002482177.1; -. DR EnsemblFungi; EED18185; EED18185; TSTA_119470. DR GeneID; 8105222; -. DR EuPathDB; FungiDB:TSTA_119470; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; B8MBQ2; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001745; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001745}; KW Reference proteome {ECO:0000313|Proteomes:UP000001745}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 887 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002877515. SQ SEQUENCE 887 AA; 97087 MW; B5C73F307EB27FF6 CRC64; MTGACIIQVL RWMQVWVLLA SDFLAHATAS QLQVQSNDNQ FLICHAPSFY DVIFPICSIY DESGRGSDYV SIVPTEPSAT TTSTTGAAQS SPPPVQDTDV NELDADSALD TADFLSFEDW KKRNLAKIGQ SADNVGGKRQ GGDIGQERRT QARAINNALD ALGDDAEIEL NFDGFGSESA QATPWESGSG NEKVNSDGET SVDNDSADGV SAVGRRKDAG TTCKERFNYA SFDCAATVLK TNPECSGSSS ILIENKDSYM LNECRAKNKF LILELCDDIL VDTIVLANYE FFSSIFRTFR VSVSDRYPVK ADKWKELGIF EAKNTRAVQA FAVENPLIWA RYVKIEFLTH YGNEFYCPLS LVRVHGTTML EEYKNEGDAS RSDEEVMETA EEVGRPVEDE QEISVQGQGA LDNSTVPISN PILDIWESTS PLNGSVLEMA ALEFSTLKTA TCAADYSAIE TPDTLNQTAV AASTGSANIT IVTPSDSKAS PEQTINGDMN MASRVGSQNA TTRGSAQDSS TVRTSVASGE DSTSAVEPTK VIPSSPPSPN PTTQESFFKS VNKRLQMLES NSTLSLLYIE EQSRMLRDAF SKVEKRQMAK MNTFLEDLNN TVIDEIRNLH MVYQSLRTIV LDDFEHQQRE VSTAASQLAI LTNELVFQKR MTALSSVLIM ILFALILFPR GSGIVGGIDF QSMITWSPRP KMSRSSRIPS TGPSSPSLES ETQTPPTVNP SKKKAHRRQC SNALRHESIK DLRECANSLA HSSEEDLQFS GYEDNGRFRS MSDFSDSDIS KIPYTPFSSD IQHMLGYDAV RMRPQINVVP DTTASSPSPS PLPSVPQDRP VSSPPVLKAA PSVDHPTNGD HVDDIQRDDS DAESDTSTEP FPPFPTD // ID B8MRH8_TALSN Unreviewed; 653 AA. AC B8MRH8; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EED13115.1}; GN ORFNames=TSTA_056140 {ECO:0000313|EMBL:EED13115.1}; OS Talaromyces stipitatus (strain ATCC 10500 / CBS 375.48 / QM 6759 / OS NRRL 1006) (Penicillium stipitatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Trichocomaceae; Talaromyces. OX NCBI_TaxID=441959 {ECO:0000313|Proteomes:UP000001745}; RN [1] {ECO:0000313|Proteomes:UP000001745} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10500 / CBS 375.48 / QM 6759 / NRRL 1006 RC {ECO:0000313|Proteomes:UP000001745}; RX PubMed=25676766; DOI=10.1128/genomeA.01559-14; RA Nierman W.C., Fedorova-Abrams N.D., Andrianopoulos A.; RT "Genome sequence of the AIDS-associated pathogen Penicillium marneffei RT (ATCC18224) and its near taxonomic relative Talaromyces stipitatus RT (ATCC10500)."; RL Genome Announc. 3:E0155914-E0155914(2015). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ962659; EED13115.1; -; Genomic_DNA. DR RefSeq; XP_002487226.1; XM_002487181.1. DR EnsemblFungi; EED13115; EED13115; TSTA_056140. DR GeneID; 8100679; -. DR EuPathDB; FungiDB:TSTA_056140; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; B8MRH8; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001745; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001745}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001745}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 301 327 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 653 AA; 72616 MW; 626C5455C4D18B66 CRC64; MPRTSHQRDP AYSPSGRRYS LEPGSPVRRS KRLEHGSPSP TPFALRATTP DARQLHETLR AASESPSKER PDTRARSSAH SSVTPSPPVQ RTISSTSTPA PPSLNPAITA GNETLEPQTD RGFSFFRYPR LPSIGFGRKE ASIVLSSPEV EDDNASVVSW QLERELHRDN LQRTKPEPEP ESYSLVPREA RNIRKPPRRL SGLTWTNDTT HSADNTNTTS NYDDDGKDDD KEDEEDKKSD TSDIRTAAAR TVISSNNNRD SADDSVSGNS RPPTSDQSAL VDRPRRPPLF NQQDITKETH WSLFLLLMTL FITIFIITTY FLGGYLLSND LFPHQARNYP TLNTTEGNIV SQLSDELVRL NTQMTSVSTY VHHLSYEQKK IADQVTIVRP NPEFEPRINF LSPGLGTRVD PKLTSPSIGT RRTLPRRLYE SLSRKRIPQP NPPGTALEAW NDIGDCWCAA PSKTGQAQLA LDLGQRAIID EVVVEHIPAG ASPDPGVAPR EMELWARFRP FRGGQQQQQQ QQAAKATETA ITSESSNKRS GWFGLFRSST TSSSTSSSSS SLSSILDAII KTLQQAYPSD PETAYANDRL LGPSYFRLGQ WEYDRAGGAV QHFALDALID YPMVRVDKVV LRVKSNWGGN STCLYRVKVH GHV // ID B8NLW8_ASPFN Unreviewed; 612 AA. AC B8NLW8; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 27. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EED49343.1}; GN ORFNames=AFLA_094240 {ECO:0000313|EMBL:EED49343.1}; OS Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM OS 12722 / SRRC 167). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=332952 {ECO:0000313|EMBL:EED49343.1, ECO:0000313|Proteomes:UP000001875}; RN [1] {ECO:0000313|Proteomes:UP000001875} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167 RC {ECO:0000313|Proteomes:UP000001875}; RA Payne W.G.A., Dean R.A., Nierman W.C., Amedeo P., Caler E.G.A., RA Fedorova N.D., Maiti R., Joardar V., Inman J., Galinsky K.J., Yu J., RA Bhatnagar D., Cleveland T.E.; RT "Genome sequence of Aspergillus flavus strain NRRL 3357."; RL Submitted (SEP-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ963480; EED49343.1; -; Genomic_DNA. DR RefSeq; XP_002381244.1; XM_002381203.1. DR EnsemblFungi; CADAFLAT00009109; CADAFLAP00009109; CADAFLAG00009109. DR GeneID; 7919516; -. DR KEGG; afv:AFLA_094240; -. DR EuPathDB; FungiDB:AFLA_094240; -. DR HOGENOM; HOG000176993; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001875; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001875}; KW Reference proteome {ECO:0000313|Proteomes:UP000001875}. SQ SEQUENCE 612 AA; 67179 MW; 0442348B5A3D06A6 CRC64; MPPKRASTRR AGAVTRTSER GTPSYIPNMS SPDARNPALP DIPTKQSFAY GSSTTPILPR ELSAKPRMNL AEMAANIDEG RRVAQDRDFD RPHMNTRSRR QSISASLSPV RRSRREPTPD QLQLLDSLRE ATMSPNPNGQ DHAEQSTPTP TPPIPHTLST ASSPATESLT NPKYPVLTTD QLYPSPLLRY GSPARNAISL SSPNFATSID NESVVSWNVE RDIHEDDLQR TRPNGTIIPS NPIREASFDE STHESTSPLR ERVKSNVRSV GNAAVGLQKG LPIKPVSLVV LAVVSILTAC FFGDQISSIS SSIGSRLPLY GSPFRDLNAT ALQAVHGLSN QVVRLGEEVS SLSKEVDVIK SEVEHIPAPS TIVQPIPAQE TPKTNFLSIG MGVLVDPYNT SPTSGRSAGF LQKLHSRFLP SSSQQQPEPP LAALTPWQDV GECWCSKPRS GMSQLAMHLG REIVPEEVVI EHIPKGASIR PEVAPRDMEL WAQFQIVDES NPDSPPSPNP SRTSGILSEE LSLHNHIIDT LRLAYKDEPE GAYSNDELLG PSFYRVGQWT YDLHASNHIQ KFELDAIIDV PAIRVNKVAF RVKSNWGGND TCLYRLKLYG HI // ID B8NRJ5_ASPFN Unreviewed; 870 AA. AC B8NRJ5; DT 03-MAR-2009, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:EED46847.1}; GN ORFNames=AFLA_049000 {ECO:0000313|EMBL:EED46847.1}; OS Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM OS 12722 / SRRC 167). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=332952 {ECO:0000313|EMBL:EED46847.1, ECO:0000313|Proteomes:UP000001875}; RN [1] {ECO:0000313|Proteomes:UP000001875} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 200026 / FGSC A1120 / NRRL 3357 / JCM 12722 / SRRC 167 RC {ECO:0000313|Proteomes:UP000001875}; RA Payne W.G.A., Dean R.A., Nierman W.C., Amedeo P., Caler E.G.A., RA Fedorova N.D., Maiti R., Joardar V., Inman J., Galinsky K.J., Yu J., RA Bhatnagar D., Cleveland T.E.; RT "Genome sequence of Aspergillus flavus strain NRRL 3357."; RL Submitted (SEP-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ963483; EED46847.1; -; Genomic_DNA. DR RefSeq; XP_002383027.1; XM_002382986.1. DR EnsemblFungi; CADAFLAT00010892; CADAFLAP00010892; CADAFLAG00010892. DR GeneID; 7920110; -. DR KEGG; afv:AFLA_049000; -. DR EuPathDB; FungiDB:AFLA_049000; -. DR HOGENOM; HOG000172520; -. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001875; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001875}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001875}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 870 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002878381. FT TRANSMEM 694 716 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 870 AA; 94546 MW; E4A36723ECC09EC3 CRC64; MVIPAWAAAA WTSLTTALIL PGIPGAAAEN KQALCLARHW SEVEAEFIQW PICVESRWER TAPRITQDTT RSPDQTVSVT VSEGAPSTTA IPAPGGQPDH ELDTDSPLDN SNFLSFEDWK KQNLAKVGQS AENVRGNRHA AGKEDRRRPT GINNALDSLG EDTEIDLDFG GFGAEASDAA KPTSWGSSIP TAGITGTAAG ASAGDMEAAV SADLRKGASR GKDAGTTCKE RFNYASFDCA ATVLKTNPEC KGSSSVLVEN KDSYMLNECR AKNKFLILEL CDDILVDTVV LANYEFFSSI FHTFRVSVAD RYPAKTDQWR ELGVYEARNT REIQAFAVEN PLIWARYVKI EFLTHYGNEF YCPLSLVRIH GTTMLEEYKH DGETNRGDEE AAAEALEPSP HPVDVEVKDV AQQPLTTVAL PDEPTNGPTA TIEAQGSCSH HGMEVVRLLQ KGVPPPVDTC DISTAPTGAE NEAASQSSES RPKANEETTP SGEASAPVSQ VDPSDKGSVG GQKVTGPTGA SPDSASSTTL GTETVRQDAA HESEIKSVSS PKEESSIPSE SVRPSGTQPP SSNPTTQESF FKSVNKRLQM LESNSTLSLL YIEEQSRILR DAFSKVEKRQ LAKTSTFLEN LNVTVLNELR QFREQYDQVW KSVALEFEHQ RIQYHQEIHS ISAQLGVLAD ELVFQKRVSV IQSIMILFCF ALVLFSRVPL GTYIDIPRVQ NMMNRSYSLR SSSPIFFGSP SASPSSTRPA SSYRATGRHR RNMSEDSQEE PLSPTIAYSP PTPTSDPSSP DEADKRPAPS LATVDMPHLA PPHFRSHSSP PVLNPADEES QGEESPVSYE SRGSSYYDSP GSTESSEPIL ASDGSMRQEG // ID B9DR50_LEPMC Unreviewed; 783 AA. AC B9DR50; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=SAD1 protein {ECO:0000313|EMBL:CAQ03440.1}; GN Name=sad1 {ECO:0000313|EMBL:CAQ03440.1}; OS Leptosphaeria maculans (Blackleg fungus) (Phoma lingam). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Leptosphaeriaceae; Leptosphaeria; Leptosphaeria maculans complex. OX NCBI_TaxID=5022 {ECO:0000313|EMBL:CAQ03440.1}; RN [1] {ECO:0000313|EMBL:CAQ03440.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=19445597; DOI=10.1094/MPMI-22-6-0725; RA Remy E., Meyer M., Blaise F., Simon U.K., Kuhn D., Balesdent M.H., RA Rouxel T.; RT "A key enzyme of the Leloir pathway is involved in pathogenicity of RT Leptosphaeria maculans toward oilseed rape."; RL Mol. Plant Microbe Interact. 22:725-736(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AM941451; CAQ03440.1; -; Genomic_DNA. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; SQ SEQUENCE 783 AA; 86062 MW; 7FDA935181EE5DD6 CRC64; MSQLASSGAP TPRRSGRLSN KASSIAETAV TTVTKAGTRA RRTGPLIEVK SRKSNAYGAS GRVGTAEELP VAATGFAQAF QNQRGNALVR EGPVEESEDG TDSADELAAE TPRLSGARNG HFPASSPPRS PTPTAGTAAT SVPGFSFLQS EDTPASEEDD AESVGNTSKS FGPLHEAGMI GQQDRPHMPY SSTQATPEPT PLVQKTNLRR SLQSQTTRMG ASQIKVPLQE QGAPPLRPSY LQTPAPGREN GTLAASAAAS AAHKKAIDES VDALLAKEQA RLHRDGAPQS QPKYQGRRRH ANSPKTVNEQ PGEVESPQKF QIDWPLKKHL SWVLGVLAAI TLVGWLGHSM MSSVASASDA NTTNNKPGLL SAVNARASYT MGKVAEFIQP PRGPTVEEEV AAFRAGDDNI MWHRMYKMSD KFETRINGVH ATIEELRKEL PDMLIVRRHE DGRSEISDDF WQALQAKLRS EEENPEWVQY LTQVKQKLDD IFDHSVDRDD TKVRPQAVSR QEFLELIDQR FRELSTRVNE NIEEAFKSQT EKFQSLVTAE AKKAMIESVR LQSLAQTNLV ANYELHLKSP NYFSPSLGAV VVPHLTSATR LDRARWFTTI AQKLALLPQR NPPQAALTEW RQPGDCWCSA PNVLGGAQTQ LTVSLALPMT PQKVTIEHVP MSMVPARDVS NAPRDVEIWV QTEKPVKSYY RYSGGTCGEG LPGWACLGAF KYNIHASNHV QTFDLVSETS EPIKRAMLRV KSNWGADHTC LYQVRLHGED ARADYEYQVR LND // ID B9EXW8_ORYSJ Unreviewed; 563 AA. AC B9EXW8; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein; GN ORFNames=OsJ_02484; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). RN [2] RP NUCLEOTIDE SEQUENCE. RA Wang J., Li R., Fan W., Huang Q., Zhang J., Zhou Y., Hu Y., Zi S., RA Li J., Ni P., Zheng H., Zhang Y., Zhao M., Hao Q., McDermott J., RA Samudrala R., Kristiansen K., Wong G.K.-S.; RT "Improved gene annotation of the rice (Oryza sativa) genomes."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000138; EEE54931.1; -; Genomic_DNA. DR STRING; 39947.LOC_Os01g41600.1; -. DR PaxDb; B9EXW8; -. DR Gramene; B9EXW8; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR ExpressionAtlas; B9EXW8; baseline. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 501 520 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 544 562 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 479 499 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 563 AA; 61050 MW; 2DDB18B869F9C660 CRC64; MSKKRREGGG GGNGGCDPPA VTDALSMDGG LREVSLSVVF SVWCLLFLLR SQFLHSQTDP SDFYDDVEDG MRENYCKVMP LEAYIFPTEY NASAAAPTCQ PSLHPPDQPQ QETDHRSLEP FNNTTGGKSS AEAAALDELD EFRSRILQGK AENGRVPDGA TPAAHRLEPS GAEYNYAAAS KGAKVLAHNR EAKGAANILG GDKDRYLRNP CSADDKFVDV ELSEETLVRT IGLANLEHYS SNFRDFELYG SPSYPAPAEE WELLGRFTAD NAKHAQRFVL PDPRWTRYLR LRLATHYGSG FYCILSYLEV YGIDAVEQML QEIISGSGAD TDASAAAKAE EGGDGGTLRN DTAQVNARLD GVGGGGGSAA GRNDSAGDGA GAKNNGSRMT VAGDGKPAAA GRFHGDAVLK IMMQKMRSLE LGLSTLEDYT KALNHRYGAK LPDLHTGLSQ TTMALDRMKA DVRDLVEWKG NVAKDLGELK EWRSNVEEMR SIQETMQNKE LAVLSISLFF ACLALFKLAC DRVLFLFTRK GAAAAERMCG ASKGWILVLA SSSFTTFLVL LYN // ID B9GL41_POPTR Unreviewed; 582 AA. AC B9GL41; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 11-DEC-2013, sequence version 2. DT 11-NOV-2015, entry version 36. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEE84267.2}; GN ORFNames=POPTR_0001s10790g {ECO:0000313|EMBL:EEE84267.2}; OS Populus trichocarpa (Western balsam poplar) (Populus balsamifera OS subsp. trichocarpa). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; OC Populus. OX NCBI_TaxID=3694 {ECO:0000313|EMBL:EEE84267.2, ECO:0000313|Proteomes:UP000006729}; RN [1] {ECO:0000313|EMBL:EEE84267.2, ECO:0000313|Proteomes:UP000006729} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nisqually {ECO:0000313|Proteomes:UP000006729}; RX PubMed=16973872; DOI=10.1126/science.1128691; RA Tuskan G.A., Difazio S., Jansson S., Bohlmann J., Grigoriev I., RA Hellsten U., Putnam N., Ralph S., Rombauts S., Salamov A., Schein J., RA Sterck L., Aerts A., Bhalerao R.R., Bhalerao R.P., Blaudez D., RA Boerjan W., Brun A., Brunner A., Busov V., Campbell M., Carlson J., RA Chalot M., Chapman J., Chen G.-L., Cooper D., Coutinho P.M., RA Couturier J., Covert S., Cronk Q., Cunningham R., Davis J., RA Degroeve S., Dejardin A., dePamphilis C.W., Detter J., Dirks B., RA Dubchak I., Duplessis S., Ehlting J., Ellis B., Gendler K., RA Goodstein D., Gribskov M., Grimwood J., Groover A., Gunter L., RA Hamberger B., Heinze B., Helariutta Y., Henrissat B., Holligan D., RA Holt R., Huang W., Islam-Faridi N., Jones S., Jones-Rhoades M., RA Jorgensen R., Joshi C., Kangasjaervi J., Karlsson J., Kelleher C., RA Kirkpatrick R., Kirst M., Kohler A., Kalluri U., Larimer F., RA Leebens-Mack J., Leple J.-C., Locascio P., Lou Y., Lucas S., RA Martin F., Montanini B., Napoli C., Nelson D.R., Nelson C., RA Nieminen K., Nilsson O., Pereda V., Peter G., Philippe R., Pilate G., RA Poliakov A., Razumovskaya J., Richardson P., Rinaldi C., Ritland K., RA Rouze P., Ryaboy D., Schmutz J., Schrader J., Segerman B., Shin H., RA Siddiqui A., Sterky F., Terry A., Tsai C.-J., Uberbacher E., RA Unneberg P., Vahala J., Wall K., Wessler S., Yang G., Yin T., RA Douglas C., Marra M., Sandberg G., Van de Peer Y., Rokhsar D.S.; RT "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)."; RL Science 313:1596-1604(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000337; EEE84267.2; -; Genomic_DNA. DR RefSeq; XP_002299462.2; XM_002299426.2. DR STRING; 3694.POPTR_0001s10790.1; -. DR EnsemblPlants; POPTR_0001s10790.1; POPTR_0001s10790.1; POPTR_0001s10790. DR GeneID; 7467076; -. DR KEGG; pop:POPTR_0001s10790g; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR InParanoid; B9GL41; -. DR OMA; KHAQSFK; -. DR Proteomes; UP000006729; Linkage group LGI. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006729}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006729}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 522 542 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 563 581 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 582 AA; 65323 MW; 26D1008E84931DEC CRC64; MKKPFSFLNN KNRSSNSRRR SLYELHLSLI LLLWGLLFSF CAGHENQGNL TPDNSSIPFS SDLKDTTLRG DTPSHDAITS NNDTNGILLE VNPSTSSKNA TVHTDLGNKK CPLPETNRLQ EIILSALGYG SSVYKMRNPE ELTTGKLKEL PSGRPQHLTY LNFDEFWNII RQEKGKVIPK QLANITHRLE PDGREYNYAS LTKGAKVLAY NKEAKGACNI LGKDHDKYLR NPCSVVEKFV VIELSEETLV DVVKIANFEH YSSNFKDFEL SGSLNYTTKS WIPLGNFVAA NVKHIQDFKL PEPKWVRYLK LNLRSHYGSG FYCTLSVVEV YGVDAIERML EDFFVPSEEP LPIELPKPSL TAAPHLKPEL NLTDKESSGK VRNGVDNAGM GAENLSDIQQ SHADGKKSPE SINIMAEPVT EVRQLPISRK PGDTLLKILM QKAKSLELSL TMLEGYIKET NQRKGDIMPK LEEELSGISL LVETTRTEIR DLMEWKENTV LIDFQNIQRV ANDQANLESK ELAVLAMSLF FMCFSTVMLI SAKVSKYLGA ASNSDKACRT SRGWMMILVS STMIIFITIL SS // ID B9GXJ9_POPTR Unreviewed; 618 AA. AC B9GXJ9; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 11-DEC-2013, sequence version 2. DT 11-NOV-2015, entry version 37. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEE78636.2}; GN ORFNames=POPTR_0003s14140g {ECO:0000313|EMBL:EEE78636.2}; OS Populus trichocarpa (Western balsam poplar) (Populus balsamifera OS subsp. trichocarpa). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; OC Populus. OX NCBI_TaxID=3694 {ECO:0000313|EMBL:EEE78636.2, ECO:0000313|Proteomes:UP000006729}; RN [1] {ECO:0000313|EMBL:EEE78636.2, ECO:0000313|Proteomes:UP000006729} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nisqually {ECO:0000313|Proteomes:UP000006729}; RX PubMed=16973872; DOI=10.1126/science.1128691; RA Tuskan G.A., Difazio S., Jansson S., Bohlmann J., Grigoriev I., RA Hellsten U., Putnam N., Ralph S., Rombauts S., Salamov A., Schein J., RA Sterck L., Aerts A., Bhalerao R.R., Bhalerao R.P., Blaudez D., RA Boerjan W., Brun A., Brunner A., Busov V., Campbell M., Carlson J., RA Chalot M., Chapman J., Chen G.-L., Cooper D., Coutinho P.M., RA Couturier J., Covert S., Cronk Q., Cunningham R., Davis J., RA Degroeve S., Dejardin A., dePamphilis C.W., Detter J., Dirks B., RA Dubchak I., Duplessis S., Ehlting J., Ellis B., Gendler K., RA Goodstein D., Gribskov M., Grimwood J., Groover A., Gunter L., RA Hamberger B., Heinze B., Helariutta Y., Henrissat B., Holligan D., RA Holt R., Huang W., Islam-Faridi N., Jones S., Jones-Rhoades M., RA Jorgensen R., Joshi C., Kangasjaervi J., Karlsson J., Kelleher C., RA Kirkpatrick R., Kirst M., Kohler A., Kalluri U., Larimer F., RA Leebens-Mack J., Leple J.-C., Locascio P., Lou Y., Lucas S., RA Martin F., Montanini B., Napoli C., Nelson D.R., Nelson C., RA Nieminen K., Nilsson O., Pereda V., Peter G., Philippe R., Pilate G., RA Poliakov A., Razumovskaya J., Richardson P., Rinaldi C., Ritland K., RA Rouze P., Ryaboy D., Schmutz J., Schrader J., Segerman B., Shin H., RA Siddiqui A., Sterky F., Terry A., Tsai C.-J., Uberbacher E., RA Unneberg P., Vahala J., Wall K., Wessler S., Yang G., Yin T., RA Douglas C., Marra M., Sandberg G., Van de Peer Y., Rokhsar D.S.; RT "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)."; RL Science 313:1596-1604(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000339; EEE78636.2; -; Genomic_DNA. DR RefSeq; XP_002303657.2; XM_002303621.2. DR STRING; 3694.POPTR_0003s14140.1; -. DR EnsemblPlants; POPTR_0003s14140.1; POPTR_0003s14140.1; POPTR_0003s14140. DR GeneID; 7465325; -. DR KEGG; pop:POPTR_0003s14140g; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR InParanoid; B9GXJ9; -. DR OMA; IVKEQAN; -. DR Proteomes; UP000006729; Linkage group LGIII. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006729}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006729}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 36 55 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 558 578 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 599 617 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 455 475 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 618 AA; 69569 MW; A1C0C08F1D55DAD7 CRC64; MKKTRQGFSV SNATVCQNNK IKNGHSRRRS LYELRFSLLF LLCYLLFLFC ARLSIGLGNQ GNLTLDDSSI PCSSDLKDTL CGDTYSRDAN RSNNCTNGIL LGVNLSTSNN NATVHTASRN QRCPLPETNR FEEVILSALG YGSSGLIMKN PEEVKAVKPK ELPSGRLQHL TYLNFDEYRN LTKQEKGKVM PKQLANITHR LEPDGKEYNY ASVTKGAKVL VYNKEAKGAC NILGKDHDKY LRNPCLTREK FVVIELSEET LVDVVKIANF EHYSSNFKDF ELSGSLTYPT RTWTQLGNFV AANVKHIQDF KLPEPKWVRY LKLNLLSHYG SEFYCTLSVV EVYGVDAIEQ MLEDFFVPSE EPLPNELPEP NSTAAPPSKP ELSLADKEDS GKVHNGSDNA GMETENIHGI QQSNPSVKKN PESINMIANP VTGVRQLLIS RKPGDTVLKI LMQKVKSLEL SLTMLEEYIK EMNQRKGDIL PKLDQELFRI SLLVEKSRTE IRDLMEWKEN TDKVLMEFES WKAGVSSSMD AMVRENTRLR LDVEKVANDQ ANLESKELAV LARSLVFVCF SIAMLVSAKV SKFLRAASCL GKACRTRRGW IMILVSSTMI IFVTLLSS // ID B9HVF3_POPTR Unreviewed; 457 AA. AC B9HVF3; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 11-DEC-2013, sequence version 2. DT 11-NOV-2015, entry version 37. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEF01568.2}; GN ORFNames=POPTR_0010s25420g {ECO:0000313|EMBL:EEF01568.2}; OS Populus trichocarpa (Western balsam poplar) (Populus balsamifera OS subsp. trichocarpa). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; OC Populus. OX NCBI_TaxID=3694 {ECO:0000313|EMBL:EEF01568.2, ECO:0000313|Proteomes:UP000006729}; RN [1] {ECO:0000313|EMBL:EEF01568.2, ECO:0000313|Proteomes:UP000006729} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nisqually {ECO:0000313|Proteomes:UP000006729}; RX PubMed=16973872; DOI=10.1126/science.1128691; RA Tuskan G.A., Difazio S., Jansson S., Bohlmann J., Grigoriev I., RA Hellsten U., Putnam N., Ralph S., Rombauts S., Salamov A., Schein J., RA Sterck L., Aerts A., Bhalerao R.R., Bhalerao R.P., Blaudez D., RA Boerjan W., Brun A., Brunner A., Busov V., Campbell M., Carlson J., RA Chalot M., Chapman J., Chen G.-L., Cooper D., Coutinho P.M., RA Couturier J., Covert S., Cronk Q., Cunningham R., Davis J., RA Degroeve S., Dejardin A., dePamphilis C.W., Detter J., Dirks B., RA Dubchak I., Duplessis S., Ehlting J., Ellis B., Gendler K., RA Goodstein D., Gribskov M., Grimwood J., Groover A., Gunter L., RA Hamberger B., Heinze B., Helariutta Y., Henrissat B., Holligan D., RA Holt R., Huang W., Islam-Faridi N., Jones S., Jones-Rhoades M., RA Jorgensen R., Joshi C., Kangasjaervi J., Karlsson J., Kelleher C., RA Kirkpatrick R., Kirst M., Kohler A., Kalluri U., Larimer F., RA Leebens-Mack J., Leple J.-C., Locascio P., Lou Y., Lucas S., RA Martin F., Montanini B., Napoli C., Nelson D.R., Nelson C., RA Nieminen K., Nilsson O., Pereda V., Peter G., Philippe R., Pilate G., RA Poliakov A., Razumovskaya J., Richardson P., Rinaldi C., Ritland K., RA Rouze P., Ryaboy D., Schmutz J., Schrader J., Segerman B., Shin H., RA Siddiqui A., Sterky F., Terry A., Tsai C.-J., Uberbacher E., RA Unneberg P., Vahala J., Wall K., Wessler S., Yang G., Yin T., RA Douglas C., Marra M., Sandberg G., Van de Peer Y., Rokhsar D.S.; RT "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)."; RL Science 313:1596-1604(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000346; EEF01568.2; -; Genomic_DNA. DR RefSeq; XP_002315397.2; XM_002315361.2. DR STRING; 3694.POPTR_0010s25420.1; -. DR EnsemblPlants; POPTR_0010s25420.1; POPTR_0010s25420.1; POPTR_0010s25420. DR GeneID; 7491581; -. DR KEGG; pop:POPTR_0010s25420g; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR InParanoid; B9HVF3; -. DR KO; K19347; -. DR OMA; MEIARHS; -. DR Proteomes; UP000006729; Linkage group LGX. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006729}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006729}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 96 117 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 457 AA; 50176 MW; 12F00F7E4ED15807 CRC64; MSASTVSITA NPASARRRPV VVSDKKSPSN NIELVVPSEQ QINGGGGGGK AKVTAAASRD LSHHSILERT VKDLQVKKTS STISPRRARK VEKPRWMKVV SVFTKNFVLL LVLAGLVQMV RKLAVKSGGI ESASVGTQMG LSEFDGRIAE MESMVKTAVK MIQVQVEVVD KKIESEVGGL RREMSKKIDD KGVILEKELR KLVERSEGLE KKIGELKAGD WLSKEDFEKF YEQFKKAKGG EFDGSDVSLD DIMVYAREIV QKEIEKHAAD GLGRVDYALA TSGGMVVKHS DPYMAGRGVN WFLKGRGVHP NADEMLKPSF GEPGKCFALK GSSGFVQIKL RGAIVPEAVT LEHVAKSVAY DRSTAPKDCR VSGWLQNRDL HTADDEEKML LLTEFTYDLE KSNAQTFNVL DNTASGLVDT VRLDFTSNHG SPTLTCIYRL RVHGYEPDPS SMTAMQP // ID B9N1M8_POPTR Unreviewed; 587 AA. AC B9N1M8; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 39. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ERP49181.1}; GN ORFNames=POPTR_0019s09690g {ECO:0000313|EMBL:ERP49181.1}; OS Populus trichocarpa (Western balsam poplar) (Populus balsamifera OS subsp. trichocarpa). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; OC Populus. OX NCBI_TaxID=3694 {ECO:0000313|EMBL:ERP49181.1, ECO:0000313|Proteomes:UP000006729}; RN [1] {ECO:0000313|EMBL:ERP49181.1, ECO:0000313|Proteomes:UP000006729} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nisqually {ECO:0000313|Proteomes:UP000006729}; RX PubMed=16973872; DOI=10.1126/science.1128691; RA Tuskan G.A., Difazio S., Jansson S., Bohlmann J., Grigoriev I., RA Hellsten U., Putnam N., Ralph S., Rombauts S., Salamov A., Schein J., RA Sterck L., Aerts A., Bhalerao R.R., Bhalerao R.P., Blaudez D., RA Boerjan W., Brun A., Brunner A., Busov V., Campbell M., Carlson J., RA Chalot M., Chapman J., Chen G.-L., Cooper D., Coutinho P.M., RA Couturier J., Covert S., Cronk Q., Cunningham R., Davis J., RA Degroeve S., Dejardin A., dePamphilis C.W., Detter J., Dirks B., RA Dubchak I., Duplessis S., Ehlting J., Ellis B., Gendler K., RA Goodstein D., Gribskov M., Grimwood J., Groover A., Gunter L., RA Hamberger B., Heinze B., Helariutta Y., Henrissat B., Holligan D., RA Holt R., Huang W., Islam-Faridi N., Jones S., Jones-Rhoades M., RA Jorgensen R., Joshi C., Kangasjaervi J., Karlsson J., Kelleher C., RA Kirkpatrick R., Kirst M., Kohler A., Kalluri U., Larimer F., RA Leebens-Mack J., Leple J.-C., Locascio P., Lou Y., Lucas S., RA Martin F., Montanini B., Napoli C., Nelson D.R., Nelson C., RA Nieminen K., Nilsson O., Pereda V., Peter G., Philippe R., Pilate G., RA Poliakov A., Razumovskaya J., Richardson P., Rinaldi C., Ritland K., RA Rouze P., Ryaboy D., Schmutz J., Schrader J., Segerman B., Shin H., RA Siddiqui A., Sterky F., Terry A., Tsai C.-J., Uberbacher E., RA Unneberg P., Vahala J., Wall K., Wessler S., Yang G., Yin T., RA Douglas C., Marra M., Sandberg G., Van de Peer Y., Rokhsar D.S.; RT "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)."; RL Science 313:1596-1604(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000355; ERP49181.1; -; Genomic_DNA. DR RefSeq; XP_006371384.1; XM_006371322.1. DR STRING; 3694.POPTR_0019s09690.1; -. DR EnsemblPlants; POPTR_0019s09690.1; POPTR_0019s09690.1; POPTR_0019s09690. DR GeneID; 18108413; -. DR KEGG; pop:POPTR_0019s09690g; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR InParanoid; B9N1M8; -. DR OMA; KTEASMA; -. DR Proteomes; UP000006729; Linkage group LGXIX. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006729}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006729}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 24 46 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 525 552 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 564 585 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 65526 MW; 02B560EB99D6E092 CRC64; MQRSRRAFLE RRALEKDIRG KNQFYKVSLS LVFVLWGLVF LLSIWISHGD GYTDGSGDLP VSISTWNEAT AEPSKCSVSV HKNQSKETCP VCSDESSCTD SAETRGSNDT LLISEGNTND AFAVEQSEVD SGSAVKSENN AQKTDRPSRV VPLGLDEFKS RAFSSKSKPG TGQVGGVIHR MEPGGKEYNY ASASKGAKVL AFNKEAKGAS NILVGDKDKY LRNPCSAEEK FVVIELSEET LVDTIEIANF EHYSSNLKHF ELLGSLVYPT GDWVKLGNFT AANVKHAQRF TLQVLIGVRY LRLNLLSHYG SEFYCTLSVI EIYGVDAVEQ MLEDMISDQD NLFGYEVGAG EQKPPSSHLE STQDDDTYTD LYSDMEDSSV ENSNAKNEVV KNKLPDPVEE VRHQQVGRMP GDSVLKILMQ KVRSLDLSLS ILERYLEEVN SKYGNIFKEI DKDLGEKDIL LEKMRSDVKS LHSSQDLIAK DVNDLISWKS LASTQLDGLL RDNLILRSKI ERVLEIQKSM ENKGIAVFLI CLIFGILAFV RLFVDLLLSV YMAFNVQGTE SRKFCWTGSS WHFLLLSCTV IILVISL // ID B9RCV2_RICCO Unreviewed; 584 AA. AC B9RCV2; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEF51373.1}; GN ORFNames=RCOM_1692390 {ECO:0000313|EMBL:EEF51373.1}; OS Ricinus communis (Castor bean). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; OC Acalyphoideae; Acalypheae; Ricinus. OX NCBI_TaxID=3988 {ECO:0000313|Proteomes:UP000008311}; RN [1] {ECO:0000313|Proteomes:UP000008311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Hale {ECO:0000313|Proteomes:UP000008311}; RX PubMed=20729833; DOI=10.1038/nbt.1674; RA Chan A.P., Crabtree J., Zhao Q., Lorenzi H., Orvis J., Puiu D., RA Melake-Berhan A., Jones K.M., Redman J., Chen G., Cahoon E.B., RA Gedil M., Stanke M., Haas B.J., Wortman J.R., Fraser-Liggett C.M., RA Ravel J., Rabinowicz P.D.; RT "Draft genome sequence of the oilseed species Ricinus communis."; RL Nat. Biotechnol. 28:951-956(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ973774; EEF51373.1; -; Genomic_DNA. DR RefSeq; XP_002509986.1; XM_002509940.1. DR GeneID; 8288036; -. DR KEGG; rcu:RCOM_1692390; -. DR InParanoid; B9RCV2; -. DR Proteomes; UP000008311; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008311}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008311}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 38 57 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 461 481 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 584 AA; 66060 MW; B26F1C613395573B CRC64; MKKPRNGSLI IDAIRCQKNN DNSSSSSNKN NKNNRRSFYE LSLSLMLLLW CLVLLFYTRL GLSHENEGNL ALYNRSVPCR STLKDKLSDD AQSYISNISH NNTNGVLLEL NLTSSCNKST VQINFANQKY SISETDRIEE VIWSFLGYRS LVCKTQNPEE WKIGRPEALP GERSHHSTYL NLDEFRNITR QEKGQQIPNQ LVNITHRLEP DGKEYNYASA MKGAKVVAHN KEAKGAGNIL GKDKDKYLRN PCSVGGKFVV IELSEETLVD VVKIANFEHY SSNFKGFNLS GSLNYPTETW ELLGNFNAAN VKHSQSFKLP EPKWVRYLKL DLLSHYGSEF YCTLSVVEVY GVDAVERMLE DLLVSPEETN PNKSPKLITT AGPPSKPELN PTDEKRNGKV QNGTDINAAM VTRNVSNAQQ TSTTKSPVTT SKIPDPATEV RQQPVSRIPG DTVLKILLQK VRSLEVNLSV LEEYIKEMNR RQGDILPDLE KELSRISLLL ENRKAELNAV MEWKENMDKG LMNFESWKDD VSSRMDALVR ENIMLRLDVE KLVNDQANLE SKELAVIAEL LNQIRYAGQV EVGL // ID B9SBM5_RICCO Unreviewed; 471 AA. AC B9SBM5; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEF38973.1}; GN ORFNames=RCOM_0342710 {ECO:0000313|EMBL:EEF38973.1}; OS Ricinus communis (Castor bean). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; OC Acalyphoideae; Acalypheae; Ricinus. OX NCBI_TaxID=3988 {ECO:0000313|Proteomes:UP000008311}; RN [1] {ECO:0000313|Proteomes:UP000008311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Hale {ECO:0000313|Proteomes:UP000008311}; RX PubMed=20729833; DOI=10.1038/nbt.1674; RA Chan A.P., Crabtree J., Zhao Q., Lorenzi H., Orvis J., Puiu D., RA Melake-Berhan A., Jones K.M., Redman J., Chen G., Cahoon E.B., RA Gedil M., Stanke M., Haas B.J., Wortman J.R., Fraser-Liggett C.M., RA Ravel J., Rabinowicz P.D.; RT "Draft genome sequence of the oilseed species Ricinus communis."; RL Nat. Biotechnol. 28:951-956(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ973915; EEF38973.1; -; Genomic_DNA. DR RefSeq; XP_002523394.1; XM_002523348.1. DR GeneID; 8282152; -. DR KEGG; rcu:RCOM_0342710; -. DR InParanoid; B9SBM5; -. DR KO; K19347; -. DR Proteomes; UP000008311; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008311}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008311}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 104 128 Helical. FT COILED 469 471 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 471 AA; 51756 MW; 85358D99DA5F0674 CRC64; MSASTVSITA NPAARRRPVV AGEKKSNNNN SIELLGSEAQ FNGGGPANVI MGNEKLLSSQ SKDLSHHSIL ERKDVTTAQL KKSTISPHRR TRTKVVPEKS KPRWLTVASI FTKNFALLLV LIGLVQMVRR LALNSSSSGD VNYSSSQMAA FSSESEARIA EVESLLKTSL KMIQLQVEVV NDKVDNEVGG LRNEFDNKIH DKGLFLESEF KRLVARFDGL DRSLTELKSV DWLSREDFNK FVDDYLNKGK GGQTDNTGVS LDDIRAYAKE IVIKEIEKHA ADGLGMVDYA LASGGAIVVK HSEPFLPGKG TNWLLKSSRI GVHPDAVKML KPSFGEPGQC FPLKGSSGFV QIRLRTAIIP QAVTLEHVAK SVAYDRSSAP KDCRVSGWLQ GHDIDLAVDT EKMFLLTEFT YDLEKSNAQT FAVLNSVASG LVDTVRLDFA SNHGSSHHTC IYRLRVHGYE PDSLSMMTME S // ID B9SBU4_RICCO Unreviewed; 484 AA. AC B9SBU4; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEF38922.1}; GN ORFNames=RCOM_1044140 {ECO:0000313|EMBL:EEF38922.1}; OS Ricinus communis (Castor bean). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; OC Acalyphoideae; Acalypheae; Ricinus. OX NCBI_TaxID=3988 {ECO:0000313|Proteomes:UP000008311}; RN [1] {ECO:0000313|Proteomes:UP000008311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Hale {ECO:0000313|Proteomes:UP000008311}; RX PubMed=20729833; DOI=10.1038/nbt.1674; RA Chan A.P., Crabtree J., Zhao Q., Lorenzi H., Orvis J., Puiu D., RA Melake-Berhan A., Jones K.M., Redman J., Chen G., Cahoon E.B., RA Gedil M., Stanke M., Haas B.J., Wortman J.R., Fraser-Liggett C.M., RA Ravel J., Rabinowicz P.D.; RT "Draft genome sequence of the oilseed species Ricinus communis."; RL Nat. Biotechnol. 28:951-956(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ973917; EEF38922.1; -; Genomic_DNA. DR RefSeq; XP_002523463.1; XM_002523417.1. DR GeneID; 8282471; -. DR KEGG; rcu:RCOM_1044140; -. DR InParanoid; B9SBU4; -. DR Proteomes; UP000008311; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008311}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008311}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 460 480 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 484 AA; 53927 MW; DC0699491BE67894 CRC64; MSTSNEENVG FCQPSDSMEK NLFNDIGSVT SDESLCTEST ETGSSNDGLL GSEGNVNHAF ASEKPEAISG SDSGPKTDRD RLSHSVPLGL DEFKSRAFSS KSKLGTDQAG GVIHRVEPGG KEYNYASASK GAKVLDFNKE AKGASNILGK DKDKYLRNPC SAEEKFVIIE LSEETLVATI EIANFEHYSS NLKDFELLGS LVYPTDTWIR LGNFTAANVK LAQRFPLQEP QWVRYLKLNL LSHYGSEFYC TLSIVEVLGV DAVERMLEDL ISVQNNVFVP KEETGDQKQL SSQTESTQVD DCDQELCMEM GSSSSVENSN VKHEVPKNKV PDPVDEIRQQ QGGRMPGDSV LKILMQKVRS LDLSLSVLER YLEELNYRYG NIFKGFDKDL VEKDTLLEKV RSDIKNLYDS KELMAKDVED LLSWKSLVST QMDNLLKDNF ALRSMVEGVQ KNQISMENKG IAVFFICLIF GTLAFVRLLV DILL // ID B9W8Q6_CANDC Unreviewed; 556 AA. AC B9W8Q6; DT 24-MAR-2009, integrated into UniProtKB/TrEMBL. DT 24-MAR-2009, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAX45129.1}; GN ORFNames=CD36_08300 {ECO:0000313|EMBL:CAX45129.1}; OS Candida dubliniensis (strain CD36 / ATCC MYA-646 / CBS 7987 / NCPF OS 3949 / NRRL Y-17841) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Candida. OX NCBI_TaxID=573826 {ECO:0000313|EMBL:CAX45129.1, ECO:0000313|Proteomes:UP000002605}; RN [1] {ECO:0000313|EMBL:CAX45129.1, ECO:0000313|Proteomes:UP000002605} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CD36 / ATCC MYA-646 / CBS 7987 / NCPF 3949 / NRRL Y-17841 RC {ECO:0000313|Proteomes:UP000002605}; RX PubMed=19745113; DOI=10.1101/gr.097501.109; RA Jackson A.P., Gamble J.A., Yeomans T., Moran G.P., Saunders D., RA Harris D., Aslett M., Barrell J.F., Butler G., Citiulo F., RA Coleman D.C., de Groot P.W.J., Goodwin T.J., Quail M.A., McQuillan J., RA Munro C.A., Pain A., Poulter R.T., Rajandream M.A., Renauld H., RA Spiering M.J., Tivey A., Gow N.A.R., Barrell B., Sullivan D.J., RA Berriman M.; RT "Comparative genomics of the fungal pathogens Candida dubliniensis and RT Candida albicans."; RL Genome Res. 19:2231-2244(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FM992688; CAX45129.1; -; Genomic_DNA. DR RefSeq; XP_002417476.1; XM_002417431.1. DR STRING; 573826.XP_002417476.1; -. DR EnsemblFungi; CAX45129; CAX45129; CD36_08300. DR GeneID; 8045021; -. DR KEGG; cdu:CD36_08300; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000093382; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002605; Chromosome 1. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002605}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 556 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002893973. FT TRANSMEM 500 517 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 556 AA; 63449 MW; EDF7B2E1BCE7CE2F CRC64; MIYIWHALVY ISCLLSCVVS GKNNSDTSAS LVNITDLYTP RSNRTLDYSP IIQYIPVFFQ KNNSFIDTPS NDDFLVLQSP ATTSTDNNKN SQKNDSVLDE CHFMSFEEWK KQKIESNTTA SNNYSMNGSS ESKSITPSNH TSVLSTNVTL MEADGKVYKD KFNFASVDCA ATIMKTNAQA KGASAILKEN KDSYLLNECS VKHKYVIIEL CQDILVDSVV IGNFEFFSSI FKDIRISVSD RFPSQNWKEL GQFIASNIRD VQTFKIENPL IWARYLKLEI LSHYGNEFYC PISIVRVHGK TMMDEFKEEE EENQRMDTVN EGSPAPQSIE EDVLLINSTT LNECRVRLPH LQLNEFLKSF NNSNQEFCVP SDAESQITTT KAATAITTQE SIYKNIMKRL SLLESNATLS LLYIEEQSKL LSTAFSNLEK RQTTNFNTLI SSVNSTLMYQ LAVFKESYYE LHEQYGNLFK IQENSHKQML SETNKKVGLL SSELTFQKRV SIFNSIIIIC LLVYVILTRD VAIEYPEDEL NEKSPSPETK KLSSPFIPRR YKMSKK // ID C0NR10_AJECG Unreviewed; 849 AA. AC C0NR10; DT 05-MAY-2009, integrated into UniProtKB/TrEMBL. DT 05-MAY-2009, sequence version 1. DT 14-OCT-2015, entry version 20. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EEH06124.1}; GN ORFNames=HCBG_05440 {ECO:0000313|EMBL:EEH06124.1}; OS Ajellomyces capsulatus (strain G186AR / H82 / ATCC MYA-2454 / RMSCC OS 2432) (Darling's disease fungus) (Histoplasma capsulatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Histoplasma. OX NCBI_TaxID=447093 {ECO:0000313|Proteomes:UP000001631}; RN [1] {ECO:0000313|Proteomes:UP000001631} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=G186AR / H82 / ATCC MYA-2454 / RMSCC 2432 RC {ECO:0000313|Proteomes:UP000001631}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Sil A., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., Berlin A., RA Borenstein D., Chen Z., Engels R., Freedman E., Gellesch M., RA Goldberg J., Griggs A., Gujja S., Heiman D., Hepburn T., Howarth C., RA Jen D., Larson L., Lewis B., Mehta T., Park D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA Walk T., White J., Yandava C., Klein B., McEwen J.G., Puccia R., RA Goldman G.H., Felipe M.S., Nino-Vega G., San-Blas G., Taylor J., RA Mendoza L., Galagan J.E., Nusbaum C., Birren B.W.; RT "The genome sequence of Ajellomyces capsulatus strain G186AR."; RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG663369; EEH06124.1; -; Genomic_DNA. DR EnsemblFungi; EEH06124; EEH06124; HCBG_05440. DR InParanoid; C0NR10; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001631; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001631}; KW Reference proteome {ECO:0000313|Proteomes:UP000001631}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 36 {ECO:0000256|SAM:SignalP}. FT CHAIN 37 849 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002899757. SQ SEQUENCE 849 AA; 94017 MW; 91BB070CBECCE02B CRC64; MAWHFPLFRR HVHMECRATD SLLYFWTIAL LAAVRAGGDM DSKHHNISPL SLDATCPPRA FSGIQHPVCL EPRWVGIGKI ENYTSNSSGE TDFYASITSA ASPSLSPTIT VTGSGSGSSS VDQELDTESP LDNANFLSFE EWKKQNLAKV GQSVENVKGD RQSAGSSGDG KRQRPTGIDN SLDSLGEDGE IALEFGGFGP EDSGPASWER KVGKDQPPDV DGAGSVTKGA EGETQIEATT RGGASRRKDA GTTCKERFNY ASFDCAATVL KTNPQCTGAS SVLIENKDSY MLNECRAKEK FLILELCDDI LIDTIVLANY EFFSSIFRTF RVSVSDRYPP KQPDMWKELG TYEAVNSREV QAFAVENPLI WARYVKIEFL THYGNEFYCP VSLIRVHGTT MLEEYKNDGE ANRLEDHNSH QIQGSRTPES GPDNSTTDPS KIVEDSEGPA EAGRFDMQPT RVQNLEDICL LKDAEVGGIL LRSVVRAEDR MCTVHETPRA YNRTDDAVQP DLVQSHGPAQ AVDNATPTTP SAEPSSNAAT PPTPVSTPTL TDTRAQKPTE NETSSNTHKT EYNGSSESPK PSTTVQYHQP NPTTQESFFK SVNKRLHMLE TNSSLSLQYI EEQSRILRDA FNKVEKRQLA KTTTFLENLN TSVLQELREF RHQYDQVWHS VAVEFEQQRL QYRQEVFAMS SQLGVLADEL VFQKRISIIQ SVFVLICFGL VLFSSSPIGS YLELPRVHNM VSRSQSFRSS THSFETPSAS PLSRPNSPYQ DNKRVSSSHT RTNSMESRED DLAVNPTICY SPPTPTSDSG GHELRRRLSE QTNSTSSSVV VAPQARFLRS ESSPPDLST // ID C0NT15_AJECG Unreviewed; 864 AA. AC C0NT15; DT 05-MAY-2009, integrated into UniProtKB/TrEMBL. DT 05-MAY-2009, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEH05176.1}; GN ORFNames=HCBG_06295 {ECO:0000313|EMBL:EEH05176.1}; OS Ajellomyces capsulatus (strain G186AR / H82 / ATCC MYA-2454 / RMSCC OS 2432) (Darling's disease fungus) (Histoplasma capsulatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Histoplasma. OX NCBI_TaxID=447093 {ECO:0000313|Proteomes:UP000001631}; RN [1] {ECO:0000313|Proteomes:UP000001631} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=G186AR / H82 / ATCC MYA-2454 / RMSCC 2432 RC {ECO:0000313|Proteomes:UP000001631}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Sil A., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., Berlin A., RA Borenstein D., Chen Z., Engels R., Freedman E., Gellesch M., RA Goldberg J., Griggs A., Gujja S., Heiman D., Hepburn T., Howarth C., RA Jen D., Larson L., Lewis B., Mehta T., Park D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA Walk T., White J., Yandava C., Klein B., McEwen J.G., Puccia R., RA Goldman G.H., Felipe M.S., Nino-Vega G., San-Blas G., Taylor J., RA Mendoza L., Galagan J.E., Nusbaum C., Birren B.W.; RT "The genome sequence of Ajellomyces capsulatus strain G186AR."; RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG663371; EEH05176.1; -; Genomic_DNA. DR EnsemblFungi; EEH05176; EEH05176; HCBG_06295. DR InParanoid; C0NT15; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001631; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001631}; KW Reference proteome {ECO:0000313|Proteomes:UP000001631}. FT COILED 161 181 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 864 AA; 94062 MW; CC06962F8DE3FCA5 CRC64; MTGRKTASVR SGSRAQNTRG TRAAPTENPT VATEGNQSNP DLGNPSLPDV RTQQSFAYGS TKTPALPRQL EVDPSMGLSE MIDTLDDGLR QAQDRELARV DVEDPTHPVP ERRQTRSMSA SVRSSISPAP GPVSRRASSR NATTRSRVGP RRAASRQTTP EEQLLETLRE VSEETEGVKR EEDPSISVLH DTPSFNGSAS VSWTTERAIH GILPRETNAG TRPNYYLHDP YGSRPSSSQE PSGLRLPPTR RPIFEEAFRA NPPLPGPIDV PNVSTSAAAR RTLPPVPAFN QLRNKSASKS SASSASSASI HTPGSSTHSS PVLVAAAPAR VHVTSKQRLS GIAKTPSALL VTIGLILMTF LTYFCRDHAC MFPQSLQTTM SHYLCSPAST FAKDNSTSMY ADAFHKLSSR LDQRLSDMAK EVTILKNEWN RRLPHLKEAL SGSPAAAMDP LMPPKVNYAS IGMGAVVDPY LTSPTMATSA GLVSRIGQYL AKVPRGSPPV AALQPWDGVG ECWCAATRSN ASQLTILLGR AIVPEEVVIE HIPKGATLDP GSAPREMELW VQYMARPPTA AAAYPPGSGS SNPSPPPSSA SSPHAPSPFP PSSAPSQPLP PPPATPHLRT PAFSHLRPSY YPHHLLPSWL RDAILTTLRQ VYPNELTTAY SDDALLGPSF FRVGRWQYNI HGGHHVQRFD LDAVIDMPAA RVEKVVFRVK HTSAGSYGSW WKWCFLSPLT EGLLVLSRLG VWDLVFWVLD VAFRLGGGFG GFFRVWSFGL DIQQGSNCDH PEVILADSMH ERNLPRGPVV IGKWMMTRLD YPNANQYLVR SHMHSATEAQ ICWAHAHAHA AMPMNPQVQT KTLLTNAEDA KLKD // ID C0SG73_PARBP Unreviewed; 874 AA. AC C0SG73; DT 05-MAY-2009, integrated into UniProtKB/TrEMBL. DT 04-FEB-2015, sequence version 2. DT 16-SEP-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEH16625.2}; GN ORFNames=PABG_06712 {ECO:0000313|EMBL:EEH16625.2}; OS Paracoccidioides brasiliensis (strain Pb03). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Paracoccidioides. OX NCBI_TaxID=482561 {ECO:0000313|EMBL:EEH16625.2, ECO:0000313|Proteomes:UP000002740}; RN [1] {ECO:0000313|EMBL:EEH16625.2, ECO:0000313|Proteomes:UP000002740} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Pb03 {ECO:0000313|EMBL:EEH16625.2, RC ECO:0000313|Proteomes:UP000002740}; RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345; RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., RA Goldberg J., Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., RA Grynberg M., Gujja S., Heiman D.I., Henn M.R., Kodira C.D., RA Leon-Narvaez H., Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., RA Morais F.V., Pereira M., Rodriguez-Brito S., Sakthikumar S., RA Salem-Izacc S.M., Sykes S.M., Teixeira M.M., Vallejo M.C., RA Walter M.E., Yandava C., Young S., Zeng Q., Zucker J., Felipe M.S., RA Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G., Puccia R., RA San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.; RT "Comparative genomic analysis of human fungal pathogens causing RT paracoccidioidomycosis."; RL PLoS Genet. 7:E1002345-E1002345(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; KN305543; EEH16625.2; -; Genomic_DNA. DR EnsemblFungi; EEH16625; EEH16625; PABG_06712. DR InParanoid; C0SG73; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002740; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002740}. SQ SEQUENCE 874 AA; 96428 MW; 408237C02A5A8D1F CRC64; MACCYSLFPR PSNTNYRATG CLLYFLILAA LAAVRAGGNF DFRHRNVSSR ALDATCPTRN FTDIKLEYIQ YPVCVEPRWV SIGKAEDYYA PQPSGAADTY KSIASGTFPS PSATVTATAP VSAELDHELD TESPLDNVNF LSFEEWKKQN LAKVGQSAEL VDGHRHAAGS ENGRQRPTGI DNSLDSLGED GEIELDFGGF APENSGPASW ERKVGNEHPS RLKDSAGAAT GGAKGATQTN AAPRGTVSRR KDAGTTCKER FNYASFDCAA TILKTNPQCT GASSVLIENK DSYMLNECRA KDKFLILELC DDILIDTIVL ANYEFFSSIF RTFKVSVSDR YPPKQPDMWK DLGTYEAVNT REVQAFAVEN PLIWARYVKI EFLSHYGNEF YCPVSLIRVH GTTMLEEYKN EGEAGRLEEN TAQIQADAAS ERARDNSIVG NQSNIADAEG TKSEMVDELD VRPTRVQKLQ DICTLRKTSI ERFSLQAKMC AIKEGPRAMN RSVNSVQPDS VKPAGPIKMT GNGLPANRTA EVSSNNLNST VTATPTETDT RAQNPTQESQ NEASSSAPNK AEHDPSTESL KPSTTVQPPP SNPTTQESFF KSVNKRLHML ETNSSLSLQY IEEQSRILRD AFNKVEKRQL AKTSTFLENL NTTVLHELQE FRHQYDQVWH SVATEFEQQR RQYHQEVFAL STQLGILADE LVFQKRISII QSVFVVICFG LVLFSRSNAL GSVASYLELP RVHSIVSRSP SFRFSSSSFD VPSRNPDSRV TPPYGVKGAR GHRRTDFTES REEMEELTAN PNIAYSPPTP TSDSEGPGSA HRQRSERSNS TSSSNFTVSP PAPFLRSESS PPDLGGHDEG PDVDSFSGAQ RVCS // ID C1E249_MICSR Unreviewed; 526 AA. AC C1E249; DT 26-MAY-2009, integrated into UniProtKB/TrEMBL. DT 26-MAY-2009, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ACO61857.1}; GN ORFNames=MICPUN_57110 {ECO:0000313|EMBL:ACO61857.1}; OS Micromonas sp. (strain RCC299 / NOUM17) (Picoplanktonic green alga). OC Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; OC Mamiellales; Mamiellaceae; Micromonas. OX NCBI_TaxID=296587 {ECO:0000313|EMBL:ACO61857.1, ECO:0000313|Proteomes:UP000002009}; RN [1] {ECO:0000313|EMBL:ACO61857.1, ECO:0000313|Proteomes:UP000002009} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RCC299 / NOUM17 {ECO:0000313|Proteomes:UP000002009}; RX PubMed=19359590; DOI=10.1126/science.1167222; RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L., RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E., RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M., RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H., RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., RA Gready J.E., John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., RA Moreau H., Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., RA Piegu B., Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., RA Zelensky A., Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., RA Van de Peer Y., Grigoriev I.V.; RT "Green evolution and dynamic adaptations revealed by genomes of the RT marine picoeukaryotes Micromonas."; RL Science 324:268-272(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP001324; ACO61857.1; -; Genomic_DNA. DR RefSeq; XP_002500599.1; XM_002500553.1. DR GeneID; 8242209; -. DR KEGG; mis:MICPUN_57110; -. DR InParanoid; C1E249; -. DR KO; K19347; -. DR Proteomes; UP000002009; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002009}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002009}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 72 92 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 191 211 {ECO:0000256|SAM:Coils}. FT COILED 287 314 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 526 AA; 55498 MW; 0A3FEBC16DE1731D CRC64; MSPSVNARAP GPARTNPANP LGGGGMPTVQ QPSRVIASQA PFTGGYTGNR LGRDERSPSA RTPKSPRRPK KAGISLGAVA LWALVAVVVA MSTSVRTVTI VRGDGVRASE KAAKLLAERV SVVGDSVDAM AAALDARGPS TAKRDRETAA LRKDLDTLVK QVKAQDATIA KVKKQRGGGG AERAAAAPAE VVDHGKDIAA LRREVEKLAK QKAPTPAPAP AEVGKAELLE LRRELLKEIA GVASQAAAAQ VNNGAPEVPE RVTAELDELR KRLDALASIP YPVPDESKAD KTDIEDLRRE IGKLANEAKG TRRHSKASKE VADEVRAQME LFRADRTGLV DYAMFSGGGK VVGHSALASA VAKGDGPLTN ALKGLRGGVH PRADEWVISA SSEAAGECLA LEGTRGWVDL RLREAIVVKA VTVEHVHRDV AYDITSAPKS VKILGWNNTK SPGAGARVLG SIRYQLLDGQ GGSAMQTFEL GGAPGTAVDH VRFEVESNYG NKDWTCLYRL RVHGKPSVPP SEPIWD // ID C1FFP9_MICSR Unreviewed; 1182 AA. AC C1FFP9; DT 26-MAY-2009, integrated into UniProtKB/TrEMBL. DT 26-MAY-2009, sequence version 1. DT 14-OCT-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ACO69120.1}; GN ORFNames=MICPUN_60591 {ECO:0000313|EMBL:ACO69120.1}; OS Micromonas sp. (strain RCC299 / NOUM17) (Picoplanktonic green alga). OC Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; OC Mamiellales; Mamiellaceae; Micromonas. OX NCBI_TaxID=296587 {ECO:0000313|EMBL:ACO69120.1, ECO:0000313|Proteomes:UP000002009}; RN [1] {ECO:0000313|EMBL:ACO69120.1, ECO:0000313|Proteomes:UP000002009} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RCC299 / NOUM17 {ECO:0000313|Proteomes:UP000002009}; RX PubMed=19359590; DOI=10.1126/science.1167222; RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L., RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E., RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M., RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H., RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., RA Gready J.E., John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., RA Moreau H., Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., RA Piegu B., Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., RA Zelensky A., Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., RA Van de Peer Y., Grigoriev I.V.; RT "Green evolution and dynamic adaptations revealed by genomes of the RT marine picoeukaryotes Micromonas."; RL Science 324:268-272(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP001575; ACO69120.1; -; Genomic_DNA. DR RefSeq; XP_002507862.1; XM_002507816.1. DR GeneID; 8245786; -. DR KEGG; mis:MICPUN_60591; -. DR InParanoid; C1FFP9; -. DR Proteomes; UP000002009; Chromosome 8. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002009}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002009}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 1182 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002909286. FT TRANSMEM 1085 1114 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1126 1154 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1160 1178 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 214 236 {ECO:0000256|SAM:Coils}. FT COILED 664 694 {ECO:0000256|SAM:Coils}. FT COILED 807 835 {ECO:0000256|SAM:Coils}. FT COILED 1011 1038 {ECO:0000256|SAM:Coils}. FT COILED 1062 1082 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1182 AA; 120908 MW; 2B8895106DDB4C18 CRC64; MRAARGSPRL AAGLTWLALW AAWTVASFPG PRTHPLGPVV GSSWWNGPSS LLPVAAAADA TPGDAGAGGL GTTIPSHPGA PGGAGVEDQT KAGSAAAPTS PSSAPRSGAQ TEDPVATPTA GEPTGAPPAA SSDDHQATVT MVSTSDPAAS KPVREADPIA ELAASCAIEN SFSHDVLRQS NVTLAALGGD AGAGGGRERM VCTTDAEKGS AAFAAAEDAR LRREKEARKA KEREAKEKAK IPSLKDFKSD IAAKVSAAKA AKEEPKEHKE PQKDKDKEHK EGHATSIEPN VSDQNATGTD RADAAGEPPA PSPGTDADVN AEGNANRTGS DVAGSISGDA NSTATTEEER ERRAGAPGTG APKGAGSAKG AASAAEENRG ERRGGGDEKA GGEPASEPER DDAAPDADAV AGAAAGARSS AEPAHPGAAE EDEGGGGGGK IEARSEPPAA PATSATSAPS APSAPSATPE DQPAAAPAAA PAAGATDGDP AVASAPAAAP AAAAAEEEDP WIMPASAAEG EAFIRQYNYA SYANGARVVS SNPESKSAGA ALKEDMDSYY LTPCAAKNGR WLTVELSEEA AVTAVTLANY EFHSSGVREF EVWASAAGAH DREEDWRRLG RCRARDGRDV QTFVLPRGHW SKYVRVSMTS HYGRHHFCTL SLLRVHGKDA KQTLEEEMEA INREVAEVEE ILNAGDDYEP AGEAVQSGGG SGDDDKPAAE DEVQVADTDS PSEPGAAPEP PIEPSIGTPD GTPTAPPTEL PADAGPGATS EAKPPGESDS AGADASAAAD DKKEGFFKSW FGKEKEAASK EEAAKEAASK EEAAKEEAAK EARDSPPRPD HALTDPAKEA GSKSRSDEEK EDDGSGSGGE DTVGVEPPAS AETAPGSSER SLSSEEAKTA TADAREGAEP AAETPTTPPA AEAAAQPIQP TAEEAAAAAA AAQQQQQQPP TVGKDDKPLP AVPKPAVSEQ RPSSPENVFS LMAKKIKSLE LNQSMFDRYI ESLSASYGGR FEDIAQEVND LEELVANQTS ALVELDKALD KATRGAKSDL SVAIERVGRD WAKNARETAA DVRDKLASVE RRWELYLGLT MFAVATVLGV AAFASGAQAA LVLWREGAEP SERARLAAKV TAVACVVAGA LSLGCYFAAA WLVFGRMATS VWAAGVQAAG AAFGYLRMNR RT // ID C1G276_PARBD Unreviewed; 874 AA. AC C1G276; DT 26-MAY-2009, integrated into UniProtKB/TrEMBL. DT 26-MAY-2009, sequence version 1. DT 16-SEP-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEH46092.1}; GN ORFNames=PADG_02242 {ECO:0000313|EMBL:EEH46092.1}; OS Paracoccidioides brasiliensis (strain Pb18). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Paracoccidioides. OX NCBI_TaxID=502780 {ECO:0000313|EMBL:EEH46092.1, ECO:0000313|Proteomes:UP000001628}; RN [1] {ECO:0000313|EMBL:EEH46092.1, ECO:0000313|Proteomes:UP000001628} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Pb18 {ECO:0000313|EMBL:EEH46092.1, RC ECO:0000313|Proteomes:UP000001628}; RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345; RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., RA Goldberg J., Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., RA Grynberg M., Gujja S., Heiman D.I., Henn M.R., Kodira C.D., RA Leon-Narvaez H., Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., RA Morais F.V., Pereira M., Rodriguez-Brito S., Sakthikumar S., RA Salem-Izacc S.M., Sykes S.M., Teixeira M.M., Vallejo M.C., RA Walter M.E., Yandava C., Young S., Zeng Q., Zucker J., Felipe M.S., RA Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G., Puccia R., RA San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.; RT "Comparative genomic analysis of human fungal pathogens causing RT paracoccidioidomycosis."; RL PLoS Genet. 7:E1002345-E1002345(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; KN275958; EEH46092.1; -; Genomic_DNA. DR RefSeq; XP_010757746.1; XM_010759444.1. DR EnsemblFungi; EEH46092; EEH46092; PADG_02242. DR GeneID; 22581755; -. DR KEGG; pbn:PADG_02242; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001628; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001628}; KW Reference proteome {ECO:0000313|Proteomes:UP000001628}. SQ SEQUENCE 874 AA; 96284 MW; D0E3D6B9854A7B0A CRC64; MACCYSLFPR PSNTNYRATG CLLYFLILAA LAAVRAGGNF DFRHSNVSSR ALDATCPTRN FTDIKLEYIQ YPVCVEPRWV SIGKAEDYYA PQPSGAADTY KSITSGTFPS PSATVTATAS VSAELDHELD TESPLDNVNF LSFEEWKKQN LAKVGQSAEL VDGHRHAAGS ENGRQRPTGI DNSLDSFGED GEIELDFGGF APENSGPASW ERKVGNEHPS RLKDSAGAAT GGAKGATQTN AAPRGTVSRR KDAGTTCKER FNYASFDCAA TILKTNPQCT GASSVLIENK DSYMLNECRA KDKFLILELC DDILIDTIVL ANYEFFSSIF RTFKVSVSDR YPPKQPDMWK DLGTYEAVNT REVQAFAVEN PLIWARYVKI EFLSHYGNEF YCPVSLIRVH GTTMLEEYKN EGEAGRLEEN TAQIQADAAS ERARDNSIVG NQSNIADAEG TKSEMVDELD VRPTRVQKLQ DICTLRKTSI ERFSLQAKMC AIKEGPRAMN RSVNSVQPDS VKPAGPIKMT GNGLPANLTA EVSSNNVNST VTATPTETDT RAQNPTQESQ NEASSSAPNK AGHDPSTESL KPSTTVQPPP SNPTTQESFF KSVNKRLHML ETNSSLSLQY IEEQSRILRD AFNKVEKRQL AKTSTFLENL NTTVLHELQE FRHQYDQVWH SVATEFEQQR RQYHQEVFAL STQLGILADE LVFQKRISII QSVFVVICFG LVLFSRSNAL GSVASYLELP RVHSIVSRSP SFRFSSSSFD VPSRNPDSRV TPPYGVKGAR GHRRTDFTES REEMEELTAN PNIAYSPPTP TSDSEGPGSA HRQRSERSNS TSSSNFTVSP PAPFLRSESS PPDLGGHDEG PDVDSFSGAQ RVCS // ID C1GPJ3_PARBA Unreviewed; 874 AA. AC C1GPJ3; DT 26-MAY-2009, integrated into UniProtKB/TrEMBL. DT 26-MAY-2009, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EEH36115.1}; GN ORFNames=PAAG_00438 {ECO:0000313|EMBL:EEH36115.1}; OS Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) (Paracoccidioides OS brasiliensis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Paracoccidioides. OX NCBI_TaxID=502779 {ECO:0000313|EMBL:EEH36115.1, ECO:0000313|Proteomes:UP000002059}; RN [1] {ECO:0000313|EMBL:EEH36115.1, ECO:0000313|Proteomes:UP000002059} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-826 / Pb01 {ECO:0000313|Proteomes:UP000002059}; RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345; RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., RA Goldberg J., Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., RA Grynberg M., Gujja S., Heiman D.I., Henn M.R., Kodira C.D., RA Leon-Narvaez H., Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., RA Morais F.V., Pereira M., Rodriguez-Brito S., Sakthikumar S., RA Salem-Izacc S.M., Sykes S.M., Teixeira M.M., Vallejo M.C., RA Walter M.E., Yandava C., Young S., Zeng Q., Zucker J., Felipe M.S., RA Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G., Puccia R., RA San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.; RT "Comparative genomic analysis of human fungal pathogens causing RT paracoccidioidomycosis."; RL PLoS Genet. 7:E1002345-E1002345(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; KN293992; EEH36115.1; -; Genomic_DNA. DR RefSeq; XP_002797899.1; XM_002797853.1. DR STRING; 502779.XP_002797899.1; -. DR EnsemblFungi; EEH36115; EEH36115; PAAG_00438. DR GeneID; 9101350; -. DR KEGG; pbl:PAAG_00438; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002059; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002059}; KW Reference proteome {ECO:0000313|Proteomes:UP000002059}. SQ SEQUENCE 874 AA; 96169 MW; 0493D3800B6B3699 CRC64; MACCYSLFPR PSNTNYRATG CLLYFLVLAA LAAVRAGGNV DFRHRNVSSR ALDATCPTRN FTDIKLEYIQ YPVCVEPRWV SIGKAEDYYA PQPSGAADTH KLITSGTLPS PPATVTATAS VSAELDHELD TESPLDNVNF LSFEEWKKQN LAKVGQSAEL VDGHRHAAGS ENGRQRPTGI DNSLDSLGED GEIELDFGGF APENSGPASW ERKVGNEHPS PLKDSAGPVT GEAKGATQTN AAPRGTVSPR KDAGTTCKER FNYASFDCAA TVLKTNPQCT GASSVLIENK DSYMLNECRA KDKFLIIELC DDILIDTIVL ANYEFFSSIF RTFKVSVSDR YPPKQPDMWK DLGTYEAVNT REVQAFAVEN PLIWARYVKI EFLSHYGNEF YCPVSLIRVH GTTMLEEYKN EGEAGRLEEN TAQIQTDAAS EPARDNSIVG NQSNIVDAEG TKSEMVDELD VRPTRVQKLQ DICSLRKTSI ERFSLQAKMC AFKEGPRAIN RSVNSVQLDC VKPAGPIKMT GNGLRANRTA EASSNNVNST VTATPTETDT RAQNQTQESQ NEASSSAPNK AEHDSSTESL KTSTTVQPPP PNPTTQESFF KSVNKRLHML ETNSSLSLQY IEEQSRILRD AFNKVEKRQL AKTSTFLENL NTTVLHELQE FRHQYDQVWH SVATEFEQQR RQYHQEVFAL STQLGILADE LVFQKRISII QSVFVVICFG LVLFSRSNAL GTVASYLELP RVHSIVSRSP SFRFSSASFE APSGSPDSRV IPPYGVKGAR GHRRTDSTES REEIDELTAN PNIAYSPPTP TSDSEGLGSA HRQRSERSNS TSSSNFTVSP PAPFLRSESS PPDLGGHDEG SDVGSFSGTQ QVCS // ID C1MM01_MICPC Unreviewed; 683 AA. AC C1MM01; DT 26-MAY-2009, integrated into UniProtKB/TrEMBL. DT 26-MAY-2009, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEH58953.1}; GN ORFNames=MICPUCDRAFT_62344 {ECO:0000313|EMBL:EEH58953.1}; OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga). OC Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; OC Mamiellales; Mamiellaceae; Micromonas. OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876}; RN [1] {ECO:0000313|EMBL:EEH58953.1, ECO:0000313|Proteomes:UP000001876} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH58953.1, RC ECO:0000313|Proteomes:UP000001876}; RX PubMed=19359590; DOI=10.1126/science.1167222; RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L., RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E., RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M., RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H., RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., RA Gready J.E., John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., RA Moreau H., Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., RA Piegu B., Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., RA Zelensky A., Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., RA Van de Peer Y., Grigoriev I.V.; RT "Green evolution and dynamic adaptations revealed by genomes of the RT marine picoeukaryotes Micromonas."; RL Science 324:268-272(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG663737; EEH58953.1; -; Genomic_DNA. DR RefSeq; XP_003057308.1; XM_003057262.1. DR GeneID; 9682293; -. DR KEGG; mpp:MICPUCDRAFT_62344; -. DR KO; K19347; -. DR Proteomes; UP000001876; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001876}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001876}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 62 80 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 197 247 {ECO:0000256|SAM:Coils}. FT COILED 273 300 {ECO:0000256|SAM:Coils}. FT COILED 311 338 {ECO:0000256|SAM:Coils}. FT COILED 361 381 {ECO:0000256|SAM:Coils}. FT COILED 407 427 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 683 AA; 70852 MW; 00E665143BDA9428 CRC64; MSARKSILAS TTRSRRSRRI ANLEDDDDAT RADARTDAER PPPGDDDEDL SRRSSSFPRG HVSLGATFLW TLVAVAIALM TSVNSVTVVR GSAAYASDAD ARALAGRMRR VGDSVDALAR AMTLHAGKRT PERDLNAVKR TLSTLTTKLK AQDAAIAKAA KSGPAPPPTP PPGPTRGDYD AMTAGNEKEF DAIRRALAED DARDASAKRE LEALREELRE ASAKRELEEL RKALAAVEKT AREAAAAAAA AAAAAKPAPA PARDADTNEW PKNAAAAKDV DALRKEMGAL AKEVAAAKKK AAPAAPAPLP KKELDALRRE LKDEIAKANA KAKATADAAA KAAAKAASRA GSSGPAAASD AAAAAAAAAA AAKEVDALRA EVSAQLAALS ASAADAAKAS SSGPAAASDA AAAAAGAEKD IEALRAEVRA ELVALSKRAS TADVAVAADV DKKLDKKLES WKGKPMVTRT HVADEVRAQL ERFKSDKTGR VDYALFSGGG KVVGHSVLSP LVSAGDGFGT KALKSLRGGT HPKADEWVIT ATTQAAGECL ALRGNRGHVD IRLREAVHVD AVTIEHVPRG VAYDITSAPK DVSVLGWNRT RTAPAKNSDA LAALGSFRYA VDASAGSQVG STQTFALRGG GGGAVDHVRF EVRSNYGNDR WTCLYRLRVH GTPVAKPREP VLD // ID C1N1V3_MICPC Unreviewed; 1135 AA. AC C1N1V3; DT 26-MAY-2009, integrated into UniProtKB/TrEMBL. DT 26-MAY-2009, sequence version 1. DT 14-OCT-2015, entry version 19. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEH53895.1}; GN ORFNames=MICPUCDRAFT_51746 {ECO:0000313|EMBL:EEH53895.1}; OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga). OC Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; OC Mamiellales; Mamiellaceae; Micromonas. OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876}; RN [1] {ECO:0000313|EMBL:EEH53895.1, ECO:0000313|Proteomes:UP000001876} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH53895.1, RC ECO:0000313|Proteomes:UP000001876}; RX PubMed=19359590; DOI=10.1126/science.1167222; RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L., RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E., RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M., RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H., RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., RA Gready J.E., John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., RA Moreau H., Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., RA Piegu B., Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., RA Zelensky A., Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., RA Van de Peer Y., Grigoriev I.V.; RT "Green evolution and dynamic adaptations revealed by genomes of the RT marine picoeukaryotes Micromonas."; RL Science 324:268-272(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG663745; EEH53895.1; -; Genomic_DNA. DR RefSeq; XP_003062183.1; XM_003062137.1. DR GeneID; 9687622; -. DR KEGG; mpp:MICPUCDRAFT_51746; -. DR Proteomes; UP000001876; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001876}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001876}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 1135 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002910717. FT TRANSMEM 1040 1066 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1078 1106 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 182 215 {ECO:0000256|SAM:Coils}. FT COILED 239 266 {ECO:0000256|SAM:Coils}. FT COILED 609 646 {ECO:0000256|SAM:Coils}. FT COILED 964 991 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1135 AA; 111692 MW; F97EAA6AD143978B CRC64; MTTMIALATI ALLTSGAPRV AAASATGTAS AAMDASSSGG DSSVATPTPT TATAAETTAS SPAASSVSED ADGAAANESA AAATTTTTTT TTTTTTTTTT AQPPSTIGVL VPPKNAEPSS SRGGRGPGRA PSVLGVGDSS TTCRRPLDRF SRDDLDAFTR GATTTSDGGG DDDVPLVCVS DVVEGARNAA KAEEEAAAAE RAARVAAERE RAKREKLPSL KAFKHDLEAK VIHAKEEQQQ KAIKEAAAAA AAAAEAERKR KEAEEAASGS EGGAGGGGEN AKTPADADED ATGGGETVAA AAPGEKGESE SGGEEFFAGG SREGENAEVL PGAAAAATDA NANGDDATAT TKASAAAPAA ASPPPPEPAP SSAEDPDANV GTPPSTPPPP SDAAGGDAPP SDASAADAAD LADAADDDDA NATGAPNDEE AAADLAAADA PDDDASDPLV LPAPAEVAAA YATQYNYAAA SNGAKVVSSN PESKSPASAL SEHMDSYYIT PCGAKSGKWL TVELSEEAAV TALTLANYEF HSSGVREFEV WGTAGGHDED DGWRRLARGK ASGHRDAQTF VLPGAGAWSK FLQLRMTGHY GAYHYCTVSL LRVHGKDAQQ TLKEEMEAIN MEVREVEEIL RDADEAEAEA EAEAEVRVDE GSNVGDDDDD DRGGGGGDDD AAADADEVDA GADDTPAAVT TSKTASEDAG AATPDSGSSA SAADADAAAA GGGGGGGDGD DKNPTAASGK GWLNWFGGDT PPETEKAEDA ATDDTAVSSS ATPGPGAAAT KDDATRVDAT ADASASAPST TGGAAAAAAA AEDPVDASRA TPSAPERAAG ATSGASAETS SDAASASASA ANEREKERPA DVGRDEKKEK SSTSTSASGS PSDAGKDEEK PQLTSTSTST STPTQTQTPP TPSKPSSSSS SSSAGGGSEN VFSLMAKKIK ALELNQSMFD RYVEALNARY ADRFDDVTRD VAALEAKLEN VTRAAVTAAE ARAEAASGKC DDAAKAAAAK AAESAAAHDA SLSRARREMS DVAASWRTRF GVLLALVAAC FAFVSVGVAG LAAHAAGSGG GGGGGGGALA RLLAAGAGVA AVVAGAAGAA AAWTLFGGGL AGAVKFAAGA VERGARAVVA RARGQ // ID C3PT64_DASNO Unreviewed; 441 AA. AC C3PT64; DT 16-JUN-2009, integrated into UniProtKB/TrEMBL. DT 16-JUN-2009, sequence version 1. DT 14-OCT-2015, entry version 6. DE SubName: Full=Sperm associated antigen 4 (Predicted) {ECO:0000313|EMBL:ACQ63008.1}; GN Name=SPAG4 {ECO:0000313|EMBL:ACQ63008.1}; OS Dasypus novemcinctus (Nine-banded armadillo). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Xenarthra; Cingulata; Dasypodidae; Dasypus. OX NCBI_TaxID=9361 {ECO:0000313|EMBL:ACQ63008.1}; RN [1] {ECO:0000313|EMBL:ACQ63008.1} RP NUCLEOTIDE SEQUENCE. RA Antonellis A., Benjamin B., Blakesley R.W., Bouffard G.G., RA Brinkley C., Brooks S., Chu G., Chub I., Coleman H., Fuksenko T., RA Gestole M., Gregory M., Guan X., Gupta J., Gurson N., Han E., Han J., RA Hansen N., Hargrove A., Hines-Harris K., Ho S.-L., Hu P., Hunter G., RA Hurle B., Idol J.R., Johnson T., Knight E., Kwong P., Lee-Lin S.-Q., RA Legaspi R., Madden M., Maduro Q.L., Maduro V.B., Margulies E.H., RA Masiello C., Maskeri B., McDowell J., Merkulov G., Montemayor C., RA Mullikin J.C., Park M., Prasad A., Ramsahoye C., Reddix-Dugue N., RA Riebow N., Schandler K., Schueler M.G., Sison C., Smith L., RA Stantripop S., Thomas J.W., Thomas P.J., Tsipouri V., Young A., RA Green E.D.; RT "NISC Comparative Sequencing Initiative."; RL Submitted (MAY-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DP001108; ACQ63008.1; -; Genomic_DNA. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 441 AA; 47953 MW; BCF3BC69FFD384EF CRC64; MRRGPRPGST VPPHKHTPNF YSDNSSVSAT SADSSGHRSA GMGPVEPEGR RARGSSCGEP ALSAGVSGGT TWAGSSRQKP ASRSHKGPTA GGAATVRGGA SEPPGSPVVS EEQLDLLSTL DLRQEMPPPR ASKTFLSQLF QVLSVLSSLV GDALVSAYRE VCSIRFLLTA VSLLSLFVAA LWWGLLYLVP PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKQV ASVRATNSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL EKTSHDYEDT DTAYFWNRFS FWNYARPPTV ILEPDVFPGN CWAFEGDQGQ VVIRLPGRVQ LSDITLQHPP PSVAHAGGAS SAPRDFAVYG LQADDETEVF LGKFTFDVEK SEIQTFRLQN DPPAAFPKVK IQILSNWGHP RFTCLYRIRA HGVRTSEGAG DSTTGATGGP H // ID C3Y953_BRAFL Unreviewed; 1111 AA. AC C3Y953; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEN63109.1}; GN ORFNames=BRAFLDRAFT_68107 {ECO:0000313|EMBL:EEN63109.1}; OS Branchiostoma floridae (Florida lancelet) (Amphioxus). OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. OX NCBI_TaxID=7739 {ECO:0000313|Proteomes:UP000001554}; RN [1] {ECO:0000313|EMBL:EEN63109.1, ECO:0000313|Proteomes:UP000001554} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN63109.1, RC ECO:0000313|Proteomes:UP000001554}; RC TISSUE=Testes {ECO:0000313|EMBL:EEN63109.1}; RX PubMed=18563158; DOI=10.1038/nature06967; RG US DOE Joint Genome Institute (JGI-PGF); RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K., RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., RA Gibson-Brown J.J., Grigoriev I.V., Horton A.C., de Jong P.J., RA Jurka J., Kapitonov V.V., Kohara Y., Kuroki Y., Lindquist E., RA Lucas S., Osoegawa K., Pennacchio L.A., Salamov A.A., Satou Y., RA Sauka-Spengler T., Schmutz J., Shin-I T., Toyoda A., RA Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H., RA Satoh N., Rokhsar D.S.; RT "The amphioxus genome and the evolution of the chordate karyotype."; RL Nature 453:1064-1071(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG666492; EEN63109.1; -; Genomic_DNA. DR RefSeq; XP_002607099.1; XM_002607053.1. DR STRING; 7739.JGI68107; -. DR GeneID; 7223494; -. DR KEGG; bfo:BRAFLDRAFT_68107; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; C3Y953; -. DR KO; K19347; -. DR Proteomes; UP000001554; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001554}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001554}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 621 641 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 825 845 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1111 AA; 119110 MW; 7006B5668E4991A9 CRC64; MASESVVETT GRSRTRSGVI YSPGRTRQRT SQGSSGYGST SSSKNTSLNT SYEDQQNVKN TSSSYDASFT QSSSQGEVVI LGDVTARSGS NRSTRSGGSG SRKSAGLSLQ GGRAASKESV SVVKTSTTVI TTSTMTSESQ NWQSSTVAGS SGIVNGFSSS SSAIESSASQ ATVGGTLTSA ALSTRSGSGR SSGSNRSGRS SALLVGGTVS SALTSGTSTQ SVVTQSESST REGSASFTQG SGSRSSAASS IRGSSGRRSA IQTSSHGSSA SERAVVKGVT SETALTQSAT TLSSGGLVQG GSGRSVGLLV VGSSGSRSSQ KRGSSRGSSS SRQLASTESV VTQTSTSSQH GNTSSLGLTG GILASTPKKA AAPSSFVTKE TTVTTATVNK EEGGHTSVEQ SGSYRFPIYV LRKVHSVLGW LEKQEVEEVQ DLDTLKEAYD TPKMMRRGLR SSSKSKFDDS IIEDSFAGDS YSYSESKTKE SRSTNRARRR HSGKDITAGS IYGTTDKSMA SEKTLSKLYE SESVTQDRLS YESDGEYGDY SDSAYSYSEA AGGGLTIWQM ITMPFIGLFW WLWWLVGTPV YWFVTTVMML DAWILSRVTI LKNTIMNRKG AVFAGAGAFP LRLLLLPLLL LLLLLGLYYL WPLSFLTGIV SSVASTPGAL LAWLPWSSST GAGAPVLNDF RAEQIRSEMR TEIQRLSATV ADVVKKWEQS SQVPEPAVGT QQSASEAALA GGMSKEEIIA LITAIVKENM EGLRKDVSVQ DQAILADLDK VRTERENQLA MLEIKMQTIN THSQDLTKQL VEAKQKINVE SSGGVGTSSD MLDHIHRLES ELASLKMELG RVQQQSDMYH QELKKYSMNI SLIEQQLSIT VQQALQEDSS ALVSWLKSRF VGQSDFDSKV SDIVSRVTEN VSQKLKVETS RIRAIVNKHI GIYDADKTGM VDHALESAGG SVLSLRCTET YESKSAQLSI LGIPLWYKTN SPRTVIQPDV HPGNCWAFRG SEGYMVIQLS GVVRPTSFSL EHIPKSLSPT GQIDSAPKDF TVYGLEDESQ PEGDVLGNYT YDDNAEPLQY FPVQATDVKP YPIVELRILS NHGNPDYTCI YRFRVHGILQ K // ID C3ZHZ1_BRAFL Unreviewed; 1269 AA. AC C3ZHZ1; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEN47846.1}; GN ORFNames=BRAFLDRAFT_88779 {ECO:0000313|EMBL:EEN47846.1}; OS Branchiostoma floridae (Florida lancelet) (Amphioxus). OC Eukaryota; Metazoa; Chordata; Cephalochordata; Branchiostomidae; OC Branchiostoma. OX NCBI_TaxID=7739 {ECO:0000313|Proteomes:UP000001554}; RN [1] {ECO:0000313|EMBL:EEN47846.1, ECO:0000313|Proteomes:UP000001554} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN47846.1, RC ECO:0000313|Proteomes:UP000001554}; RC TISSUE=Testes {ECO:0000313|EMBL:EEN47846.1}; RX PubMed=18563158; DOI=10.1038/nature06967; RG US DOE Joint Genome Institute (JGI-PGF); RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K., RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., RA Gibson-Brown J.J., Grigoriev I.V., Horton A.C., de Jong P.J., RA Jurka J., Kapitonov V.V., Kohara Y., Kuroki Y., Lindquist E., RA Lucas S., Osoegawa K., Pennacchio L.A., Salamov A.A., Satou Y., RA Sauka-Spengler T., Schmutz J., Shin-I T., Toyoda A., RA Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H., RA Satoh N., Rokhsar D.S.; RT "The amphioxus genome and the evolution of the chordate karyotype."; RL Nature 453:1064-1071(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG666625; EEN47846.1; -; Genomic_DNA. DR RefSeq; XP_002591835.1; XM_002591789.1. DR STRING; 7739.JGI88779; -. DR GeneID; 7246643; -. DR KEGG; bfo:BRAFLDRAFT_88779; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; C3ZHZ1; -. DR OMA; VENTFLE; -. DR Proteomes; UP000001554; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001554}; KW Reference proteome {ECO:0000313|Proteomes:UP000001554}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 1269 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002934648. FT COILED 974 1009 {ECO:0000256|SAM:Coils}. FT COILED 1017 1044 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1269 AA; 138117 MW; F41C66AB330BA42C CRC64; MWNGLLAVLL LTLFLNSLST SVQCSSPAEV TGDDQKQQVK ETSQDCPVYL VDMTTGGEPG ISYPTTKSSS SNEGTEKTQK HQEPEDSPRA LTEDEGKVST DQPVDSQKVE ESAESALPSP AVEKDLPKDQ QAEGVPQHTD KPVPTTRLEQ TGGSVKPDIS PLKDSQTQDA VESNEEPAVM VQPLTVTEIS ADADRTKSKT IGDDVIPKLD GEMAGDEILD IQEQETAVTG TGDSVEEQKT AVTGTEDSAA KEGTAVTGTE DNADIEETAE TDIENSAVKQ GIAVTGTEDS VEEEKTAVTG AEDSAAKEGT AVTGTGDIAD KQGTAVTDTE DSIEEQKTAM TDTEDSADKQ ETAVTGTEDI ADKQKTAETG TKDSAEEEDL VAHKEDDDMP SFDEWKKRML EQEEREKKQN SNVLVENKDM YMLNPCSAKI WFIVELCEPI QLKQIDIANF ELFSSVPESF KVSTSERYPA REWQLLGTFH MANERSIQSF PLDEKLFNKY LKVEMLSHYG SEHYCPLSLF RVFGTSMEEE IEETEQHPES VVDPEDELFP DESIPLSPDS NLFGSAKDAV LNIVKQAAKV FTGPSDGMAS TEPHEAKGQV SVENTFLEPT LEYPEPCLDI NRTVNVSVPD HDGSGKESRN IQPTPTSTAH STVKHELETV QKCQGNPRCL FHQLILQTSC EEEQRITTPP ADKLVIATDV QGDEGRSDDV ELGKTDKDED INVQSKQRVK IDPPATAGTD QVLSADRDDK GLLPVVSDND TVADTSTDMS ATPPTLQHGE DSSPAGSVDG EEKLKPDQLP TSSTVAPLEG SERTSAVLEG VGGPDIATST TDGAETPPAD SASSPTDSET SGSVSAAGSN QNGGKKQVHD KASSAQVEGN AQVKDDIITS ASPSQDPQDS SSPEPTAVVQ NESTDGLDKQ AKNSSDFYAE KNENGTSSQH ISHGGHTGKE SVIMRLNNRI KALELNMSLS SRYLEELSQR YRKQMDEMQK AFNKTISKLT NNSRKAEERD IRQQEIIANL AASITNLTAD IQSLNTDRDN LHRKVIERHI FLMVVEVLCL AVVFMVCIHT RPGHTPQLIH SQGEQFQEHL RDKMEGLRYR TRTEDLPPRR SDGEEDKLVI VEPQCNRQGI DGTKKKRSKK KHRHQGLRNT SSTPNLLEQL TGHEETAAGV NDVNAAGLLF SSSTATDRQP FSSSRDSKAA QMPPGTTYQR GVTHSTLQTT QRCQPPSGLT QETTEHGKNC HQPALDKLYG DVQEIGARVK ALSEDPKLS // ID C4JNE0_UNCRE Unreviewed; 849 AA. AC C4JNE0; DT 07-JUL-2009, integrated into UniProtKB/TrEMBL. DT 07-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEP79500.1}; GN ORFNames=UREG_04346 {ECO:0000313|EMBL:EEP79500.1}; OS Uncinocarpus reesii (strain UAMH 1704). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Onygenaceae; Uncinocarpus. OX NCBI_TaxID=336963 {ECO:0000313|EMBL:EEP79500.1, ECO:0000313|Proteomes:UP000002058}; RN [1] {ECO:0000313|Proteomes:UP000002058} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=UAMH 1704 {ECO:0000313|Proteomes:UP000002058}; RX PubMed=19717792; DOI=10.1101/gr.087551.108; RA Sharpton T.J., Stajich J.E., Rounsley S.D., Gardner M.J., RA Wortman J.R., Jordar V.S., Maiti R., Kodira C.D., Neafsey D.E., RA Zeng Q., Hung C.-Y., McMahan C., Muszewska A., Grynberg M., RA Mandel M.A., Kellner E.M., Barker B.M., Galgiani J.N., Orbach M.J., RA Kirkland T.N., Cole G.T., Henn M.R., Birren B.W., Taylor J.W.; RT "Comparative genomic analyses of the human fungal pathogens RT Coccidioides and their relatives."; RL Genome Res. 19:1722-1731(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476616; EEP79500.1; -; Genomic_DNA. DR RefSeq; XP_002544829.1; XM_002544783.1. DR STRING; 336963.XP_002544829.1; -. DR EnsemblFungi; EEP79500; EEP79500; UREG_04346. DR GeneID; 8438332; -. DR KEGG; ure:UREG_04346; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; C4JNE0; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002058; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002058}; KW Reference proteome {ECO:0000313|Proteomes:UP000002058}. SQ SEQUENCE 849 AA; 94558 MW; 9042A2C71C39588F CRC64; MRGRWPFLRF DTDIGVLNGL ILLFLLPLFV AEDGDFQHPH PQHRQGLDEA RWANGQNNGI CPVRGVVEIQ AEYLRHPMCS VTTGGQDPQG DGFLLNTTET TITTTEKGLE TPASTARLES ELDTESPLDN EKFLSFEEWK KKNLAKVGQS ADNVRGNRPG SGSTEMRRRP YPGKISNALD SLGEEGEIEL GFGGFDPDDP NIPPLGKKDA RTAQTTSGDE SPVTKGTEGE IQSDGVPRRG VARRKDAGTT CKERFNYASF DCAATVLKTN KECTGSSSIL IENKDSYMLN ECRAKDKFII LELCDDILVD TVVLANYEFF SSIFRTFRVS VSDRYPAKPD KWKELGTYEA ANTREIQAFA VENPLIWTRY LKIEFLSHYG NEFYCPVSLV RVHGTTMMEE YKNYGDTARA EEETVQAVQP TQETPGAVST VDQSNQTQKE NCKRNATASV TKTGAESLPD EEALGALCFP ELDEIEKLLL GYNVNNMSSI YDLVSEHEYQ YDSHDLAESE SVTAKATGST ASEDIPQSDT PSTTNGGDKS QKPASEARVA STSSSTEAEN DTVVESQRTP IASQPPPPNP TTQESFFKSV HKRLQMLETN STLSLLYIEE QSRILRDAFN KVEKRQLAKT SSFLENLNAT VLHELRDFRQ QYDHIWRSVV LEFEQQRQQY HHELFAVTTQ LAILADEVVF QKRVSIIQSV FVLLCLGLVL FSRSAVGSYL EFPKVQNMVS RTHSFRSASP SYETPSASPN TVRQRPSYSK ANLHRRNVSE DQTDSELCSP TFAYHSPPLS EGPSPSEEEE KGLNEVQSDY ARPMSSSPVP VENLATLKRQ KSSPAELGEA TNNAELRPP // ID C4JQB8_UNCRE Unreviewed; 648 AA. AC C4JQB8; DT 07-JUL-2009, integrated into UniProtKB/TrEMBL. DT 07-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEP79826.1}; GN ORFNames=UREG_04672 {ECO:0000313|EMBL:EEP79826.1}; OS Uncinocarpus reesii (strain UAMH 1704). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Onygenaceae; Uncinocarpus. OX NCBI_TaxID=336963 {ECO:0000313|EMBL:EEP79826.1, ECO:0000313|Proteomes:UP000002058}; RN [1] {ECO:0000313|Proteomes:UP000002058} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=UAMH 1704 {ECO:0000313|Proteomes:UP000002058}; RX PubMed=19717792; DOI=10.1101/gr.087551.108; RA Sharpton T.J., Stajich J.E., Rounsley S.D., Gardner M.J., RA Wortman J.R., Jordar V.S., Maiti R., Kodira C.D., Neafsey D.E., RA Zeng Q., Hung C.-Y., McMahan C., Muszewska A., Grynberg M., RA Mandel M.A., Kellner E.M., Barker B.M., Galgiani J.N., Orbach M.J., RA Kirkland T.N., Cole G.T., Henn M.R., Birren B.W., Taylor J.W.; RT "Comparative genomic analyses of the human fungal pathogens RT Coccidioides and their relatives."; RL Genome Res. 19:1722-1731(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476616; EEP79826.1; -; Genomic_DNA. DR RefSeq; XP_002545155.1; XM_002545109.1. DR EnsemblFungi; EEP79826; EEP79826; UREG_04672. DR GeneID; 8440058; -. DR KEGG; ure:UREG_04672; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; C4JQB8; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002058; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002058}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002058}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 299 316 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 648 AA; 70767 MW; C120FE68DFF3A47F CRC64; MVETRRGRSG RSVSQEPSGQ NQYTRQGTAP LGSQNDATAI PASSFDNPSL PAIQTQQSFA YGATGSPAFP RQLRASPPMA AQFGAKIERP SLNTEANDFE RIQEQARANP GTGRATRLRT NRQSASPTRR TPGRRTRTRE PTPDDQLLGS LREASEEAEG SKEVVLPSIE DSSVSWNTER HVIADPRHGS VANSADNASG AGSFQVQPQW RQVHPMAGPP LRPSARSQPA ANFQVPQEAR VGPSVVASSQ SAPQLGAIPV LAPSQGEDYG DATPISKKAP SSAFHSAPPT NRSQQGVRFA LMTVFFMLFA FGGMLIKVED VRGVLPKDIG KGLNLPPSFC GGQPATSQYA DALNKLSTGV DQRLADMARD VAALKDEWNK RLPHLRQAIW PEREDPLLPR RINWFSTGMG AFVDPYLTTK FRPGLLKGGT ERAVGVKRTN PPAAALTRWD EHGDCWCVND HTSEIQLAVL LGRPLVPEEV VIEHIHKEAT LDPESAPREM ELWVEYSSRS ASAAPSTVPP GARATVAPGS SEWLTQRPEL LESSAEAREA YSGPLSPSQR EDIISTLRMA YPDEPETAYS QDTKLGSTYY RVGKFQYDIN GKHNIQRFLL DAVIDLPNIR TKKAVLRVKS NWGSVNTCIY RARLHGHM // ID C4LWG1_ENTHI Unreviewed; 413 AA. AC C4LWG1; DT 07-JUL-2009, integrated into UniProtKB/TrEMBL. DT 07-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 27. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAL51544.2}; GN ORFNames=EHI_096480 {ECO:0000313|EMBL:EAL51544.2}; OS Entamoeba histolytica. OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. OX NCBI_TaxID=5759 {ECO:0000313|Proteomes:UP000001926}; RN [1] {ECO:0000313|Proteomes:UP000001926} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 30459 / HM-1:IMSS {ECO:0000313|Proteomes:UP000001926}; RX PubMed=15729342; DOI=10.1038/nature03291; RA Loftus B.J., Anderson I., Davies R., Alsmark U.C., Samuelson J., RA Amedeo P., Roncaglia P., Berriman M., Hirt R.P., Mann B.J., Nozaki T., RA Suh B., Pop M., Duchene M., Ackers J., Tannich E., Leippe M., RA Hofer M., Bruchhaus I., Willhoeft U., Bhattacharya A., RA Chillingworth T., Churcher C.M., Hance Z., Harris B., Harris D., RA Jagels K., Moule S., Mungall K.L., Ormond D., Squares R., RA Whitehead S., Quail M.A., Rabbinowitsch E., Norbertczak H., Price C., RA Wang Z., Guillen N., Gilchrist C., Stroup S.E., Bhattacharya S., RA Lohia A., Foster P.G., Sicheritz-Ponten T., Weber C., Singh U., RA Mukherjee C., El-Sayed N.M.A., Petri W.A., Clark C.G., Embley T.M., RA Barrell B.G., Fraser C.M., Hall N.; RT "The genome of the protist parasite Entamoeba histolytica."; RL Nature 433:865-868(2005). RN [2] {ECO:0000313|Proteomes:UP000001926} RP GENOME REANNOTATION. RC STRAIN=ATCC 30459 / HM-1:IMSS {ECO:0000313|Proteomes:UP000001926}; RA Lorenzi H., Amedeo P., Inman J., Schobel S., Caler E.; RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS571163; EAL51544.2; -; Genomic_DNA. DR RefSeq; XP_656924.2; XM_651832.2. DR STRING; 5759.rna_EHI_096480-1; -. DR EnsemblProtists; rna_EHI_096480-1; rna_EHI_096480-1; EHI_096480. DR GeneID; 3411241; -. DR KEGG; ehi:EHI_096480; -. DR EuPathDB; AmoebaDB:EHI_096480; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; C4LWG1; -. DR Proteomes; UP000001926; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001926}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001926}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 330 351 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 303 323 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 413 AA; 47733 MW; EEC298DC76883F87 CRC64; MNGQLVDRKN NYASDDCGAK VIATNSKACG SNNLLNSNKD EYYLSPCQDN IHFVVELCQT IQLHQVGIGN FELFSNQLQN LTISCSVDGT YWRVLGEFRL PNQKILHSIS IPHPFWCKYI KIHQTSWYGK EYYCSINKFV AYGISSLDEL VDDMVETEDV KNNSVNLLDN EHTFAEPSIT PIKWSEEEFI NESNKPFINL TTKIINSSLN NITIENTTRT FLYKQKMRIH RVELELEVFK GHIDELQKHF VQHILSLNET ENVVNKLLTE IKKDISLSFE ETKTMNEQLT IFEKENTNQI LENQRLSKEL LALKNSYHEL KLKINTKTQA MFYILLIIAC IFFLFIIYWS LMLSRKCTNT LSKTQKLITS KITNISFPPS VSLTKDGTNV NKTLLVQLQK ENKIPLIVSS KHP // ID C4M1N7_ENTHI Unreviewed; 350 AA. AC C4M1N7; DT 07-JUL-2009, integrated into UniProtKB/TrEMBL. DT 07-JUL-2009, sequence version 1. DT 14-OCT-2015, entry version 27. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAL45762.2}; GN ORFNames=EHI_062510 {ECO:0000313|EMBL:EAL45762.2}; OS Entamoeba histolytica. OC Eukaryota; Amoebozoa; Archamoebae; Entamoebidae; Entamoeba. OX NCBI_TaxID=5759 {ECO:0000313|Proteomes:UP000001926}; RN [1] {ECO:0000313|Proteomes:UP000001926} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 30459 / HM-1:IMSS {ECO:0000313|Proteomes:UP000001926}; RX PubMed=15729342; DOI=10.1038/nature03291; RA Loftus B.J., Anderson I., Davies R., Alsmark U.C., Samuelson J., RA Amedeo P., Roncaglia P., Berriman M., Hirt R.P., Mann B.J., Nozaki T., RA Suh B., Pop M., Duchene M., Ackers J., Tannich E., Leippe M., RA Hofer M., Bruchhaus I., Willhoeft U., Bhattacharya A., RA Chillingworth T., Churcher C.M., Hance Z., Harris B., Harris D., RA Jagels K., Moule S., Mungall K.L., Ormond D., Squares R., RA Whitehead S., Quail M.A., Rabbinowitsch E., Norbertczak H., Price C., RA Wang Z., Guillen N., Gilchrist C., Stroup S.E., Bhattacharya S., RA Lohia A., Foster P.G., Sicheritz-Ponten T., Weber C., Singh U., RA Mukherjee C., El-Sayed N.M.A., Petri W.A., Clark C.G., Embley T.M., RA Barrell B.G., Fraser C.M., Hall N.; RT "The genome of the protist parasite Entamoeba histolytica."; RL Nature 433:865-868(2005). RN [2] {ECO:0000313|Proteomes:UP000001926} RP GENOME REANNOTATION. RC STRAIN=ATCC 30459 / HM-1:IMSS {ECO:0000313|Proteomes:UP000001926}; RA Lorenzi H., Amedeo P., Inman J., Schobel S., Caler E.; RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS571213; EAL45762.2; -; Genomic_DNA. DR RefSeq; XP_651149.2; XM_646057.2. DR EnsemblProtists; rna_EHI_062510-1; rna_EHI_062510-1; EHI_062510. DR GeneID; 3405450; -. DR KEGG; ehi:EHI_062510; -. DR EuPathDB; AmoebaDB:EHI_062510; -. DR InParanoid; C4M1N7; -. DR Proteomes; UP000001926; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001926}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001926}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 315 336 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 177 197 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 350 AA; 40504 MW; C965BECEA99D24B7 CRC64; MFSTSSIIGG GNVLTDDPNT YLIAPCSSNI TFIIRLCETT QVLHFEFINT ELFSGNIKDF SLKCSMNGID FIPILNGTTT NTFKTQLFDI NAIQCRDILI EKINTHKKDR YCTISQVKVK GLSVLNEVIE DSALVTQNNL SMSHLSRNTD TKLLLDAQQK KEQNVLNTYL QFSSQNINNL TSLYDEYQIQ IERLSDSLET PFVSLNFLKF KSQQLLVEVF NIKNYLIQAS DTLRKWVRDR WSATQLNKKR QHDMEFDYKK VMEKVTLINS SMRGITFQNQ IMKSNSKRLT LEMKGLNESF TTFFETTSSK ISKYIILYFV IVLTMSVLYL TWLCVLSNKL DEYQIKLNKE // ID C4QYG7_PICPG Unreviewed; 660 AA. AC C4QYG7; DT 07-JUL-2009, integrated into UniProtKB/TrEMBL. DT 07-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAY68290.1}; GN OrderedLocusNames=PAS_chr1-4_0439 {ECO:0000313|EMBL:CAY68290.1}; OS Komagataella pastoris (strain GS115 / ATCC 20864) (Yeast) (Pichia OS pastoris). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Phaffomycetaceae; Komagataella. OX NCBI_TaxID=644223 {ECO:0000313|EMBL:CAY68290.1, ECO:0000313|Proteomes:UP000000314}; RN [1] {ECO:0000313|EMBL:CAY68290.1, ECO:0000313|Proteomes:UP000000314} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=GS115 / ATCC 20864 {ECO:0000313|Proteomes:UP000000314}; RX PubMed=19465926; DOI=10.1038/nbt.1544; RA De Schutter K., Lin Y.-C., Tiels P., Van Hecke A., Glinka S., RA Weber-Lehmann J., Rouze P., Van de Peer Y., Callewaert N.; RT "Genome sequence of the recombinant protein production host Pichia RT pastoris."; RL Nat. Biotechnol. 27:561-566(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN392319; CAY68290.1; -; Genomic_DNA. DR RefSeq; XP_002490571.1; XM_002490526.1. DR STRING; 644223.XP_002490571.1; -. DR EnsemblFungi; CAY68290; CAY68290; PAS_chr1-4_0439. DR GeneID; 8197027; -. DR KEGG; ppa:PAS_chr1-4_0439; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000093382; -. DR InParanoid; C4QYG7; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000314; Chromosome 1. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000314}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000314}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 16 {ECO:0000256|SAM:SignalP}. FT CHAIN 17 660 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002942273. FT TRANSMEM 494 511 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 660 AA; 75286 MW; EE620F8E2F98E729 CRC64; MLVAWFLLLL VSSCICNDDK LNDSNPLKEA EQEEDSRFMS FEEWKKKKID NTRNDKAIQD DQKSHNVYRE PERSYRNNEL NGVLGEDMEI DLEMFTGVDR DDEIGKVYQQ RFNYASFDCA ATIVKTNTEA KGASSILNEN KDSYLLNKCD VANQYAVVEL CQDILVDTVV LANYEFFSSG FKTVRFSVSD RFPVPANGWK VLGDFDASNT RSIQTFNIES PLIWARYLKI EILSHYGNEY YCPLSLVRVH GKTMMEKFKL EEVEEEEKAN TENGQNTNIA TQSKAIAANQ TNVQNKFISP NGKNITVLCK NSSKTNCDPE SAQVKPLKDE DENEEDCPVS FKHFSLDEFL TEHSKEICLE KDEQSSDHIS IEPPLSSSEP KTQESIYKNI IKRISLLESN ATLSLLYIEE QSRLLSNAFS KLEERQSLRF EAMLDSVNSS IQNQIQLIDD LKLYFKVEFE TLLAGSKTKH ERALVENIEL ISSISDDLVF QKKLIFFTIF AGLCLFAFVI FNRETYIEST YEDDHLEYIK DRNRYRDRSG DESSTLGSPE PFQPISRSST PDSSSPLTPL PGPQLAPIPI HLTNRKKATS FERKNKNHTD LIYKRTATNS YEVGSKTERP TESEPEANLE PNTGEYTDNE ADDVFSNPSK HEPSEATSSI // ID C4V6Q2_NOSCE Unreviewed; 279 AA. AC C4V6Q2; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEQ83098.1}; GN ORFNames=NCER_100094 {ECO:0000313|EMBL:EEQ83098.1}; OS Nosema ceranae (strain BRL01) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Nosematidae; Nosema. OX NCBI_TaxID=578460 {ECO:0000313|Proteomes:UP000009082}; RN [1] {ECO:0000313|Proteomes:UP000009082} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=BRL01 {ECO:0000313|Proteomes:UP000009082}; RX PubMed=19503607; DOI=10.1371/journal.ppat.1000466; RA Cornman R.S., Chen Y.P., Schatz M.C., Street C., Zhao Y., Desany B., RA Egholm M., Hutchison S., Pettis J.S., Lipkin W.I., Evans J.D.; RT "Genomic analyses of the microsporidian Nosema ceranae, an emergent RT pathogen of honey bees."; RL PLoS Pathog. 5:E1000466-E1000466(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACOL01000003; EEQ83098.1; -; Genomic_DNA. DR RefSeq; XP_002996769.1; XM_002996723.1. DR EnsemblFungi; EEQ83098; EEQ83098; NCER_100094. DR GeneID; 9422351; -. DR KEGG; nce:NCER_100094; -. DR EuPathDB; MicrosporidiaDB:NCER_100094; -. DR InParanoid; C4V6Q2; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000009082; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009082}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009082}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 70 92 Helical. FT COILED 97 124 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 279 AA; 32725 MW; D43C076DFB951715 CRC64; MNRKERLKIF NSPERNRNSK NIYQKFPDVS ENSTINNDID DDYNAECVDD SLFTPPDNLL RKIFDNTKRI FSYLKYVIPY GIVFYFTLYY FIFDKRSTNL DTELSDLAKE VDSLKKKNSE MTISAVETPF NLCRIEHCTT LQVNKDDLYK YGFIGFRKSI DPYIIFGENA APGDCTAFLP KKITFTVKFS KNVKLNNFHI FHPETENKQS AIKDFLLSGF YNEEEFEIGG FTYDINKNNY QSFLFPEIIT DRLKITVLNN NGCKKYTTVY KIYAFGVFN // ID C4Y4T1_CLAL4 Unreviewed; 611 AA. AC C4Y4T1; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEQ39941.1}; GN ORFNames=CLUG_04069 {ECO:0000313|EMBL:EEQ39941.1}; OS Clavispora lusitaniae (strain ATCC 42720) (Yeast) (Candida OS lusitaniae). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Metschnikowiaceae; Clavispora. OX NCBI_TaxID=306902 {ECO:0000313|EMBL:EEQ39941.1, ECO:0000313|Proteomes:UP000007703}; RN [1] {ECO:0000313|Proteomes:UP000007703} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 42720 {ECO:0000313|Proteomes:UP000007703}; RX PubMed=19465905; DOI=10.1038/nature08064; RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S., RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., RA Agrafioti I., Arnaud M.B., Bates S., Brown A.J.P., Brunke S., RA Costanzo M.C., Fitzpatrick D.A., de Groot P.W.J., Harris D., RA Hoyer L.L., Hube B., Klis F.M., Kodira C., Lennard N., Logue M.E., RA Martin R., Neiman A.M., Nikolaou E., Quail M.A., Quinn J., RA Santos M.C., Schmitzberger F.F., Sherlock G., Shah P., RA Silverstein K.A.T., Skrzypek M.S., Soll D., Staggs R., Stansfield I., RA Stumpf M.P.H., Sudbery P.E., Srikantha T., Zeng Q., Berman J., RA Berriman M., Heitman J., Gow N.A.R., Lorenz M.C., Birren B.W., RA Kellis M., Cuomo C.A.; RT "Evolution of pathogenicity and sexual reproduction in eight Candida RT genomes."; RL Nature 459:657-662(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH408079; EEQ39941.1; -; Genomic_DNA. DR RefSeq; XP_002616828.1; XM_002616782.1. DR STRING; 306902.XP_002616828.1; -. DR EnsemblFungi; EEQ39941; EEQ39941; CLUG_04069. DR GeneID; 8496588; -. DR KEGG; clu:CLUG_04069; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; C4Y4T1; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007703; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007703}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007703}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 33 {ECO:0000256|SAM:SignalP}. FT CHAIN 34 611 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002946231. FT TRANSMEM 548 565 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 611 AA; 69318 MW; CC14AC43B640C01D CRC64; MIVHVQSARP IFLDRGMFLL LVLASLALRL TEAIDTIKSE TIHLSNEHET TFSTSNFIGI NSETDGSDGT ENGTYVVTLE SNRISMDTGI ATGTISIGES TSNPLISSLQ DSTANVEEGI QKMSYDNDTA GFPEDMPENS TDECRFMSFE EWKKQKEEEA SYLELSQSVT ETRSSMPNDL ELRTQTEKAE ESKAEEVDQR KIYKDKFNYA SVDCAATIVE TNRDAKGASA ILTEVKDSYL LNKCSTSNKF VVIELCQDIL VTSVVMGNFE LFSSMFKSLR FSVSDRFPVT SGWRELGEFE AQNVRDVQVF PIENPLIWAR YLKIEIQSHY GDEFYCPISI VRVHGTTMME EFKDTEQSSE KDEKQHLAPK KTIVAENFLN YTTDLDTMDD CRVVLPYLAL NEFLKDQNST DGLCEAPLYS NEESTSVAEA SATKTTQESL FRNIVKRLTL LESNASLSLL YVEEQSKLLS DAFTTMERRQ SSRLENVLSR LNQSFVAQVH YLHENLAQIK QESILTIENS QTWINTAIDG LDVRTSQFTR ELRFQRKIII IDTLIILLLV AYVIMSRELF FTEDANLIDS TGSKSTPRLK QFPVHGQKRK HRSKKRSSKF S // ID C4YDF2_CANAW Unreviewed; 455 AA. AC C4YDF2; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEQ42338.1}; GN ORFNames=CAWG_00546 {ECO:0000313|EMBL:EEQ42338.1}; OS Candida albicans (strain WO-1) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Candida. OX NCBI_TaxID=294748 {ECO:0000313|EMBL:EEQ42338.1, ECO:0000313|Proteomes:UP000001429}; RN [1] {ECO:0000313|Proteomes:UP000001429} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=WO-1 {ECO:0000313|Proteomes:UP000001429}; RX PubMed=19465905; DOI=10.1038/nature08064; RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S., RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., RA Agrafioti I., Arnaud M.B., Bates S., Brown A.J.P., Brunke S., RA Costanzo M.C., Fitzpatrick D.A., de Groot P.W.J., Harris D., RA Hoyer L.L., Hube B., Klis F.M., Kodira C., Lennard N., Logue M.E., RA Martin R., Neiman A.M., Nikolaou E., Quail M.A., Quinn J., RA Santos M.C., Schmitzberger F.F., Sherlock G., Shah P., RA Silverstein K.A.T., Skrzypek M.S., Soll D., Staggs R., Stansfield I., RA Stumpf M.P.H., Sudbery P.E., Srikantha T., Zeng Q., Berman J., RA Berriman M., Heitman J., Gow N.A.R., Lorenz M.C., Birren B.W., RA Kellis M., Cuomo C.A.; RT "Evolution of pathogenicity and sexual reproduction in eight Candida RT genomes."; RL Nature 459:657-662(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH672346; EEQ42338.1; -; Genomic_DNA. DR EnsemblFungi; EEQ42338; EEQ42338; CAWG_00546. DR HOGENOM; HOG000093382; -. DR OMA; IDECHFM; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001429; Chromosome 1, Supercontig 1.1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001429}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 398 415 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 455 AA; 52115 MW; 14531CF9CB757AD1 CRC64; MSFEEWKKQK IESNTTTSNN YSMNGSSESK SITPSNHSSV ISTNVTLMEA DGKVYKDKFN FASVDCAATI MKTNAQAKGA SAILKENKDS YLLNECSVKH KYVIIELCQD ILVDSVVIGN FEFFSSIFKD IRISVSDRFP SQNWKELGQF TASNIRDVQT FKIENPLIWA RYLKLEILSH YGNEFYCPIS IVRVHGKTMM DEFKEDEEGN QHMGAIKEEE PPTPQTIEED VLLINQTTLN ECRVRLPHLQ LNEFLKSFNS SNQEFCVPSD AEPQVTTAKT TTAITTQESI YKNIMKRLSL LESNATLSLL YIEEQSKLLS TAFSNLEKRQ TTNFNTLISS VNSTLMNQLM VFKESYYELY EQYGNLFKMQ ENSHRQLLAE TNKKVGLLSS ELTFQKRVSI FNSIIIICLL VYVILTRDVA IEYPEDELNE KSPSPQSKKL SSPFIPIRYK KSKKR // ID C5DCD8_LACTC Unreviewed; 628 AA. AC C5DCD8; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=KLTH0B02244p {ECO:0000313|EMBL:CAR21449.1}; GN OrderedLocusNames=KLTH0B02244g {ECO:0000313|EMBL:CAR21449.1}; OS Lachancea thermotolerans (strain ATCC 56472 / CBS 6340 / NRRL Y-8284) OS (Yeast) (Kluyveromyces thermotolerans). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Lachancea. OX NCBI_TaxID=559295 {ECO:0000313|EMBL:CAR21449.1, ECO:0000313|Proteomes:UP000002036}; RN [1] {ECO:0000313|EMBL:CAR21449.1, ECO:0000313|Proteomes:UP000002036} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 56472 / CBS 6340 / NRRL Y-8284 RC {ECO:0000313|Proteomes:UP000002036}; RX PubMed=19525356; DOI=10.1101/gr.091546.109; RG The Genolevures Consortium; RA Souciet J.-L., Dujon B., Gaillardin C., Johnston M., Baret P.V., RA Cliften P., Sherman D.J., Weissenbach J., Westhof E., Wincker P., RA Jubin C., Poulain J., Barbe V., Segurens B., Artiguenave F., RA Anthouard V., Vacherie B., Val M.-E., Fulton R.S., Minx P., Wilson R., RA Durrens P., Jean G., Marck C., Martin T., Nikolski M., Rolland T., RA Seret M.-L., Casaregola S., Despons L., Fairhead C., Fischer G., RA Lafontaine I., Leh V., Lemaire M., de Montigny J., Neuveglise C., RA Thierry A., Blanc-Lenfle I., Bleykasten C., Diffels J., Fritsch E., RA Frangeul L., Goeffon A., Jauniaux N., Kachouri-Lafond R., Payen C., RA Potier S., Pribylova L., Ozanne C., Richard G.-F., Sacerdot C., RA Straub M.-L., Talla E.; RT "Comparative genomics of protoploid Saccharomycetaceae."; RL Genome Res. 19:1696-1709(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU928166; CAR21449.1; -; Genomic_DNA. DR RefSeq; XP_002551887.1; XM_002551841.1. DR EnsemblFungi; CAR21449; CAR21449; KLTH0B02244g. DR GeneID; 8290717; -. DR KEGG; lth:KLTH0B02244g; -. DR HOGENOM; HOG000113639; -. DR InParanoid; C5DCD8; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000002036; Chromosome B. DR GO; GO:0005825; C:half bridge of spindle pole body; IEA:EnsemblFungi. DR GO; GO:0016021; C:integral component of membrane; IEA:EnsemblFungi. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:EnsemblFungi. DR GO; GO:0005635; C:nuclear envelope; IEA:EnsemblFungi. DR GO; GO:0034399; C:nuclear periphery; IEA:EnsemblFungi. DR GO; GO:0006348; P:chromatin silencing at telomere; IEA:EnsemblFungi. DR GO; GO:0034087; P:establishment of mitotic sister chromatid cohesion; IEA:EnsemblFungi. DR GO; GO:0000741; P:karyogamy; IEA:EnsemblFungi. DR GO; GO:0045141; P:meiotic telomere clustering; IEA:EnsemblFungi. DR GO; GO:0000743; P:nuclear migration involved in conjugation with cellular fusion; IEA:EnsemblFungi. DR GO; GO:0030474; P:spindle pole body duplication; IEA:EnsemblFungi. DR GO; GO:0007129; P:synapsis; IEA:EnsemblFungi. DR GO; GO:0034398; P:telomere tethering at nuclear periphery; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002036}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002036}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 128 148 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 164 195 {ECO:0000256|SAM:Coils}. FT COILED 331 351 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 628 AA; 71294 MW; 83332BA868303A55 CRC64; MDTSTHSRGF YNSKTEDAYN QMLADRRSER SAQISDSDDS VSADFSNEML DFSNSSSHSD LGINEDYEHF KTSLLQDDWG EPLEDDDDED YTEEADQSFI MESDAAEALS RGPDYHIKSF STWIRGSLVP LVASLVVVVL AIMVLAPGGS PQAPEIQNAV PGTNAQLRTQ LNSLYREFQE EKKAAKKDLD NAIKLIILQV EKHMKQLLPR DLGSVNSQLR RLDTQVQDLS QSLSLNNVTE WQESLMHELE RLLPDQIPVV LDNSTNALMV VPELHRYLAE VIPQAVNRTL PRGLAAPFNY DAGQYVREIL RDEYEYVDKS DFLRELDSAL RVNKEDILRE MEARISTLEN VPQQYSNVLQ RKLIHKIYNA NQHQWQDDVD FATVAQGTRI LNHLCSSTFK GHQGIPPNGV SPLDLLADTP AATSTYWLCK DKRATGSCSW AMHFAQPLYL TRISYLHGRF TNNLHLMNSA PKSIAVYVKL QSPPTAEFQR IAAKHGQGAV WDRDNSFVEI GSWDYDVSDA RIRQYFLLPP WFVQCKPQVR SLALVVRSNH GNTHYTALRK FVINAVTAQD LQLSSTYAQQ RQFEIPEYAS PFEDRERSRA SQIAAWQRRG PSDADPAVPS FGQDEYDR // ID C5DK70_LACTC Unreviewed; 701 AA. AC C5DK70; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 32. DE SubName: Full=KLTH0F02244p {ECO:0000313|EMBL:CAR23871.1}; GN OrderedLocusNames=KLTH0F02244g {ECO:0000313|EMBL:CAR23871.1}; OS Lachancea thermotolerans (strain ATCC 56472 / CBS 6340 / NRRL Y-8284) OS (Yeast) (Kluyveromyces thermotolerans). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Lachancea. OX NCBI_TaxID=559295 {ECO:0000313|EMBL:CAR23871.1, ECO:0000313|Proteomes:UP000002036}; RN [1] {ECO:0000313|EMBL:CAR23871.1, ECO:0000313|Proteomes:UP000002036} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 56472 / CBS 6340 / NRRL Y-8284 RC {ECO:0000313|Proteomes:UP000002036}; RX PubMed=19525356; DOI=10.1101/gr.091546.109; RG The Genolevures Consortium; RA Souciet J.-L., Dujon B., Gaillardin C., Johnston M., Baret P.V., RA Cliften P., Sherman D.J., Weissenbach J., Westhof E., Wincker P., RA Jubin C., Poulain J., Barbe V., Segurens B., Artiguenave F., RA Anthouard V., Vacherie B., Val M.-E., Fulton R.S., Minx P., Wilson R., RA Durrens P., Jean G., Marck C., Martin T., Nikolski M., Rolland T., RA Seret M.-L., Casaregola S., Despons L., Fairhead C., Fischer G., RA Lafontaine I., Leh V., Lemaire M., de Montigny J., Neuveglise C., RA Thierry A., Blanc-Lenfle I., Bleykasten C., Diffels J., Fritsch E., RA Frangeul L., Goeffon A., Jauniaux N., Kachouri-Lafond R., Payen C., RA Potier S., Pribylova L., Ozanne C., Richard G.-F., Sacerdot C., RA Straub M.-L., Talla E.; RT "Comparative genomics of protoploid Saccharomycetaceae."; RL Genome Res. 19:1696-1709(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU928170; CAR23871.1; -; Genomic_DNA. DR RefSeq; XP_002554308.1; XM_002554262.1. DR EnsemblFungi; CAR23871; CAR23871; KLTH0F02244g. DR GeneID; 8292500; -. DR KEGG; lth:KLTH0F02244g; -. DR HOGENOM; HOG000093382; -. DR InParanoid; C5DK70; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002036; Chromosome F. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002036}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002036}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 701 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002950513. FT TRANSMEM 572 589 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 701 AA; 78792 MW; 2D1ED9B00A5386E8 CRC64; MVVIGRILHL LVLFRASACF TSEPQEISTS SSAGQSELSS FLSKESRGTS TTQQSSAEVP QTQYDQHSVS TVSVTSSGNF TQTSSSLDKD TYIENSAYSP PDESGSASRS LQGTKSSLVA IAEAQESDFN ETDTTFLSFD EWKKEKLGEE SLQKAKPPAR VNRPVDSSVY KGEAMGDDFE IDVGLFTSSK QDLNEEPEGK LYKDKFNYAS LDCAATIVKT NSEASGANAV LHENKDKYLL TPCSASNKFV VIELCQDILV EEIVMANYEF FSSTFSKVRF SVSNSYPPKN GWKVLGEFDA ANTRNLQKFG ISNPLIWARY LRVEVLAHHG NEFYCPISVI RVHGKTMMDD FKLDESNSLY SEDAVEQASP EGQLKECRQE KLAPHNLSES MLRECQFPQF PQADNVSILS KLDFLSTQCP AVLPHLKFDQ FLKDINQSVC DTKIHQPQLD ISTSAPSSST EESIFKTIMK RLTLLESNST LSLRYIEEQS MLLSKAFASL ERNQAKKFQS LVQAFNQTIV SNLGDINFFT QQLRESSIKL LEEQKLANDQ FTSETFHRLE SMKKDAIFQR RLSYTMLFAF VILLVYVLLT KEAYIDEYME DDGWYLNSPP LKKFKDNFLR RAGKSTDDRR SLVFCTSRDY SENDEKSDVS ASTSSASLYD EFATKVAHEM DGNVLSYRKP TTSLEEEDID IDEALSEGSS R // ID C5DYY8_ZYGRC Unreviewed; 596 AA. AC C5DYY8; DT 22-SEP-2009, integrated into UniProtKB/TrEMBL. DT 22-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=ZYRO0F16808p {ECO:0000313|EMBL:CAR28999.1}; GN OrderedLocusNames=ZYRO0F16808g {ECO:0000313|EMBL:CAR28999.1}; OS Zygosaccharomyces rouxii (strain ATCC 2623 / CBS 732 / NBRC 1130 / OS NCYC 568 / NRRL Y-229) (Candida mogii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Zygosaccharomyces. OX NCBI_TaxID=559307 {ECO:0000313|Proteomes:UP000008536}; RN [1] {ECO:0000313|EMBL:CAR28999.1, ECO:0000313|Proteomes:UP000008536} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 2623 / CBS 732 / NBRC 1130 / NCYC 568 / NRRL Y-229 RC {ECO:0000313|Proteomes:UP000008536}; RX PubMed=19525356; DOI=10.1101/gr.091546.109; RG The Genolevures Consortium; RA Souciet J.-L., Dujon B., Gaillardin C., Johnston M., Baret P.V., RA Cliften P., Sherman D.J., Weissenbach J., Westhof E., Wincker P., RA Jubin C., Poulain J., Barbe V., Segurens B., Artiguenave F., RA Anthouard V., Vacherie B., Val M.-E., Fulton R.S., Minx P., Wilson R., RA Durrens P., Jean G., Marck C., Martin T., Nikolski M., Rolland T., RA Seret M.-L., Casaregola S., Despons L., Fairhead C., Fischer G., RA Lafontaine I., Leh V., Lemaire M., de Montigny J., Neuveglise C., RA Thierry A., Blanc-Lenfle I., Bleykasten C., Diffels J., Fritsch E., RA Frangeul L., Goeffon A., Jauniaux N., Kachouri-Lafond R., Payen C., RA Potier S., Pribylova L., Ozanne C., Richard G.-F., Sacerdot C., RA Straub M.-L., Talla E.; RT "Comparative genomics of protoploid Saccharomycetaceae."; RL Genome Res. 19:1696-1709(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU928178; CAR28999.1; -; Genomic_DNA. DR RefSeq; XP_002497932.1; XM_002497887.1. DR EnsemblFungi; CAR28999; CAR28999; ZYRO0F16808g. DR GeneID; 8205704; -. DR KEGG; zro:ZYRO0F16808g; -. DR InParanoid; C5DYY8; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008536; Chromosome F. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008536}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008536}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 596 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002950668. FT TRANSMEM 549 566 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 134 154 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 596 AA; 67999 MW; 12A3E9B4FCC1C38D CRC64; MWLLWNFLVI LAYSGILICC EDNYSASSSV IVNETQSSGS SCQSPKSLNE DFSTLSSKIG WETPSTSKNV TLSSSSRGGS KSFSSNNDRT SMLMQQQAVK SFGPNAADLK QGTDTSNQTF LSFNEWRLAK INQEVIQEQQ RSKAKEQMES LESEPLGDDM EIELSVFSTT DAIKDKQESE PEGKVYNHKF NFASLDCAAT IVKTNSEASG ATSILTENKD KYLLNPCSAP NKFIIIELCQ DILVEEVALA NFEFFSSTFS RIRLSVSDLY PVAKNGWRVL GEFDAENSRN LQSFPIQNPQ IWARYLRIEI LTHHDKEFYC PVSLVRVHGK TMMDEFKMEN TQELPSNQEN SQEVEEPEDD TSEQCINEII EKCNSWPSID PDNITYLPDL PETFSNCQSK LVPLKFEEFL KELNRSHCLP KNKNNSSTFS PSPAFSTEES IFKNIMKRLT TLESNANLTV LYIEEQSKLL AESFEQMERT QFFNFDNLVS IFNQTIMENL NVLRVFANQL KDQSIRILEE QKLNNDQFTT QNTIKLANLE KELRIQQRFA YTITTGLIAV MVYFIFHRES YLDNHKKSIS TDTQAVEENK EIVDSI // ID C5E0F5_ZYGRC Unreviewed; 669 AA. AC C5E0F5; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=ZYRO0G12298p {ECO:0000313|EMBL:CAR29589.1}; GN OrderedLocusNames=ZYRO0G12298g {ECO:0000313|EMBL:CAR29589.1}; OS Zygosaccharomyces rouxii (strain ATCC 2623 / CBS 732 / NBRC 1130 / OS NCYC 568 / NRRL Y-229) (Candida mogii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Zygosaccharomyces. OX NCBI_TaxID=559307 {ECO:0000313|Proteomes:UP000008536}; RN [1] {ECO:0000313|EMBL:CAR29589.1, ECO:0000313|Proteomes:UP000008536} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 2623 / CBS 732 / NBRC 1130 / NCYC 568 / NRRL Y-229 RC {ECO:0000313|Proteomes:UP000008536}; RX PubMed=19525356; DOI=10.1101/gr.091546.109; RG The Genolevures Consortium; RA Souciet J.-L., Dujon B., Gaillardin C., Johnston M., Baret P.V., RA Cliften P., Sherman D.J., Weissenbach J., Westhof E., Wincker P., RA Jubin C., Poulain J., Barbe V., Segurens B., Artiguenave F., RA Anthouard V., Vacherie B., Val M.-E., Fulton R.S., Minx P., Wilson R., RA Durrens P., Jean G., Marck C., Martin T., Nikolski M., Rolland T., RA Seret M.-L., Casaregola S., Despons L., Fairhead C., Fischer G., RA Lafontaine I., Leh V., Lemaire M., de Montigny J., Neuveglise C., RA Thierry A., Blanc-Lenfle I., Bleykasten C., Diffels J., Fritsch E., RA Frangeul L., Goeffon A., Jauniaux N., Kachouri-Lafond R., Payen C., RA Potier S., Pribylova L., Ozanne C., Richard G.-F., Sacerdot C., RA Straub M.-L., Talla E.; RT "Comparative genomics of protoploid Saccharomycetaceae."; RL Genome Res. 19:1696-1709(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU928179; CAR29589.1; -; Genomic_DNA. DR RefSeq; XP_002498522.1; XM_002498477.1. DR EnsemblFungi; CAR29589; CAR29589; ZYRO0G12298g. DR GeneID; 8206335; -. DR KEGG; zro:ZYRO0G12298g; -. DR HOGENOM; HOG000113639; -. DR InParanoid; C5E0F5; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000008536; Chromosome G. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008536}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008536}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 166 187 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 669 AA; 77101 MW; 73C57270DFB6FDAB CRC64; MDKDYSEDYL YNKSLSSAYR DLLMKQMGRQ NNSMAVEGDG SEGDLEDMSN YEEYDVGLDQ RNKDDNDPED DSEYYRKFKK SLVLDDDEGV SEMDDAWLDD NSTVDGDYTD EADRSFYDGD HQTEEGVDND DDDDDDEDDE DLEDDEEEYL EDEEEHMANR YSIRRLICWI TLGMVLFFAS PLASSGLKLL GSSTTPIAGT PTGTTSLQRQ INHLYNELDH RDDRYKSDFD KTIKVVISQF EKNIKKLLPS NFNYLQTQLE SLTNRVNNIS IALSRSVYPQ FSMDNVTEWQ QQLVRELEAQ LPQEIPVVVD NTSSVLVIPE LHNYLTRLTS SLIENSHPLE QPLEYDLSLY VKEILANQFQ YVDKDFFIKE LNRKLQLNKQ EIWQEMAGKF DQWKMENNPN NGVPQQYSTI LLKKLINQIY NSNQHQWEDD LDFATFSQGT KLLNHLTSST LKQGNGVNPM ELLQESKYGP STYWQCASPK GCSWAIRFKE PVYLTRLFYS HGRFRNNLQM MNSAPKTISV YVRLATGDAS KKLLGLASSF KMGTRFAGDN QHILIGQYNY RLTDNRIRQP LPLPTWFIQL KPLVRSVVFQ VDENHGNKQF TSLRKFIVNA VTQQDLQIME SNAFPLISND PEYASSSTSA PQIEDKPRVA IAHDNDQGIP SFGQDELDE // ID C5FK80_ARTOC Unreviewed; 598 AA. AC C5FK80; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEQ30102.1}; GN ORFNames=MCYG_02921 {ECO:0000313|EMBL:EEQ30102.1}; OS Arthroderma otae (strain ATCC MYA-4605 / CBS 113480) (Microsporum OS canis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Arthroderma. OX NCBI_TaxID=554155 {ECO:0000313|Proteomes:UP000002035}; RN [1] {ECO:0000313|Proteomes:UP000002035} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4605 / CBS 113480 {ECO:0000313|Proteomes:UP000002035}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS995703; EEQ30102.1; -; Genomic_DNA. DR RefSeq; XP_002847415.1; XM_002847369.1. DR EnsemblFungi; EEQ30102; EEQ30102; MCYG_02921. DR GeneID; 9224861; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002035; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002035}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002035}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 332 352 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 598 AA; 65008 MW; BED8E1A50C81A7E9 CRC64; MAPARRARRA VTPTTANEAE NPYLPSIETQ QSFSYGSSTP VLPRQLGTIA SANAEDVAAT LDAAVRPRVT ESAGFSQIED EARKSPEKQR FTRGRRRAES VLSARESLSP VRETARRLTP DNQLMSTLRE ASGEPEDQFG AVDPLADAME GSSISWNTER HLLATEQPHV SRHPASSHSQ LQGAAKPSLG WPRPARRVDP PPVPASPSRA SSTSVRWPQP SPGAVEFSPR RNSSSQNQIQ GPAQRSRATA ERIERGIAIG APVGVLPSSS SSRPEVAAPA LRRPTATTRP TFRDDTPDTP QSGHTPASSR HASPSPTMPA GVSLLSRPTG SISHVILLLL LTLIVAFNAY LVRHEIRAVA QSVVQSPISK RPATPIHNYT EAMDKILSTV DRRLTSMTHE IAVLKEEASN RPPSPPPPTD PLVPRRVNFF ALGTGALWDD VGDCWCAATT NGKAQLAVQL GRPIVPEEVI IEHIPREATL DPGSAPQTME LWVEYTLLAA GRETTMSAVR RSMLETLAMV YPGEHPSAYS DDLALGPSFF RIGTWRYDIN AGHHVQRFPL EAVVDLPGAR VSRAVVRATS NWGNQYTCLY RVRLYGRL // ID C5FW10_ARTOC Unreviewed; 896 AA. AC C5FW10; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EEQ34094.1}; GN ORFNames=MCYG_06913 {ECO:0000313|EMBL:EEQ34094.1}; OS Arthroderma otae (strain ATCC MYA-4605 / CBS 113480) (Microsporum OS canis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Arthroderma. OX NCBI_TaxID=554155 {ECO:0000313|Proteomes:UP000002035}; RN [1] {ECO:0000313|Proteomes:UP000002035} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4605 / CBS 113480 {ECO:0000313|Proteomes:UP000002035}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS995706; EEQ34094.1; -; Genomic_DNA. DR RefSeq; XP_002844949.1; XM_002844903.1. DR STRING; 554155.XP_002844949.1; -. DR EnsemblFungi; EEQ34094; EEQ34094; MCYG_06913. DR GeneID; 9228174; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002035; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002035}; KW Reference proteome {ECO:0000313|Proteomes:UP000002035}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 32 {ECO:0000256|SAM:SignalP}. FT CHAIN 33 896 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002951607. SQ SEQUENCE 896 AA; 99567 MW; F942CB31438A5ADC CRC64; MNWYRPSPLW RKRPRTDRIT SAFLAFLTVC SAETVADSHT APIESRLMAV DKSNTVCQAR MTGDLETRWA VGGAMRHATG VDTTATIGSA GIYLDAREMT VTVPGYTGSS MPEEGGGGGG DSGRIKGKED DADIESPLDN SHFLSFEEWK NQNLARAGQS AEHMRRHRPG SGRSGEHGGQ VRRRPTRPSQ MHDPLDGLGE ESEIDLEFGG FSTDESGVAS WDRKESGIPP DMGSTTTGGD DQGKKLSQPA FELDGQDAEN IPRKGIGRRK HAGTTCKERF NYASFDCAAT VLKTNPQCTG SSAVLNENKD SYMLSECRAK EKFLIMELCD DILVDTVVLA NYEFFSSIFR SFRVSVSDRY PIKADKWRVL GTYEAANARQ VQAFAVENPL IWARYLKIEF LSHYGNEFYC PVSLVRVHGT TMMEEYKNDG EATRADEEED ANAQLEEPQQ QPQREQVEAV VHENVSIPES TVDAQLVPLS NLSNHELTEL RCFVERNETE SILLGLVSGK MCAIQERAAR MERQPVIATH VRDDTAAPAS GSISSANTLE QIRSIPSTRV WTASDREETR RSSTGSAVTA DGSHTEPTRM NSAANPPPPS SPPPNPSTQE SFFKSVNKRL QMLESNSTLS LLYIEEQSRI LRDAFNKVEK RQLAKTSAFL ENLNSTVLQE LKEFRQQYDH LWHSVFIEFE QQRQQYHREV FSVAAQLGVL ADELVFQKRV AVIQSIFVLV CFGLVLFSRS SGAPYFEFPR NIVTRTRSFR SSSPTYDSPA PSASPSPPPM SRMGSSILSR DETVRHHHQS HSRHHRSPSE QTDYEVENPT FAYSPPTPTS RTTTPDRSGK HHFSLEPHEG LAASATSGIA SLSDPDLRLR QRAAKDIAEK HESESDDGQA EGHSFS // ID C5GAW0_AJEDR Unreviewed; 878 AA. AC C5GAW0; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 16-SEP-2015, entry version 16. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EEQ87114.1}; GN ORFNames=BDCG_02234 {ECO:0000313|EMBL:EEQ87114.1}; OS Ajellomyces dermatitidis (strain ER-3 / ATCC MYA-2586) (Blastomyces OS dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Blastomyces. OX NCBI_TaxID=559297 {ECO:0000313|Proteomes:UP000002039}; RN [1] {ECO:0000313|Proteomes:UP000002039} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ER-3 / ATCC MYA-2586 {ECO:0000313|Proteomes:UP000002039}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Klein B., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., RA Berlin A.M., Heiman D.I., Hepburn T.A., Saif S., Shea T.D., Shenoy N., RA Sykes S., Galagan J.E., Nusbaum C., Birren B.W.; RT "The genome sequence of Blastomyces dermatitidis strain ER-3."; RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ999974; EEQ87114.1; -; Genomic_DNA. DR EnsemblFungi; EEQ87114; EEQ87114; BDCG_02234. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002039; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002039}; KW Reference proteome {ECO:0000313|Proteomes:UP000002039}. SQ SEQUENCE 878 AA; 95886 MW; 0928F0BE7D665DDD CRC64; MAWHYTLFPR GAHTKCRATD RLLYFWAIAL LAAVHAGGNL GFEQPNISSR ALDATCPNRD LSDIQLEYMQ NPVCLEPKWA GIGHVENYTS NSSSGGADID ASIPPGASPS PSSTVTATTS GSSGLDQDLD TESPLDNANF LSFEEWKKQN LAKVGQSVEH VRGDRQGAGS EASGRRQRPM GIDNSLDSLG EDGEIALEFG GFGPENSGPA SWERKVGKGQ APHADGAESA TRGAEGETQI ETTTRGGVSK RKDAGTTCKE RFNYASFDCA ATVLKTNPQC SGASSVLTEN KDNYMLNECR ARDKFLIVEL CDDILIDTIV LANYEFFSSI FRTFRVSVSD RYPPKQPDMW RELGTYEAVN SREVQAFAVE NPLIWARYVK IEFLTHYGNE FYCPVSLIRV HGTTMLEEYK NDGEASRLED HNSAQIQGNV ASESGPDNPT ANQSKVVGKE SDGSTGAGGF DVQPTRVQKP EDICLPKADI GAILSRSLAG EEDGVCLIKE APRAHNQSMD AVQSASVQAH GPAKVAEDAT PITPSAESSS NVASPTQKTT PTVTDSRAQN PANESQHATS TSTHKTEYGG SSESSKPSTT VQQHQPNPTT QESFFKSVNK RLHMLETNSS LSLQYIEEQS RILRDAFSKV EKRQLAKTTT FLENLNTSVL QELREFRHQY DQVWHSVAVE FEQQRLHYHQ EVFAMSSQLG ILADELLFQK RISIIQSVFV LICFGLVLFS RSSIANYLEL PRVHTMVSRS QSYRSSTHSF ESPSASPSSR PNSSYRDSSK DASNTSHRRT HSVESVEDDL AVNPTIAYSP PTPTSDSDDH GLHRQGSQRS DSTASSIMVT QPPQLLRSES SPPDLRGPSE GSEGSEGRSL LEAPQVSS // ID C5GIY0_AJEDR Unreviewed; 717 AA. AC C5GIY0; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEQ89481.1}; GN ORFNames=BDCG_04601 {ECO:0000313|EMBL:EEQ89481.1}; OS Ajellomyces dermatitidis (strain ER-3 / ATCC MYA-2586) (Blastomyces OS dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Blastomyces. OX NCBI_TaxID=559297 {ECO:0000313|Proteomes:UP000002039}; RN [1] {ECO:0000313|Proteomes:UP000002039} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ER-3 / ATCC MYA-2586 {ECO:0000313|Proteomes:UP000002039}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Klein B., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., RA Berlin A.M., Heiman D.I., Hepburn T.A., Saif S., Shea T.D., Shenoy N., RA Sykes S., Galagan J.E., Nusbaum C., Birren B.W.; RT "The genome sequence of Blastomyces dermatitidis strain ER-3."; RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EQ999976; EEQ89481.1; -; Genomic_DNA. DR EnsemblFungi; EEQ89481; EEQ89481; BDCG_04601. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002039; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002039}; KW Reference proteome {ECO:0000313|Proteomes:UP000002039}. FT COILED 163 184 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 717 AA; 77866 MW; F77FF43B457E9E9B CRC64; MTGRRATSLR SGSRAQSTRP TRAAATANAA TATQADQSNP DLGNPSLPDV RTQQSFAYGS SKTPALPRQL EVDPSMGLSE MVDTLDDGLR QAQDRELARV EDPRNPSPER RQTRSMSLSM RSSMSPAPEP ASRRTPSRRT TATRGRAGSR RAASRQPTPE GQLLESLREV SEETENVKQE EEEAYTSTLP DTPSFNDSAS ISWTTERAIH GTLPREVNTG TRPNYYLRDP YGSRPSSSQG PSGLSFPPTR RPIFEESFPA NPHLSGPVDA SRAAAPTAVR RTLPPVPAFN QLRDEPRSKS TTSSTSSASN HTPSSSTHSS PVFVAATPAA ANVTSSQKRL AGIAKTPSAI LVIIGLFLTT FLAYFCRNHA CAFPHSLQTT MSHYLCRPTS SFTMDNSTSM YADAFHKLSS HVDQRLLDMA KDVATLKNEW NRRLPHLKEA LSRSPAAATD PLAPPKVNYA SVGMGAVVDP YLTSPTMSTS AGLVSRIGQY LAKVPRGSPP VAALQPWDGV GECWCAATRS NVSQLTILLG RPIVPEEVVV EHIPKGATLD PGSAPREMEL WAQYTARQPA AAAAAYPPGS SSSNPPSSPS SAPGRPPPPP YTPPYLRSPP IAHLHPSHSL HYLLPSRLRD AILTTLRQVY PDEPTTAYSE DALLGPSFFR VGRWQYNIHG DHHIQRFELD AVIDMPAARV EKVVFRVKSN WGAAHTCLYR VRLHGHL // ID C5JG04_AJEDS Unreviewed; 717 AA. AC C5JG04; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEQ73355.1}; GN ORFNames=BDBG_01585 {ECO:0000313|EMBL:EEQ73355.1}; OS Ajellomyces dermatitidis (strain SLH14081) (Blastomyces dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Blastomyces. OX NCBI_TaxID=559298 {ECO:0000313|Proteomes:UP000002038}; RN [1] {ECO:0000313|Proteomes:UP000002038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SLH14081 {ECO:0000313|Proteomes:UP000002038}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Klein B., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., RA Berlin A.M., Heiman D.I., Hepburn T.A., Saif S., Shea T.D., Shenoy N., RA Sykes S., Galagan J.E., Nusbaum C., Birren B.W.; RT "The genome sequence of Blastomyces dermatitidis strain SLH14081."; RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG657449; EEQ73355.1; -; Genomic_DNA. DR RefSeq; XP_002628677.1; XM_002628631.1. DR EnsemblFungi; EEQ73355; EEQ73355; BDBG_01585. DR GeneID; 8507162; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002038; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002038}. FT COILED 163 184 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 717 AA; 77866 MW; 4EC00F508FCA47B5 CRC64; MTGRRATSLR SGSRAQSTRP TRAAATANAA TATQADQSNP DLGNPSLPDV RTQQSFAYGS TKTPALPRQL EVDPSMGLSE MVDTLDDGLR QAQDRELARV EDPRNPSPER RQTRSMSLSM RSSMSPAPEP ASRRTPSRRT TATRGRAGSR RAASRQPTPE GQLLESLREV SEETENVKQE EEEAYTSTLP DTPSFNDSAS ISWTTERAIH GTLPREVNTG TRPNYYLRDP YGSRPSSSQG PSGLSFPPTR RPIFEESFPA NPHLSGPVDA SRAAAPTAVR RTLPPVPAFN QLRDEPRSKS TTSSTSSASN HTPSSSTHSS PVFVAATPAA ANVTSSQKRL AGIAKTPSAI LVIIGLFLTT FLAYFCRNHA CAFPHSLQTT MSHYLCRPTS SFIMDNSTSM YADAFHKLSS HVDQRLSDMA KDVATLKNEW NRRLPHLKEA LSRSPAAATD PLAPPKVNYA SVGMGAVVDP YLTSPTMSTS AGLVSRIGQY LAKVPRGSPP VAALQPWDGV GECWCAATRS NVSQLTILLG RPIVPEEVVV EHIPKGATLD PGSAPREMEL WAQYTARQPA AAAAAYPPGS SSSNPPSSPS SAPGRPPPPP YTPPYLRSPP IAHLHPSHSL HYLLPSRLRD AILTTLRQVY PDEPTTAYSE DALLGPSFFR VGRWQYNIHG DHHIQRFELD AVIDMPAARV EKVVFRVKSN WGAAHTCLYR VRLHGHL // ID C5JM66_AJEDS Unreviewed; 878 AA. AC C5JM66; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EEQ77334.1}; GN ORFNames=BDBG_03660 {ECO:0000313|EMBL:EEQ77334.1}; OS Ajellomyces dermatitidis (strain SLH14081) (Blastomyces dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Blastomyces. OX NCBI_TaxID=559298 {ECO:0000313|Proteomes:UP000002038}; RN [1] {ECO:0000313|Proteomes:UP000002038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SLH14081 {ECO:0000313|Proteomes:UP000002038}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Klein B., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., RA Berlin A.M., Heiman D.I., Hepburn T.A., Saif S., Shea T.D., Shenoy N., RA Sykes S., Galagan J.E., Nusbaum C., Birren B.W.; RT "The genome sequence of Blastomyces dermatitidis strain SLH14081."; RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG657453; EEQ77334.1; -; Genomic_DNA. DR RefSeq; XP_002625601.1; XM_002625555.1. DR STRING; 559298.XP_002625601.1; -. DR EnsemblFungi; EEQ77334; EEQ77334; BDBG_03660. DR GeneID; 8505094; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002038; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002038}. SQ SEQUENCE 878 AA; 95891 MW; 0EDC884DF28B7A9A CRC64; MAWHYTLFPR GAHTKCRATD RLLYFWAIAL LAAVHAGGNL GFEQPNISSR ALDATCPNRD LSDIQLEYMQ YPVCLEPKWA GIGHVENYTS NSSSGGADID ASIPPGASPS PSSTVTATTS GSSGLDQDLD TESPLDNANF LSFEEWKKQN LAKVGQSVDH VRGDRQGAGS EASGRRQRPM GIDNSLDSLG EDGEIALEFG GFGPENSGPA SWERKVGKGQ APHADGAESA TRGAEGETQI ETTTRGGVSK RKDAGTTCKE RFNYASFDCA ATVLKTNPQC SGASSVLTEN KDNYMLNECR ARDKFLIVEL CDDILIDTIV LANYEFFSSI FRTFRVSVSD RYPPKQPDMW RELGTYEAVN SREVQAFAVE NPLIWARYVK IEFLTHYGNE FYCPVSLIRV HGTTMLEEYK NDGEASRLED HNSAQIQGNV ASESGPDNPA ANQSKVVGKE SDGSTGAGGF DVQPTRVQKP EDICLPKADI GAILSRSLAG EEDGVCLIKE APRAHNQSMD AVQSASVQAH GPAKVAEDAT PITPSAESSS NVASPTQKTT PTVTDSRAQN PANESQHATS TSTHKTEYGG SSESSKPSTT VQQHQPNPTT QESFFKSVNK RLHMLETNSS LSLQYIEEQS RILRDAFSKV EKRQLAKTTT FLENLNTSVL QELREFRHQY DQVWHSVAVE FEQQRLHYHQ EVFAMSSQLG ILADELLFQK RISIIQSVFV LICFGLVLFS RSSIANYLEL PRVHTMVSRS QSYRSSTHSF ESPSASPSSR PNSSYRDSSK DASNTSHRRT HSVESVEDDL AVNPTIAYSP PTPTSDSDDH GLHRQGSQRS DSTASSIMVT QPPQLLRSES SPPDLRGPSE GSEGSEGRSL LEAPQVSS // ID C5KLX0_PERM5 Unreviewed; 408 AA. AC C5KLX0; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EER14485.1}; GN ORFNames=Pmar_PMAR024372 {ECO:0000313|EMBL:EER14485.1}; OS Perkinsus marinus (strain ATCC 50983 / TXsc). OC Eukaryota; Alveolata; Perkinsea; Perkinsida; Perkinsidae; Perkinsus. OX NCBI_TaxID=423536 {ECO:0000313|Proteomes:UP000007800}; RN [1] {ECO:0000313|EMBL:EER14485.1, ECO:0000313|Proteomes:UP000007800} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 50983 / TXsc {ECO:0000313|Proteomes:UP000007800}; RA El-Sayed N., Caler E., Inman J., Amedeo P., Hass B., Wortman J.; RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG674222; EER14485.1; -; Genomic_DNA. DR RefSeq; XP_002782690.1; XM_002782644.1. DR STRING; 423536.XP_002782690.1; -. DR EnsemblProtists; EER14485; EER14485; Pmar_PMAR024372. DR GeneID; 9045016; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; C5KLX0; -. DR Proteomes; UP000007800; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007800}; KW Reference proteome {ECO:0000313|Proteomes:UP000007800}. FT COILED 274 301 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 408 AA; 45703 MW; 8535B8061788745B CRC64; MQRKDRLFIR GAPYTNSQTR TLVDHASVNG GAKLLGAADG LSHPSDVLNG DDGKYMMCQC DLRKKWITFA LDDDTYVEKI ALDTKEYFSS TFRHLQILGS RKYPTDTWRV LGEIETDPTE TQQWFDLSHT SRCAKCYVKY IKIRVLTSHT MEGYAMCTLT RVQIFGSTMI QSIGKLQKRY EVQKHPAAFI SAEKMELATE QSLKSLTDCS ALNPNGTSIS SQYDNFRFPG AKASNVEQSG STSKPWKVGN SSYNDGGGRD HNDTVENVAE GPPLLRFIEE MTELEANYHN LATNVNTLLA DLKYHEQDIS EMKRRTGHGH SEFSDDGLLP DLFTSLGSDT IIKLLLYMVV LLSLSQMYLA YRVFAGGPVV AYKAGNVSPN GKPRKHRRRN RKRIIIPSPI PSDGFASA // ID C5KLX2_PERM5 Unreviewed; 410 AA. AC C5KLX2; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EER14487.1}; GN ORFNames=Pmar_PMAR024374 {ECO:0000313|EMBL:EER14487.1}; OS Perkinsus marinus (strain ATCC 50983 / TXsc). OC Eukaryota; Alveolata; Perkinsea; Perkinsida; Perkinsidae; Perkinsus. OX NCBI_TaxID=423536 {ECO:0000313|Proteomes:UP000007800}; RN [1] {ECO:0000313|EMBL:EER14487.1, ECO:0000313|Proteomes:UP000007800} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 50983 / TXsc {ECO:0000313|Proteomes:UP000007800}; RA El-Sayed N., Caler E., Inman J., Amedeo P., Hass B., Wortman J.; RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG674222; EER14487.1; -; Genomic_DNA. DR RefSeq; XP_002782692.1; XM_002782646.1. DR STRING; 423536.XP_002782692.1; -. DR EnsemblProtists; EER14487; EER14487; Pmar_PMAR024374. DR GeneID; 9045019; -. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; C5KLX2; -. DR Proteomes; UP000007800; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007800}; KW Reference proteome {ECO:0000313|Proteomes:UP000007800}. FT COILED 238 258 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 410 AA; 45813 MW; 1FAFC2F42FDA0A63 CRC64; MCQCDLRKKW ITFALDDDTY VEKIALDTKE YFSSTFRHLQ ILGSRKYPTD TWRVLGEIET DPTETQQWFD LSHTSRCAKC YVKYIKIRVL TSHTMEGYGI CTLTRLQLFG GTLLQNLHRL QRKYEAPKHA AASIDASALI KSTSAMLESL SMSTSRPVDA NPTEEQTLSA VETCPEKRDS HDELLKSPTV EVGKPSQGED DESQSFNNRD APSPPPLEAG IEAAISEGPP LLRFVEEMSA LETNYHQLSA KVEELIGKIR ERDKASLHNT QGVLDVGAHP YKTAAPHPHA SPLYVSWASR WINDNGSFLR DVILAVAVIS TLYSMYRINV SASRHNSSPS RWTPIGMREE VSDPGKRRWK SLSSDFIEIL ETASSSKLNE AIIEDEASRV PTRTVYLRSC SVCADTAPGG // ID C5MAZ0_CANTT Unreviewed; 587 AA. AC C5MAZ0; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EER32807.1}; GN ORFNames=CTRG_03232 {ECO:0000313|EMBL:EER32807.1}; OS Candida tropicalis (strain ATCC MYA-3404 / T1) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Candida. OX NCBI_TaxID=294747 {ECO:0000313|Proteomes:UP000002037}; RN [1] {ECO:0000313|Proteomes:UP000002037} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-3404 / T1 {ECO:0000313|Proteomes:UP000002037}; RX PubMed=19465905; DOI=10.1038/nature08064; RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S., RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., RA Agrafioti I., Arnaud M.B., Bates S., Brown A.J.P., Brunke S., RA Costanzo M.C., Fitzpatrick D.A., de Groot P.W.J., Harris D., RA Hoyer L.L., Hube B., Klis F.M., Kodira C., Lennard N., Logue M.E., RA Martin R., Neiman A.M., Nikolaou E., Quail M.A., Quinn J., RA Santos M.C., Schmitzberger F.F., Sherlock G., Shah P., RA Silverstein K.A.T., Skrzypek M.S., Soll D., Staggs R., Stansfield I., RA Stumpf M.P.H., Sudbery P.E., Srikantha T., Zeng Q., Berman J., RA Berriman M., Heitman J., Gow N.A.R., Lorenz M.C., Birren B.W., RA Kellis M., Cuomo C.A.; RT "Evolution of pathogenicity and sexual reproduction in eight Candida RT genomes."; RL Nature 459:657-662(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG692398; EER32807.1; -; Genomic_DNA. DR RefSeq; XP_002548935.1; XM_002548889.1. DR STRING; 294747.XP_002548935.1; -. DR EnsemblFungi; EER32807; EER32807; CTRG_03232. DR GeneID; 8295954; -. DR KEGG; ctp:CTRG_03232; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002037; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002037}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002037}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 509 526 Helical. FT COILED 302 326 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 587 AA; 66679 MW; EC2FD1F0A5804EB9 CRC64; MRCSWYRIVD ILGLVTLVYL PILVKASENA SFSEVVVELV QPVDFSPVIP VFVLSNDTIP ASNELQLLLS QSPATSYPSG TSQEQPRNDS VLDDCHFMSF EEWKKQKIIE SNNTPPSSHS NQSINNTDIK LVNTTIHKNS TASSTNITLV EADGKVYKDK FNFASVDCAA TIVKTNAKAK GASAILKENK DSYLLNECSV PNKYVIIELC QDILVDSVVI GNFEFFSSMF KDIRVSVSDR FPSQAWKELG QFTAENIRDV QSFKIQNPLI WARYLKLEIL SHYGNEYYCP ISIVRVHGKT MMDEFKEDEE KSKENLERKE VEGEEEVATP QTIENEDILL INQSTLNECR VIMPHLQLNE FLKDLNTTET ESCVVVTSDS CDPQASTTQV SSATVATQES IYKNIMKRLA LLESNATLSL LYIEEQSKLL STAFANLEKR QSSNFNSLIS SVNVTLINQL NSFKESYNNL HDQYNNLFKI QQNSHRQIML GTNKKINQLV NDLTFQKRVS IFNTIIIICL LVYVILTRDV DIEIQEDHDI EEDEEVLVTD EKEMDYPEET PSKQETVSLS SPFNPQKKKS KKQRQKS // ID C5P6C9_COCP7 Unreviewed; 663 AA. AC C5P6C9; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EER26979.1}; GN ORFNames=CPC735_023150 {ECO:0000313|EMBL:EER26979.1}; OS Coccidioides posadasii (strain C735) (Valley fever fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Coccidioides. OX NCBI_TaxID=222929 {ECO:0000313|EMBL:EER26979.1, ECO:0000313|Proteomes:UP000009084}; RN [1] {ECO:0000313|EMBL:EER26979.1, ECO:0000313|Proteomes:UP000009084} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C735 {ECO:0000313|Proteomes:UP000009084}; RX PubMed=19717792; DOI=10.1101/gr.087551.108; RA Sharpton T.J., Stajich J.E., Rounsley S.D., Gardner M.J., RA Wortman J.R., Jordar V.S., Maiti R., Kodira C.D., Neafsey D.E., RA Zeng Q., Hung C.-Y., McMahan C., Muszewska A., Grynberg M., RA Mandel M.A., Kellner E.M., Barker B.M., Galgiani J.N., Orbach M.J., RA Kirkland T.N., Cole G.T., Henn M.R., Birren B.W., Taylor J.W.; RT "Comparative genomic analyses of the human fungal pathogens RT Coccidioides and their relatives."; RL Genome Res. 19:1722-1731(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EER26979.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFW01000025; EER26979.1; -; Genomic_DNA. DR RefSeq; XP_003069124.1; XM_003069078.1. DR UniGene; Cpo.1785; -. DR EnsemblFungi; EER26979; EER26979; CPC735_023150. DR GeneID; 9694619; -. DR KEGG; cpw:CPC735_023150; -. DR EuPathDB; FungiDB:CPC735_023150; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000009084; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009084}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 298 320 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 148 168 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 663 AA; 72633 MW; 0E99F8942630407E CRC64; MTGARRTRSG RSISQEPTGQ GHRTRKTPGP VDSGDSAAIP TTSFGSPSLQ ALQAQHSFAY GATGSPALPR QLRMCPPTGA TEMAANIEGR YLETHANDFE RIEEEARANP GTRRSTRSGA NTGSARQSVS PVRRTPGQRA RNREPTPDDQ LLESLREASE EAEETKETIL PSIEDSSVSW NTERHILGDP RSVPTATSSG GSLSQSQRQR EAAHPMAGPP LRPYTRSQPR TTFQASQAVR SGSSAASSAQ PAPRLGAAPV LSPPQAEDNA YATPNPRRQT RSVSQQTMSS TASQRQRGFS VISLGLMITI FMLAVAGMLF RFDDIEMIGK NILQNGIGKE FSLPSSFCGA QPPTSQYIEA FDKLSAGVDR RLADMARDVA TLKDEWNRRL PHLKQAIWPE MEDPLLPRKI NWFSVGMGAF VDPYLTTKHR SGLLHRGAER AAGMRKTNPP VAALSRWEEH GDCWCVNDHT SEIQLAVLLG RPLVPEEVVI EHIQKEATLD PESAPREMEL WVEYVARSHA AAPSTTLPGF RATGAPGRSD QTATSSPVST RRPELLESSA EARAAFAGPL SPSQHEDIIS TLRMAYPDEP ETAYSHDTML GSSFYRIGKF QYDINGKHNI QKFHLDAVID LPNIRTKKAV LRVKSNWGSV NTCVYRVRLH GHM // ID C5P767_COCP7 Unreviewed; 856 AA. AC C5P767; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EER27267.1}; GN ORFNames=CPC735_026030 {ECO:0000313|EMBL:EER27267.1}; OS Coccidioides posadasii (strain C735) (Valley fever fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Coccidioides. OX NCBI_TaxID=222929 {ECO:0000313|EMBL:EER27267.1, ECO:0000313|Proteomes:UP000009084}; RN [1] {ECO:0000313|EMBL:EER27267.1, ECO:0000313|Proteomes:UP000009084} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C735 {ECO:0000313|Proteomes:UP000009084}; RX PubMed=19717792; DOI=10.1101/gr.087551.108; RA Sharpton T.J., Stajich J.E., Rounsley S.D., Gardner M.J., RA Wortman J.R., Jordar V.S., Maiti R., Kodira C.D., Neafsey D.E., RA Zeng Q., Hung C.-Y., McMahan C., Muszewska A., Grynberg M., RA Mandel M.A., Kellner E.M., Barker B.M., Galgiani J.N., Orbach M.J., RA Kirkland T.N., Cole G.T., Henn M.R., Birren B.W., Taylor J.W.; RT "Comparative genomic analyses of the human fungal pathogens RT Coccidioides and their relatives."; RL Genome Res. 19:1722-1731(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EER27267.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFW01000025; EER27267.1; -; Genomic_DNA. DR RefSeq; XP_003069412.1; XM_003069366.1. DR UniGene; Cpo.584; -. DR STRING; 222929.XP_003069412.1; -. DR EnsemblFungi; EER27267; EER27267; CPC735_026030. DR GeneID; 9694907; -. DR KEGG; cpw:CPC735_026030; -. DR EuPathDB; FungiDB:CPC735_026030; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009084; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009084}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 31 {ECO:0000256|SAM:SignalP}. FT CHAIN 32 856 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5002953767. SQ SEQUENCE 856 AA; 95287 MW; 4BDDA7E915FC89D3 CRC64; MRGRWSFVCF DIDFSVLNVL ILLFLLPLLV AENGDFQGRR HQPQPGGDVW SMDHGYAVSC PVRDFVEVQA EYVRYPVCLG SRRASSAPDG MTESASVATE RTSEAPTASS AETVSTPKVE SELDTESPLD NEKFLSFEEW KKKNLAKIGQ SVDNVRGNRQ AVGSTEMRKR SRPGEISNAL DSLGEEGEIE LGFGGFGPGD SDIPPVEKKD AQSASSSVNG EKHVTKGTEG ESQSDGVPRR GIARRKDAGV TCKERFNYAS FDCAATVLKT NRECTGSSSI LIENKDSYML NECRAKDKFI ILELCDDILV DTLVLANYEF FSSIFRTFRV SVADRYPAKP DKWKELGTYE AANTREIQAF AVENPLIWAR YLKIEFFSHY GNEFYCPLSL VRVHGTTMME EYKNYGDSAR AEEEAVEAVV QAQQNPDSVP TMKNSNQTQR EIRDQNVNIS ITQTGSGTLP DEEALGASCF PQINEIERLL LGMSSDNMSS IYDMALDPDY QSEAHESAES ETWASNATGS IGLEDTSVSD TPPTMVGGSD HQRATPGSRM VSTSGSSRSE NETSADNQRT PVVSQPPPPN PTTQESFFKS VHKRLQMLET NSTLSLLYIE EQSRILRDAF NKVEKRQLAK TSSFLENLNS TVLHELREFR QQYDHIWHSV VIEFEQQRQQ YHHELFAVTS QLAILADEVV FQKRVSIIQS VFVLLSFGLV LFSRSAVGSY LEFPKMQSMV SRSHSFRSAS PPYETPSPSP NSPMQSPTYQ EGNLHRRNPS DDQTDCEICN HTFPYSPPPS SDTLSPSEEE EKGLHDVHLE YSRSTASNLV PEENPAGIKR QRSSPADLCG HDEGDSAEFK LPQAPS // ID C5XGK7_SORBI Unreviewed; 126 AA. AC C5XGK7; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Putative uncharacterized protein Sb03g010590 {ECO:0000313|EMBL:EES02744.1}; GN Name=Sb03g010590 {ECO:0000313|EMBL:EES02744.1}; GN ORFNames=SORBIDRAFT_03g010590 {ECO:0000313|EMBL:EES02744.1}; OS Sorghum bicolor (Sorghum) (Sorghum vulgare). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; OC Sorghum. OX NCBI_TaxID=4558 {ECO:0000313|Proteomes:UP000000768}; RN [1] {ECO:0000313|EMBL:EES02744.1, ECO:0000313|Proteomes:UP000000768} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768}; RX PubMed=19189423; DOI=10.1038/nature07723; RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J., RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., RA Schmutz J., Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., RA Chapman J., Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., RA Maher C.A., Martis M., Narechania A., Otillar R.P., Penning B.W., RA Salamov A.A., Wang Y., Zhang L., Carpita N.C., Freeling M., RA Gingle A.R., Hash C.T., Keller B., Klein P., Kresovich S., RA McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman M., Ware D., RA Westhoff P., Mayer K.F.X., Messing J., Rokhsar D.S.; RT "The Sorghum bicolor genome and the diversification of grasses."; RL Nature 457:551-556(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000762; EES02744.1; -; Genomic_DNA. DR RefSeq; XP_002457624.1; XM_002457579.1. DR UniGene; Sbi.605; -. DR STRING; 4558.Sb03g010590.1; -. DR EnsemblPlants; Sb03g010590.1; Sb03g010590.1; Sb03g010590. DR GeneID; 8074513; -. DR KEGG; sbi:SORBI_03g010590; -. DR Gramene; C5XGK7; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; C5XGK7; -. DR KO; K19347; -. DR OMA; MICAYYG; -. DR Proteomes; UP000000768; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000768}; KW Reference proteome {ECO:0000313|Proteomes:UP000000768}. SQ SEQUENCE 126 AA; 14093 MW; 3474AE8B801E51DD CRC64; MNNPQENRFD LILPYERPHH VSEFSLDVAY DRSTAPKDCL VSGWYEETPG ETQLGHAAKM ALVEFTYDLE KNNVQTFDVS APDVGVINMI RLDFNSNHGS SQLTCIYRLR VHGHEPVSPG TAGSQA // ID C5XQ75_SORBI Unreviewed; 626 AA. AC C5XQ75; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein Sb03g026980 {ECO:0000313|EMBL:EES01018.1}; GN Name=Sb03g026980 {ECO:0000313|EMBL:EES01018.1}; GN ORFNames=SORBIDRAFT_03g026980 {ECO:0000313|EMBL:EES01018.1}; OS Sorghum bicolor (Sorghum) (Sorghum vulgare). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; OC Sorghum. OX NCBI_TaxID=4558 {ECO:0000313|Proteomes:UP000000768}; RN [1] {ECO:0000313|EMBL:EES01018.1, ECO:0000313|Proteomes:UP000000768} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768}; RX PubMed=19189423; DOI=10.1038/nature07723; RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J., RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., RA Schmutz J., Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., RA Chapman J., Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., RA Maher C.A., Martis M., Narechania A., Otillar R.P., Penning B.W., RA Salamov A.A., Wang Y., Zhang L., Carpita N.C., Freeling M., RA Gingle A.R., Hash C.T., Keller B., Klein P., Kresovich S., RA McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman M., Ware D., RA Westhoff P., Mayer K.F.X., Messing J., Rokhsar D.S.; RT "The Sorghum bicolor genome and the diversification of grasses."; RL Nature 457:551-556(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000762; EES01018.1; -; Genomic_DNA. DR RefSeq; XP_002455898.1; XM_002455853.1. DR ProteinModelPortal; C5XQ75; -. DR STRING; 4558.Sb03g026980.1; -. DR EnsemblPlants; Sb03g026980.1; Sb03g026980.1; Sb03g026980. DR GeneID; 8082200; -. DR KEGG; sbi:SORBI_03g026980; -. DR Gramene; C5XQ75; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR InParanoid; C5XQ75; -. DR Proteomes; UP000000768; Chromosome 3. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000768}; KW Reference proteome {ECO:0000313|Proteomes:UP000000768}. SQ SEQUENCE 626 AA; 67647 MW; 665D4575C07B87E8 CRC64; MSRKRREGGG GGRGAGTGTG DHHGGGSGKG SGAGADAVSM DGGLREVSVS VVFSVWCILF LLRSQFLHSQ TDDDPSSEFY EDHHGRRDSY CKVRPLEAYV LPYHNDSSTT TCQSSYSQPQ PPQESPSASA PAPPELPPQY NATTGGGNNA SSPEAAAFVG LDEFRSRIMQ GKAENDTGRP RPTDGGAAHR LEPNGAEYNY AAASKGAKVL AHNKEAKGAG NILGGDKDKY LRNPCSADDK FVVVELSEET LVDTVALANL EHYSSNFRDF EVYGSMSYPT EAWELLGRFT AENAKHAQRF VLPEPRWTRY LRLRLVSHYG SGFYCILSYL EVYGVDAVER MLQDFIAGNG AGAGAEADAS RDRASIDLAS RDVDSNDTTA QQARQVHAKL DGNGGAGTGR NDSSSAGDAK NNGSRSGDAK LPPPLGKEAK PPQVAAAPGS STGRIHSDGV LKILMQKMRS LELSLSTLEE YTREVNQRYG AKVPDLQNGL SQTAVALEKM KADVHVLVDW KDSVAKDVDE LKAWKSTVSG KLDDLIKENQ EMRWSVEEMR GVQETLQNKE LAVLSISLFF ACLALFKLAC DRLLCLFAGK GSREEADAEE HTRSSRAWML VLASSSFTTL IVLLYN // ID C5XRC9_SORBI Unreviewed; 615 AA. AC C5XRC9; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Putative uncharacterized protein Sb03g041510 {ECO:0000313|EMBL:EES03974.1}; GN Name=Sb03g041510 {ECO:0000313|EMBL:EES03974.1}; GN ORFNames=SORBIDRAFT_03g041510 {ECO:0000313|EMBL:EES03974.1}; OS Sorghum bicolor (Sorghum) (Sorghum vulgare). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; OC Sorghum. OX NCBI_TaxID=4558 {ECO:0000313|Proteomes:UP000000768}; RN [1] {ECO:0000313|EMBL:EES03974.1, ECO:0000313|Proteomes:UP000000768} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768}; RX PubMed=19189423; DOI=10.1038/nature07723; RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J., RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., RA Schmutz J., Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., RA Chapman J., Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., RA Maher C.A., Martis M., Narechania A., Otillar R.P., Penning B.W., RA Salamov A.A., Wang Y., Zhang L., Carpita N.C., Freeling M., RA Gingle A.R., Hash C.T., Keller B., Klein P., Kresovich S., RA McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman M., Ware D., RA Westhoff P., Mayer K.F.X., Messing J., Rokhsar D.S.; RT "The Sorghum bicolor genome and the diversification of grasses."; RL Nature 457:551-556(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000762; EES03974.1; -; Genomic_DNA. DR RefSeq; XP_002458854.1; XM_002458809.1. DR UniGene; Sbi.10912; -. DR STRING; 4558.Sb03g041510.1; -. DR EnsemblPlants; Sb03g041510.1; Sb03g041510.1; Sb03g041510. DR GeneID; 8061007; -. DR KEGG; sbi:SORBI_03g041510; -. DR Gramene; C5XRC9; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR InParanoid; C5XRC9; -. DR OMA; YGSASYC; -. DR Proteomes; UP000000768; Chromosome 3. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000768}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000768}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 35 57 Helical. FT TRANSMEM 556 576 Helical. FT TRANSMEM 597 614 Helical. FT COILED 486 513 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 615 AA; 68179 MW; 86946EA9FA4C2818 CRC64; MQRSRRALLR RTAAAQEQSA VAEAEAAANG RKRRLYGFSA SLVVASWVAV LLLHSLVGHG DSQRDGGGYA VDLTVVEPAL NVGPFNPVVQ VEHGENLAVP GDPCVNSVEN AVLSEDTLVQ ADQLCSNDEV LSENTEALTK DSQVELSGDQ GGYLPQSDVD SGVQPGEKVE SEDLPRPPRL SRVAPPDLDE FKTRAIAERG PGVSSQPGHV IHRREPSGKL YNYAAASKGA KVLDFNKEAK GASNILDKDK DKYLRNPCSA EGKFVIIELS EETLVDTIAI ANFEHYSSNP KEFELLSSLT YPTENWETLG RFTAANAKVS QNFTFLEPKW ARYLKLNLVS HYGSEFYCTL SMLEVYGMDA VEKMLENLIP VENKKTEPDD KTKEPVEQMP LKESSGGKES SQEPLDEDEF EIEDVKPNSD STKNGANDQV SETRTLQAGR IPGDTVLKML MQKVQSLDVS FSVLEKYLVE LNSRYGQIFK DFDADIDSKD VLLERIKSEL KNLESSKDSI TNEIEGILSW KLVASSQLNQ LVLDNALLRS EFETFRQKQT DMENRSLAVI FLSFVFACLA LAKLSIGIMS KFCRFYDFEK IHNVRSGWVV LLLSSCIIST ILIIQ // ID C5XW88_SORBI Unreviewed; 443 AA. AC C5XW88; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Putative uncharacterized protein Sb04g005160 {ECO:0000313|EMBL:EES06365.1}; GN Name=Sb04g005160 {ECO:0000313|EMBL:EES06365.1}; GN ORFNames=SORBIDRAFT_04g005160 {ECO:0000313|EMBL:EES06365.1}; OS Sorghum bicolor (Sorghum) (Sorghum vulgare). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; OC Sorghum. OX NCBI_TaxID=4558 {ECO:0000313|Proteomes:UP000000768}; RN [1] {ECO:0000313|EMBL:EES06365.1, ECO:0000313|Proteomes:UP000000768} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768}; RX PubMed=19189423; DOI=10.1038/nature07723; RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J., RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., RA Schmutz J., Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., RA Chapman J., Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., RA Maher C.A., Martis M., Narechania A., Otillar R.P., Penning B.W., RA Salamov A.A., Wang Y., Zhang L., Carpita N.C., Freeling M., RA Gingle A.R., Hash C.T., Keller B., Klein P., Kresovich S., RA McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman M., Ware D., RA Westhoff P., Mayer K.F.X., Messing J., Rokhsar D.S.; RT "The Sorghum bicolor genome and the diversification of grasses."; RL Nature 457:551-556(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000763; EES06365.1; -; Genomic_DNA. DR RefSeq; XP_002453389.1; XM_002453344.1. DR UniGene; Sbi.284; -. DR STRING; 4558.Sb04g005160.1; -. DR EnsemblPlants; Sb04g005160.1; Sb04g005160.1; Sb04g005160. DR GeneID; 8077203; -. DR KEGG; sbi:SORBI_04g005160; -. DR Gramene; C5XW88; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR InParanoid; C5XW88; -. DR KO; K19347; -. DR OMA; MEIARHS; -. DR Proteomes; UP000000768; Chromosome 4. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000768}; KW Reference proteome {ECO:0000313|Proteomes:UP000000768}. FT COILED 176 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 443 AA; 48870 MW; 7C46B4DA2ED7DEE7 CRC64; MSASTAAIPT ANTNGNHALS TDSHSSQDAR RRTAGITRRK ALPPILAKIP SNDLSHTIRG ESVLDKSKHS SEARKDAVAS AAAVRQKKSP TKQEKAKWVT ALSVLVKLCL LISATAWMGQ VFWRWQSGEL SLTLDMESRL SKVEGFKKTA KMLQLQLDVL DKKLGNEIDK AKRDITKQFE DKGSKIEKKM KTLEDKTDKL DKSLAELSDM GFLSKNEFEE ILSQLKKKKG FGGTDDEISL DDIRLYAKEV VEMEIARHSA DGLGMVDYAL GSGGAKVVSH SEPFMNGKNY LPGRSIVHTT AQKMLEPSFG QPGECFALKG SSGFVKVKLR TGIIPEAVTL EHVDKSVAYD RSSAPKDFQV RGWYQGSHDD SEKHSNVMAA LGEFSYDLDK SNAQTFQLER TADSRVVNMV QLDFSSNHGN LELTCIYRFR VHGREPGSLN TGA // ID C6HPV2_AJECH Unreviewed; 865 AA. AC C6HPV2; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 16-SEP-2015, entry version 15. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EER37563.1}; GN ORFNames=HCDG_08233 {ECO:0000313|EMBL:EER37563.1}; OS Ajellomyces capsulatus (strain H143) (Darling's disease fungus) OS (Histoplasma capsulatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Histoplasma. OX NCBI_TaxID=544712 {ECO:0000313|Proteomes:UP000002624}; RN [1] {ECO:0000313|Proteomes:UP000002624} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H143 {ECO:0000313|Proteomes:UP000002624}; RA Champion M., Cuomo C.A., Ma L.-J., Henn M.R., Sil A., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., RA Berlin A.M., Borenstein D., Chen Z., Engels R., Freedman E., RA Gellesch M., Goldberg J., Griggs A., Gujja S., Heiman D.I., RA Hepburn T.A., Howarth C., Jen D., Larson L., Lewis B., Mehta T., RA Park D., Pearson M., Roberts A., Saif S., Shea T.D., Shenoy N., RA Sisk P., Stolte C., Sykes S., Walk T., White J., Yandava C., Klein B., RA McEwen J.G., Puccia R., Goldman G.H., Felipe M.S., Nino-Vega G., RA San-Blas G., Taylor J.W., Mendoza L., Galagan J.E., Nusbaum C., RA Birren B.W.; RT "The genome sequence of Ajellomyces capsulatus strain H143."; RL Submitted (MAY-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG692434; EER37563.1; -; Genomic_DNA. DR EnsemblFungi; EER37563; EER37563; HCDG_08233. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002624; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002624}; KW Reference proteome {ECO:0000313|Proteomes:UP000002624}. SQ SEQUENCE 865 AA; 95513 MW; FEC0745C86414FA6 CRC64; MAWHFPLFRQ HVHTECRATD NLLYFWTIAL LAAVRAGGDM DSKNHNISPL SLDATCPPRA FSGIQHPVCL EPRWVGIGKI ENYTSNSSGE TDFYASITSA ASPSLSPTIT VTGSGSGSSG VDQELDTESP LDNANFLSFE EWKKQNLAKV GQSVENVRGD RQSAGSSGDG KRQRPTGIDN SLDSLGEDGE IALEFGGFGP EDSGPASWER KVGKDQPPDV DGAGSVTKGA EGETQIEATT RGGASRRKDA GTTCKERFNY ASFDCAATVL KTNPQCTGAS SVLIENKDSY MLNECKAKEK FLILELCDDI LIDTIVLANY EFFSSIFRTF RVSVSDRYPP KQPDMWKELG TYEAVNSREV QAFAVENPLI WARYVKIEFL THYGNEFYCP VSLIRVHGTT MLEEYKNDGE ANRLEDHNSH QIQGSRTLES GPDNSTTDPS KIAEDSEGPA EAGRFDMQPT RVLEDICLLK DAEVGGILLR SVVRAEDRMC AVHETPRAYN RTDDAVQPDL VQSHGPAQAV DNATPTTPSV EPSSNAVTPP TPVSTPTLTD TRAQKPTENE TSSNTHKTEY NGSSESPKPS TTVQYHQPNP TTQESFFKSV NKRLHMLETN SSLSLQYIEE QSRILRDAFN KVEKRQLAKT TTFLENLNTS VLQELREFRH QYDQVWHSVA VEFEQQRLQY RQEVFAMSSQ LGVLADELVF QKRISIIQSV FVLICFGLVL FSSSPIGSYL ELPRVHNMVS RSQSFRSSTH SFETPSASPL SRPNSPYQDN KRVSSSHTRT HSMESREDDL AVNPTICYSP PTPTSDSGGH ELRRRLSEQT NSTSSSVVVA PQARFLRSES SPPDLCESYE GSDGGSSSEA PQLST // ID C6LYW1_GIAIB Unreviewed; 849 AA. AC C6LYW1; DT 22-SEP-2009, integrated into UniProtKB/TrEMBL. DT 22-SEP-2009, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EES98800.1}; GN ORFNames=GL50581_3990 {ECO:0000313|EMBL:EES98800.1}; OS Giardia intestinalis (strain ATCC 50581 / GS clone H7) (Giardia OS lamblia). OC Eukaryota; Diplomonadida; Hexamitidae; Giardiinae; Giardia. OX NCBI_TaxID=598745 {ECO:0000313|EMBL:EES98800.1, ECO:0000313|Proteomes:UP000002488}; RN [1] {ECO:0000313|EMBL:EES98800.1, ECO:0000313|Proteomes:UP000002488} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 50581 / GS clone H7 {ECO:0000313|Proteomes:UP000002488}; RX PubMed=19696920; DOI=10.1371/journal.ppat.1000560; RA Franzen O., Jerlstrom-Hultqvist J., Castro E., Sherwood E., RA Ankarklev J., Reiner D.S., Palm D., Andersson J.O., Andersson B., RA Svard S.G.; RT "Draft genome sequencing of giardia intestinalis assemblage B isolate RT GS: is human giardiasis caused by two different species?"; RL PLoS Pathog. 5:E1000560-E1000560(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EES98800.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACGJ01002917; EES98800.1; -; Genomic_DNA. DR EnsemblProtists; EES98800; EES98800; GL50581_3990. DR Proteomes; UP000002488; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002488}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 242 264 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 340 360 {ECO:0000256|SAM:Coils}. FT COILED 387 407 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 849 AA; 92463 MW; 7E1A45EC374C13B6 CRC64; MPPRTRSSGA LVRMNPTRED EAYTKEVLRK TSDGFRAELT SGKSERITQN LRNQDEGTLR LQEYLMAAEV DARPPKKIDS LMDQSDALKL QKSLTALQVA PIQRSRVQEP PRYEITSEDK DSSSSKPHQA PLSETSSTST PPESSSTPSP PPSPPPRKSP KDVPKPATPT PVKLPAAPIK TKSPTIQPSQ EDEYTDEYVM VKQRVKRKPR APSKLSAGTK VVTAGEPGML YAYRKAWGCA EYVYASVCGL LALMVLFIFS YLLMRLWTGC AGLGIVGTSA PLVPVYTGDG TAGVSCTGPS LREIQSLLES NNKILLKEWG NTRTFALDSK EVQSIATQVA TKLSDQHQRL ELKIKEIVDK HTARTADQGI TTSQLKDLAK TIREEVVSAT KERISTLSAA IAKFEKEMLD AQSVTTELLK DLFKNSSFSV SSASAKKASE KIKAAVGQMS DAADAAVAAG SANFTALFTM IESLSRKLET SYRSENITKD LQSLHQSILL DIQTQLKEVA ATTTQMIGDS LTSIRAQTDN LSATLIDFVA KSAAGAEASI SGTAPVTFAS ESLSSMQRAI DRIVIALGEL SSQVASGVNN KDVHGDSHSS LATLTDIEEM QRKTSQHITQ QAENVTLTIL ERIDASDNRM SQEANEHVDH LKTGLAGVIK KLLLKQTDYS SGSMGTTPQG VVDELRLQYT DFTKQSFGTR IAGKSDDITN IAESLKTLIS GNERVRLMFN DNMSPGSCWP TKKTGYVVLR FKHPVTLYYG SISHPAAPKL STGRTTVPRD LTFTGRTTTG KEVQLGSFVF DVDGPEQQAF RLQENHDIIQ VRVGFTNNGG EYTCIYNLGL FGEKDSKNP // ID C7GN70_YEAS2 Unreviewed; 587 AA. AC C7GN70; DT 13-OCT-2009, integrated into UniProtKB/TrEMBL. DT 13-OCT-2009, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Slp1p {ECO:0000313|EMBL:EEU07759.1}; GN Name=SLP1 {ECO:0000313|EMBL:EEU07759.1}; GN ORFNames=C1Q_01723 {ECO:0000313|EMBL:EEU07759.1}; OS Saccharomyces cerevisiae (strain JAY291) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=574961 {ECO:0000313|EMBL:EEU07759.1, ECO:0000313|Proteomes:UP000008073}; RN [1] {ECO:0000313|EMBL:EEU07759.1, ECO:0000313|Proteomes:UP000008073} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JAY291 {ECO:0000313|EMBL:EEU07759.1, RC ECO:0000313|Proteomes:UP000008073}; RX PubMed=19812109; DOI=10.1101/gr.091777.109; RA Argueso J.L., Carazzolle M.F., Mieczkowski P.A., Duarte F.M., RA Netto O.V.C., Missawa S.K., Galzerani F., Costa G.G.L., Vidal R.O., RA Noronha M.F., Dominska M., Andrietta M.G.S., Andrietta S.R., RA Cunha A.F., Gomes L.H., Tavares F.C.A., Alcarde A.R., Dietrich F.S., RA McCusker J.H., Petes T.D., Pereira G.A.G.; RT "Genome structure of a Saccharomyces cerevisiae strain widely used in RT bioethanol production."; RL Genome Res. 19:2258-2270(2009). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EEU07759.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFL01000070; EEU07759.1; -; Genomic_DNA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008073; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008073}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67353 MW; 4A43AF40124071AD CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFRQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID C7YY81_NECH7 Unreviewed; 971 AA. AC C7YY81; DT 13-OCT-2009, integrated into UniProtKB/TrEMBL. DT 13-OCT-2009, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEU43143.1}; GN ORFNames=NECHADRAFT_95388 {ECO:0000313|EMBL:EEU43143.1}; OS Nectria haematococca (strain 77-13-4 / ATCC MYA-4622 / FGSC 9596 / OS MPVI) (Fusarium solani subsp. pisi). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Fusarium; Fusarium solani species complex. OX NCBI_TaxID=660122 {ECO:0000313|Proteomes:UP000005206}; RN [1] {ECO:0000313|EMBL:EEU43143.1, ECO:0000313|Proteomes:UP000005206} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=77-13-4 / ATCC MYA-4622 / FGSC 9596 / MPVI RC {ECO:0000313|Proteomes:UP000005206}; RX PubMed=19714214; DOI=10.1371/journal.pgen.1000618; RA Coleman J.J., Rounsley S.D., Rodriguez-Carres M., Kuo A., RA Wasmann C.C., Grimwood J., Schmutz J., Taga M., White G.J., Zhou S., RA Schwartz D.C., Freitag M., Ma L.-J., Danchin E.G.J., Henrissat B., RA Coutinho P.M., Nelson D.R., Straney D., Napoli C.A., Barker B.M., RA Gribskov M., Rep M., Kroken S., Molnar I., Rensing C., Kennell J.C., RA Zamora J., Farman M.L., Selker E.U., Salamov A., Shapiro H., RA Pangilinan J., Lindquist E., Lamers C., Grigoriev I.V., Geiser D.M., RA Covert S.F., Temporini E., VanEtten H.D.; RT "The genome of Nectria haematococca: contribution of supernumerary RT chromosomes to gene expansion."; RL PLoS Genet. 5:E1000618-E1000618(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG698903; EEU43143.1; -; Genomic_DNA. DR RefSeq; XP_003048856.1; XM_003048810.1. DR EnsemblFungi; NechaT95388; NechaP95388; NechaG95388. DR GeneID; 9667797; -. DR KEGG; nhe:NECHADRAFT_95388; -. DR InParanoid; C7YY81; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000005206; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005206}; KW Reference proteome {ECO:0000313|Proteomes:UP000005206}. SQ SEQUENCE 971 AA; 109904 MW; C4A5DC20F5AAA53A CRC64; MPPRAAAGRR PRFASREPEQ TSTHESPNAL IRPQLPPLQG TPSSRRQYTY GSGVEPPPRV SAGFQRMDIS TAVNQALSKR DDTDVFVRPP KPRAATVEDE ETSRDNGNRL SAGGLPRGTV AGSDADSLRS FGMESDYYED ATIGSAPTLT PGPQTRQRTS KTARTQKDVS EEHEDEEEED PRTTNVVNHG KKINTPAPSR TRQTRSNQLP ARNTAREEDQ EDESGYEEAT GLEPEAELQG VHRVAANRRP RHGSDETSEK SEGTFQRRPQ HRITSFDKAN EIPIDPRERD SLIQQEIRDV EDQVARERAE RETRTRFVVQ HETWKQWFQQ QIAWVLTLWP FRLFMGQRNN LDDFDDFDED HDVPAVQLWR IFHPMTYVRT LEWLSDKLMD YIFNFINRVC GVQLRGSQTG LTMYWTMLSF LALLIGGAVI HMGMGTSMPS VPSFPGLGSS GTLPWPSSSG FFERIGNMIP SMPSWSTDEE PNIWDKPEER GSGSFEKFLA SYKKAVSTLK DKDNIHEQAI KKLEAIVPHI VHMDTKGGKP VISQQFWHAL RDLMKADGSF LSFEKKKNGN LEVSSDGQWQ ALLARLTKDP SFTSTINKSA TGAAEQVESK LSGWWDRWIK NNDNKIQEIL DKAMDKRQSA GSEREFDERL TKIVNEQLKG KNQAVVSREQ FLKQVEGEFT KHRKQIAAEM TELRTKMDER VKEMIRAATF DAPKQVTKTE ISKLVHEIVK KALADLSLQA VAKGEIKVNW DAVLKNQVNF FGVGAGATID PKRTAPVWDP WNKGVASVEA YEKGLVGVNP LPPIVALHPW QDEGDCYCAA RTINHRGNAH SASIAVHLAH LMVPQQVVIE HILPGATTDP EARPRQIEVW ANIEADEREK VRDFSQTHFP DNKEDWDFTP PNFEDSFVKI SQFVYESDEL HNGVHIHHLS PELEDLGAMT DHVIIRAVSN YGAKTHTCFY RVRLFGRRAD E // ID C7ZA05_NECH7 Unreviewed; 812 AA. AC C7ZA05; DT 13-OCT-2009, integrated into UniProtKB/TrEMBL. DT 13-OCT-2009, sequence version 1. DT 14-OCT-2015, entry version 27. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEU39596.1}; DE Flags: Fragment; GN ORFNames=NECHADRAFT_21098 {ECO:0000313|EMBL:EEU39596.1}; OS Nectria haematococca (strain 77-13-4 / ATCC MYA-4622 / FGSC 9596 / OS MPVI) (Fusarium solani subsp. pisi). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Fusarium; Fusarium solani species complex. OX NCBI_TaxID=660122 {ECO:0000313|Proteomes:UP000005206}; RN [1] {ECO:0000313|EMBL:EEU39596.1, ECO:0000313|Proteomes:UP000005206} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=77-13-4 / ATCC MYA-4622 / FGSC 9596 / MPVI RC {ECO:0000313|Proteomes:UP000005206}; RX PubMed=19714214; DOI=10.1371/journal.pgen.1000618; RA Coleman J.J., Rounsley S.D., Rodriguez-Carres M., Kuo A., RA Wasmann C.C., Grimwood J., Schmutz J., Taga M., White G.J., Zhou S., RA Schwartz D.C., Freitag M., Ma L.-J., Danchin E.G.J., Henrissat B., RA Coutinho P.M., Nelson D.R., Straney D., Napoli C.A., Barker B.M., RA Gribskov M., Rep M., Kroken S., Molnar I., Rensing C., Kennell J.C., RA Zamora J., Farman M.L., Selker E.U., Salamov A., Shapiro H., RA Pangilinan J., Lindquist E., Lamers C., Grigoriev I.V., Geiser D.M., RA Covert S.F., Temporini E., VanEtten H.D.; RT "The genome of Nectria haematococca: contribution of supernumerary RT chromosomes to gene expansion."; RL PLoS Genet. 5:E1000618-E1000618(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG698912; EEU39596.1; -; Genomic_DNA. DR RefSeq; XP_003045309.1; XM_003045263.1. DR EnsemblFungi; NechaT21098; NechaP21098; NechaG21098. DR GeneID; 9672083; -. DR KEGG; nhe:NECHADRAFT_21098; -. DR InParanoid; C7ZA05; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005206; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005206}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005206}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 652 672 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 617 644 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EEU39596.1}. FT NON_TER 812 812 {ECO:0000313|EMBL:EEU39596.1}. SQ SEQUENCE 812 AA; 88197 MW; BB635CF461BE726C CRC64; TTVSSCDART INYITHTLPQ SCLTSSWSSA TASAANATAD TPSNVTSSDA PQSSSADAES QSPTPAATAE GEKDAAGEEP SKPFMSFEDW KEMMLRKTGQ DPQDLRSRNN QPRRAEDRIP PDMGHAGLGE EDEISLNFDS YLDNAGEQKN RPSSGDVEIP RRDGAGKEVV YEIHKTKDAG KTCKERFSYS SFDAGATVLK SSPGAKNAKA ILVENKDSYM LLECSAPQKF VIVELSEEVL IDTVVIANFE FFSSMVRLFR VSVSDRYPVK PDKWKELGSF EARNSRDIQP FLVENPQIWA KYVRIEFLTQ YGNEYYCPVS LIRIHGSRML ESWLRDDENH DDHEEPQQLP SPEDESAKPQ EIEKPQESIV PTSTEKMPYC EVEDPTMLFL APLVCPASFN ATTGVPSQPD TSSIEGISAS SQNVSTEDRE RSGASTPHVR RAEYPATEDA TSSSSSSTAS PSATPAISPT SPSSISSSSA STGSNSTAPA TSAKAASTSS TSTSSSASTS IAKPSPANTA NPKNRTTATT SNSPASPTVQ EGFFKAISKR LHQVESNLTL SLKYVEDQAR HMSDTLHRTE QKQISKATLF LDNLNQTVLA ELRSVREQYD QIWQSTVIAL ESQREQSNRE IVALSTRLNL LADEVVFQKR MAIVQAVLLL SCLLLVIFSR GVSLPYLAPF MDQASLASYD ASASSPGRVR ALYGNSYDTD GEDPSLLTPG SKRGYMPLSS SARDVSSADL RRRTSFVDDS RLECEQLSPP PTPGPADGYT SSTDAPLSSH ESQPAVLRRS PAMHPNSSRK PLPALPENPS SP // ID C8ZGR2_YEAS8 Unreviewed; 587 AA. AC C8ZGR2; DT 03-NOV-2009, integrated into UniProtKB/TrEMBL. DT 03-NOV-2009, sequence version 1. DT 14-OCT-2015, entry version 19. DE SubName: Full=Slp1p {ECO:0000313|EMBL:CAY86441.1}; GN ORFNames=EC1118_1O4_3686g {ECO:0000313|EMBL:CAY86441.1}; OS Saccharomyces cerevisiae (strain Lalvin EC1118 / Prise de mousse) OS (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=643680 {ECO:0000313|EMBL:CAY86441.1, ECO:0000313|Proteomes:UP000000286}; RN [1] {ECO:0000313|EMBL:CAY86441.1, ECO:0000313|Proteomes:UP000000286} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Lalvin EC1118 / Prise de mousse RC {ECO:0000313|Proteomes:UP000000286}; RX PubMed=19805302; DOI=10.1073/pnas.0904673106; RA Novo M., Bigey F., Beyne E., Galeote V., Gavory F., Mallet S., RA Cambon B., Legras J.-L., Wincker P., Casaregola S., Dequin S.; RT "Eukaryote-to-eukaryote gene transfer events revealed by the genome RT sequence of the wine yeast Saccharomyces cerevisiae EC1118."; RL Proc. Natl. Acad. Sci. U.S.A. 106:16333-16338(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN394216; CAY86441.1; -; Genomic_DNA. DR EnsemblFungi; CAY86441; CAY86441; EC1118_1O4_3686g. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000286; Chromosome XV, Scaffold EC1118_1O4. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000286}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67254 MW; 8C28CE40124071A8 CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFGQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID C9JJZ6_HUMAN Unreviewed; 214 AA. AC C9JJZ6; DT 03-NOV-2009, integrated into UniProtKB/TrEMBL. DT 14-OCT-2015, sequence version 6. DT 11-NOV-2015, entry version 43. DE SubName: Full=Sperm-associated antigen 4 protein {ECO:0000313|Ensembl:ENSP00000396670}; DE Flags: Fragment; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSP00000396670}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000396670, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000396670, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=11780052; DOI=10.1038/414865a; RA Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., RA Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., RA Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M., Brown A.J., RA Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., RA Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., RA Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., RA Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., RA Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., RA Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., RA Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., RA Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., RA Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., RA Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., RA Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., RA Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., RA Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., RA Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., RA Rogers J.; RT "The DNA sequence and comparative analysis of human chromosome 20."; RL Nature 414:865-871(2001). RN [2] {ECO:0000313|Ensembl:ENSP00000396670} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000396670}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL109827; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; C9JJZ6; -. DR STRING; 9606.ENSP00000363391; -. DR PaxDb; C9JJZ6; -. DR PRIDE; C9JJZ6; -. DR Ensembl; ENST00000454819; ENSP00000396670; ENSG00000061656. DR HGNC; HGNC:11214; SPAG4. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000246956; -. DR NextBio; 35486772; -. DR Proteomes; UP000005640; Chromosome 20. DR Bgee; C9JJZ6; -. DR ExpressionAtlas; C9JJZ6; baseline and differential. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:C9JJZ6}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 41 64 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 77 111 {ECO:0000256|SAM:Coils}. FT NON_TER 214 214 {ECO:0000313|Ensembl:ENSP00000396670}. SQ SEQUENCE 214 AA; 24479 MW; 68B69DA2E2B30D81 CRC64; MPPPRVFKSF LSLLFQGLSV LLSLAGDVLV SMYREVCSIR FLFTAVSLLS LFLSAFWLGL LYLVSPLENE PKEMLTLSEY HERVRSQGQQ LQQLQAELDK LHKEVSTVRA ANSERVAKLV FQRLNEDFVR KPDYALSSVG ASIDLQKTSH DYADRNTAYF WNRFSFWNYA RPPTVILEPH VFPGNCWAFE GDQGQVVIQL PGRVQLSDIT LQHP // ID C9SR37_VERA1 Unreviewed; 801 AA. AC C9SR37; DT 24-NOV-2009, integrated into UniProtKB/TrEMBL. DT 24-NOV-2009, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EEY20839.1}; GN ORFNames=VDBG_06949 {ECO:0000313|EMBL:EEY20839.1}; OS Verticillium alfalfae (strain VaMs.102 / ATCC MYA-4576 / FGSC 10136) OS (Verticillium wilt of alfalfa) (Verticillium albo-atrum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Glomerellales; OC Plectosphaerellaceae; mitosporic Plectosphaerellaceae; Verticillium. OX NCBI_TaxID=526221 {ECO:0000313|Proteomes:UP000008698}; RN [1] {ECO:0000313|Proteomes:UP000008698} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=VaMs.102 / ATCC MYA-4576 / FGSC 10136 RC {ECO:0000313|Proteomes:UP000008698}; RX PubMed=21829347; DOI=10.1371/journal.ppat.1002137; RA Klosterman S.J., Subbarao K.V., Kang S., Veronese P., Gold S.E., RA Thomma B.P.H.J., Chen Z., Henrissat B., Lee Y.-H., Park J., RA Garcia-Pedrajas M.D., Barbara D.J., Anchieta A., de Jonge R., RA Santhanam P., Maruthachalam K., Atallah Z., Amyotte S.G., Paz Z., RA Inderbitzin P., Hayes R.J., Heiman D.I., Young S., Zeng Q., Engels R., RA Galagan J., Cuomo C.A., Dobinson K.F., Ma L.-J.; RT "Comparative genomics yields insights into niche adaptation of plant RT vascular wilt pathogens."; RL PLoS Pathog. 7:E1002137-E1002137(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS985222; EEY20839.1; -; Genomic_DNA. DR RefSeq; XP_003002378.1; XM_003002332.1. DR STRING; 526221.XP_003002378.1; -. DR EnsemblFungi; EEY20839; EEY20839; VDBG_06949. DR GeneID; 9527860; -. DR KEGG; val:VDBG_06949; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008698; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008698}; KW Reference proteome {ECO:0000313|Proteomes:UP000008698}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 801 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003000955. FT COILED 544 564 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 801 AA; 88909 MW; 043FF2533A10CB00 CRC64; MFWHAQGWAV LAWLMLATFP AGSSESDTPI PSANQPPSTC HARHINYVTD TLLDQCYRVR WEQADSILPK DGDQTAEHIA HDLSFLDTRS VDRKENDDQT PTASVEPTNT VDAHTEATTF MSFEDWKDLK AREAARETHD TNLDSEKALP PPQGHDSREE SDIALKVEAV SEELSSIAAP SRQSLAGAGD NEQPSEPVLY DDGKAQYYRS KDAGKTCKER FSYSSFDAGA TVLKTIAGAK NAKAILVENK DSYMLLECAA VNKFAIVELT DDILIDTVVL ANFEFFSSMI RHFKVSVSDR YPVKVDKWKD LGIFEAKNSR DIQPFLVENP LIWAKYVRIE FLTHYGNEYY CPVSLLRVHG TRMLDSWKDT EAPPDEDDTE EEHVDAVPDL TQDADIDQAP VPTPGNNEAD EVTLMTPRPT SVPHISSQNA SRITSGSPKR PSPVAQAPAR SKNGTSTSAP PASPTVQESF HKAVSKRLQL LESNVTLSLE YLEEQSRFLQ QSQRASERRQ LAKVDLLLDS LNHTVLSELR HVRQQYDQIW QSTIMALENQ RDQSQRELVA LGSRLNVLAD EVVFQKRMAI LQAILLLSCL VMVIFSRATV SISPNQMDMS FRSSRQYQYR LPRALHSSGL GRSHNSHLST SIVEYDHEGD TQQQGQIYGT DPFVRVHQHS HKGSLPDARR TSGPRPGCAR LATSPISPSE DDSWVTRQSE DVPISPSTSA TPPTTSDRHI GRSFNPHSSE QVLTPTSSDA QDSQSDGIGS DPSSEDGSRI PPSPLDRLAN THDLGAGFRK PLPALPEHLP S // ID C9ZQI5_TRYB9 Unreviewed; 492 AA. AC C9ZQI5; DT 24-NOV-2009, integrated into UniProtKB/TrEMBL. DT 24-NOV-2009, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBH11665.1}; GN ORFNames=TbgDal_VI1430 {ECO:0000313|EMBL:CBH11665.1}; OS Trypanosoma brucei gambiense (strain MHOM/CI/86/DAL972). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Trypanosoma. OX NCBI_TaxID=679716 {ECO:0000313|EMBL:CBH11665.1, ECO:0000313|Proteomes:UP000002316}; RN [1] {ECO:0000313|Proteomes:UP000002316} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MHOM/CI/86/DAL972 {ECO:0000313|Proteomes:UP000002316}; RX PubMed=20404998; DOI=10.1371/journal.pntd.0000658; RA Jackson A.P., Sanders M., Berry A., McQuillan J., Aslett M.A., RA Quail M.A., Chukualim B., Capewell P., MacLeod A., Melville S.E., RA Gibson W., Barry J.D., Berriman M., Hertz-Fowler C.; RT "The genome sequence of Trypanosoma brucei gambiense, causative agent RT of chronic human african trypanosomiasis."; RL PLoS Negl. Trop. Dis. 4:E658-E658(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN554969; CBH11665.1; -; Genomic_DNA. DR RefSeq; XP_011773950.1; XM_011775648.1. DR GeneID; 23861776; -. DR Proteomes; UP000002316; Chromosome 6. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002316}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 456 480 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 368 395 {ECO:0000256|SAM:Coils}. FT COILED 417 437 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 492 AA; 55493 MW; 3405F62896A05F59 CRC64; MKRYYLPAAA FVVVVAAAYA STFFKPSEPG WRDEKPHERS KGFTTNYASA YLGATLTDFS PECLDASSVL NEDNEKYMLC PCNTQRKYFT VQLIRGIEVR IMTLVSQEHF SSRVKNFTVL GSSRYPTNEW RVLGHFKADP WRGTQHFDVA NQQPVRFLRF LWATSYGEHS WCALTTFKVF GVDVLETLTE DYTVSVEEQQ QQHEQEQEHS IPPTPLTEPL IIVSPPQDDK HTAIGIDYGT SGAGVTAAVI STVEDHHETN SRSPGGGLLK HSNYEGNLCV DLNGCKDDGS KTKKCNGTTF NSMYLDTIAQ RYCSTVLPPE NASRTCLPHE RNLYVIHLLS FCVSRVALSN KITALSKPHT SSSVLLMLAQ MSKQIKTLQQ EVVDLNSRHK DMELKAAQRE ITLQWLGMQV KDFKRSNNEN RDKLQDVMKQ IEVLKSKLSL QLHLGQNCED DSLVRVMVVG SLTLSLFSSV LSCITVRTFY RSRRRTSATH LG // ID D0NWM4_PHYIT Unreviewed; 581 AA. AC D0NWM4; DT 15-DEC-2009, integrated into UniProtKB/TrEMBL. DT 15-DEC-2009, sequence version 1. DT 14-OCT-2015, entry version 23. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEY67087.1}; GN ORFNames=PITG_17690 {ECO:0000313|EMBL:EEY67087.1}; OS Phytophthora infestans (strain T30-4) (Potato late blight fungus). OC Eukaryota; Stramenopiles; Oomycetes; Peronosporales; Phytophthora. OX NCBI_TaxID=403677 {ECO:0000313|Proteomes:UP000006643}; RN [1] {ECO:0000313|Proteomes:UP000006643} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=T30-4 {ECO:0000313|Proteomes:UP000006643}; RX PubMed=19741609; DOI=10.1038/nature08358; RG The Broad Institute Genome Sequencing Platform; RA Haas B.J., Kamoun S., Zody M.C., Jiang R.H., Handsaker R.E., RA Cano L.M., Grabherr M., Kodira C.D., Raffaele S., Torto-Alalibo T., RA Bozkurt T.O., Ah-Fong A.M., Alvarado L., Anderson V.L., RA Armstrong M.R., Avrova A., Baxter L., Beynon J., Boevink P.C., RA Bollmann S.R., Bos J.I., Bulone V., Cai G., Cakir C., Carrington J.C., RA Chawner M., Conti L., Costanzo S., Ewan R., Fahlgren N., RA Fischbach M.A., Fugelstad J., Gilroy E.M., Gnerre S., Green P.J., RA Grenville-Briggs L.J., Griffith J., Grunwald N.J., Horn K., RA Horner N.R., Hu C.H., Huitema E., Jeong D.H., Jones A.M., Jones J.D., RA Jones R.W., Karlsson E.K., Kunjeti S.G., Lamour K., Liu Z., Ma L., RA Maclean D., Chibucos M.C., McDonald H., McWalters J., Meijer H.J., RA Morgan W., Morris P.F., Munro C.A., O'Neill K., Ospina-Giraldo M., RA Pinzon A., Pritchard L., Ramsahoye B., Ren Q., Restrepo S., Roy S., RA Sadanandom A., Savidor A., Schornack S., Schwartz D.C., Schumann U.D., RA Schwessinger B., Seyer L., Sharpe T., Silvar C., Song J., RA Studholme D.J., Sykes S., Thines M., van de Vondervoort P.J., RA Phuntumart V., Wawra S., Weide R., Win J., Young C., Zhou S., Fry W., RA Meyers B.C., van West P., Ristaino J., Govers F., Birch P.R., RA Whisson S.C., Judelson H.S., Nusbaum C.; RT "Genome sequence and analysis of the Irish potato famine pathogen RT Phytophthora infestans."; RL Nature 461:393-398(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS028176; EEY67087.1; -; Genomic_DNA. DR RefSeq; XP_002896539.1; XM_002896493.1. DR EnsemblProtists; PITG_17690T0; PITG_17690T0; PITG_17690. DR GeneID; 9471243; -. DR KEGG; pif:PITG_17690; -. DR EuPathDB; FungiDB:PITG_17690; -. DR HOGENOM; HOG000181364; -. DR InParanoid; D0NWM4; -. DR Proteomes; UP000006643; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006643}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006643}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 462 484 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 423 443 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 581 AA; 64113 MW; 12946EEA6FECA1C1 CRC64; MAVCSVRGIV TPDAPSDPAS SIPQRIEESD EPKADAELAD EDVDPLMDVP SGLFEVVDAD SVDNRKRQNY ASLDAGAIIL DAAPDTKSPT NLLVPDKDRY MLTPCSNPRK WVVISLSEDV HADAIAIANY EKFSSPVKEF IVLGSVNYPT DTWLVLGNFT ATHTNGEQIF QLDAQQHVRY IKFRFLSHYG SEYYCTLSQL RVFGRTFTQV ISQLEKSIDA EVEVLDVQAS IPAPQVMALS GSVEVSVART PNPTELTSQC LMEKNNTVVA IFYNESQRLE HYRSNGMCCL VDYTPEQIEA EIAATFSTYE HSATSSTEAV DADTSEGSAQ SPDSLPPNGA SSTSTNTSSV PVANANTTAP SSSPAASSLL STSHGFPASS TQGLGRLESI FVRITKKIQA LEVNQSVMVR QLEEFHTHQW AAIKVLQANQ ESLNEQLIEI RSMIVDLNEM LIVREVITTM KAGILCAIVL SGFIILFYLF RLLFRCVSKC KERADLREWF WRMENHESSS EDPGKKTTDA NMTAGALRVN RKAQFGSSWD DSAIERKTLV SDMVGDGPQK FRRHRAKRAS QPSVSVKRSR K // ID D0NXG7_PHYIT Unreviewed; 654 AA. AC D0NXG7; DT 15-DEC-2009, integrated into UniProtKB/TrEMBL. DT 15-DEC-2009, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEY67767.1}; GN ORFNames=PITG_17997 {ECO:0000313|EMBL:EEY67767.1}; OS Phytophthora infestans (strain T30-4) (Potato late blight fungus). OC Eukaryota; Stramenopiles; Oomycetes; Peronosporales; Phytophthora. OX NCBI_TaxID=403677 {ECO:0000313|Proteomes:UP000006643}; RN [1] {ECO:0000313|Proteomes:UP000006643} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=T30-4 {ECO:0000313|Proteomes:UP000006643}; RX PubMed=19741609; DOI=10.1038/nature08358; RG The Broad Institute Genome Sequencing Platform; RA Haas B.J., Kamoun S., Zody M.C., Jiang R.H., Handsaker R.E., RA Cano L.M., Grabherr M., Kodira C.D., Raffaele S., Torto-Alalibo T., RA Bozkurt T.O., Ah-Fong A.M., Alvarado L., Anderson V.L., RA Armstrong M.R., Avrova A., Baxter L., Beynon J., Boevink P.C., RA Bollmann S.R., Bos J.I., Bulone V., Cai G., Cakir C., Carrington J.C., RA Chawner M., Conti L., Costanzo S., Ewan R., Fahlgren N., RA Fischbach M.A., Fugelstad J., Gilroy E.M., Gnerre S., Green P.J., RA Grenville-Briggs L.J., Griffith J., Grunwald N.J., Horn K., RA Horner N.R., Hu C.H., Huitema E., Jeong D.H., Jones A.M., Jones J.D., RA Jones R.W., Karlsson E.K., Kunjeti S.G., Lamour K., Liu Z., Ma L., RA Maclean D., Chibucos M.C., McDonald H., McWalters J., Meijer H.J., RA Morgan W., Morris P.F., Munro C.A., O'Neill K., Ospina-Giraldo M., RA Pinzon A., Pritchard L., Ramsahoye B., Ren Q., Restrepo S., Roy S., RA Sadanandom A., Savidor A., Schornack S., Schwartz D.C., Schumann U.D., RA Schwessinger B., Seyer L., Sharpe T., Silvar C., Song J., RA Studholme D.J., Sykes S., Thines M., van de Vondervoort P.J., RA Phuntumart V., Wawra S., Weide R., Win J., Young C., Zhou S., Fry W., RA Meyers B.C., van West P., Ristaino J., Govers F., Birch P.R., RA Whisson S.C., Judelson H.S., Nusbaum C.; RT "Genome sequence and analysis of the Irish potato famine pathogen RT Phytophthora infestans."; RL Nature 461:393-398(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS028182; EEY67767.1; -; Genomic_DNA. DR RefSeq; XP_002997929.1; XM_002997883.1. DR UniGene; Pin.4387; -. DR EnsemblProtists; PITG_17997T0; PITG_17997T0; PITG_17997. DR GeneID; 9463884; -. DR KEGG; pif:PITG_17997; -. DR EuPathDB; FungiDB:PITG_17997; -. DR HOGENOM; HOG000182096; -. DR InParanoid; D0NXG7; -. DR KO; K19347; -. DR Proteomes; UP000006643; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006643}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006643}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 172 194 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 654 AA; 73705 MW; FDA8F368B93645A9 CRC64; MADGNTYTRR LRSRRRSDST SSEEEEDPQR VTRSGSRRYG IYTPEPVQRT LELRSDEFED DDEEEDSDFE ELDDYGETVY RSTTYRPPQV YEQDVVELED VHEDDEVDIE VQETPEHEAQ RSELKRRAAG AAAYFKKSNK VDRMWQKVTD SKAMKTISKY LRRFWRFMLR NSFMAVNVLW LLAPLCCFVV AITVPHHLTT AFQYVDDLSS SWIGGRGNAD AGFEKGAMRS VVQEIVDMKL VGLNEEIGML RQTVQTQEHE IEALKLLHVT LRLDLDEQRQ KFSLSEPDSA INVHIEKVVT KHTEELWEKI IDRTSQLQQD LQNATKQQSV ISSVLKEQEE KMDSVQTIVE KTASTPAPDA ASENARAMKK EFTQWRQSFE IELQSEMQRK VQAIESRMSR VLQDEKDALS SSADALRGLD ATDPGILRVI EVAVQAVEIK KTGRVDHAAL ANGASVIHSE RDLLYQDSSS PVQLLAQLVG LSDSDGDSRF TSPSYRRAPA PFLGQLLSSG ENPWWLSRHN GRPETALSET MEIGSCWGIS GSSGRLSVKF AQQIVADAIT IDHIPAQIAS DFSSAPNQFR VLGISGHPLR ETVELISFGN FSYASNGPAS QTFKLTSSLS QRSAIDGITL EVLSNHGNPE YTCLYRFRVH GQPA // ID D2A235_TRICA Unreviewed; 2552 AA. AC D2A235; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 11-NOV-2015, entry version 34. DE SubName: Full=Putative uncharacterized protein GLEAN_07046 {ECO:0000313|EMBL:EFA01492.1}; GN Name=GLEAN_07046 {ECO:0000313|EMBL:EFA01492.1}; GN ORFNames=TcasGA2_TC007046 {ECO:0000313|EMBL:EFA01492.1}; OS Tribolium castaneum (Red flour beetle). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; OC Cucujiformia; Tenebrionidae; Tenebrionidae incertae sedis; Tribolium. OX NCBI_TaxID=7070 {ECO:0000313|Proteomes:UP000007266}; RN [1] {ECO:0000313|EMBL:EFA01492.1, ECO:0000313|Proteomes:UP000007266} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA01492.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=18362917; DOI=10.1038/nature06784; RG Tribolium Genome Sequencing Consortium; RA Richards S., Gibbs R.A., Weinstock G.M., Brown S.J., Denell R., RA Beeman R.W., Gibbs R., Beeman R.W., Brown S.J., Bucher G., RA Friedrich M., Grimmelikhuijzen C.J., Klingler M., Lorenzen M., RA Richards S., Roth S., Schroder R., Tautz D., Zdobnov E.M., Muzny D., RA Gibbs R.A., Weinstock G.M., Attaway T., Bell S., Buhay C.J., RA Chandrabose M.N., Chavez D., Clerk-Blankenburg K.P., Cree A., Dao M., RA Davis C., Chacko J., Dinh H., Dugan-Rocha S., Fowler G., Garner T.T., RA Garnes J., Gnirke A., Hawes A., Hernandez J., Hines S., Holder M., RA Hume J., Jhangiani S.N., Joshi V., Khan Z.M., Jackson L., Kovar C., RA Kowis A., Lee S., Lewis L.R., Margolis J., Morgan M., Nazareth L.V., RA Nguyen N., Okwuonu G., Parker D., Richards S., Ruiz S.J., RA Santibanez J., Savard J., Scherer S.E., Schneider B., Sodergren E., RA Tautz D., Vattahil S., Villasana D., White C.S., Wright R., Park Y., RA Beeman R.W., Lord J., Oppert B., Lorenzen M., Brown S., Wang L., RA Savard J., Tautz D., Richards S., Weinstock G., Gibbs R.A., Liu Y., RA Worley K., Weinstock G., Elsik C.G., Reese J.T., Elhaik E., Landan G., RA Graur D., Arensburger P., Atkinson P., Beeman R.W., Beidler J., RA Brown S.J., Demuth J.P., Drury D.W., Du Y.Z., Fujiwara H., RA Lorenzen M., Maselli V., Osanai M., Park Y., Robertson H.M., Tu Z., RA Wang J.J., Wang S., Richards S., Song H., Zhang L., Sodergren E., RA Werner D., Stanke M., Morgenstern B., Solovyev V., Kosarev P., RA Brown G., Chen H.C., Ermolaeva O., Hlavina W., Kapustin Y., RA Kiryutin B., Kitts P., Maglott D., Pruitt K., Sapojnikov V., RA Souvorov A., Mackey A.J., Waterhouse R.M., Wyder S., Zdobnov E.M., RA Zdobnov E.M., Wyder S., Kriventseva E.V., Kadowaki T., Bork P., RA Aranda M., Bao R., Beermann A., Berns N., Bolognesi R., Bonneton F., RA Bopp D., Brown S.J., Bucher G., Butts T., Chaumot A., Denell R.E., RA Ferrier D.E., Friedrich M., Gordon C.M., Jindra M., Klingler M., RA Lan Q., Lattorff H.M., Laudet V., von Levetsow C., Liu Z., Lutz R., RA Lynch J.A., da Fonseca R.N., Posnien N., Reuter R., Roth S., RA Savard J., Schinko J.B., Schmitt C., Schoppmeier M., Schroder R., RA Shippy T.D., Simonnet F., Marques-Souza H., Tautz D., Tomoyasu Y., RA Trauner J., Van der Zee M., Vervoort M., Wittkopp N., Wimmer E.A., RA Yang X., Jones A.K., Sattelle D.B., Ebert P.R., Nelson D., Scott J.G., RA Beeman R.W., Muthukrishnan S., Kramer K.J., Arakane Y., Beeman R.W., RA Zhu Q., Hogenkamp D., Dixit R., Oppert B., Jiang H., Zou Z., RA Marshall J., Elpidina E., Vinokurov K., Oppert C., Zou Z., Evans J., RA Lu Z., Zhao P., Sumathipala N., Altincicek B., Vilcinskas A., RA Williams M., Hultmark D., Hetru C., Jiang H., Grimmelikhuijzen C.J., RA Hauser F., Cazzamali G., Williamson M., Park Y., Li B., Tanaka Y., RA Predel R., Neupert S., Schachtner J., Verleyen P., Raible F., Bork P., RA Friedrich M., Walden K.K., Robertson H.M., Angeli S., Foret S., RA Bucher G., Schuetz S., Maleszka R., Wimmer E.A., Beeman R.W., RA Lorenzen M., Tomoyasu Y., Miller S.C., Grossmann D., Bucher G.; RT "The genome of the model beetle and pest Tribolium castaneum."; RL Nature 452:949-955(2008). RN [2] {ECO:0000313|EMBL:EFA01492.1, ECO:0000313|Proteomes:UP000007266} RP GENOME REANNOTATION. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA01492.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=19820115; DOI=10.1093/nar/gkp807; RA Kim H.S., Murphy T., Xia J., Caragea D., Park Y., Beeman R.W., RA Lorenzen M.D., Butcher S., Manak J.R., Brown S.J.; RT "BeetleBase in 2010: revisions to provide comprehensive genomic RT information for Tribolium castaneum."; RL Nucleic Acids Res. 38:D437-D442(2010). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000279; EFA01492.1; -; Genomic_DNA. DR STRING; 7070.TC007046-PA; -. DR EnsemblMetazoa; TC007046-RA; TC007046-PA; TC007046. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; D2A235; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; D2A235; -. DR Proteomes; UP000007266; Linkage group 4. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 5. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 6. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000007266}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007266}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2552 AA; 282438 MW; 57E6D88E527C5484 CRC64; MADPDPETLL EWLNSGLGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCRI FLDEQAPENV LEVTARAITY YLDVSAECTR RIVAIEGAVR AICNRLVVAE LSSRTSKDLA EQCVKVLELI CTREAGAVFD AGGLSAILPF IRDNGNRVHK DTLHSAMAVV SRLCTKMEPA DVQLPTCVQA LSTLLRHEDS HVADGALRCF ASVADRFTRR GVDPAPLAQH GLVNELLSRL SNAAGPSVAT GTQNTSGKAS STTNATAAVP DAKATAASVS TIISLLSTLC RGSPAITHDL LRSELPDAIE KSLKGDERCA LDSMRLVDLL LVLLFEGRRA LGKSGGATTS GQLLPRMRRM DPAAEKSHRQ LIDCIRSKDT DALIEAIESG GVEVNFMDDV GQTLLNWASA FGTQEMVEYL CDRGADVNKG QRSSSLHYAA CFGRPAIAKV LLRHGANPDL RDEDGKTPLD KARERADEGH REVAAILQSP GEWMIPIDKD RNRKSESDND DNIEPRGDPE MAPVYLKRLL PVFCTTFQST MLASVRKASL GLIKKMIHYI LPSLLEELCN NESSPNFGTQ LVEVIATVLD NEDDEDGHLI VLAIIQDLMA KCQEIFLDHF ARLGIFSKVQ ALAGPPEVQE NEENEQQTTE EVTEQSHVED AKEILPGKAY HWRDWSVCRG RDCLYIWSDA AALELSNGSN GWFRFILDGK LATMYSSGSP EGGADTSGKG RAADTLTTEE NRGEFLEKLQ RARAAVRSNV SSIPILSKPG PTRLVVGNWS LTSRKDAEIH IHNSDGQHQT TILREDLPGF IFESNRGTKH SFTAETSLGP EFSAGWTNKK GKRLRSKTEA TKIKVKYLAH SIYEQYFRAA QAQPRGVVAK LGNIVAQIER ACQKQCSYGN NREGGNSWKE ILRNALDDLT QILEDDGVVS AYELHSSGLI QALLSLLSTS YWDQGLKSSK MNKYQKQRVQ VFKQCFKSRA NEEKNSIQIL VHKLVAVLES IEKLPVYLYD SPGSGYGLQI LTRRLRFRLE KAPGESSLID RTGRGLKMEP LSTVAQLERY LLKMVAKQWY DYDRSTFQFL KKLRESKYQT FKHSHDFDEN GIIYFIGTNG KTSSEWVNPA QYGLVTVTSS DGRNLPYGRV EDILSRDSSA LNCHTNDDKK AWFSIDLGLY VIPSGYTLRH ARGYGRSALR NWYFQMSRDG ITWTTLSTHT DDTSLNDPGS TNSWPIELSS PDEQGWRHVR IQQAGKNASG QTHYLSLSGF EIYGQVVGVC EDLGKAAKEA EAHLRKQRRL LKSQLLKHMT VGARVVRGID WKWRGQDGNP PGEGTVTGEL HSGWIDVTWD HGGSNSYRMG AEGKYDLKLA PGYDVEATTS SKNATSKPKD SKDKQSVLTS RKSSSTPSLP EATDVKTSVA STEQAASADN LAAKQAAETI AESVLNMARN EALVAVTSES QAANNDSELS VVVHPLRDPH HDLSTINNSS DLATIVESLA LSDTKPATVT TRRQNSDDRP NTSEKASMVT SNSTSNLNKS GHSNRSKINN SQVAAAAAQT FVEAVEALDK LREGSDMLRN NTNNFLSGEL LQTALSLGQT SQQSISSLNT GVRISVSGNP ETDEEKPVRR KPDEPSTTKD PCSEKDEVTN NARNTSNNTI VVTNPMSVSV PNLTSTEANS QIEPTSTAGL LETFAALARR RTLGSVTTTN SSNANPNNSG AITNSNAQNN QNTGSLFPRG PNSVSSLVRL ALSSNFPGGL LSTAQSYPSL SSSNNTAAQQ GGISTTAGTV QGLSQALTMS LTSTSSDSEQ VSLEDFLESC RAPTLLAELD DDEEMGDEDD NDDDENEDDA DYDEVMVSRN LLSFMEEEGF ETSSRPSKRR SWDDEYVLKR QFSALIPAFD PRPGRTNVNQ TTDLEVPPPG QEENGSYLEH ELLPQPKLQL VLRGPNMPGI PDVEIELNDP SWTIFRAVQE LMQMTEFGSK QEKLRRIWEP TYIIVYRELK DDGTYDSEEG RATPVVLYSR SGSSSVAGTT LSPSTPVPGT PSVSNCTVED VLQLLRHLFV ITTTKENDLN SLDNNIEITP DQYTSKKITN KLLQQIQDPL VLSSSSLPAW CEELNHSCPF LFPFETRHLY FNCTAFGASR SIVWLQTQRD VTMERQRTTG LSPRRDEPHE FRVGRLKHER VKVPRGEEIL SWAMQVMKIH SDRKSILEVE FLGEEGTGLG PTLEFYALVA AELQRRDLGM WICDDDAIDM QEIVDLGEGV KPPGYYVRRV SGLFPAPLPQ NSEICDKAVK YFWFLGVFLA KVLQDNRLVD LPLSQSFLKL MCHGEIQNTV NERIGFAGVK KSSEDDIMTS SLISEESEKE LELDPPKMII EDKKPWYLNI LGEEDLHDID YIRANFLKQI RELVKQKHKV MQDHNLSPEA KTHQIQNMCL NHASGPVLLE DLALTFTYAP SSSAFGFSAV ELVSNGADIE VNIENIEEYA ELTTSFCLDK GIARQLEAFH KGFCTVFPME KLAAFSPDEM RIMLCGDQNP QWTRDDLINY TEPKLGYTKD RRKTVNFDSV EE // ID D2GZK3_AILME Unreviewed; 700 AA. AC D2GZK3; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFB26740.1}; DE Flags: Fragment; GN ORFNames=PANDA_002522 {ECO:0000313|EMBL:EFB26740.1}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646; RN [1] {ECO:0000313|EMBL:EFB26740.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL192392; EFB26740.1; -; Genomic_DNA. DR STRING; 9646.ENSAMEP00000011330; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 197 218 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 346 384 {ECO:0000256|SAM:Coils}. FT COILED 387 414 {ECO:0000256|SAM:Coils}. FT COILED 461 481 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EFB26740.1}. FT NON_TER 700 700 {ECO:0000313|EMBL:EFB26740.1}. SQ SEQUENCE 700 AA; 78506 MW; 8F7FA6EA17EA9558 CRC64; RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESVVRESYF GSPRASSLAR SSILDDHLHS DPYWSEDLRG RRRRGTGGTE SSKLNGLAEN KSSEDFLGSS SGYSSEDDFA GRSEDRPSGG LSVSAGYLET DHRSSGSRLR NAVSWAASCF WTLVTSPGRL FGLLYWWVGT TWYRLTTAAS LLDVFVLTRR FSSVKTFLWF LLLLLLMTGL TYGAWYFYPY GLQTLQPAVV SWWAAKSSSG RQDMWESRDS SPFQAEQHIM SRVHSLERRL EALAAEFSSN WQKEAVRLER LELRQGAAGG GGHVGLSQED TLALLEGLVS RREAALKEDF RRDTAAWIQE ELVSLRAEHQ QDSEDLFKKI VQASQESEAR IQQLKSEWQR MTQESFRENS MKELARLEGQ LAGLRQELAA LSLKQSSVAD QVGLLPQQLQ AVRDDVESQF PAWVSQFLLR GGGTRTGLVQ REELQAQLQE LESKILAHVA EMQGRSASEA AASLGLTLQK EGVIGVTEEQ VQRIVNQALK RYSEDRIGMV DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKSLSPNS TISSAPKDFS IFGFDEDLQQ EGTLLGQFTY DQDGEPIQTF YFQDTKMATY QVVELRILTN WGHPEYTCIY RFRVHGEPTH // ID D2H453_AILME Unreviewed; 436 AA. AC D2H453; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFB21644.1}; DE Flags: Fragment; GN ORFNames=PANDA_004538 {ECO:0000313|EMBL:EFB21644.1}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646; RN [1] {ECO:0000313|EMBL:EFB21644.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL192479; EFB21644.1; -; Genomic_DNA. DR STRING; 9646.ENSAMEP00000010386; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000246956; -. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 136 159 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 165 190 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 203 237 {ECO:0000256|SAM:Coils}. FT NON_TER 436 436 {ECO:0000313|EMBL:EFB21644.1}. SQ SEQUENCE 436 AA; 48044 MW; 1D026ACA9F5AE4E9 CRC64; MRRSPRPGSA TSPHKHTPNF YSDNSNSSVS VTSGDSSGHR SAGPGEPEGR RARGSSCGEP ALSAGVPGGT TWAGSSRQKP APGSHNVDLP CARPQWRPWA PAEPAGSPVV SEEQLDLLST LDLRQEIPPP RVSKNFLSLL LQVLSVLLSL VGDVLVIVYR EVCSIRFLLT AVSLLSLFLA ALWWGLLYLV PPSENEPKEM LTLSEYHERV RSQGQQLQQL QAELNKLHKE VSSVRAANSE RVAKIVFQRL NEDFVRKPDY ALSSVGASID LEKTSHDYED ANTAYFWNRF SFWNYARPPT VILEPDVFPG NCWAFEGDQG QVVIRLPGRV QLSDITLQHP PPSVAHSGGA NSAPRDFAVY GLQVDDETEV FLGKFTFDVE KSEIQTFHLQ NDPPNAFPKV KIQILSNWGH PRFTCLYRVR AHGMRISEGA GDSATG // ID D2H8Z5_AILME Unreviewed; 239 AA. AC D2H8Z5; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 14-OCT-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFB14887.1}; DE Flags: Fragment; GN ORFNames=PANDA_006758 {ECO:0000313|EMBL:EFB14887.1}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646; RN [1] {ECO:0000313|EMBL:EFB14887.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL192588; EFB14887.1; -; Genomic_DNA. DR HOGENOM; HOG000007503; -. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 15 35 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EFB14887.1}. FT NON_TER 239 239 {ECO:0000313|EMBL:EFB14887.1}. SQ SEQUENCE 239 AA; 27153 MW; BA1D7554812F2229 CRC64; VRMYQEKVRH HTGEIQDLRG NMTQLIAKLQ LMEAMSDEQK MAQKIMKMIQ GDFIEKPDFA LKSIGASIDF EQTSATYNHD KARSYWNWIR LWNYAQPPDV ILEAGGLGDE ERVAGPNMTP GNCWAFSGDR GQVTIRLAQK VYLSNLTLQH IPKTISLSGS LDTAPKDFVI YGMEGSPREE VFLGAFQFQP ENIIQMFQLQ NQPVRAFGAV KVKISSNWGN PRFTCLYRVR VHGSVTPPR // ID D2H9M5_AILME Unreviewed; 2610 AA. AC D2H9M5; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 11-NOV-2015, entry version 53. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000016685}; DE Flags: Fragment; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSAMEP00000016685}; GN ORFNames=PANDA_007038 {ECO:0000313|EMBL:EFB13214.1}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646; RN [1] {ECO:0000313|EMBL:EFB13214.1, ECO:0000313|Ensembl:ENSAMEP00000016685} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000016685} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01138443; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01146442; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01154441; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01162440; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01170438; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01178436; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GL192604; EFB13214.1; -; Genomic_DNA. DR RefSeq; XP_002918561.1; XM_002918515.2. DR STRING; 9646.ENSAMEP00000016685; -. DR Ensembl; ENSAMET00000017374; ENSAMEP00000016685; ENSAMEG00000015770. DR GeneID; 100472253; -. DR KEGG; aml:100472253; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR HOGENOM; HOG000018061; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. FT NON_TER 2610 2610 {ECO:0000313|EMBL:EFB13214.1}. SQ SEQUENCE 2610 AA; 289208 MW; 067CF52969A3EFB8 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNTMDF DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RAPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENAK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLA KEPPQAKAGN GQSSCGVEDV LQLLRILYIV ASDPCSRISQ EEGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAVKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID D2HH22_AILME Unreviewed; 301 AA. AC D2HH22; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFB30037.1}; DE Flags: Fragment; GN ORFNames=PANDA_010371 {ECO:0000313|EMBL:EFB30037.1}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646; RN [1] {ECO:0000313|EMBL:EFB30037.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL192826; EFB30037.1; -; Genomic_DNA. DR STRING; 9646.ENSAMEP00000005034; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000007503; -. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; FT NON_TER 1 1 {ECO:0000313|EMBL:EFB30037.1}. FT NON_TER 301 301 {ECO:0000313|EMBL:EFB30037.1}. SQ SEQUENCE 301 AA; 34346 MW; 661CF7E8B45BF4F5 CRC64; LVFSFTGLGN HMWLKETEFP QRSRQFYALI AEYGSRLYNY QARLRMPKEQ LELLKKESQT LENNFREILF LIEQIDVLKA LLRDTRDGLH YSWNADGGKD PEPLEATEEE MSNLVNYVLK KLREDQVQMA DYALKSAGAS VIEAGTSESY KNNKAKLYWH GIGFLTYEMP PDIILQPDVH PGKCWAFPGS QGHALIKLAR KIKPTAITME HISEKVSPSG NISSAPKEFS VYGISKQCEG EEIFLGQFVY NKTGSTVQTF KLQHDVSESL LCVKLKILSN WGHPKYTCLY RFRVHGTPGE N // ID D2I6M8_AILME Unreviewed; 832 AA. AC D2I6M8; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFB26971.1}; DE Flags: Fragment; GN ORFNames=PANDA_021492 {ECO:0000313|EMBL:EFB26971.1}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|EMBL:EFB26971.1, ECO:0000313|Proteomes:UP000008912} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL195043; EFB26971.1; -; Genomic_DNA. DR STRING; 9646.ENSAMEP00000006247; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 279 296 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 302 323 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 335 352 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 420 447 {ECO:0000256|SAM:Coils}. FT COILED 473 507 {ECO:0000256|SAM:Coils}. FT COILED 522 542 {ECO:0000256|SAM:Coils}. FT NON_TER 832 832 {ECO:0000313|EMBL:EFB26971.1}. SQ SEQUENCE 832 AA; 92466 MW; D8D2765CDB528283 CRC64; MDFSWLHMYT PPQCVPENTG YTYALSSSYS SDALAFETEH RLDPVFDSPR MSRRSLRLVT TACAVEDGQA GDACSCVSST ASLKDRVARA AKQRRSVSKP AVSVNHTSRK VVSCAAGQSA ASMLSGAACL RPPVLDESLI REQTKVDHFW GLDDDGDLKG GNKAATQGNG DLAAEGTRSN GYTCSDCLLL AERKDTLTAH SAPRGTSPRL YSRDMNQKHE SVSLKGDDCK RKELLEMHTA VRLQSSSPKS VAGAIWHVFS YTGHLLVQTL QRIGASGWSV LKMLLSVLWL AVLAPGKAAS GIFWWLGIGW YQFVTLISWL NVFLLTRCLR NICKFLLLLI PLLLLLAAGL SLCGQGDFLS GLPVLNWTRI YGAQRVDGPE STFTPGESHL SQLLEDGDEA FRWFRRSEVE RQLTSLSGQC RSHDEKLREL AAVLQKLQAQ VDQMDGDSEA TLSLSVAYLP LVFLSKTDTM SFHQEHELRL SNLEDVLGKL TEKSEAIRKE LEQTKLRTAS GAEEEQYLLS MVKHLELELG QLKSELSSWQ HLKTSCEEVD TIHGKASDAQ VRETIRRMFS GEEKGGSLEW LLQTVSSRFV SKDDLQVLLR DLELQILKNV THYISVTKRV PDSETVVSAA KEAGISGITE AQARVIVNNA LKLYSQDKTG MVDFALESGG GSVLSTRCSE TYETKTALIS LFGIPLWYFS QSPRVVIQPD IHPGNCWAFR GSQGYLVVRL SMKIRPTTFT LEHIPKTLSP TGNITSAPKD FAVYGLENEY QEEGQLLGQF MYDQEGESLQ MFHVLERPDG TFQIVELRIL SNWGHPEYTC LYRFRVHGEP VK // ID D2JWS5_9TREE Unreviewed; 825 AA. AC D2JWS5; DT 09-FEB-2010, integrated into UniProtKB/TrEMBL. DT 09-FEB-2010, sequence version 1. DT 14-OCT-2015, entry version 9. DE SubName: Full=Putative Sad1 protein {ECO:0000313|EMBL:ACZ80628.1}; OS Filobasidiella depauperata. OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella. OX NCBI_TaxID=5208 {ECO:0000313|EMBL:ACZ80628.1}; RN [1] {ECO:0000313|EMBL:ACZ80628.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=CBS7855 {ECO:0000313|EMBL:ACZ80628.1}; RX PubMed=20224779; DOI=10.1371/journal.pone.0009620; RA Rodriguez-Carres M., Findley K., Sun S., Dietrich F.S., Heitman J.; RT "Morphological and genomic characterization of Filobasidiella RT depauperata: a homothallic sibling species of the pathogenic RT cryptococcus species complex."; RL PLoS ONE 5:E9620-E9620(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GU131347; ACZ80628.1; -; Genomic_DNA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 279 335 {ECO:0000256|SAM:Coils}. FT COILED 510 537 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 825 AA; 92400 MW; 23FA070B9C8EE225 CRC64; MGVVDTSTTA SKAESTVQDT KDESVGSQDL LSFEEWKRIK MEEDEQATVS QDVSSEDSVT HTSALEASSI ATDIEIGSST ESPNNLPQTI ADFVKVESTQ PLPAATHHNK YNYASLDCSA RIHSSSPQTQ HASSILHKSR DRYMLTPCKA KEHWVVVELC DEIRIEAVEI AVWEFFSSVV REVRVSVGGE DEEEEAAREK DEGKSHRWKE VGSFVGKNIR GSQTFTLFQP TSFHRFIRLD FPTFFGTZYY CPVSSLKVYG MNQMEAFKWE QKKLNAAVKE KDRNGNKEKE RELEELRMVE RQEKEKRERE EKDRQEARER ELDELEKLLH EQAKRVIPDL LTESGLISSV DESVPTVTPS PVDPASLSLN SSTSSLKSDT SMNATAETLE NSNSSTMRAT NSSNSAIFSS TKSLEAKTAT SSTSTFSRVS IPRSDSSESI YAFIVRRLNA LEGNSSLVAR YIEEQAKAMR FMLRRVETRW DEWKADWEGD DHGRWQQERM RQEDRLGKVI SQLEQQRIAF ENDRKEMQAQ LRGLANELSY ERRRGIAQLI AMIIIVILGV ITRTTTIDNI LTPLLVEARR RRNVYTRRST SGPLTGLCID MGDGRSPKVI GQGTQFEHNA DSTSQLPSPS YTPRAKHSLS RSGSGNRPNH LGKRRALQAP FSSLRSASAT DHTLSSNSSH ASPISAFTNP RPRVSLPSAR GLRKLARSSH LHSMDATMRD NQDQHANALH AAESAYVSDA PVGLKRRRPR ISLLYNADVV SSPISSPSPK AANEKDTGMG RLEDSQGEWN TDLDTEASEV ENEVVRKDMS DKANLVKIQA RGKHG // ID D2VL48_NAEGR Unreviewed; 709 AA. AC D2VL48; DT 02-MAR-2010, integrated into UniProtKB/TrEMBL. DT 02-MAR-2010, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EFC42478.1}; GN ORFNames=NAEGRDRAFT_69660 {ECO:0000313|EMBL:EFC42478.1}; OS Naegleria gruberi (Amoeba). OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; Naegleria. OX NCBI_TaxID=5762 {ECO:0000313|Proteomes:UP000006671}; RN [1] {ECO:0000313|EMBL:EFC42478.1, ECO:0000313|Proteomes:UP000006671} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NEG-M {ECO:0000313|EMBL:EFC42478.1, RC ECO:0000313|Proteomes:UP000006671}; RX PubMed=20211133; DOI=10.1016/j.cell.2010.01.032; RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., RA Carpenter M.L., Field M.C., Kuo A., Paredez A., Chapman J., Pham J., RA Shu S., Neupane R., Cipriano M., Mancuso J., Tu H., Salamov A., RA Lindquist E., Shapiro H., Lucas S., Grigoriev I.V., Cande W.Z., RA Fulton C., Rokhsar D.S., Dawson S.C.; RT "The genome of Naegleria gruberi illuminates early eukaryotic RT versatility."; RL Cell 140:631-642(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG738879; EFC42478.1; -; Genomic_DNA. DR STRING; 5762.XP_002675222.1; -. DR EnsemblProtists; EFC42478; EFC42478; NAEGRDRAFT_69660. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D2VL48; -. DR Proteomes; UP000006671; Unassembled WGS sequence. DR InterPro; IPR028119; Snapin/Pallidin/Snn1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR Pfam; PF14712; Snapin_Pallidin; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006671}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006671}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 156 176 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 533 553 {ECO:0000256|SAM:Coils}. FT COILED 594 614 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 709 AA; 79538 MW; 695E032EB9DB101C CRC64; MAPIPELEDD GSLHDSERDV SMKETPAPTR KKNQSHKRKS TDDEAVEDAK KRKSNPTEEE QTILEEETFR TDTPFQSPDV RRKSTGLRTK PQQLNQQVFV NRKHKSVPSN VLDANTSYQT PKKTNIEAEV PPSISTFIPE KQFAISRNKA SKTGTYTILA SLVAIISIVY LIYVQYVTLD IPKTNVTEES NLSNNTQNSA PIIKYFYVTN NVTSNVNHSE IFEKLIEKHS KQFKNNMLDL IDTRLYELER KLNNNIRSSI SSSSVSVKKE QQKELEKLKE SLLSKISTEV DIILSQKLKS INDINEQSIS EIKGRLDTII PGVKNLIDES LDKYDADKIG LTDYALSSLG SKIVEHSPTY SPSKFWPQLF VPVKTPDMII KPDTTIGNCW PMKGSSGFVV IEIAHSIIPT SFSIDHVPKA LSPNISSAPK QISVFGYENE TTLTKLSAFE YDVHGSPTQT FPVNESTNKK YNKFRFQISG NYGNSFYTCI YRFRIHGDSQ AMIDKTFADG LIRVLEPLTT EYDSKVKDIQ ISQTKLAEEI DRLAKKLDSC KEEAQFVNVA PYLQKLANSR KRVINISTTL GHISDRLSRL NKLAKQKYPE LEQLRQQRER LKQQGSSGKL VVQQQSSSSS EVVPATSTTT PSSTTEQQSS STTTPTEQTP PEQSSTTLSN EDETTKSTPS TSEEAQDTNT NITTSSATTE NPVEAVDQQ // ID D2VWJ9_NAEGR Unreviewed; 624 AA. AC D2VWJ9; DT 02-MAR-2010, integrated into UniProtKB/TrEMBL. DT 02-MAR-2010, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EFC38854.1}; GN ORFNames=NAEGRDRAFT_73406 {ECO:0000313|EMBL:EFC38854.1}; OS Naegleria gruberi (Amoeba). OC Eukaryota; Heterolobosea; Schizopyrenida; Vahlkampfiidae; Naegleria. OX NCBI_TaxID=5762 {ECO:0000313|Proteomes:UP000006671}; RN [1] {ECO:0000313|EMBL:EFC38854.1, ECO:0000313|Proteomes:UP000006671} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NEG-M {ECO:0000313|EMBL:EFC38854.1, RC ECO:0000313|Proteomes:UP000006671}; RX PubMed=20211133; DOI=10.1016/j.cell.2010.01.032; RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., RA Carpenter M.L., Field M.C., Kuo A., Paredez A., Chapman J., Pham J., RA Shu S., Neupane R., Cipriano M., Mancuso J., Tu H., Salamov A., RA Lindquist E., Shapiro H., Lucas S., Grigoriev I.V., Cande W.Z., RA Fulton C., Rokhsar D.S., Dawson S.C.; RT "The genome of Naegleria gruberi illuminates early eukaryotic RT versatility."; RL Cell 140:631-642(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG738904; EFC38854.1; -; Genomic_DNA. DR STRING; 5762.XP_002671598.1; -. DR EnsemblProtists; EFC38854; EFC38854; NAEGRDRAFT_73406. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D2VWJ9; -. DR Proteomes; UP000006671; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006671}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006671}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 42 60 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 320 340 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 624 AA; 71237 MW; FF8DC06EEB7B7E4E CRC64; MSSSHSTTTT TTQYVHQNKP KFKSRRLLSE LSDTSSQLSR TITTPIISFL VFVLIILIYL DNSSSSTYNL FVHASTSPTA TINNTQQQPN TNTNTNTNNN NIGSSSSSAT IEQQSTTSST PTATAPPPPR QPKINVGFNF ASEEAGAKIL SSNREAKKVS RILNEDSDKY CLIPRSVPKK WIVVELSEEI LMKSIAIANY EYYSCSFKHF KVYASVKYPC KEKSNCWELL GTFQAANSRK VQHFTFKKPS ITRYVKLEFL SHYGENEYYC TLSLLRVHGS TLLEDLKKSL QKSSKKNSES TQITNEQTQS NNHGSIEYDI KKEEANLNEL IKENTEQIAS IHNSDKENSK RLDQLEESFG KLLNDDISNL IKEKITNSRM NSKTEFWAML VKKKFQSFGF CNIGRLNDGN DRISFIIEQE RDVINCTCFK VLKKDFIERV GFLKNSDKGF SLYNRTLHSN NTMNSQDMNT ISNKNREIWG VCQYRVTYFS LKCITNGTLK FENIQPTKKP SAKDSKKKQK VVVDDETDGL DEKTPMKSTL SHLLNEKQNI LKSLFNAIKI HGENENILKN QIKALEMKNA KTFQYIHQIL EKNGEDHKLI ANSFISLTQK NREQIINDMV CFFV // ID D3B2Q5_POLPA Unreviewed; 629 AA. AC D3B2Q5; DT 23-MAR-2010, integrated into UniProtKB/TrEMBL. DT 23-MAR-2010, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=SUN domain-containing protein 1 {ECO:0000313|EMBL:EFA83603.1}; GN Name=sun1 {ECO:0000313|EMBL:EFA83603.1}; GN ORFNames=PPL_02669 {ECO:0000313|EMBL:EFA83603.1}; OS Polysphondylium pallidum (Cellular slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. OX NCBI_TaxID=13642 {ECO:0000313|Proteomes:UP000001396}; RN [1] {ECO:0000313|Proteomes:UP000001396} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PN500 {ECO:0000313|Proteomes:UP000001396}; RA Gloeckner G., Schaap P., Noegel A.A., Felder M., Eichinger L., RA Heidel A.J., Platzer M.; RT "Living fossils from the dawn of multicellularity."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFA83603.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADBJ01000010; EFA83603.1; -; Genomic_DNA. DR EnsemblProtists; EFA83603; EFA83603; PPL_02669. DR InParanoid; D3B2Q5; -. DR Proteomes; UP000001396; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR GO; GO:0005089; F:Rho guanyl-nucleotide exchange factor activity; IEA:InterPro. DR GO; GO:0035023; P:regulation of Rho protein signal transduction; IEA:InterPro. DR InterPro; IPR000219; DH-domain. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS50010; DH_2; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001396}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001396}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 125 147 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 167 186 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 336 356 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 629 AA; 72009 MW; FE7EE8F3EDDC53CD CRC64; MSTTSRSRFF DSNNSNDLHQ STTRARPISS TTSSTLYSNN NNNNVQFDAD DNYNESSSLT TTTTTTNTNT TSSSNINNNN SLNNSSNSKY KQPISTKQQQ HNNNNNSNSN NNNRSIVYKY TIAPILYLLR LVILPIVWLD VFIMSIFRGP VDMTYQTKGK IENKCRIISW VTLVSLVSLF AVYLLLVRPT PFDINTNNNT VIPPTKIDKE YLQQILHELL NSNNAKINKI IDEKMDLIKL AYMDEISGNN QKLIDVITQK IDYFKQKEHV PLFDKVNRLE EDVKLQSSVS IGKDINELFI QYKQDSATVY EKFLEQIKEL ARMEREQLST NTKSFIDQLI AEKNTLIQQL QSQSKEKFES LIKDFESTTQ SHTLQLISQL EKSQENEKDK LYSSLQEISL KINSIQQWIK DSPELQSLES SLITVEKIQS LIDNALEVYA SDKTARLDYA LRRGGASIQY GLVHHPNTET YPEITIPALL RVATQWIRSD HRPNVPEIIL DQSRNLLGDC WAFKGQNGSI AIKLAQPIIV KAITIEHPNP KISYHFESAL QEFSVVGVRN ETDTGTHLGT FRFEKNNKHI QTFLINNEEV FPIVVLKVLS NFGYDYTCIY RTRVHGEPTS IYKDPVLSF // ID D3BIE0_POLPA Unreviewed; 884 AA. AC D3BIE0; DT 23-MAR-2010, integrated into UniProtKB/TrEMBL. DT 23-MAR-2010, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=SUN domain-containing protein 2 {ECO:0000313|EMBL:EFA79040.1}; GN Name=sun2 {ECO:0000313|EMBL:EFA79040.1}; GN ORFNames=PPL_08510 {ECO:0000313|EMBL:EFA79040.1}; OS Polysphondylium pallidum (Cellular slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Polysphondylium. OX NCBI_TaxID=13642 {ECO:0000313|Proteomes:UP000001396}; RN [1] {ECO:0000313|Proteomes:UP000001396} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PN500 {ECO:0000313|Proteomes:UP000001396}; RA Gloeckner G., Schaap P., Noegel A.A., Felder M., Eichinger L., RA Heidel A.J., Platzer M.; RT "Living fossils from the dawn of multicellularity."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFA79040.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADBJ01000037; EFA79040.1; -; Genomic_DNA. DR EnsemblProtists; EFA79040; EFA79040; PPL_08510. DR InParanoid; D3BIE0; -. DR Proteomes; UP000001396; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001396}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001396}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 884 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003041167. FT TRANSMEM 732 752 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 237 {ECO:0000256|SAM:Coils}. FT COILED 248 282 {ECO:0000256|SAM:Coils}. FT COILED 340 360 {ECO:0000256|SAM:Coils}. FT COILED 705 732 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 884 AA; 101165 MW; 9F2EE236BB9B5108 CRC64; MNILLNRTFY CFFVFLLLLL CYCRADDNIL QQDLANKDIK EQQQEQKLDS YQKIVEQYLQ HQQHQQQQQQ QQQESTTTQL QELNVDNGDG VTSEKEKEIP AVQQTHIDVA ITEEEEEVDI SVVVDQDKPV VEDIKEDTSR EDITIKFSAT SDNNNDVNSN YNNNNNNNNK NENNNDSPTT DSNQNNQKEQ SIIPEYIGNQ ILQDIENKIN NNNNNNNNNK EEKINEQSTV SEEKNEESNE NTKNQQPNQT EKDTEKKIEN ENENKEKDKD NEKEKENNNE TKVGSEDKDT QQQQTEIEKV KEKEIVDEEN NEKNNEIPRI STVLETVHRD VITSIEQKEK EIEQQKIADT QRETKLQEEQ DTNNINNEIK LEPFNKFTQK VIFSLADTES NSALALTHNN TVSNQTSYPT VRTPKDLPDK FNYAGAECGA TVLAANSEAR EISKLANYEF FSSMFKDFVV MGINKYPSST WHFLGNFTAE NIRKPQYFVL KEKSWYKYLK IKMLTNYGNQ MYCTLSDIKV YGSTMIDDLK NQVGINIQEV ESILNRMNNN ITFGTSSSSK LKTRKENETI SWTQLQQVSM NLTQTRESYS PPDKNGGGSG NGNGQSSATS SATEENDEHG ATHVESASPQ SILQLLANRV KSAEINQSIS NKYLEKLETH YSERFRSLDE DFTKIMSVIN GIAELGSDLE RRVAIEQQSI EKKISQDIAK ELNLLRERIN RLEQKHEEDK NYYITLLASS FILAIIISYL IIKANIEETS FKHSSLRIPW NNSNSNSNNN ARRNSLPVFS DQPTTPLTSL SSSTSMIRPN TPNTQFNINT SPTSTGNGFD YVRNSPFSNQ QSLLKASEVL LSPPIVSNSN TNDIFKSKKK KKKANIVQFN NNKY // ID D3KCC3_MAIZE Unreviewed; 639 AA. AC D3KCC3; DT 23-MAR-2010, integrated into UniProtKB/TrEMBL. DT 23-MAR-2010, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=SUN4 {ECO:0000313|EMBL:ADB78704.1}; GN Name=Sun4 {ECO:0000313|EMBL:ADB78704.1}; OS Zea mays (Maize). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; OC PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Tripsacinae; OC Zea. OX NCBI_TaxID=4577 {ECO:0000313|EMBL:ADB78704.1}; RN [1] {ECO:0000313|EMBL:ADB78704.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Central region of pre-emerged {ECO:0000313|EMBL:ADB78704.1}; RX PubMed=21143845; DOI=10.1186/1471-2229-10-269; RA Murphy S.P., Simmons C.R., Bass H.W.; RT "Structure and expression of the maize (Zea mays L.) SUN-domain RT protein gene family: evidence for the existence of two divergent RT classes of SUN proteins in plants."; RL BMC Plant Biol. 10:269-269(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GU453173; ADB78704.1; -; mRNA. DR RefSeq; NP_001183941.1; NM_001197012.1. DR UniGene; Zm.17612; -. DR STRING; 4577.GRMZM2G005483_P01; -. DR PaxDb; D3KCC3; -. DR GeneID; 100502541; -. DR KEGG; zma:100502541; -. DR Gramene; D3KCC3; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 57 79 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 580 600 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 621 638 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 510 537 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 639 AA; 70585 MW; CA1D4B86BB8F195B CRC64; MSLSCWRVRF PGADVREAGR GREGMQRSRK ALLRRTAAAQ VQSAVAEAAG NGRKRRLYGF SVSLVVTLWV AVLLLHSLVG HGDGQRDGGG SGVDITFIEP ALNGGPVNSA VQEVHGENLA VPSDTCVGSV ENAVLPEDTL VQAAQLCSND EARSENTEAL TKNNQVELSG DQCGYLPQPD FDSGVQPGEK VESEDLPRPP RLSRVAPPDL DEFKTRAIAE RGPGISSQPG NVVHRREPSG KLYNYAAASK GAKVLDFNKE AKGASNILDK DKDKYLRNAC SAEGKFVIIE LSEETLVDTI AIANFEHYSS NPKEFELLSS LTYPTENWET LGRFTAANAR LAQNFTFLEP KWARYLKLNL VSHYGSEFYC TLSMLEVYGM DAVEKMLENL IPVENKKTEP DGKIKEPIEQ IPLKESAGGK ESSQEPLDED EFELEDGKPN GHGDSSKNGA NDPVSETRTL QAGRIPGDTV LKVLMQKVQS LDVSFSVLER YLVELNNRYG QIFKDFDSDI DSKDALLEKI KTELKNLESS KDSITNEIEG IISWKVVASS QLNQLVLDNA LLRSEFETFR QKQADMENRS LAVIFLSFVF ACLALAKLSI GIMSRFCRFY DFEKFHNVRS GWLVLLLSSC IISTILIIQ // ID D3Z0V9_MOUSE Unreviewed; 757 AA. AC D3Z0V9; DT 20-APR-2010, integrated into UniProtKB/TrEMBL. DT 20-APR-2010, sequence version 1. DT 11-NOV-2015, entry version 46. DE SubName: Full=SUN domain-containing protein 1 {ECO:0000313|Ensembl:ENSMUSP00000106506}; DE SubName: Full=Sun1 isoform eta {ECO:0000313|EMBL:ADP89697.1}; GN Name=Sun1 {ECO:0000313|EMBL:ADP89697.1, GN ECO:0000313|Ensembl:ENSMUSP00000106506, ECO:0000313|MGI:MGI:1924303}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090 {ECO:0000313|Ensembl:ENSMUSP00000106506, ECO:0000313|Proteomes:UP000000589}; RN [1] {ECO:0000313|Ensembl:ENSMUSP00000106506, ECO:0000313|Proteomes:UP000000589} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000106506, RC ECO:0000313|Proteomes:UP000000589}; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [2] {ECO:0000213|PubMed:21183079} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). RN [3] {ECO:0000313|EMBL:ADP89697.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=C57BL/6J {ECO:0000313|EMBL:ADP89697.1}; RC TISSUE=Testis {ECO:0000313|EMBL:ADP89697.1}; RX PubMed=20711465; DOI=10.1371/journal.pone.0012072; RA Gob E., Schmitt J., Benavente R., Alsheimer M.; RT "Mammalian sperm head formation involves different polarization of two RT novel LINC complexes."; RL PLoS ONE 5:E12072-E12072(2010). RN [4] {ECO:0000313|EMBL:ADP89697.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=C57BL/6J {ECO:0000313|EMBL:ADP89697.1}; RC TISSUE=Testis {ECO:0000313|EMBL:ADP89697.1}; RA Goeb E.; RL Submitted (OCT-2010) to the EMBL/GenBank/DDBJ databases. RN [5] {ECO:0000313|Ensembl:ENSMUSP00000106506} RP IDENTIFICATION. RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000106506}; RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC125065; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; HQ402597; ADP89697.1; -; mRNA. DR RefSeq; NP_001243047.1; NM_001256118.1. DR UniGene; Mm.210845; -. DR STRING; 10090.ENSMUSP00000056655; -. DR Ensembl; ENSMUST00000110882; ENSMUSP00000106506; ENSMUSG00000036817. DR GeneID; 77053; -. DR UCSC; uc012efr.2; mouse. DR CTD; 23353; -. DR MGI; MGI:1924303; Sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000253025; -. DR NextBio; 346388; -. DR Proteomes; UP000000589; Chromosome 5. DR GO; GO:0002080; C:acrosomal membrane; IDA:MGI. DR GO; GO:0005737; C:cytoplasm; IDA:MGI. DR GO; GO:0016021; C:integral component of membrane; IDA:MGI. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IDA:MGI. DR GO; GO:0043231; C:intracellular membrane-bounded organelle; ISO:MGI. DR GO; GO:0034993; C:LINC complex; ISO:MGI. DR GO; GO:0005635; C:nuclear envelope; IDA:MGI. DR GO; GO:0031965; C:nuclear membrane; ISO:MGI. DR GO; GO:0005634; C:nucleus; IDA:MGI. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; ISO:MGI. DR GO; GO:0006998; P:nuclear envelope organization; ISO:MGI. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; ISO:MGI. DR GO; GO:0007129; P:synapsis; IMP:MGI. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR015880; Znf_C2H2-like. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00355; ZnF_C2H2; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000589}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|MaxQB:D3Z0V9, KW ECO:0000213|PeptideAtlas:D3Z0V9}; KW Reference proteome {ECO:0000313|Proteomes:UP000000589}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 228 248 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 260 279 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 345 365 {ECO:0000256|SAM:Coils}. FT COILED 407 441 {ECO:0000256|SAM:Coils}. FT COILED 455 475 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 757 AA; 84671 MW; 764BD7C53D24CCA3 CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFETEH KLEPVFDSPR MSRRSLRLVT TASYSSGDSQ AIDSHISTSR ATPAKGRETR TVKQRRSASK PAFSINHLSG KGLSSSTSHD SSCSLRSATV LRHPVLDESL IREQTKVDHF WGLDDDGDLK GGNKAATQGN GELAAEVASS NGYTCRDCRM LSARTDALTA HSAIHGTTSR VYSRDRTLKP RKAASGTFWW LGSGWYQFVT LISWLNVFLL TRCLRNICKV FVLLLPLLLL LGAGVSLWGQ GNFFSLLPVL NWTAMQPTQR VDDSKGMHRP GPLPPSPPPK VDHKASQWPQ ESDMGQKVAS LSAQCHNHDE RLAELTVLLQ KLQIRVDQVD DGREGLSLWV KNVVGQHLQE MGTIEPPDAK TDFMTFHHDH EVRLSNLEDV LRKLTEKSEA IQKELEETKL KAGSRDEEQP LLDRVQHLEL ELNLLKSQLS DWQHLKTSCE QAGARIQETV QLMFSEDQQG GSLEWLLEKL SSRFVSKDEL QVLLHDLELK LLQNITHHIT VTGQAPTSEA IVSAVNQAGI SGITEAQAHI IVNNALKLYS QDKTGMVDFA LESGGGSILS TRCSETYETK TALLSLFGVP LWYFSQSPRV VIQPDIYPGN CWAFKGSQGY LVVRLSMKIY PTTFTMEHIP KTLSPTGNIS SAPKDFAVYG LETEYQEEGQ PLGRFTYDQE GDSLQMFHTL ERPDQAFQIV ELRVLSNWGH PEYTCLYRFR VHGEPIQ // ID D3Z805_RAT Unreviewed; 314 AA. AC D3Z805; DT 20-APR-2010, integrated into UniProtKB/TrEMBL. DT 20-APR-2010, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=Protein Sun5 {ECO:0000313|Ensembl:ENSRNOP00000029493}; DE SubName: Full=Sperm associated antigen 4-like (Predicted) {ECO:0000313|EMBL:EDL85988.1}; GN Name=Sun5 {ECO:0000313|Ensembl:ENSRNOP00000029493, GN ECO:0000313|RGD:1306357}; GN Synonyms=Spag4l_predicted {ECO:0000313|EMBL:EDL85988.1}; GN ORFNames=rCG_37429 {ECO:0000313|EMBL:EDL85988.1}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000029493, ECO:0000313|Proteomes:UP000002494}; RN [1] {ECO:0000313|Ensembl:ENSRNOP00000029493, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000029493, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [2] {ECO:0000313|EMBL:EDL85988.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDL85988.1}; RX PubMed=15632090; DOI=10.1101/gr.2889405; RA Florea L., Di Francesco V., Miller J., Turner R., Yao A., Harris M., RA Walenz B., Mobarry C., Merkulov G.V., Charlab R., Dew I., Deng Z., RA Istrail S., Li P., Sutton G.; RT "Gene and alternative splicing annotation with AIR."; RL Genome Res. 15:54-66(2005). RN [3] {ECO:0000313|EMBL:EDL85988.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDL85988.1}; RA Mural R.J., Li P.W., Adams M.D., Amanatides P.G., Baden-Tillson H., RA Barnstead M., Chin S.H., Dew I., Evans C.A., Ferriera S., Flanigan M., RA Fosler C., Glodek A., Gu Z., Holt R.A., Jennings D., Kraft C.L., RA Lu F., Nguyen T., Nusskern D.R., Pfannkoch C.M., Sitter C., RA Sutton G.G., Venter J.C., Wang Z., Woodage T., Zheng X.H., Zhong F.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|Ensembl:ENSRNOP00000029493} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000029493}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABR06027115; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AABR06027116; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CH474050; EDL85988.1; -; Genomic_DNA. DR RefSeq; NP_001100000.1; NM_001106530.1. DR UniGene; Rn.218586; -. DR STRING; 10116.ENSRNOP00000029493; -. DR Ensembl; ENSRNOT00000037589; ENSRNOP00000029493; ENSRNOG00000027221. DR GeneID; 296289; -. DR KEGG; rno:296289; -. DR UCSC; RGD:1306357; rat. DR CTD; 140732; -. DR RGD; 1306357; Sun5. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 640938; -. DR Proteomes; UP000002494; Chromosome 3. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 78 95 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 314 AA; 36142 MW; 0E68857BF7981FF4 CRC64; MPRTRNTGDL CPISEDTTHS GRSRRSAQRS YINRMAEATP ANMTWLTYLA CFLRTQAQQV LLNTCRFKLL FQKLIEKMGL LVLCVFGFWM FSMHLPSKME VWQDLRGSMN LLIAKLQQME AMSDEQKMTQ KIMKMIQGDF IEKPDFALKS IGASIDFEHT SPTYNHDKAR SYWNWIRLWN YAQPPDPSVT PGNCWAFAGD RGQVTIRLAQ KVYLSNVTLQ HIPKTISLSG CLDTAPKDFV IYGMEHPPRE EVFLGAFQFQ PENIIQTFQL QNQPPRGFAA VKVKISSNWG NPRFTCLYRV RVHGCATPPK RSYL // ID D3ZLS5_RAT Unreviewed; 2610 AA. AC D3ZLS5; DT 20-APR-2010, integrated into UniProtKB/TrEMBL. DT 22-JUL-2015, sequence version 3. DT 11-NOV-2015, entry version 50. DE SubName: Full=Protein Hectd1 {ECO:0000313|Ensembl:ENSRNOP00000008459}; GN Name=Hectd1 {ECO:0000313|Ensembl:ENSRNOP00000008459, GN ECO:0000313|RGD:1561653}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000008459, ECO:0000313|Proteomes:UP000002494}; RN [1] {ECO:0000313|Ensembl:ENSRNOP00000008459, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000008459, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [2] {ECO:0000313|Ensembl:ENSRNOP00000008459} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000008459}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [3] {ECO:0000213|PubMed:22673903} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22673903; RA Lundby A., Secher A., Lage K., Nordsborg N.B., Dmytriyev A., RA Lundby C., Olsen J.V.; RT "Quantitative maps of protein phosphorylation sites across 14 RT different rat organs and tissues."; RL Nat. Commun. 3:876-876(2012). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSRNOP00000008459}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABR07064269; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AABR07064270; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_006240148.1; XM_006240086.1. DR UniGene; Rn.36797; -. DR ProteinModelPortal; D3ZLS5; -. DR STRING; 10116.ENSRNOP00000008459; -. DR PaxDb; D3ZLS5; -. DR Ensembl; ENSRNOT00000008459; ENSRNOP00000008459; ENSRNOG00000006905. DR GeneID; 362736; -. DR UCSC; RGD:1561653; rat. DR RGD; 1561653; Hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; D3ZLS5; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR NextBio; 681057; -. DR PRO; PR:D3ZLS5; -. DR Proteomes; UP000002494; Chromosome 6. DR Genevisible; D3ZLS5; RN. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Proteomics identification {ECO:0000213|PeptideAtlas:D3ZLS5}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289034 MW; 23173C24619E1084 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSAAADS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPVY LKRLLPVFAQ TFQHTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTALVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSALAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSVIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDESTISPYE MCSSGLVQAL LTVLNSSIDL DMKQDCSQLV ERINVFKTAF SESEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGVWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPAKDEKQGW RHVRLKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RAPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPSGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLSNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTR SGLNQGASSS LQSSDILNLT KEQPQAKAGN GQNPCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GTWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILGNKSLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID D3ZTT7_RAT Unreviewed; 730 AA. AC D3ZTT7; DT 20-APR-2010, integrated into UniProtKB/TrEMBL. DT 20-APR-2010, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=Protein Sun2 {ECO:0000313|Ensembl:ENSRNOP00000044360}; DE SubName: Full=Similar to SUN2 (Predicted), isoform CRA_a {ECO:0000313|EMBL:EDM15784.1}; GN Name=Sun2 {ECO:0000313|Ensembl:ENSRNOP00000044360, GN ECO:0000313|RGD:1563141}; GN Synonyms=RGD1563141_predicted {ECO:0000313|EMBL:EDM15784.1}; GN ORFNames=rCG_59961 {ECO:0000313|EMBL:EDM15784.1}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000044360, ECO:0000313|Proteomes:UP000002494}; RN [1] {ECO:0000313|Ensembl:ENSRNOP00000044360, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000044360, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [2] {ECO:0000313|EMBL:EDM15784.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDM15784.1}; RX PubMed=15632090; DOI=10.1101/gr.2889405; RA Florea L., Di Francesco V., Miller J., Turner R., Yao A., Harris M., RA Walenz B., Mobarry C., Merkulov G.V., Charlab R., Dew I., Deng Z., RA Istrail S., Li P., Sutton G.; RT "Gene and alternative splicing annotation with AIR."; RL Genome Res. 15:54-66(2005). RN [3] {ECO:0000313|EMBL:EDM15784.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDM15784.1}; RA Mural R.J., Li P.W., Adams M.D., Amanatides P.G., Baden-Tillson H., RA Barnstead M., Chin S.H., Dew I., Evans C.A., Ferriera S., Flanigan M., RA Fosler C., Glodek A., Gu Z., Holt R.A., Jennings D., Kraft C.L., RA Lu F., Nguyen T., Nusskern D.R., Pfannkoch C.M., Sitter C., RA Sutton G.G., Venter J.C., Wang Z., Woodage T., Zheng X.H., Zhong F.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|Ensembl:ENSRNOP00000044360} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000044360}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [5] {ECO:0000213|PubMed:22673903} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22673903; RA Lundby A., Secher A., Lage K., Nordsborg N.B., Dmytriyev A., RA Lundby C., Olsen J.V.; RT "Quantitative maps of protein phosphorylation sites across 14 RT different rat organs and tissues."; RL Nat. Commun. 3:876-876(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC128476; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CH473950; EDM15784.1; -; Genomic_DNA. DR RefSeq; XP_006226220.1; XM_006226158.2. DR RefSeq; XP_006242107.1; XM_006242045.2. DR UniGene; Rn.2240; -. DR STRING; 10116.ENSRNOP00000044360; -. DR Ensembl; ENSRNOT00000046399; ENSRNOP00000044360; ENSRNOG00000015177. DR GeneID; 315135; -. DR CTD; 25777; -. DR RGD; 1563141; Sun2. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Reactome; R-RNO-1221632; Meiotic synapsis. DR NextBio; 33815504; -. DR PRO; PR:D3ZTT7; -. DR Proteomes; UP000002494; Chromosome 7. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:D3ZTT7}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 175 192 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 226 247 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 286 306 {ECO:0000256|SAM:Coils}. FT COILED 365 385 {ECO:0000256|SAM:Coils}. FT COILED 417 444 {ECO:0000256|SAM:Coils}. FT COILED 491 511 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 730 AA; 81527 MW; EA839DC4BED2C735 CRC64; MSRRSQRLTR YSQDDNDGSS SSGASSVAGG QSTVFKDSPL RTLKRKSSNM KRLSPAPQLP PPSDSHTSYY SESVVRESYI GSPRAVSLAR SALLDDHLHS EPYWSGDLRG RRRRGTGGSE SSKANGLTME NKASEDFFGS SSGYSSEDDL AGYTDSDQHS SGSGLRSAAS RAGSFVWTLV TLPGRLFGLL YWWVGTTWYR LTTAASLLDV FVLTRSRHFS PNLKSFLWFL LLLLLLTGLT YGAWHFYPLG LQTLQPAVAS WWAAKESRRQ PEVWDTRDAS SHLQAEQRIL SRVHSLERRL EALAAEFSSN WQKEAIRLER LELRQGAAGH GGGSSLSHED ALSLLEGLVS RREAALKEDL RRDTVARIQE ELATLRAEHH QDSEDLFKKI VQASQESEAR VQQLKTEWRS MTQEAFQESS VKELERLEAQ LAGLRQELAA LTLKQNSVAD EVGLLPQKIQ AARADVESQF PDWISQFLLR DRGARSGLLQ RDEMHAQLQE LENKILANMA EMQGKSAREA AASLGQTLQK EGIVGVTEEQ VHRIVKQALQ RYSEDRIGMV DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGTFAY DQDGEPIQTF YFQASKMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID D4AIH1_ARTBC Unreviewed; 872 AA. AC D4AIH1; DT 18-MAY-2010, integrated into UniProtKB/TrEMBL. DT 18-MAY-2010, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:EFE36544.1}; GN ORFNames=ARB_04066 {ECO:0000313|EMBL:EFE36544.1}; OS Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) OS (Trichophyton mentagrophytes). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Arthroderma. OX NCBI_TaxID=663331 {ECO:0000313|EMBL:EFE36544.1, ECO:0000313|Proteomes:UP000008866}; RN [1] {ECO:0000313|Proteomes:UP000008866} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4681 / CBS 112371 {ECO:0000313|Proteomes:UP000008866}; RX PubMed=21247460; DOI=10.1186/gb-2011-12-1-r7; RA Burmester A., Shelest E., Gloeckner G., Heddergott C., Schindler S., RA Staib P., Heidel A., Felder M., Petzold A., Szafranski K., RA Feuermann M., Pedruzzi I., Priebe S., Groth M., Winkler R., Li W., RA Kniemeyer O., Schroeckh V., Hertweck C., Hube B., White T.C., RA Platzer M., Guthke R., Heitman J., Woestemeyer J., Zipfel P.F., RA Monod M., Brakhage A.A.; RT "Comparative and functional genomics provide insights into the RT pathogenicity of dermatophytic fungi."; RL Genome Biol. 12:R7.1-R7.16(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFE36544.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABSU01000001; EFE36544.1; -; Genomic_DNA. DR RefSeq; XP_003017189.1; XM_003017143.1. DR STRING; 663331.XP_003017189.1; -. DR EnsemblFungi; EFE36544; EFE36544; ARB_04066. DR GeneID; 9522273; -. DR KEGG; abe:ARB_04066; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008866; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008866}; KW Reference proteome {ECO:0000313|Proteomes:UP000008866}. FT COILED 395 433 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 872 AA; 96349 MW; 7AD22DDA461BF0D7 CRC64; MAVDKSNTIC QAHVSDLGAE YIRYPICLET RWNAAASASA AASATTGGEG PSTTRSASGI YADSKDASGS VTVPVPGTAS SDSSNSKESG SSGSGDDADV ESPLDNSNFL SFEEWKNQNL AKAGQSAETM RRHRQDKGQQ ARRRHTRSPQ MNDPLDGLGE ESEIDLEFGG FSTDESGVAS WERKDGGKAS PDNIDSVTAP SGAVVGGKED KHPSQPIFEL DGQDAENMPR KGIGRRKHAG TTCKERFNYA SFDCAATVLK TNPQCTGSSA VLNENKDSYM LNECRAKDKF LIMELCDDIL VDTVVLANYE FFSSIFRSFR VSVSDRYPIK ADKWRVLGTY EAANARQVQA FAVENPLIWA RYLKIDFLSH YGNEFYCPVS LVRVHGTTMM EEYKNDGEAA RADEEEDANA QEEAEQQRRQ NEQQQQLEQK EADVVVHPDV SVPEVVINDQ MVPLSNLSDR ELDELRCFVE RNETESILLG LVSSKMCAIQ ERAAHIASQP VTATRVKDEA AAPASDSITS TNTPEQIRSV SSTRTPTTSD REETRRSSTG SSIAANGSHT EPTRMNSATY SPSPASPPPN PSTQESFFKS VNKRLQMLES NSTLSLLYIE EQSRILRDAF NKVEKRQLAK TSTFLENLNS TVLQELKEFR QQYDHLWHSV FIEFEQQRQQ YHREVYSVAT QLGVLADELV FQKRVAVIQS IFVLVCFGLV LFSRSSGTPY LEFPRNIVTR TRSFRSSSVT YGSPAPSASP SPPPMSRMGS SILSRSEADD DNLHHNHSRH HRSPSEQTDY EVGNPTFTYS PPTPTSRTTT PERTRKLRFS PEPQSGLAAS ATGSPATMSD PELSLRKRPI KSVEVKHESE SDAEQPEGDS FT // ID D4B2M7_ARTBC Unreviewed; 664 AA. AC D4B2M7; DT 18-MAY-2010, integrated into UniProtKB/TrEMBL. DT 18-MAY-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFE30338.1}; GN ORFNames=ARB_02710 {ECO:0000313|EMBL:EFE30338.1}; OS Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) OS (Trichophyton mentagrophytes). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Arthroderma. OX NCBI_TaxID=663331 {ECO:0000313|EMBL:EFE30338.1, ECO:0000313|Proteomes:UP000008866}; RN [1] {ECO:0000313|Proteomes:UP000008866} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4681 / CBS 112371 {ECO:0000313|Proteomes:UP000008866}; RX PubMed=21247460; DOI=10.1186/gb-2011-12-1-r7; RA Burmester A., Shelest E., Gloeckner G., Heddergott C., Schindler S., RA Staib P., Heidel A., Felder M., Petzold A., Szafranski K., RA Feuermann M., Pedruzzi I., Priebe S., Groth M., Winkler R., Li W., RA Kniemeyer O., Schroeckh V., Hertweck C., Hube B., White T.C., RA Platzer M., Guthke R., Heitman J., Woestemeyer J., Zipfel P.F., RA Monod M., Brakhage A.A.; RT "Comparative and functional genomics provide insights into the RT pathogenicity of dermatophytic fungi."; RL Genome Biol. 12:R7.1-R7.16(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFE30338.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABSU01000030; EFE30338.1; -; Genomic_DNA. DR RefSeq; XP_003010978.1; XM_003010932.1. DR EnsemblFungi; EFE30338; EFE30338; ARB_02710. DR GeneID; 9525095; -. DR KEGG; abe:ARB_02710; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008866; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008866}; KW Reference proteome {ECO:0000313|Proteomes:UP000008866}. SQ SEQUENCE 664 AA; 72271 MW; AF05AAB5AE9336FF CRC64; MAPPRRTRRL TPSAAGGASN EADNPYLPSI ETQQTFSYGG SATPALPRPL GSLPAANTAA DVAASIEAAI TRPARPARPA ARPLTESAGF HQIEDEARKS PEKQRVTRGQ QRRAESMTPP REPVRRMTPD IQLMGSLREA SGEPEDHDQQ QQQQQQQQEQ QSDDPVDLLA DAIDGSSISW NTERHLLANE RPAFGLTGWP RPTSMRPQMS PSQASSTSIH QTTQQQYQQQ QHPRQQQSLQ KHHYQQLRGQ PQRSRATAER IERGIAIGPP VGLTTITTTT NNNNNAATTA SPSVRPETPS DQPAAIHTPQ SEHTPASSRP PSALDNAAAP TSPTSPSPST SPSGVTNFGF MHVVCILLSI MMALNGYLLR DEIASAARSI IYSPSGHYGM ANCTESISQM MAAVDQRLTS MTKDISFLKQ EVNKATTSPP PPKPPVNPLE PRRPNFFSLG FGATVDPYLS SPTLSSTTSY LDRLRRLAGG IRPGPSHVSA LQPWDDIGDC WCASTTSITS TTTSSTTTSK KENRIQLAVE LGRPIVPEEV IVEHMPREAT LDNGAAAPQL MELWGEFSDS SSVNSDEVRS ALAAVWPGEA ESAYAHEPSL GPSFVRLGRW QYDIHAAHHI QRFTIAASAV HDLPATSKVV VVARSNWGRR EYTCLYRLRL HGRL // ID D4DCK4_TRIVH Unreviewed; 874 AA. AC D4DCK4; DT 18-MAY-2010, integrated into UniProtKB/TrEMBL. DT 18-MAY-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:EFE40376.1}; GN ORFNames=TRV_04859 {ECO:0000313|EMBL:EFE40376.1}; OS Trichophyton verrucosum (strain HKI 0517). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=663202 {ECO:0000313|EMBL:EFE40376.1, ECO:0000313|Proteomes:UP000008383}; RN [1] {ECO:0000313|Proteomes:UP000008383} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=HKI 0517 {ECO:0000313|Proteomes:UP000008383}; RX PubMed=21247460; DOI=10.1186/gb-2011-12-1-r7; RA Burmester A., Shelest E., Gloeckner G., Heddergott C., Schindler S., RA Staib P., Heidel A., Felder M., Petzold A., Szafranski K., RA Feuermann M., Pedruzzi I., Priebe S., Groth M., Winkler R., Li W., RA Kniemeyer O., Schroeckh V., Hertweck C., Hube B., White T.C., RA Platzer M., Guthke R., Heitman J., Woestemeyer J., Zipfel P.F., RA Monod M., Brakhage A.A.; RT "Comparative and functional genomics provide insights into the RT pathogenicity of dermatophytic fungi."; RL Genome Biol. 12:R7.1-R7.16(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFE40376.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACYE01000248; EFE40376.1; -; Genomic_DNA. DR RefSeq; XP_003020994.1; XM_003020948.1. DR STRING; 663202.XP_003020994.1; -. DR EnsemblFungi; EFE40376; EFE40376; TRV_04859. DR GeneID; 9577846; -. DR KEGG; tve:TRV_04859; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008383; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008383}. FT COILED 397 435 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 874 AA; 96583 MW; 3799A57A46554510 CRC64; MAVDKSNTIC QARVSDLGAE YIRYPICLET RWNAAASASA SAAASATTGS EGPSTTRSAS GIYADSKDTS GSVTVPVPGT ASSDSSSSKE TGSSGNGDDA DVESPLDNSN FLSFEEWKNQ NFAKAGQSAE TMRRHRQDKG QQARRRHTRS PQMNDPLDGL GEESEIDLEF GGFSTDESGV ASWERKDGGR ASPDNIDSVT APGGAIVGGK EDKHPSQPIF ELDGQDAENM PRKGIGRRKH AGTTCKERFN YASFDCAATV LKTNPQCTGS SAVLNENKDS YMLNECRAKD KFLIMELCDD ILVDTVVLAN YEFFSSIFRS FRVSVSDRYP IKADKWRVLG TYEAANARQV QAFAVENPLI WARYLKIDFL SHYGNEFYCP VSLVRVHGTT MMEEYKNDGE AARADEEEDA NAQEEAEQQR QQDEQQQQLE QKEADVVVHP DVSVPEVVIN DQMVPLSNLS DRELDELRCF VERNETESIL LGLVSSKMCA IQERAAHIAS QPVTATRVKD EAAAPASGSI TSTNTPEQIR SVSSTRTPTT SDREETRRSS TGSSIAANGS HTEPTRMNSA TYSPSPASPP PNPSTQESFF KSVNKRLQML ESNSTLSLLY IEEQSRILRD AFNKVEKRQL AKTSTFLENL NSTVLQELKE FRQQYDHLWH SVFIEFEQQR QQYHREVYSV ATQLGVLADE LVFQKRVAVI QSIFVLVCFG LVLFSRSSGT PYLEFPRNIV TRTRSFRSSS VTYGSPAPSA SPSPPPMSRM GSSILSRSEA DDDNLHHNHS RHHRSPSEQT DYEVGNPTFT YSPPTPTSRT TTPERTRKLR FSSEPQSGLA ASETGSPATM SDPELSLRKR PIKSVEVKHE SESDAEQAEG DSFT // ID D4G7C7_DROME Unreviewed; 563 AA. AC D4G7C7; DT 18-MAY-2010, integrated into UniProtKB/TrEMBL. DT 18-MAY-2010, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=RT06825p {ECO:0000313|EMBL:ADE06685.1}; DE SubName: Full=RT07126p {ECO:0000313|EMBL:ADE58551.1}; GN Name=koi-RA {ECO:0000313|EMBL:ADE06685.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:ADE06685.1}; RN [1] {ECO:0000313|EMBL:ADE06685.1} RP NUCLEOTIDE SEQUENCE. RA Carlson J., Booth B., Frise E., Park S., Wan K., Yu C., Celniker S.; RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:ADE58551.1} RP NUCLEOTIDE SEQUENCE. RA Carlson J., Booth B., Frise E., Park S., Wan K., Yu C., Celniker S.; RL Submitted (APR-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BT122079; ADE06685.1; -; mRNA. DR EMBL; BT122181; ADE58551.1; -; mRNA. DR STRING; 7227.FBpp0292403; -. DR PaxDb; D4G7C7; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OMA; LEHEKDQ; -. DR PhylomeDB; D4G7C7; -. DR NextBio; 794196; -. DR Bgee; D4G7C7; -. DR ExpressionAtlas; D4G7C7; differential. DR Genevisible; D4G7C7; DM. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 183 210 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 563 AA; 63875 MW; 8511DB810D1EA8B7 CRC64; MEVPTVRSPQ REAEAIKVNM ASIEQNIQKA LTAEEYENIL NHVNSYVQQL VELKMQQHSK ELAPQQIELF VKLMKENLKQ IMYKTELSEK DLSDLAIKLK LELQSSGGWQ DGAKLSQANL EEITKLIKAE VHLHESHYTI QLDRIDFASL LERILAAPAL ADFVDARISL RVGELEPKES SGSSDAEVQI ERLNREIAFI KLALSDKQAE NADLHQSISN LKLGQEDLLE RIQQHELSQD RRFHGLLAEI ENKLSALNDS QFALLNKQIK LSLVEILGFK QSTAGGSAGQ LDDFDLQTWV RSMFVAKDYL EQQLLELNKR TNNNIRDEIE RSSILLMSDI SQRLKREILL VVEAKHNEST KALKGHIREE EVRQIVKTVL AIYDADKTGL VDFALESAGG QILSTRCTES YQTKSAQISV FGIPLWYPTN TPRVAISPNV QPGECWAFQG FPGFLVLKLN SLVYVTGFTL EHIPKSLSPT GRIESAPRNF TVWGLEQEKD QEPVLFGDYQ FEDNGASLQY FAVQNLDIKR PYEIVELRIE TNHGHPTYTC LYRFRVHGKP PAT // ID D5GG91_TUBMM Unreviewed; 126 AA. AC D5GG91; DT 15-JUN-2010, integrated into UniProtKB/TrEMBL. DT 15-JUN-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAZ83534.1}; GN ORFNames=GSTUM_00007266001 {ECO:0000313|EMBL:CAZ83534.1}; OS Tuber melanosporum (strain Mel28) (Perigord black truffle). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Pezizomycetes; OC Pezizales; Tuberaceae; Tuber. OX NCBI_TaxID=656061 {ECO:0000313|Proteomes:UP000006911}; RN [1] {ECO:0000313|EMBL:CAZ83534.1, ECO:0000313|Proteomes:UP000006911} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Mel28 {ECO:0000313|EMBL:CAZ83534.1, RC ECO:0000313|Proteomes:UP000006911}; RX PubMed=20348908; DOI=10.1038/nature08867; RA Martin F., Kohler A., Murat C., Balestrini R., Coutinho P.M., RA Jaillon O., Montanini B., Morin E., Noel B., Percudani R., Porcel B., RA Rubini A., Amicucci A., Amselem J., Anthouard V., Arcioni S., RA Artiguenave F., Aury J.M., Ballario P., Bolchi A., Brenna A., Brun A., RA Buee M., Cantarel B., Chevalier G., Couloux A., Da Silva C., RA Denoeud F., Duplessis S., Ghignone S., Hilselberger B., Iotti M., RA Marcais B., Mello A., Miranda M., Pacioni G., Quesneville H., RA Riccioni C., Ruotolo R., Splivallo R., Stocchi V., Tisserant E., RA Viscomi A.R., Zambonelli A., Zampieri E., Henrissat B., Lebrun M.H., RA Paolocci F., Bonfante P., Ottonello S., Wincker P.; RT "Perigord black truffle genome uncovers evolutionary origins and RT mechanisms of symbiosis."; RL Nature 464:1033-1038(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN430253; CAZ83534.1; -; Genomic_DNA. DR RefSeq; XP_002839343.1; XM_002839297.1. DR EnsemblFungi; CAZ83534; CAZ83534; GSTUM_00007266001. DR GeneID; 9184596; -. DR KEGG; tml:GSTUM_00007266001; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006911; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006911}; KW Reference proteome {ECO:0000313|Proteomes:UP000006911}. SQ SEQUENCE 126 AA; 14506 MW; ABD4EB9983348AAB CRC64; MFVHKFSAIG RTLRFYVKPH LETESGAASI FVENKDSCLL NKCGAEKKFF IMELCDDILV DTVVLANFES FSSMFRSLKI FDRDRYPIKR NGWKDVSTFE ARNSRQAQAF LIGRGTSYGV FDPVRE // ID D5GMJ0_TUBMM Unreviewed; 500 AA. AC D5GMJ0; DT 15-JUN-2010, integrated into UniProtKB/TrEMBL. DT 15-JUN-2010, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAZ85733.1}; GN ORFNames=GSTUM_00010749001 {ECO:0000313|EMBL:CAZ85733.1}; OS Tuber melanosporum (strain Mel28) (Perigord black truffle). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Pezizomycetes; OC Pezizales; Tuberaceae; Tuber. OX NCBI_TaxID=656061 {ECO:0000313|Proteomes:UP000006911}; RN [1] {ECO:0000313|EMBL:CAZ85733.1, ECO:0000313|Proteomes:UP000006911} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Mel28 {ECO:0000313|EMBL:CAZ85733.1, RC ECO:0000313|Proteomes:UP000006911}; RX PubMed=20348908; DOI=10.1038/nature08867; RA Martin F., Kohler A., Murat C., Balestrini R., Coutinho P.M., RA Jaillon O., Montanini B., Morin E., Noel B., Percudani R., Porcel B., RA Rubini A., Amicucci A., Amselem J., Anthouard V., Arcioni S., RA Artiguenave F., Aury J.M., Ballario P., Bolchi A., Brenna A., Brun A., RA Buee M., Cantarel B., Chevalier G., Couloux A., Da Silva C., RA Denoeud F., Duplessis S., Ghignone S., Hilselberger B., Iotti M., RA Marcais B., Mello A., Miranda M., Pacioni G., Quesneville H., RA Riccioni C., Ruotolo R., Splivallo R., Stocchi V., Tisserant E., RA Viscomi A.R., Zambonelli A., Zampieri E., Henrissat B., Lebrun M.H., RA Paolocci F., Bonfante P., Ottonello S., Wincker P.; RT "Perigord black truffle genome uncovers evolutionary origins and RT mechanisms of symbiosis."; RL Nature 464:1033-1038(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN430356; CAZ85733.1; -; Genomic_DNA. DR RefSeq; XP_002841542.1; XM_002841496.1. DR EnsemblFungi; CAZ85733; CAZ85733; GSTUM_00010749001. DR GeneID; 9182159; -. DR KEGG; tml:GSTUM_00010749001; -. DR InParanoid; D5GMJ0; -. DR KO; K19347; -. DR OMA; ERYSADT; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000006911; Unassembled WGS sequence. DR GO; GO:0000780; C:condensed nuclear chromosome, centromeric region; IEA:EnsemblFungi. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0031021; C:interphase microtubule organizing center; IEA:EnsemblFungi. DR GO; GO:0034993; C:LINC complex; IEA:EnsemblFungi. DR GO; GO:0035974; C:meiotic spindle pole body; IEA:EnsemblFungi. DR GO; GO:0044732; C:mitotic spindle pole body; IEA:EnsemblFungi. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:EnsemblFungi. DR GO; GO:0035861; C:site of double-strand break; IEA:EnsemblFungi. DR GO; GO:0072766; P:centromere clustering at the nuclear envelope; IEA:EnsemblFungi. DR GO; GO:0090307; P:mitotic spindle assembly; IEA:EnsemblFungi. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006911}; KW Reference proteome {ECO:0000313|Proteomes:UP000006911}. FT COILED 32 52 {ECO:0000256|SAM:Coils}. FT COILED 194 214 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 500 AA; 55973 MW; C58B7C660B36D501 CRC64; MESSTDRDKI RNEIDSLSST LDTVRVLQDA MLEDNRRNLD SLLSAQQALD RELSDFRALQ KARNEAMQSF EEALPKQMVA TLGPNGKVQL TDEFQDSLQD VFSKIFPKHF NDAIAKTGPS GIGKIPSWEA FMKGNEDKLK DLIQKHASAG DGVGKNGKPG GVVLSKAFVM AMIQEEAEKY HKKWEVETFL PDFESKFESQ LEELQRRIRR ENTEHFSSAS SSILAAASAI ASGTANRAAR NMAQEFADSR GGRGTSGRIS QGGKERWTTL PDYASIITGA SIWPYITSPS YDWTGGRGYH HFVWRLFGRG ARISPPPALA ITPSTEVGEC WPFPERSGDI GIKLAKPIYV SHVTIDHVPK QQAIEISSAP KNIEFWIRVP AERKEELQKA VGKPASEWFR QDSDTQKIQQ QQLQMSGNNG GEWVRVHEFM YDIHTAGSPV QTFELPVDLT RLNVTSHFVA FRIVDNWGHP NFTCLYRVRV HGYPPKRDLP IGDGERAEGL // ID D6RM79_COPC7 Unreviewed; 878 AA. AC D6RM79; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Spindle pole body-associated protein sad1 {ECO:0000313|EMBL:EFI27877.1}; GN ORFNames=CC1G_11655 {ECO:0000313|EMBL:EFI27877.1}; OS Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC OS 9003) (Inky cap fungus) (Hormographiella aspergillata). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. OX NCBI_TaxID=240176 {ECO:0000313|EMBL:EFI27877.1, ECO:0000313|Proteomes:UP000001861}; RN [1] {ECO:0000313|EMBL:EFI27877.1, ECO:0000313|Proteomes:UP000001861} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003 RC {ECO:0000313|Proteomes:UP000001861}; RX PubMed=20547848; DOI=10.1073/pnas.1003391107; RA Stajich J.E., Wilke S.K., Ahren D., Au C.H., Birren B.W., RA Borodovsky M., Burns C., Canbaeck B., Casselton L.A., Cheng C.K., RA Deng J., Dietrich F.S., Fargo D.C., Farman M.L., Gathman A.C., RA Goldberg J., Guigo R., Hoegger P.J., Hooker J.B., Huggins A., RA James T.Y., Kamada T., Kilaru S., Kodira C., Kuees U., Kupfer D., RA Kwan H.S., Lomsadze A., Li W., Lilly W.W., Ma L.-J., Mackey A.J., RA Manning G., Martin F., Muraguchi H., Natvig D.O., Palmerini H., RA Ramesh M.A., Rehmeyer C.J., Roe B.A., Shenoy N., Stanke M., RA Ter-Hovhannisyan V., Tunlid A., Velagapudi R., Vision T.J., Zeng Q., RA Zolan M.E., Pukkila P.J.; RT "Insights into evolution of multicellular fungi from the assembled RT chromosomes of the mushroom Coprinopsis cinerea (Coprinus cinereus)."; RL Proc. Natl. Acad. Sci. U.S.A. 107:11889-11894(2010). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFI27877.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACS02000004; EFI27877.1; -; Genomic_DNA. DR RefSeq; XP_002911371.1; XM_002911325.1. DR STRING; 240176.XP_002911371.1; -. DR EnsemblFungi; EFI27877; EFI27877; CC1G_11655. DR GeneID; 6015305; -. DR KEGG; cci:CC1G_11655; -. DR EuPathDB; FungiDB:CC1G_11655; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D6RM79; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001861; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001861}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001861}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 338 362 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 383 404 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 505 532 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 878 AA; 95798 MW; ED2090B68E22D0EC CRC64; MPIAAQHNPN HSWASSSRSQ QHIPRSTSVE YEEQARTAAA RRLPGPNSRI AGRASSSNTL KPPSRNRSLQ HVPDSEGEDS IGPNGRATSP YQAVIAAAKR ALAPALDPAT YYVRERTPEG EGNSVEHPSG SAPGNDTTYS YEEEERFVQD LQAAKSEQRI SGRRGRISKD NQAYKPTSDD EYSSEDDDDD RKRRRRAKPA IRLNNLPTVA GGKTTRKRRT KSKSNILAAD EDVSEDISQS DIRSQSAAAS QRASVPRISV EPVPPPDDQS LSMAESGLHS IPEVPEDDIR PQSEPPEPEA TAKQSRAQSK PPSTGRPRKN SRSSTPVRSQ QRFSIGAILG RMFNVFFVLL SSITLIIGRG FGTVFHTVFS RPSQWISSAR PGFFRMVGKY IFFAATILSA WYMLQHPALH SLIPSFDRST SPIYVPPPVP PQDISEVTAR LALIEKALSG LTVESEKNKA KVEESAKGFT DIHHKLGEIE GKWSAETKRI LDTESRARGA LGSTVSSVKE EIAALQAQIE AQKKAYEKEK ARVPAGSDEE ARAKLKALED KLLGFEGPLK EALELGKKLA STPAAPAPPA GTAWWNKVIA GSKGRLQITT PDGQDVTGLI SQLVDHSVAN AMNNKELKVD FALHSAGARV IPSLTSPTFE IKPNSIRAQV AGLITNNGKA IGRPPVTALH PDTYSGDCWP MFGSSGRLGV ALAAPVYIDE ITIDHVAKEA AFDMRSAPRQ MEVWGLVEGA DNLAKVKEWQ DSELLSRKVA AEAAGEVVDD AWEKRDREAQ AALLPPGLPK SGGTYIRLAN FTYDIHAPRN VQTFGVDPKI KELGVDFGVV SLRILNNWGR DEYTCLYRFR VHGRKFGEVP PVWAPESEGE GQVEQESS // ID D6RQS4_COPC7 Unreviewed; 2156 AA. AC D6RQS4; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFI26622.1}; GN ORFNames=CC1G_15394 {ECO:0000313|EMBL:EFI26622.1}; OS Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC OS 9003) (Inky cap fungus) (Hormographiella aspergillata). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. OX NCBI_TaxID=240176 {ECO:0000313|EMBL:EFI26622.1, ECO:0000313|Proteomes:UP000001861}; RN [1] {ECO:0000313|EMBL:EFI26622.1, ECO:0000313|Proteomes:UP000001861} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003 RC {ECO:0000313|Proteomes:UP000001861}; RX PubMed=20547848; DOI=10.1073/pnas.1003391107; RA Stajich J.E., Wilke S.K., Ahren D., Au C.H., Birren B.W., RA Borodovsky M., Burns C., Canbaeck B., Casselton L.A., Cheng C.K., RA Deng J., Dietrich F.S., Fargo D.C., Farman M.L., Gathman A.C., RA Goldberg J., Guigo R., Hoegger P.J., Hooker J.B., Huggins A., RA James T.Y., Kamada T., Kilaru S., Kodira C., Kuees U., Kupfer D., RA Kwan H.S., Lomsadze A., Li W., Lilly W.W., Ma L.-J., Mackey A.J., RA Manning G., Martin F., Muraguchi H., Natvig D.O., Palmerini H., RA Ramesh M.A., Rehmeyer C.J., Roe B.A., Shenoy N., Stanke M., RA Ter-Hovhannisyan V., Tunlid A., Velagapudi R., Vision T.J., Zeng Q., RA Zolan M.E., Pukkila P.J.; RT "Insights into evolution of multicellular fungi from the assembled RT chromosomes of the mushroom Coprinopsis cinerea (Coprinus cinereus)."; RL Proc. Natl. Acad. Sci. U.S.A. 107:11889-11894(2010). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFI26622.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACS02000012; EFI26622.1; -; Genomic_DNA. DR RefSeq; XP_002910116.1; XM_002910070.1. DR EnsemblFungi; EFI26622; EFI26622; CC1G_15394. DR GeneID; 9380065; -. DR KEGG; cci:CC1G_15394; -. DR EuPathDB; FungiDB:CC1G_15394; -. DR eggNOG; ENOG410K177; Eukaryota. DR eggNOG; ENOG4110CQ2; LUCA. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001861; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001861}; KW Reference proteome {ECO:0000313|Proteomes:UP000001861}. SQ SEQUENCE 2156 AA; 234619 MW; E6DAF3E16C6F97BB CRC64; MAPSLHFESM PQGHPESSGQ HSSDPGLPKP FKTTTSSYPA LNRKRSFNDE NEAESSSAEE SSDADIEGGR IFKDHVIDGL KRRKIGHWPG VQQERREQKV DSPGSVPNVA SENTSRSSGI CSRFLIVRTT DGEGGEDITI DMTIEDEEVF IAEKDTPIQE GQAPNLSQST SDAVQPKRAA SVAHSPPSVS PSHEPCSADG GSMMVVNEHN TDSKLDDDES PGSHPGFQTV KSASDGSTSQ AGAAITPSAL STSLASHPGS QPSDGYQDRS PAPSRYSNDA IIRGFQQTIE RMPTSFPELD SDTSILPDVG NNAISSTTHD HAQPSAHACR IRSSLHSAIP PVSPSIIDVD PSTAPPAFTP SSGRLDAGNN NVSVDARQSI ARLTDYDAPN PLQTVGSSRV NEAEDDTMSV GSRRSMASIS DPVDSETGPF DEVRASASPP VAEATQAPTP EYSPFEGNEP TISFLAAEPD GIEEETMSIA SRRSMASISH REESPKPFDG VRTSTSPQAA PLETHTPTSE DGPFVETELP ISTSSAARPD VAEEGTMEED TMSIASRRSM ASISHREESP KPFDGVRTST SPQAAPLETH TPTSEDGPFV ETELLISTSS AARPDVAEED TMSIASRRSM ASISHREESP KPFDGVRTST SPQATPLETH TPTSEDGPFV ETELPISTSS AARPDVAEEG TMVEDTMSIA SRRSMASISD REESPKPFDG VRTSTSPQAA PLETHTPTSE DGPFVETELP ISTSSAARPD VAEEGTMVED TMSIASRRSM ASISDREESP KPFDGVRTST SPQATPLETH TPTSEDGPFV ETELPISTSS AARPDVAEED TMEEDTMSIA SRRSMASISH REESPKPFDG VHTSTSPQAA PLETHTPTSE DTMSIASRRS MASISDREES SKAFDGTRGS MFPPPVTNQS EKELMSVGSI RSIASVPDYG EDPKPQENVK PSVFSLLVKP HESYPTTISS GTRRSMANSR GQNEVPGPLE GVSTLSSSGG PEVTEAGDDA MSVSRAGSIA SILDHDEDPK PFEGVTALLP SSCPNVTESD TEAMPRGARR NNVTDPEQLD GPKHFPDVSI SNPPSGPKLL ESEEERMSVD DTRSVASVSD HGNDPHPFEG IEAVSVSHSP VNVNPSRGLE VHASSQHPGR GRSRGLTSQM ALKTSSAPGL DENNPRSQGH ECGMTCGSDN EADDEDSDSE GSIAISLFAD EVCLPDQLDV DVGPFLLIEE DFLPNNRGRE YSVSPDPFDE DEDGDEEEYY EGDIASQKPF AKAARFGSQT DFEETPPNRR VASMIETRIK AAPICFAQAK PRRSQSSPPQ KADIRKHCLE LMRRPTPRAK VESVPFESVL AFGESEDKDD GPSVRNFGVF LDFKSLTTVM HRIYANSWNR RAAVVFAQSF VSSGLYDCDD KDAVESQFLV HLIHLGKLFA RQGKKNSAAA IKASADARRK SRRDRFEETA DFYSNIRGME HYPELWAKIH WRVLSGDETD PADPDGPRVI THLKWRHRRI RRWMQNWAYL EDHRRHPDNE NDQGKDPDPR RDPSQEDPPR KVRERDGHPQ KCLPRNFYTK SFLKKVEGGE YLEDYKIQEK ENLSFPPSIM GIIKKQAVKR MKRNTHRGSR ARKEADESQG QQQVATPQPK VPSNKGAPPP LRERGKAGPR KSQSSSGKGK STRGPASSTT NLFAKADPTY GHPGQWFVAR HEWPCWKQSH ACNSEVGPLP EYPTVLVSNP QADQRSRETP EKAPISLVTF WDIFELLRLF GSFKIHRSIP AVIGHSRDNP GQSHAQVAFS AGCHGAAAGS RHLRALRVCG VNSVTQFLFT SHSEPLALST TSRAAVTTER ICCANDGTTV EEDYPLRDTR VTILCPHSCI TMQMFFSWIP TWVTKCEKLG DSLGGAGCSD DGLHTDPLPA QTFNRYQESP TCAMLTSIPI VGDLFHLQRR LKKVVTRLDD IDTRLDDHLP LEYPSRDLAL LGLGASIIPA LTSETDSSLS SHPGVVLPAA ILEESPSIGE CWEFRGSSGQ VGIRLPEKAN ITALSVSYVR PSRLTKPLRE KIPKSFKLWG LVSKEDFDNL APGAPSRPLQ QALRRDRRPK RNAAPDKASG LASVDSTFIH LVSGHYDPLS KRSRQVFPAE HRQNPTGVRG MTKVHW // ID D6WJR5_TRICA Unreviewed; 1081 AA. AC D6WJR5; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFA03108.1}; GN ORFNames=TcasGA2_TC013018 {ECO:0000313|EMBL:EFA03108.1}; OS Tribolium castaneum (Red flour beetle). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; OC Cucujiformia; Tenebrionidae; Tenebrionidae incertae sedis; Tribolium. OX NCBI_TaxID=7070 {ECO:0000313|Proteomes:UP000007266}; RN [1] {ECO:0000313|EMBL:EFA03108.1, ECO:0000313|Proteomes:UP000007266} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA03108.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=18362917; DOI=10.1038/nature06784; RG Tribolium Genome Sequencing Consortium; RA Richards S., Gibbs R.A., Weinstock G.M., Brown S.J., Denell R., RA Beeman R.W., Gibbs R., Beeman R.W., Brown S.J., Bucher G., RA Friedrich M., Grimmelikhuijzen C.J., Klingler M., Lorenzen M., RA Richards S., Roth S., Schroder R., Tautz D., Zdobnov E.M., Muzny D., RA Gibbs R.A., Weinstock G.M., Attaway T., Bell S., Buhay C.J., RA Chandrabose M.N., Chavez D., Clerk-Blankenburg K.P., Cree A., Dao M., RA Davis C., Chacko J., Dinh H., Dugan-Rocha S., Fowler G., Garner T.T., RA Garnes J., Gnirke A., Hawes A., Hernandez J., Hines S., Holder M., RA Hume J., Jhangiani S.N., Joshi V., Khan Z.M., Jackson L., Kovar C., RA Kowis A., Lee S., Lewis L.R., Margolis J., Morgan M., Nazareth L.V., RA Nguyen N., Okwuonu G., Parker D., Richards S., Ruiz S.J., RA Santibanez J., Savard J., Scherer S.E., Schneider B., Sodergren E., RA Tautz D., Vattahil S., Villasana D., White C.S., Wright R., Park Y., RA Beeman R.W., Lord J., Oppert B., Lorenzen M., Brown S., Wang L., RA Savard J., Tautz D., Richards S., Weinstock G., Gibbs R.A., Liu Y., RA Worley K., Weinstock G., Elsik C.G., Reese J.T., Elhaik E., Landan G., RA Graur D., Arensburger P., Atkinson P., Beeman R.W., Beidler J., RA Brown S.J., Demuth J.P., Drury D.W., Du Y.Z., Fujiwara H., RA Lorenzen M., Maselli V., Osanai M., Park Y., Robertson H.M., Tu Z., RA Wang J.J., Wang S., Richards S., Song H., Zhang L., Sodergren E., RA Werner D., Stanke M., Morgenstern B., Solovyev V., Kosarev P., RA Brown G., Chen H.C., Ermolaeva O., Hlavina W., Kapustin Y., RA Kiryutin B., Kitts P., Maglott D., Pruitt K., Sapojnikov V., RA Souvorov A., Mackey A.J., Waterhouse R.M., Wyder S., Zdobnov E.M., RA Zdobnov E.M., Wyder S., Kriventseva E.V., Kadowaki T., Bork P., RA Aranda M., Bao R., Beermann A., Berns N., Bolognesi R., Bonneton F., RA Bopp D., Brown S.J., Bucher G., Butts T., Chaumot A., Denell R.E., RA Ferrier D.E., Friedrich M., Gordon C.M., Jindra M., Klingler M., RA Lan Q., Lattorff H.M., Laudet V., von Levetsow C., Liu Z., Lutz R., RA Lynch J.A., da Fonseca R.N., Posnien N., Reuter R., Roth S., RA Savard J., Schinko J.B., Schmitt C., Schoppmeier M., Schroder R., RA Shippy T.D., Simonnet F., Marques-Souza H., Tautz D., Tomoyasu Y., RA Trauner J., Van der Zee M., Vervoort M., Wittkopp N., Wimmer E.A., RA Yang X., Jones A.K., Sattelle D.B., Ebert P.R., Nelson D., Scott J.G., RA Beeman R.W., Muthukrishnan S., Kramer K.J., Arakane Y., Beeman R.W., RA Zhu Q., Hogenkamp D., Dixit R., Oppert B., Jiang H., Zou Z., RA Marshall J., Elpidina E., Vinokurov K., Oppert C., Zou Z., Evans J., RA Lu Z., Zhao P., Sumathipala N., Altincicek B., Vilcinskas A., RA Williams M., Hultmark D., Hetru C., Jiang H., Grimmelikhuijzen C.J., RA Hauser F., Cazzamali G., Williamson M., Park Y., Li B., Tanaka Y., RA Predel R., Neupert S., Schachtner J., Verleyen P., Raible F., Bork P., RA Friedrich M., Walden K.K., Robertson H.M., Angeli S., Foret S., RA Bucher G., Schuetz S., Maleszka R., Wimmer E.A., Beeman R.W., RA Lorenzen M., Tomoyasu Y., Miller S.C., Grossmann D., Bucher G.; RT "The genome of the model beetle and pest Tribolium castaneum."; RL Nature 452:949-955(2008). RN [2] {ECO:0000313|EMBL:EFA03108.1, ECO:0000313|Proteomes:UP000007266} RP GENOME REANNOTATION. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA03108.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=19820115; DOI=10.1093/nar/gkp807; RA Kim H.S., Murphy T., Xia J., Caragea D., Park Y., Beeman R.W., RA Lorenzen M.D., Butcher S., Manak J.R., Brown S.J.; RT "BeetleBase in 2010: revisions to provide comprehensive genomic RT information for Tribolium castaneum."; RL Nucleic Acids Res. 38:D437-D442(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000280; EFA03108.1; -; Genomic_DNA. DR ProteinModelPortal; D6WJR5; -. DR STRING; 7070.TC013018-PA; -. DR EnsemblMetazoa; TC013018-RA; TC013018-PA; TC013018. DR eggNOG; KOG0034; Eukaryota. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D6WJR5; -. DR OMA; RNATINE; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; D6WJR5; -. DR Proteomes; UP000007266; Linkage group 5. DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro. DR Gene3D; 1.10.238.10; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR011992; EF-hand-dom_pair. DR InterPro; IPR018247; EF_Hand_1_Ca_BS. DR InterPro; IPR002048; EF_hand_dom. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF13499; EF-hand_7; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00054; EFh; 3. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS00018; EF_HAND_1; 2. DR PROSITE; PS50222; EF_HAND_2; 3. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007266}; KW Reference proteome {ECO:0000313|Proteomes:UP000007266}. FT COILED 833 853 {ECO:0000256|SAM:Coils}. FT COILED 855 889 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1081 AA; 121048 MW; 5835CD1115F6083B CRC64; MGQGRSQFSE EELQDYQDLT YFTKKEVLYA HQKFKVLAPE KVGHNKNAKL PMSKMLNYPE LNVNPFGDRI CKVFSSSRDG DCTFEDFLDM MSVFSEAAPK SVKAEHAFRI FDFDGDDMLG ISDLKQIIER LIGKDNHLGE QEMERLIQNV LEEADLDDDG ALSFAEFEHI IDRSSDFLNR GLVCEDAAPT PPPSAPIGEN VSSDSIQDSK KTIESEKSDD VIRGLEPSAD VVSNFNEKTE DTLQVFAEAT SQNSGVANDL TTSGQPVVVT LSDVATAELQ QRAILPSQDT PPEPQSTNHT FKLNQSRSDL PTVKDNLTEE IPSFSEWAQK QLQEAEKNKS NTSTHHPNGN KQASGAKLRW KNYASLDCGA KVVASNPEAV SPSAILSPSR DEYKLNTCTS RIWFIVELCE AIQAKKIDLA NFELFSSSPK DFAVFVSDRF PTREWSNVGH FTAKDERDVQ SFDLHPHLFG KYIKVEVKSH YGSEHYCPIS LFRVYGTSEF EVLQKEDQAH EDRGDDDDDD DGSLDLENGD ARKNLFSSAT DAVISMVKKA AEVLGNKGNY SNQTEILNNT KQAVPLVRVC TSPSHLVVCD NCSDALFGRV YELLSCHDSQ IWGLINLAFI RNVLVESSLC QKFGFCKVSD KFEAHAKYIE ALFPEYLLGA MCNMAAVYQN KVVLNVSNHS NDTIIDDDQM INIETSEPHV LPLETTRKEP EVTDALSLQK SEAVSTPSLP IVTLTSQIKP TKTLNSESDS RQSNTEASSV RSEIKSIVTS TEPVQSNLTE PEEPEVENVT ESVELDDPLD AATPQAQKES VFLRLSNRIK ALERNMSLSG QYLEELSKRY KKQVEEIQKL LDKTILSLNE ESQKKDERNK QLEERLNVLT SNLEALLAER RSWSSTISCV VISSLVTLFI VTFCCKIPEP VAARRPELKR RKSIDVVQHT APKKKRRPSD QALKIVRSSM EASDNLRKKK KRKNMLRRSN SISTLGGEEK KAWPEAGSID WVEGKRFEEV PFVLEESEHS TLEPIALEEK IAPPSFVQTA VMARAVRVNG EAKREENGGV AVEGTPKKEK KGLKRIFRKV F // ID D6WVN4_TRICA Unreviewed; 435 AA. AC D6WVN4; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFA09315.1}; GN ORFNames=TcasGA2_TC030748 {ECO:0000313|EMBL:EFA09315.1}; OS Tribolium castaneum (Red flour beetle). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; OC Cucujiformia; Tenebrionidae; Tenebrionidae incertae sedis; Tribolium. OX NCBI_TaxID=7070 {ECO:0000313|Proteomes:UP000007266}; RN [1] {ECO:0000313|EMBL:EFA09315.1, ECO:0000313|Proteomes:UP000007266} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA09315.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=18362917; DOI=10.1038/nature06784; RG Tribolium Genome Sequencing Consortium; RA Richards S., Gibbs R.A., Weinstock G.M., Brown S.J., Denell R., RA Beeman R.W., Gibbs R., Beeman R.W., Brown S.J., Bucher G., RA Friedrich M., Grimmelikhuijzen C.J., Klingler M., Lorenzen M., RA Richards S., Roth S., Schroder R., Tautz D., Zdobnov E.M., Muzny D., RA Gibbs R.A., Weinstock G.M., Attaway T., Bell S., Buhay C.J., RA Chandrabose M.N., Chavez D., Clerk-Blankenburg K.P., Cree A., Dao M., RA Davis C., Chacko J., Dinh H., Dugan-Rocha S., Fowler G., Garner T.T., RA Garnes J., Gnirke A., Hawes A., Hernandez J., Hines S., Holder M., RA Hume J., Jhangiani S.N., Joshi V., Khan Z.M., Jackson L., Kovar C., RA Kowis A., Lee S., Lewis L.R., Margolis J., Morgan M., Nazareth L.V., RA Nguyen N., Okwuonu G., Parker D., Richards S., Ruiz S.J., RA Santibanez J., Savard J., Scherer S.E., Schneider B., Sodergren E., RA Tautz D., Vattahil S., Villasana D., White C.S., Wright R., Park Y., RA Beeman R.W., Lord J., Oppert B., Lorenzen M., Brown S., Wang L., RA Savard J., Tautz D., Richards S., Weinstock G., Gibbs R.A., Liu Y., RA Worley K., Weinstock G., Elsik C.G., Reese J.T., Elhaik E., Landan G., RA Graur D., Arensburger P., Atkinson P., Beeman R.W., Beidler J., RA Brown S.J., Demuth J.P., Drury D.W., Du Y.Z., Fujiwara H., RA Lorenzen M., Maselli V., Osanai M., Park Y., Robertson H.M., Tu Z., RA Wang J.J., Wang S., Richards S., Song H., Zhang L., Sodergren E., RA Werner D., Stanke M., Morgenstern B., Solovyev V., Kosarev P., RA Brown G., Chen H.C., Ermolaeva O., Hlavina W., Kapustin Y., RA Kiryutin B., Kitts P., Maglott D., Pruitt K., Sapojnikov V., RA Souvorov A., Mackey A.J., Waterhouse R.M., Wyder S., Zdobnov E.M., RA Zdobnov E.M., Wyder S., Kriventseva E.V., Kadowaki T., Bork P., RA Aranda M., Bao R., Beermann A., Berns N., Bolognesi R., Bonneton F., RA Bopp D., Brown S.J., Bucher G., Butts T., Chaumot A., Denell R.E., RA Ferrier D.E., Friedrich M., Gordon C.M., Jindra M., Klingler M., RA Lan Q., Lattorff H.M., Laudet V., von Levetsow C., Liu Z., Lutz R., RA Lynch J.A., da Fonseca R.N., Posnien N., Reuter R., Roth S., RA Savard J., Schinko J.B., Schmitt C., Schoppmeier M., Schroder R., RA Shippy T.D., Simonnet F., Marques-Souza H., Tautz D., Tomoyasu Y., RA Trauner J., Van der Zee M., Vervoort M., Wittkopp N., Wimmer E.A., RA Yang X., Jones A.K., Sattelle D.B., Ebert P.R., Nelson D., Scott J.G., RA Beeman R.W., Muthukrishnan S., Kramer K.J., Arakane Y., Beeman R.W., RA Zhu Q., Hogenkamp D., Dixit R., Oppert B., Jiang H., Zou Z., RA Marshall J., Elpidina E., Vinokurov K., Oppert C., Zou Z., Evans J., RA Lu Z., Zhao P., Sumathipala N., Altincicek B., Vilcinskas A., RA Williams M., Hultmark D., Hetru C., Jiang H., Grimmelikhuijzen C.J., RA Hauser F., Cazzamali G., Williamson M., Park Y., Li B., Tanaka Y., RA Predel R., Neupert S., Schachtner J., Verleyen P., Raible F., Bork P., RA Friedrich M., Walden K.K., Robertson H.M., Angeli S., Foret S., RA Bucher G., Schuetz S., Maleszka R., Wimmer E.A., Beeman R.W., RA Lorenzen M., Tomoyasu Y., Miller S.C., Grossmann D., Bucher G.; RT "The genome of the model beetle and pest Tribolium castaneum."; RL Nature 452:949-955(2008). RN [2] {ECO:0000313|EMBL:EFA09315.1, ECO:0000313|Proteomes:UP000007266} RP GENOME REANNOTATION. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA09315.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=19820115; DOI=10.1093/nar/gkp807; RA Kim H.S., Murphy T., Xia J., Caragea D., Park Y., Beeman R.W., RA Lorenzen M.D., Butcher S., Manak J.R., Brown S.J.; RT "BeetleBase in 2010: revisions to provide comprehensive genomic RT information for Tribolium castaneum."; RL Nucleic Acids Res. 38:D437-D442(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000283; EFA09315.1; -; Genomic_DNA. DR STRING; 7070.TC030748-PA; -. DR EnsemblMetazoa; TC030748-RA; TC030748-PA; TC030748. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D6WVN4; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; D6WVN4; -. DR Proteomes; UP000007266; Linkage group 8. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007266}; KW Reference proteome {ECO:0000313|Proteomes:UP000007266}. FT COILED 112 132 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 435 AA; 49073 MW; 23AB91D058DE5ED1 CRC64; MLADTWLLRK SSAPRKTGAV VALCLLPLLV FAAWFLLSSL GSAIYWSFTN STSIPVEQQV AEKIISAPPV PSPTQINTDE IIEKILQNPK IHNIIINNHK GQESDEKFQH IIEELRLEID RIKSEQQNQN ADLGRIIAQI RTENVRNLAR LTQKLNRCCS RQIIDLEPYI TRVFTNLLND PHFLSNQNGL TDWLHTLFVA KTDLENRLLN LTTNFDVSDA ANQVMEKVVG KLSKLDPSSL DETQIRKIVQ SALRIYDADK TGLVDYAMER LGGEIVTTRC TESYFAGTAV ISVLGIPIWY PSISPRTVIT PGINPGECWA FQNFPGLLVI KLASRVRIEA FSLEHVSRLL VPEGKIDSAP KEFEVFGLNG ENDKDPVWLG EFVYDYDGDP LQFFAVREPK VCSMIEIRIK SNHGNPNYTC LYRFRVHGKV SNEPA // ID D6WWR3_TRICA Unreviewed; 376 AA. AC D6WWR3; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFA08092.1}; GN ORFNames=TcasGA2_TC005696 {ECO:0000313|EMBL:EFA08092.1}; OS Tribolium castaneum (Red flour beetle). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; OC Cucujiformia; Tenebrionidae; Tenebrionidae incertae sedis; Tribolium. OX NCBI_TaxID=7070 {ECO:0000313|Proteomes:UP000007266}; RN [1] {ECO:0000313|EMBL:EFA08092.1, ECO:0000313|Proteomes:UP000007266} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA08092.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=18362917; DOI=10.1038/nature06784; RG Tribolium Genome Sequencing Consortium; RA Richards S., Gibbs R.A., Weinstock G.M., Brown S.J., Denell R., RA Beeman R.W., Gibbs R., Beeman R.W., Brown S.J., Bucher G., RA Friedrich M., Grimmelikhuijzen C.J., Klingler M., Lorenzen M., RA Richards S., Roth S., Schroder R., Tautz D., Zdobnov E.M., Muzny D., RA Gibbs R.A., Weinstock G.M., Attaway T., Bell S., Buhay C.J., RA Chandrabose M.N., Chavez D., Clerk-Blankenburg K.P., Cree A., Dao M., RA Davis C., Chacko J., Dinh H., Dugan-Rocha S., Fowler G., Garner T.T., RA Garnes J., Gnirke A., Hawes A., Hernandez J., Hines S., Holder M., RA Hume J., Jhangiani S.N., Joshi V., Khan Z.M., Jackson L., Kovar C., RA Kowis A., Lee S., Lewis L.R., Margolis J., Morgan M., Nazareth L.V., RA Nguyen N., Okwuonu G., Parker D., Richards S., Ruiz S.J., RA Santibanez J., Savard J., Scherer S.E., Schneider B., Sodergren E., RA Tautz D., Vattahil S., Villasana D., White C.S., Wright R., Park Y., RA Beeman R.W., Lord J., Oppert B., Lorenzen M., Brown S., Wang L., RA Savard J., Tautz D., Richards S., Weinstock G., Gibbs R.A., Liu Y., RA Worley K., Weinstock G., Elsik C.G., Reese J.T., Elhaik E., Landan G., RA Graur D., Arensburger P., Atkinson P., Beeman R.W., Beidler J., RA Brown S.J., Demuth J.P., Drury D.W., Du Y.Z., Fujiwara H., RA Lorenzen M., Maselli V., Osanai M., Park Y., Robertson H.M., Tu Z., RA Wang J.J., Wang S., Richards S., Song H., Zhang L., Sodergren E., RA Werner D., Stanke M., Morgenstern B., Solovyev V., Kosarev P., RA Brown G., Chen H.C., Ermolaeva O., Hlavina W., Kapustin Y., RA Kiryutin B., Kitts P., Maglott D., Pruitt K., Sapojnikov V., RA Souvorov A., Mackey A.J., Waterhouse R.M., Wyder S., Zdobnov E.M., RA Zdobnov E.M., Wyder S., Kriventseva E.V., Kadowaki T., Bork P., RA Aranda M., Bao R., Beermann A., Berns N., Bolognesi R., Bonneton F., RA Bopp D., Brown S.J., Bucher G., Butts T., Chaumot A., Denell R.E., RA Ferrier D.E., Friedrich M., Gordon C.M., Jindra M., Klingler M., RA Lan Q., Lattorff H.M., Laudet V., von Levetsow C., Liu Z., Lutz R., RA Lynch J.A., da Fonseca R.N., Posnien N., Reuter R., Roth S., RA Savard J., Schinko J.B., Schmitt C., Schoppmeier M., Schroder R., RA Shippy T.D., Simonnet F., Marques-Souza H., Tautz D., Tomoyasu Y., RA Trauner J., Van der Zee M., Vervoort M., Wittkopp N., Wimmer E.A., RA Yang X., Jones A.K., Sattelle D.B., Ebert P.R., Nelson D., Scott J.G., RA Beeman R.W., Muthukrishnan S., Kramer K.J., Arakane Y., Beeman R.W., RA Zhu Q., Hogenkamp D., Dixit R., Oppert B., Jiang H., Zou Z., RA Marshall J., Elpidina E., Vinokurov K., Oppert C., Zou Z., Evans J., RA Lu Z., Zhao P., Sumathipala N., Altincicek B., Vilcinskas A., RA Williams M., Hultmark D., Hetru C., Jiang H., Grimmelikhuijzen C.J., RA Hauser F., Cazzamali G., Williamson M., Park Y., Li B., Tanaka Y., RA Predel R., Neupert S., Schachtner J., Verleyen P., Raible F., Bork P., RA Friedrich M., Walden K.K., Robertson H.M., Angeli S., Foret S., RA Bucher G., Schuetz S., Maleszka R., Wimmer E.A., Beeman R.W., RA Lorenzen M., Tomoyasu Y., Miller S.C., Grossmann D., Bucher G.; RT "The genome of the model beetle and pest Tribolium castaneum."; RL Nature 452:949-955(2008). RN [2] {ECO:0000313|EMBL:EFA08092.1, ECO:0000313|Proteomes:UP000007266} RP GENOME REANNOTATION. RC STRAIN=Georgia GA2 {ECO:0000313|EMBL:EFA08092.1, RC ECO:0000313|Proteomes:UP000007266}; RX PubMed=19820115; DOI=10.1093/nar/gkp807; RA Kim H.S., Murphy T., Xia J., Caragea D., Park Y., Beeman R.W., RA Lorenzen M.D., Butcher S., Manak J.R., Brown S.J.; RT "BeetleBase in 2010: revisions to provide comprehensive genomic RT information for Tribolium castaneum."; RL Nucleic Acids Res. 38:D437-D442(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000283; EFA08092.1; -; Genomic_DNA. DR STRING; 7070.TC005696-PA; -. DR EnsemblMetazoa; TC005696-RA; TC005696-PA; TC005696. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D6WWR3; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; D6WWR3; -. DR Proteomes; UP000007266; Linkage group 8. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007266}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007266}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 20 41 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 376 AA; 41715 MW; D7E4D76E7C41686E CRC64; MDYCCQPGPP LRRIRNSSPF CTFLLFCFYL LLLAGAFFVI YSQYLITKEV GKLKEDITKM KGGPNRREDP HAKLSQELDK KCSETLKLLL KDPVGRPDFA LESSGGKVLS VEAAPYSNPK SLFGFSLCEG DHGPSSMIQA TTAPGQCWAF KGQTGKAVLQ LIDYVLIDSV TLEHIPPTIS PSGGIDSAPK DFKLWGLEKP NGAKLFLGEF RYTAEGGTVQ TFNVNNVSSF YKYVEFEIVT NHGNKDFTCV YRFVLDFLSL LLRLICKDSA AWDRKEVRLC RCRRDRSAPY PFPALVKSSA QESAESSPRF GSRALDGALW PLAGMPLLIS QTISKSLHRL VLLNVSRPMD TQRSKAAPDA FSMQKAQRDA PFPAGQ // ID D7FIC3_ECTSI Unreviewed; 412 AA. AC D7FIC3; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBJ28747.1}; GN ORFNames=Esi_0119_0043 {ECO:0000313|EMBL:CBJ28747.1}; OS Ectocarpus siliculosus (Brown alga). OC Eukaryota; Stramenopiles; PX clade; Phaeophyceae; Ectocarpales; OC Ectocarpaceae; Ectocarpus. OX NCBI_TaxID=2880 {ECO:0000313|EMBL:CBJ28747.1, ECO:0000313|Proteomes:UP000002630}; RN [1] {ECO:0000313|EMBL:CBJ28747.1, ECO:0000313|Proteomes:UP000002630} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Ec32 / CCAP1310/4 {ECO:0000313|Proteomes:UP000002630}; RX PubMed=20520714; DOI=10.1038/nature09016; RA Cock J.M., Sterck L., Rouze P., Scornet D., Allen A.E., Amoutzias G., RA Anthouard V., Artiguenave F., Aury J.M., Badger J.H., Beszteri B., RA Billiau K., Bonnet E., Bothwell J.H., Bowler C., Boyen C., RA Brownlee C., Carrano C.J., Charrier B., Cho G.Y., Coelho S.M., RA Collen J., Corre E., Da Silva C., Delage L., Delaroque N., RA Dittami S.M., Doulbeau S., Elias M., Farnham G., Gachon C.M., RA Gschloessl B., Heesch S., Jabbari K., Jubin C., Kawai H., Kimura K., RA Kloareg B., Kupper F.C., Lang D., Le Bail A., Leblanc C., Lerouge P., RA Lohr M., Lopez P.J., Martens C., Maumus F., Michel G., RA Miranda-Saavedra D., Morales J., Moreau H., Motomura T., Nagasato C., RA Napoli C.A., Nelson D.R., Nyvall-Collen P., Peters A.F., Pommier C., RA Potin P., Poulain J., Quesneville H., Read B., Rensing S.A., RA Ritter A., Rousvoal S., Samanta M., Samson G., Schroeder D.C., RA Segurens B., Strittmatter M., Tonon T., Tregear J.W., Valentin K., RA von Dassow P., Yamagishi T., Van de Peer Y., Wincker P.; RT "The Ectocarpus genome and the independent evolution of RT multicellularity in brown algae."; RL Nature 465:617-621(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN647870; CBJ28747.1; -; Genomic_DNA. DR InParanoid; D7FIC3; -. DR Proteomes; UP000002630; Chromosome LG23. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002630}; KW Reference proteome {ECO:0000313|Proteomes:UP000002630}. FT COILED 17 40 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 412 AA; 44511 MW; 7B7658B4035C2C64 CRC64; MEARSFEEFR ARRAGALKEI EEIAARLRAS VLEVEELEAL GKAVHEDNSG DAGLTTEVTR AQESLRDLHR SVTDELGSFQ RRLEEQSGTM REVRDGTARV SQLAERREDR IAAAVHMKLE SKLDDILADL KDFYHTVEEH VDSLSPDVMT ASDLEALLES AAAQPSERTD TSRDGGGDAA EHRIRALSKA AVDLAVKTRV LGGGSRDGGD ESGGGGGCVS EGAVREEVDV AIGKLMADGT GMRDYANAAL GGKVLTSKGM VSDTYTPSSW WGPSRYWHGA GVENGVGPVE SVISEGSSLG ACWAMSGSEG LVTIQLPKKI TVDGVSVEHV SRMVTTESGS APKELEVWGM KNKKDKKPAK LGSAVYDVDG RPIQTFRIEP PGEQIELIQF KILSNWGNPD YTCLYRLRVH GR // ID D7FYE3_ECTSI Unreviewed; 529 AA. AC D7FYE3; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBJ32485.1}; GN ORFNames=Esi_0342_0020 {ECO:0000313|EMBL:CBJ32485.1}; OS Ectocarpus siliculosus (Brown alga). OC Eukaryota; Stramenopiles; PX clade; Phaeophyceae; Ectocarpales; OC Ectocarpaceae; Ectocarpus. OX NCBI_TaxID=2880 {ECO:0000313|EMBL:CBJ32485.1, ECO:0000313|Proteomes:UP000002630}; RN [1] {ECO:0000313|EMBL:CBJ32485.1, ECO:0000313|Proteomes:UP000002630} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Ec32 / CCAP1310/4 {ECO:0000313|Proteomes:UP000002630}; RX PubMed=20520714; DOI=10.1038/nature09016; RA Cock J.M., Sterck L., Rouze P., Scornet D., Allen A.E., Amoutzias G., RA Anthouard V., Artiguenave F., Aury J.M., Badger J.H., Beszteri B., RA Billiau K., Bonnet E., Bothwell J.H., Bowler C., Boyen C., RA Brownlee C., Carrano C.J., Charrier B., Cho G.Y., Coelho S.M., RA Collen J., Corre E., Da Silva C., Delage L., Delaroque N., RA Dittami S.M., Doulbeau S., Elias M., Farnham G., Gachon C.M., RA Gschloessl B., Heesch S., Jabbari K., Jubin C., Kawai H., Kimura K., RA Kloareg B., Kupper F.C., Lang D., Le Bail A., Leblanc C., Lerouge P., RA Lohr M., Lopez P.J., Martens C., Maumus F., Michel G., RA Miranda-Saavedra D., Morales J., Moreau H., Motomura T., Nagasato C., RA Napoli C.A., Nelson D.R., Nyvall-Collen P., Peters A.F., Pommier C., RA Potin P., Poulain J., Quesneville H., Read B., Rensing S.A., RA Ritter A., Rousvoal S., Samanta M., Samson G., Schroeder D.C., RA Segurens B., Strittmatter M., Tonon T., Tregear J.W., Valentin K., RA von Dassow P., Yamagishi T., Van de Peer Y., Wincker P.; RT "The Ectocarpus genome and the independent evolution of RT multicellularity in brown algae."; RL Nature 465:617-621(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN648534; CBJ32485.1; -; Genomic_DNA. DR InParanoid; D7FYE3; -. DR Proteomes; UP000002630; Unplaced LGUn. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002630}; KW Reference proteome {ECO:0000313|Proteomes:UP000002630}. FT COILED 22 56 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 529 AA; 55396 MW; A65AE2012BCB2ED3 CRC64; MVRQSEGFRS LESVLVERRR YLKEMEAQDE QEQETLRKLR GVVRRMDSQM QLLREQAEAG VASRGAPEDP AASESVSLSQ LDELKTMREE VTVLAGSTSN AVEEGEQWTT GAAQELEALV SRSPSSHQHV STEETEESEL DADETWKAGF EGPLEEAFLD LEGILGTVNS IMDVAAAKPG VMEGGAYVRL VEDAGLSAST AADAAAAADG DRNLQTRVEA IAAQEVALKW GEAEATGEHG GRDAVSKEAG VLATVEEAEE LVAREVEMFS SGGTGMPDYA SLTPGAKVVY GPFPVAPAYD GEAQQGDGMV EKKGWLTSKT LASTELEWHD YLLHMLRVPG RVYAEDGDAA LSASNSLGSC FAFKGGEGRL TVELARPPPP PPRSSGGDGN PGAATTTAAG FVRVTHVSIE HARAASAPTA AQSAPRAFRI LGWDADPAGT ATTTASLVGG GGDGGGEALL SPHVLLAGAE YQVGEGAPGV QTFAVGEGRG EEAQAVVPPV GWVTLEVQSN HGGAWTCLYG FRVHGDPVR // ID D7KMK5_ARALL Unreviewed; 634 AA. AC D7KMK5; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFH66800.1}; GN ORFNames=ARALYDRAFT_335524 {ECO:0000313|EMBL:EFH66800.1}; OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694}; RN [1] {ECO:0000313|Proteomes:UP000008694} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694}; RX PubMed=21478890; DOI=10.1038/ng.807; RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M., RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G., RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., RA Schneeberger K., Spannagl M., Wang X., Yang L., Nasrallah M.E., RA Bergelson J., Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., RA Van de Peer Y., Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.; RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome RT size change."; RL Nat. Genet. 43:476-481(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL348713; EFH66800.1; -; Genomic_DNA. DR RefSeq; XP_002890541.1; XM_002890495.1. DR EnsemblPlants; fgenesh1_pg.C_scaffold_1002089; fgenesh1_pg.C_scaffold_1002089; fgenesh1_pg.C_scaffold_1002089. DR GeneID; 9326603; -. DR KEGG; aly:ARALYDRAFT_335524; -. DR Proteomes; UP000008694; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008694}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008694}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 583 603 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 615 633 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 481 501 {ECO:0000256|SAM:Coils}. FT COILED 524 582 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 634 AA; 71254 MW; 96C73390AE41B4D2 CRC64; MQRSRRRVSV NNFNGRNSFY KVSLSLVFLL WVLLFLSTLL ISLGDGAKDT PLNDSVGMAD PDDGQSGEKV VSFDGPLSLE SASVHVTSDL SRNDDITLSE DSEDKEKSVK EAEIKSTVSG NDLESKDSYN SKQSEITKKD TGIDAGSKED DFLMQSQMSI DNDTESKDNV FLKQNQVNKT DPGNDTEINA SKVDQPSRAV PLGLDEFKSR ASNSRNKSLS DQVSGVIHRM EPGGKEYNYA SASKGAKVLS SNKEAKGAPS ILSRDNDKYL RNPCSTEGKF VVVELSEETL VNTIKIANFE HYSSNLKEFQ LQGTLVYPTD TWVHMGNFTA SNVKHEQNFT LLEPKWVRYL KLNFLSHYGS EFYCTLSLIE VYGVDAVERM LEDLISVQDN KNAFKTREGD FEQKEKPVQQ TESLEGDDSA SRSMQRENER EAPPENMLAK TEASMAKSSN KLADPVEEMR HHQPGSRMPG DTVLKILMQK LRSLDLNLSV LERYLEELNT RYGNIFKEMD REAGVREKAI ATLRLDLEGM KERQERMVSE AEEMKEWRKR VEAEMEKAEK EKENTRESLE EVSKRLEWME KKGLMVFTVC LGFGTIAVIA VVVGVGTGRA EKTGIGAWLL LLISSTFIMF VLSL // ID D7KYV0_ARALL Unreviewed; 666 AA. AC D7KYV0; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 14-OCT-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFH65092.1}; GN ORFNames=ARALYDRAFT_339371 {ECO:0000313|EMBL:EFH65092.1}; OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694}; RN [1] {ECO:0000313|Proteomes:UP000008694} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694}; RX PubMed=21478890; DOI=10.1038/ng.807; RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M., RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G., RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., RA Schneeberger K., Spannagl M., Wang X., Yang L., Nasrallah M.E., RA Bergelson J., Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., RA Van de Peer Y., Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.; RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome RT size change."; RL Nat. Genet. 43:476-481(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL348714; EFH65092.1; -; Genomic_DNA. DR RefSeq; XP_002888833.1; XM_002888787.1. DR EnsemblPlants; fgenesh1_pg.C_scaffold_2001470; fgenesh1_pg.C_scaffold_2001470; fgenesh1_pg.C_scaffold_2001470. DR GeneID; 9324896; -. DR KEGG; aly:ARALYDRAFT_339371; -. DR Proteomes; UP000008694; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008694}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008694}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 95 117 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 616 635 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 647 665 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 545 607 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 666 AA; 74935 MW; 906824857CC3904A CRC64; MIDRDEVDRR NFIDRVSLDR RLIGDVGVEK GFVTEKVPES SSSFGLSSSI SDSLLSLYNF ALVVNLMCIV MNRSRRALLV RRRVSETTSN GRNRFYKVSL SLVFLIWGLV FLSTLWISHV DGDKGRSLVD AVENGEPDDE RADETAKPVD APSLESASVH STPDLSLDVD IAAAGEIKGS ETILKQIEVD NTIVIAGNVT ESKDNESMKE SEINNNTVPG DDTETTGSKL DQLSRAVPLG LDEFKSRASI SRDKSLSGQV TGVIHRMEPG GKEYNYAAAS KGAKVLSSNK EAKGASSIIC RDKDKYLRNP CSTEGKFVVI ELSEETLVNT IKIANFEHYS SNLKDFEILG TLVYPTDTWV HLGNFTALNM KHEQNFTLVD PKWVRYLKLN LLSHYGSEFY CTLSLLEVYG VDAVERMLED LISIQDKNIL KPQEGDIEQK EKKTIKAKES FESDEDKSKQ KEKEQEASPE NAVVKDEVSI ERRKLPDPVE EIKHQPGSRM PGDTVLKILM QKIRSLDVSL SVLESYLEER SSKYGMIFKE MDVEANKREK EVETMRLEVE GMKEREESTK KEAMEMREWR RRVETELEKA ENEKGKVKER LEQVLERMEW MEKKCVVVFT ICVGFGAIAV VAVVLGKGRG RAENPGGLAW LLLLISSTFV LFILSL // ID D7L9Q1_ARALL Unreviewed; 456 AA. AC D7L9Q1; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 14-OCT-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFH61069.1}; GN ORFNames=ARALYDRAFT_478408 {ECO:0000313|EMBL:EFH61069.1}; OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694}; RN [1] {ECO:0000313|Proteomes:UP000008694} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694}; RX PubMed=21478890; DOI=10.1038/ng.807; RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M., RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G., RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., RA Schneeberger K., Spannagl M., Wang X., Yang L., Nasrallah M.E., RA Bergelson J., Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., RA Van de Peer Y., Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.; RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome RT size change."; RL Nat. Genet. 43:476-481(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL348715; EFH61069.1; -; Genomic_DNA. DR RefSeq; XP_002884810.1; XM_002884764.1. DR EnsemblPlants; fgenesh2_kg.3__1109__AT3G10730.1; fgenesh2_kg.3__1109__AT3G10730.1; fgenesh2_kg.3__1109__AT3G10730.1. DR GeneID; 9320876; -. DR KEGG; aly:ARALYDRAFT_478408; -. DR KO; K19347; -. DR Proteomes; UP000008694; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008694}; KW Reference proteome {ECO:0000313|Proteomes:UP000008694}. FT COILED 206 226 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 456 AA; 49742 MW; C0895F34BFC40E06 CRC64; MSASTVSITA SPTRAIRRTP VLSGENKSNF DFPPSESHAN AAIGESSAGT NKDLIRSEAA AERSNTYDVG PVTRKSGSTA TGTNTTTTQR RTRKSQGNKT DKGQWKTVVR VFAKQFGALL LLVGLIQLIR KLTLKDSSLS SSNFPIETEM VLSELESRIS AVDGLVKTTT KMMQVQVEFL DKKMESESRA LRQTIDSTSS VLQSGLKKVE SKTERLQVSV DELNAKPLVS REELERVYEE LKKGKVGDSD VNIDELRAYA RDVVEKEIGK HAADGLGRVD YALASGGAFV MGHSDPFLVG SGGNWFGTSR RRVHSKAVKM LTPSFGEPGQ CFPLKGSNGY VLVRLRAPII PEAVTLEHVS EAVAYDRSSA PKDCRVSGWL GDIDMETETM PILTEFSYDL DRSNAQTFDI AESAHSGLVN TVRLDFNSNH GSSSHTCIYR FRVHGCQLDS VSVVHA // ID D7LY82_ARALL Unreviewed; 476 AA. AC D7LY82; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 14-OCT-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFH49439.1}; GN ORFNames=ARALYDRAFT_908391 {ECO:0000313|EMBL:EFH49439.1}; OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694}; RN [1] {ECO:0000313|Proteomes:UP000008694} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694}; RX PubMed=21478890; DOI=10.1038/ng.807; RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M., RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G., RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., RA Schneeberger K., Spannagl M., Wang X., Yang L., Nasrallah M.E., RA Bergelson J., Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., RA Van de Peer Y., Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.; RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome RT size change."; RL Nat. Genet. 43:476-481(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL348718; EFH49439.1; -; Genomic_DNA. DR RefSeq; XP_002873180.1; XM_002873134.1. DR EnsemblPlants; scaffold_600448.1; scaffold_600448.1; scaffold_600448.1. DR GeneID; 9309248; -. DR KEGG; aly:ARALYDRAFT_908391; -. DR KO; K19347; -. DR Proteomes; UP000008694; Unassembled WGS sequence. DR GO; GO:0005783; C:endoplasmic reticulum; IEA:EnsemblPlants/Gramene. DR GO; GO:0005635; C:nuclear envelope; IEA:EnsemblPlants/Gramene. DR GO; GO:0090435; P:protein localization to nuclear envelope; IEA:EnsemblPlants/Gramene. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008694}; KW Reference proteome {ECO:0000313|Proteomes:UP000008694}. FT COILED 181 226 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 476 AA; 51954 MW; CD0CE231B94BCCD8 CRC64; MSASTVSITA NTAAATRRTP ILSGEKKSNF DYPQSESLAN GGGVGEAGGT SRDLSRGEAI VDRSHGQDLG PVTRRSGSAA TGTNTTTTQR RTRKVATPKP EKARWKTVVR IFAKQLGALL IIVGLIQLTR KMILKASSPS SPISSYETEM AFSGLESRIA EVDGLVKATT STMQVQVELL DKKMEREAKT LRQEIERKAS AFQSELKKIE SRTESLEKSV GEVNAKPWVT KDELERIYEE LKKGNVDDSA FSEISIDELR AYARDIMEKE IEKHAADGLG RVDYALASGG AFVMQHSDPY LVGKGSSWFA TTMRRAHTNA VKMLSPSFGE PGQCFPLKGS DGYVQIRLRG PIIPEAFTLE HVAKSVAYDR SSAPKDCRVS GWLQGQGKGL ESSAENENMQ LLTEFTYDLD RSNAQTFNIL DSSNSGPIDT VRLDFTSNHG SDSHTCIYRF RVHGRASDPV PVVETSLDQD SSPGSE // ID D7M8Z4_ARALL Unreviewed; 577 AA. AC D7M8Z4; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFH46011.1}; GN ORFNames=ARALYDRAFT_354376 {ECO:0000313|EMBL:EFH46011.1}; OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694}; RN [1] {ECO:0000313|Proteomes:UP000008694} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694}; RX PubMed=21478890; DOI=10.1038/ng.807; RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M., RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G., RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., RA Schneeberger K., Spannagl M., Wang X., Yang L., Nasrallah M.E., RA Bergelson J., Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., RA Van de Peer Y., Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.; RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome RT size change."; RL Nat. Genet. 43:476-481(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL348719; EFH46011.1; -; Genomic_DNA. DR RefSeq; XP_002869752.1; XM_002869706.1. DR EnsemblPlants; fgenesh1_pg.C_scaffold_7001549; fgenesh1_pg.C_scaffold_7001549; fgenesh1_pg.C_scaffold_7001549. DR GeneID; 9305823; -. DR KEGG; aly:ARALYDRAFT_354376; -. DR Proteomes; UP000008694; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008694}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008694}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 517 539 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 559 576 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 577 AA; 65531 MW; 4C971FC85F83544C CRC64; MTRRGMCSTI CLNEKLQRFR IVRISEKADN VNSRSGSFFE RSISLVLLLW CFLFLVYSKL GQSHDYDYGN GEIAILKKVI YFVDRIGNYT DGSVSKTLNT TSSVFPQASG KENNYCLLRN GQLQDVYEHV LGNNALLICK IVLPERRISK KTLEARDPRY GNLEDKSLKV NGSGLPSQLV NNVTHYRVEP DGTGYNYAAA MKGAKVVDHN KEAKGASNVL GKDHDKYLRN PCSVSDKYVV IELAEETLVD TVRIANLEHY SSNPKEFNMS GSLSYPTDMW TPAGSFMAAN VKQIQTFRLP EPKWLRYLKL NLISHYGSEF YCTLSIVEVF GIDALEQMLE DLFVPSETPP SKPAMLELKT ADEKEVGEVK SNRTDQIGKE TEAQKKKDDV VKTINIIGDK KYEVREKHNV LKVMMQKVKL IEMNLSVLED SVKEMHEKQP EVSLEMQKTL VLVEKSKADI REITEWKGKM EKELRDLELW KTLVASRVES LARGNTALRL DVEKIVKEQA NLESKELGVL LISLFFVVLA TIRLVSTRLW SFLGMSFTDK ARTLWPDSGW VMILLSSSIM IFITLLS // ID D7MXR9_ARALL Unreviewed; 456 AA. AC D7MXR9; DT 10-AUG-2010, integrated into UniProtKB/TrEMBL. DT 10-AUG-2010, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFH38664.1}; GN ORFNames=ARALYDRAFT_497464 {ECO:0000313|EMBL:EFH38664.1}; OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694}; RN [1] {ECO:0000313|Proteomes:UP000008694} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694}; RX PubMed=21478890; DOI=10.1038/ng.807; RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M., RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G., RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., RA Schneeberger K., Spannagl M., Wang X., Yang L., Nasrallah M.E., RA Bergelson J., Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., RA Van de Peer Y., Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.; RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome RT size change."; RL Nat. Genet. 43:476-481(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL349067; EFH38664.1; -; Genomic_DNA. DR RefSeq; XP_002862406.1; XM_002862360.1. DR EnsemblPlants; fgenesh2_kg.532__1__AT3G10730.1; fgenesh2_kg.532__1__AT3G10730.1; fgenesh2_kg.532__1__AT3G10730.1. DR GeneID; 9298482; -. DR KEGG; aly:ARALYDRAFT_497464; -. DR KO; K19347; -. DR Proteomes; UP000008694; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008694}; KW Reference proteome {ECO:0000313|Proteomes:UP000008694}. FT COILED 206 226 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 456 AA; 49968 MW; 14E68094AA820EA9 CRC64; MSASTVSITA SPTRAIRRTP VLSGENKSNF DFPPSESHAN AAIGESSAGT NKDLIRSEAA AERSNTYDVG PVTRKSGSTA TGTNTTTTQR RTRKSQGNKT DKGQWKTVVR VFAKQFGALL LLVGLIQLIR KLTLKDSSLS ASNFPIETEM VLSELESRIS AVDGLVKTTT KMMQVQVEFL DKKMESESRA LRQTIDSTSS VLQSWLKKVE SKTERLQVSV DELNAKPLVS REELERVYEE LKKGKVGDSD LNIDELRAYA RDVVEKEIGK HAADGLGRVD YALASGGAFV MGHSDPFLVG SGGNWFRTSR RRVHSKAVKM LTPSFGEPGQ CFPLKGSNGY VLVRLRAPII PEAVTLEHVS EAVAYDRSSA PKDCRVSGWL GDIDMETETM PILTEFSYDL DRSNAQTFDI AESAHSGLVN TVRLDFNSNH GSSSHTCIYR FRVHGCQLDS VSVVHA // ID D8LC58_ECTSI Unreviewed; 126 AA. AC D8LC58; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Sad1/Unc-84 domain-containing protein, putative {ECO:0000313|EMBL:CBN79241.1}; GN ORFNames=Esi_0010_0187 {ECO:0000313|EMBL:CBN79241.1}; OS Ectocarpus siliculosus (Brown alga). OC Eukaryota; Stramenopiles; PX clade; Phaeophyceae; Ectocarpales; OC Ectocarpaceae; Ectocarpus. OX NCBI_TaxID=2880 {ECO:0000313|EMBL:CBN79241.1, ECO:0000313|Proteomes:UP000002630}; RN [1] {ECO:0000313|EMBL:CBN79241.1, ECO:0000313|Proteomes:UP000002630} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Ec32 / CCAP1310/4 {ECO:0000313|Proteomes:UP000002630}; RX PubMed=20520714; DOI=10.1038/nature09016; RA Cock J.M., Sterck L., Rouze P., Scornet D., Allen A.E., Amoutzias G., RA Anthouard V., Artiguenave F., Aury J.M., Badger J.H., Beszteri B., RA Billiau K., Bonnet E., Bothwell J.H., Bowler C., Boyen C., RA Brownlee C., Carrano C.J., Charrier B., Cho G.Y., Coelho S.M., RA Collen J., Corre E., Da Silva C., Delage L., Delaroque N., RA Dittami S.M., Doulbeau S., Elias M., Farnham G., Gachon C.M., RA Gschloessl B., Heesch S., Jabbari K., Jubin C., Kawai H., Kimura K., RA Kloareg B., Kupper F.C., Lang D., Le Bail A., Leblanc C., Lerouge P., RA Lohr M., Lopez P.J., Martens C., Maumus F., Michel G., RA Miranda-Saavedra D., Morales J., Moreau H., Motomura T., Nagasato C., RA Napoli C.A., Nelson D.R., Nyvall-Collen P., Peters A.F., Pommier C., RA Potin P., Poulain J., Quesneville H., Read B., Rensing S.A., RA Ritter A., Rousvoal S., Samanta M., Samson G., Schroeder D.C., RA Segurens B., Strittmatter M., Tonon T., Tregear J.W., Valentin K., RA von Dassow P., Yamagishi T., Van de Peer Y., Wincker P.; RT "The Ectocarpus genome and the independent evolution of RT multicellularity in brown algae."; RL Nature 465:617-621(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN647683; CBN79241.1; -; Genomic_DNA. DR InParanoid; D8LC58; -. DR Proteomes; UP000002630; Chromosome LG08. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002630}; KW Reference proteome {ECO:0000313|Proteomes:UP000002630}. SQ SEQUENCE 126 AA; 13810 MW; F72D6F710FECC981 CRC64; MHISDVIVPG SDGRLTVKLA RTIKVESISL EHAPREVLLN KGVSAPKDFT VVGYPKGRVV RDDDPGDVLV SGGEYRLEGD VIQFFEVAEE YRQVDYGVIA LVVDSNHGEG AYTCIYRLRV HGTPSP // ID D8LTA6_ECTSI Unreviewed; 264 AA. AC D8LTA6; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBN77977.2}; DE Flags: Fragment; GN ORFNames=Esi_0081_0075 {ECO:0000313|EMBL:CBN77977.2}; OS Ectocarpus siliculosus (Brown alga). OC Eukaryota; Stramenopiles; PX clade; Phaeophyceae; Ectocarpales; OC Ectocarpaceae; Ectocarpus. OX NCBI_TaxID=2880 {ECO:0000313|EMBL:CBN77977.2, ECO:0000313|Proteomes:UP000002630}; RN [1] {ECO:0000313|EMBL:CBN77977.2, ECO:0000313|Proteomes:UP000002630} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Ec32 / CCAP1310/4 {ECO:0000313|Proteomes:UP000002630}; RX PubMed=20520714; DOI=10.1038/nature09016; RA Cock J.M., Sterck L., Rouze P., Scornet D., Allen A.E., Amoutzias G., RA Anthouard V., Artiguenave F., Aury J.M., Badger J.H., Beszteri B., RA Billiau K., Bonnet E., Bothwell J.H., Bowler C., Boyen C., RA Brownlee C., Carrano C.J., Charrier B., Cho G.Y., Coelho S.M., RA Collen J., Corre E., Da Silva C., Delage L., Delaroque N., RA Dittami S.M., Doulbeau S., Elias M., Farnham G., Gachon C.M., RA Gschloessl B., Heesch S., Jabbari K., Jubin C., Kawai H., Kimura K., RA Kloareg B., Kupper F.C., Lang D., Le Bail A., Leblanc C., Lerouge P., RA Lohr M., Lopez P.J., Martens C., Maumus F., Michel G., RA Miranda-Saavedra D., Morales J., Moreau H., Motomura T., Nagasato C., RA Napoli C.A., Nelson D.R., Nyvall-Collen P., Peters A.F., Pommier C., RA Potin P., Poulain J., Quesneville H., Read B., Rensing S.A., RA Ritter A., Rousvoal S., Samanta M., Samson G., Schroeder D.C., RA Segurens B., Strittmatter M., Tonon T., Tregear J.W., Valentin K., RA von Dassow P., Yamagishi T., Van de Peer Y., Wincker P.; RT "The Ectocarpus genome and the independent evolution of RT multicellularity in brown algae."; RL Nature 465:617-621(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN649047; CBN77977.2; -; Genomic_DNA. DR InParanoid; D8LTA6; -. DR Proteomes; UP000002630; Chromosome LG26. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002630}; KW Reference proteome {ECO:0000313|Proteomes:UP000002630}. FT NON_TER 264 264 {ECO:0000313|EMBL:CBN77977.2}. SQ SEQUENCE 264 AA; 27968 MW; C864299F91160EDB CRC64; MAAAAASAAA TARAMVPDVE DDGAKQPKRV NLQNYASRDS GAVLLEASPA SKGMQNLLLD SKDKYAISPC EDKQWAVLGL SEDILVRSLV IGSHEKYSSL LKEFQVLASQ TYPVNEWLDL GTFTAKFVQG EQTFEIPQPA FARYLKFKFL SHYGDEFYCT VSQVKVHGST MLESFQHEWQ QSSAEVREVQ DFMMKKDPKP SAVGTVGAGA GVGGGAADTT VEALASEGGG GAHGPTGVSN VEPATPGSAP KSPGVQTAPF CDCV // ID D8QIF4_SCHCM Unreviewed; 797 AA. AC D8QIF4; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFI92223.1}; GN ORFNames=SCHCODRAFT_258669 {ECO:0000313|EMBL:EFI92223.1}; OS Schizophyllum commune (strain H4-8 / FGSC 9210) (Split gill fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Schizophyllaceae; OC Schizophyllum. OX NCBI_TaxID=578458 {ECO:0000313|Proteomes:UP000007431}; RN [1] {ECO:0000313|EMBL:EFI92223.1, ECO:0000313|Proteomes:UP000007431} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H4-8 / FGSC 9210 {ECO:0000313|Proteomes:UP000007431}; RX PubMed=20622885; DOI=10.1038/nbt.1643; RA Ohm R.A., de Jong J.F., Lugones L.G., Aerts A., Kothe E., RA Stajich J.E., de Vries R.P., Record E., Levasseur A., Baker S.E., RA Bartholomew K.A., Coutinho P.M., Erdmann S., Fowler T.J., RA Gathman A.C., Lombard V., Henrissat B., Knabe N., Kuees U., RA Lilly W.W., Lindquist E., Lucas S., Magnuson J.K., Piumi F., RA Raudaskoski M., Salamov A., Schmutz J., Schwarze F.W.M.R., RA vanKuyk P.A., Horton J.S., Grigoriev I.V., Woesten H.A.B.; RT "Genome sequence of the model mushroom Schizophyllum commune."; RL Nat. Biotechnol. 28:957-963(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377313; EFI92223.1; -; Genomic_DNA. DR RefSeq; XP_003027126.1; XM_003027080.1. DR STRING; 578458.XP_003027126.1; -. DR EnsemblFungi; EFI92223; EFI92223; SCHCODRAFT_258669. DR GeneID; 9596692; -. DR KEGG; scm:SCHCODRAFT_258669; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8QIF4; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000007431; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007431}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007431}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 345 363 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 375 395 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 430 450 {ECO:0000256|SAM:Coils}. FT COILED 466 490 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 797 AA; 87823 MW; 3B9D2923E9F1DD26 CRC64; MSFAGTPLGQ GRRLDHDTFL NKPQAKSRPP STGYSYPANG SRSPPKLHPE KWAYKDTSVN VANAFHLAAE DMNPNQAWAA GPSRANVPRS TSVEYENQVR QTTGRGLPPP ASRFVKPPSH DRSNGTITQS ENSFAREKSP FDQIAEKIHQ SIPGPLQFIV RQREPEDSTS YEYSMEERDY QQQTKRQTHR RNRISTDNKA YKPSQSDLEE SDEDFDDDDR TRRRKKKKKE TGGGLLTTLP VIGQDKRRKR KKKNVTTGED EESEPEQDQQ QQRGPSMPRA QSLHPADVSE ELEAGVRSLD SIAEMDESTL PDPTAEEELY PSRPRGSIIS VLAGRLVHIL IQVPFVWLGR IVGTLADLII RICRKTWLLA GPWPIVMLGI SLVLYNGLGM LGSLFHRGKY VPPQTAPADI SELSNRLLSI ESAVLALADR EEVYRKLRAL ETRVDNADLR VGQTESSLRM VAGNDLQNIR KELQELGAKL TAEMERERAV PMPSTEDEEA RSRLKALEER VGGVEGGVKE ALDASKKAAT APGWWKKQPG SAPSADAVRD VVLRMYSGDM LERVDHALFT GGGAIVPTLT SPSLRQSLVA PRSGFSSVLS WISGGHGDIV ERSPVIALTP GVQPGQCWAF AAGEGRLGVK LRLPARIDEI TIEHAAGSIA YDLRSAPRDM EVWALIDGAD NAEKWAAVVA AREAAGVPTD ESEAAFAAEM ARLTHGALYV RIAQFAYDIE AGRYAQSFAV AEDVREQGMD FGIVMLRVLT NWGHPGHTCL YRFRVHGETA ATIEPPHVPG DEADGEP // ID D8R0A2_SELML Unreviewed; 301 AA. AC D8R0A2; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ34940.1}; DE Flags: Fragment; GN ORFNames=SELMODRAFT_64465 {ECO:0000313|EMBL:EFJ34940.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ34940.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377569; EFJ34940.1; -; Genomic_DNA. DR RefSeq; XP_002964607.1; XM_002964561.1. DR STRING; 88036.EFJ34940; -. DR EnsemblPlants; EFJ34940; EFJ34940; SELMODRAFT_64465. DR GeneID; 9654846; -. DR KEGG; smo:SELMODRAFT_64465; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8R0A2; -. DR KO; K19347; -. DR OMA; SSHAAKM; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT NON_TER 1 1 {ECO:0000313|EMBL:EFJ34940.1}. FT NON_TER 301 301 {ECO:0000313|EMBL:EFJ34940.1}. SQ SEQUENCE 301 AA; 33880 MW; 031073275579E8E8 CRC64; AKVSEVEDFM RKTSKWLQVQ LDVVDDKIGK EVSGVRSELE EKLSERTRGF EKDIGSIKAQ VQKVDNSLKM LYSQELLSRE ETLELVKSAM DQRAREGSDK AISLDDVRAA ARKVVQSELE THAADGIGRV DFALESGGGK VVHHSDGYFQ GLHWTRLGIH VLPGVFRRHP MADRLLRPSF GEPGQCLPLK GSNVTVEIRL RAHIFPEAVT LEHLSKKVAY DPRSAPRDFE IFAWRTVKDD VLDARNVTSL GRFTYDLDKG SIQTFDLSDK PTSSVNMIRL HVLSNYGSPT HTCLYRLRVH G // ID D8RAQ3_SELML Unreviewed; 291 AA. AC D8RAQ3; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30666.1}; GN ORFNames=SELMODRAFT_89851 {ECO:0000313|EMBL:EFJ30666.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30666.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30666.1; -; Genomic_DNA. DR RefSeq; XP_002968412.1; XM_002968366.1. DR STRING; 88036.EFJ30666; -. DR EnsemblPlants; EFJ30666; EFJ30666; SELMODRAFT_89851. DR GeneID; 9650069; -. DR KEGG; smo:SELMODRAFT_89851; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8RAQ3; -. DR KO; K19347; -. DR OMA; EYDIRST; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT COILED 38 58 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 291 AA; 32197 MW; 1FDC371948731A8C CRC64; MQAQLDLIDM KIGKEMAGLK RDVEHMIDAE AFSIASKVHN LKSQMDSIES SLNFLKQEGI LTRQETLQLI GSAADDRATD GSGKALSLDD VRAAAKKIIE DELERHRADG IGRTDYALAI GGGRVIDYSE GYFSLTPWSG LLGILPGDYR RHPKANKILE PSFGEPGQCL PLKGSNVFVD IRLRTAIYAD SITLEHVSKR VAYDTGSAPR DFQVFGWLEA SAAHKKGERV LLGSFRYDIE SSSVQTFRLY KTASKLLVNT VRVHVVSNYG SSTHTCLYRV RVHGTEPHSE E // ID D8RB19_SELML Unreviewed; 439 AA. AC D8RB19; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30414.1}; GN ORFNames=SELMODRAFT_440338 {ECO:0000313|EMBL:EFJ30414.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30414.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30414.1; -; Genomic_DNA. DR RefSeq; XP_002968160.1; XM_002968114.1. DR STRING; 88036.EFJ30414; -. DR EnsemblPlants; EFJ30414; EFJ30414; SELMODRAFT_440338. DR GeneID; 9650126; -. DR KEGG; smo:SELMODRAFT_440338; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RB19; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 439 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121659. FT COILED 369 396 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 439 AA; 48091 MW; B27588903424943C CRC64; MVSDLRLAAL VLLVILFFCY ASGPGIGIEL HSGDPEPDEH SIAEISEAQH QIVESDEQLL EIPSNLTNDI TNNRDLAEEK FPALFTENSS KGEEVTAFDP VHQNSSKGEG VELGTELVVS GEERKTVRFS LVGLDEYKRQ ATIEASANEN PVEENTTVRH KLEAEGKEYN FAAASHGAKV VSSNKDGKGG GNILVKDNDK YFRSPCSAEE KFVVVELSEE TLVDTIVIAN YELYSSNPRE LELLGSLMFP TEEWKLLGKF EAENVRQPQR FVLPKPEWAR YLKLRILSHY GAEFYCTLSA VEVFGVAIER MLEGWIGRKS NEDSGGDPSR KPDVGDKRDA STTPGGPTNA GPAGGSTGGS HGSSTTSLNK FLIEKLKQLE REHRVLENLC RMQVARMADR ELVAFSVALL AATLLILPKT KFPIVLAVAG SLVMLILAV // ID D8RB20_SELML Unreviewed; 286 AA. AC D8RB20; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30415.1}; GN ORFNames=SELMODRAFT_409313 {ECO:0000313|EMBL:EFJ30415.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30415.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30415.1; -; Genomic_DNA. DR RefSeq; XP_002968161.1; XM_002968115.1. DR STRING; 88036.EFJ30415; -. DR EnsemblPlants; EFJ30415; EFJ30415; SELMODRAFT_409313. DR GeneID; 9631305; -. DR KEGG; smo:SELMODRAFT_409313; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 286 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121658. FT TRANSMEM 254 275 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 286 AA; 32304 MW; 3E4B82D47F8CBFE7 CRC64; MALVFFVILL LCSIFQLGIP TLGGGPVPKF QVTSLAEYKR MVNTNEYEGP FVRLEEHNYA AAANGARVVS FNKEAQGGGN ILNRDKDRHY SSPCSAEDKF VVVELSKETF VGAILIANYD EDSSYPRDLE LLGSLEYPTE EWTLLGRLEA KDDIGAFQAF ILPRTDHWVR YLKLRILSHH REESRCTLGT MMVYEPLIKR TRPKVFDSSF KPQQPPSPAP PTCTCKSAKE LLGEETQKIL AKARSCDAWT FDPYIALLAV PLILFQFSLA YRCGLVDQRN NHDMKC // ID D8RB21_SELML Unreviewed; 431 AA. AC D8RB21; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30724.1}; GN ORFNames=SELMODRAFT_409314 {ECO:0000313|EMBL:EFJ30724.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30724.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30724.1; -; Genomic_DNA. DR RefSeq; XP_002968470.1; XM_002968424.1. DR STRING; 88036.EFJ30724; -. DR EnsemblPlants; EFJ30724; EFJ30724; SELMODRAFT_409314. DR GeneID; 9631306; -. DR KEGG; smo:SELMODRAFT_409314; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RB21; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 431 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121652. FT TRANSMEM 332 351 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 431 AA; 48738 MW; B560D06A3D7F84F0 CRC64; MAGDLRLPVV LILLFWFVSG SGIGIELHYG DPETNEAQIF EAQLSDELQA GEERDEMQVG DFRGCRADFA GSCAEPEDLG TVASETSDIL VEENYAAASL GAKVIGANKE AEGDYVLNKD KDKHFRSPCS AEEKFVVVEL SEETLVVTIA IANYELLSSN PRELELLGSL EHPTEEWKLL GRFEAKDVRI PPRFTLSVPV WARYLKLRYL SHYGTNFYCT LSTIEVFGDG VERMIEGWMS KRPSLNISSS SEETQSQVRE LVARRKILLS YLEYMRMVLL ENYHEEMPNV LLRLDGVMLE FQAAKENLDN VTVLKFPLER CMRAVVKVRD KHLLMIIVAV IVLSVSLSVY ISTSTILKVY CPWATFSQHD YGKTFMDNEE GAWVVREGRQ AIHASRLAPA LKGRAGDPTS LQFLQECWVA RIRISVVMDW F // ID D8RB22_SELML Unreviewed; 464 AA. AC D8RB22; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30416.1}; GN ORFNames=SELMODRAFT_440339 {ECO:0000313|EMBL:EFJ30416.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30416.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30416.1; -; Genomic_DNA. DR RefSeq; XP_002968162.1; XM_002968116.1. DR UniGene; Smo.10521; -. DR STRING; 88036.EFJ30416; -. DR EnsemblPlants; EFJ30416; EFJ30416; SELMODRAFT_440339. DR GeneID; 9650127; -. DR KEGG; smo:SELMODRAFT_440339; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RB22; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 464 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121665. FT COILED 343 363 {ECO:0000256|SAM:Coils}. FT COILED 375 399 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 464 AA; 51896 MW; 89E67CD97D50EA0E CRC64; MVSDLRLVVL VLLIVSGLGI GMELHSKLPE VDEASIPGIL QAHHPILELE EQRLDISSRD RAKFQNSSNG EEEVSASEER SVPLASPDEH MQMPRIEFPG SENQRLFEAI SPVENSGNGQ EASEEVSAST QLAEERSVPL AKRSVPLASA TRDEHMRTEF LASENQDDEN RHKLNRKQHH NYAAASLGAK VLGVNKEGKG GGNILIKDND KYFRNPCGAK DKFVIVELAE EILVETFVIA NYELYSSNPR ELELLGSLSY PSSGWKLLGK FEAKNVRQPQ RFILAKQEWA RYLKLRMLSH YGTEFYCTLS SVEVFGVAIG RMLEDLIGSS PGDSSSVSTT SVNKFLIEQL KQLESEQKTF EEYVEVSNSQ NTAAVNDYQE ELTNVMEQLK SLNDSVQEQH RLVKEMYSES MEGLSLCRKE VDRIANGQVH SLMLTLVAML VVVSKLRILV RLALTGFLVI LVLM // ID D8RB23_SELML Unreviewed; 289 AA. AC D8RB23; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30725.1}; GN ORFNames=SELMODRAFT_409316 {ECO:0000313|EMBL:EFJ30725.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30725.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30725.1; -; Genomic_DNA. DR RefSeq; XP_002968471.1; XM_002968425.1. DR STRING; 88036.EFJ30725; -. DR EnsemblPlants; EFJ30725; EFJ30725; SELMODRAFT_409316. DR GeneID; 9631307; -. DR KEGG; smo:SELMODRAFT_409316; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 27 {ECO:0000256|SAM:SignalP}. FT CHAIN 28 289 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121662. FT TRANSMEM 236 255 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 267 286 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 289 AA; 32567 MW; 1EDD609AA95019CD CRC64; MALLFFEILL LCSVFQLGIL TSGTSLAEYE YDGPYVRFEE HNYAAAANGA RVVSLNIEAR GGGNILNRYK DQYYSSPCSA EDKFVVVELS KEIFVGAILI ASYNDDSSHP RDLEILGSLE YPAEEWKLLG RLEAKDDIGA FQVFILPRSD HSVRYLKLRI LSHHREETLC TLGTMMVYEP LIKRTRPQVF GAPFKPQQPP SPKVPPVGDT CTCKSAKELL GEEIQKILAK VESCNIWSFD RFVALLALPF FLFLFPSLVD KCGLDQGFFR GIIFSSGMLY LIMVVSKYF // ID D8RB24_SELML Unreviewed; 443 AA. AC D8RB24; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ30726.1}; GN ORFNames=SELMODRAFT_409317 {ECO:0000313|EMBL:EFJ30726.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ30726.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377575; EFJ30726.1; -; Genomic_DNA. DR RefSeq; XP_002968472.1; XM_002968426.1. DR STRING; 88036.EFJ30726; -. DR EnsemblPlants; EFJ30726; EFJ30726; SELMODRAFT_409317. DR GeneID; 9650128; -. DR KEGG; smo:SELMODRAFT_409317; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RB24; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 443 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121661. FT TRANSMEM 319 339 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 443 AA; 49992 MW; 992448720FEA6513 CRC64; MARLPALPRV LFVILLLSRS ATGLHSSITQ ICDGKRSRHG QSSPEAEHQI LESEELYFGG LERDEAEISV ELDGVEILSA RTALAAAENH EKPMENTAVR KRLKKVTPIS LKDYKRMVSW ASGSDKWNTT IKHRLEPGGE EFNFAAASHG AQIVTSSSDG GNLLKDKYFR SPCKAKEKSF VLKLAEEVLV DTVVIENHEL YSSNPRELEV LGTLSYPTEN WRLLGNVEAQ NICRPQRFVL PKPEWARYLK LRILSHYGNE YYCTLNHLQI FGSGLEGLIE EAQPYCELQQ QSFHFQDATQ IHKLQEELRL LKSQMADDVL YVAIAVLVLV SAVLVYPWMG TIDYEVMKGL KDCLSLRRYF PSKGEVRQVQ YGGQRPNAQS SLPGDAHTDW GGLEQQRQAF VVSKFVDRSI CREMLAICLL QEEAGLSMVF VGNLMSEPKF QPC // ID D8RH39_SELML Unreviewed; 291 AA. AC D8RH39; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ28703.1}; GN ORFNames=SELMODRAFT_93925 {ECO:0000313|EMBL:EFJ28703.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ28703.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377579; EFJ28703.1; -; Genomic_DNA. DR RefSeq; XP_002970573.1; XM_002970527.1. DR STRING; 88036.EFJ28703; -. DR EnsemblPlants; EFJ28703; EFJ28703; SELMODRAFT_93925. DR GeneID; 9632483; -. DR KEGG; smo:SELMODRAFT_93925; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8RH39; -. DR KO; K19347; -. DR OMA; RVSGWYQ; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT COILED 38 58 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 291 AA; 32171 MW; B3ADE1CEDE26881F CRC64; MQAQLDLIDM KIGKEMAGLK RDVEHMIDAE AFSIASKVHN LKSQMDSIES SLNFLKQEGI LTRQETLQLI GSAADDRATD GSGKALSLDD VRAAAKKIIE DELERHRADG IGRTDYALAS GGGRVIDYSE GYFSLTPWSG LLGILPGDYR RHPKANKILE PSFGEPGQCL PLKGSNVFVD IRLRTAIYAD SITLEHVSKR VAYDTGSAPR DFQVFGWLEA SAAHKKGERV LLGSFRYDIE SSSVQTFRLY KTASKLLVNT VRVHVVSNYG SSTHTCLYRV RVHGTEPHSE E // ID D8RHG5_SELML Unreviewed; 492 AA. AC D8RHG5; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ28523.1}; GN ORFNames=SELMODRAFT_441228 {ECO:0000313|EMBL:EFJ28523.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ28523.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377579; EFJ28523.1; -; Genomic_DNA. DR RefSeq; XP_002970393.1; XM_002970347.1. DR UniGene; Smo.4639; -. DR STRING; 88036.EFJ28523; -. DR EnsemblPlants; EFJ28523; EFJ28523; SELMODRAFT_441228. DR GeneID; 9632705; -. DR KEGG; smo:SELMODRAFT_441228; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RHG5; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 492 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121813. FT COILED 369 396 {ECO:0000256|SAM:Coils}. FT COILED 401 432 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 492 AA; 54449 MW; D396FDCC0E6DB2D5 CRC64; MVSDLRLAAL VLLVILFFCY ASGPGIGIEL HSGDPEPDEH SIAEISEAQH QIVESDEQLL EIPSNLINDI TNERDLAEEK FPALFTENSS KGEEVTAFDR VHQNSSKGEE LELGTELVVS GEERKTVRFS LVGLDEYKRQ ATIEASANEN PVEENTTVRH KLEAEGKEYN FAAASHGAKV VSSNKDGKGG GNILVKDNDK YFRNPCSAED KFVVVELSEE TLVDTIVIAN YELYSSNPRE LELLGSLMFP TEEWKLLGKF EAENVRQPQR FVLPKPEWAR YLKLRILSHY GAEFYCTLSA VEVFGVAIER MLEGWIGRKS NEDSGGDPSR KPDVGDKRDA STTPGGPTNA GPAGGSTGGS HGSSTTSLNK FLIEKLKQLE REHRVLEKYV DDLSTHHSAV LKEYDQELSN VMKRLKSMND AWKKQLRVAE EMFSESTGEL SLCRMQVARM ADRELVAFSV ALLAATLLIL PKTKFPIVLA VAGSLVMLIL AV // ID D8RHG6_SELML Unreviewed; 585 AA. AC D8RHG6; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ28524.1}; GN ORFNames=SELMODRAFT_411383 {ECO:0000313|EMBL:EFJ28524.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ28524.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377579; EFJ28524.1; -; Genomic_DNA. DR RefSeq; XP_002970394.1; XM_002970348.1. DR STRING; 88036.EFJ28524; -. DR EnsemblPlants; EFJ28524; EFJ28524; SELMODRAFT_411383. DR GeneID; 9631964; -. DR KEGG; smo:SELMODRAFT_411383; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT COILED 343 363 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 585 AA; 65867 MW; CEFA168AB0A72F22 CRC64; MEVTTLIQFP GQIMNGYMIL LRSRSIKLDC FMGKSSWRRT APRYLDTVRG DENSPHGNLA YQETLESQIF KSLSIYTRRF SHDLKGYLRQ KPEDLPRAEL TQTENYRRKT LRVLWMKINV LDLTRQVKQY RVRGMLQSWK RSTKLSSRMK IPSNLTNATA KDKFPALFTE TSKRQATIEA SANERGEYHC PAQTRSRGRK VVRTTNFGNA EEKFVVVELS EKTLVDTIDY DLYSSNVGAA GKPDVCAAKG GVARYLKLRI LAHYGAEFYC TLSAVEVFGV TIERMLEGWI GRKSNVDTGG DPSRKPDVGD KSDASTTPGG PTNAGPAGGS TGGSHGSSTT AVLKEYDQEL SNVMKRLKSM NDSWKKQGGG PVPKFQVTSL AEYKRMVNLN EYEGPFVRLE EYNYAAAANG ARVVSFNKEA QGGGNILNRI KDQHYSSPCS AEDKFVVVEL SKEIYVGAII IVNFDEDSSY PRDLELLGSL EYPTEEWKLL GRFEAKDDIR SFQAFILPRT DHSVRYLKLR ILSHHREESI CTLSTMMVYE PLIKRTRPQV FDAPFKPEPP PSPEGHISCT CLGEEIEKVI GHPQV // ID D8RHG7_SELML Unreviewed; 469 AA. AC D8RHG7; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ28774.1}; GN ORFNames=SELMODRAFT_411384 {ECO:0000313|EMBL:EFJ28774.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ28774.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377579; EFJ28774.1; -; Genomic_DNA. DR RefSeq; XP_002970644.1; XM_002970598.1. DR STRING; 88036.EFJ28774; -. DR EnsemblPlants; EFJ28774; EFJ28774; SELMODRAFT_411384. DR GeneID; 9631965; -. DR KEGG; smo:SELMODRAFT_411384; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RHG7; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 469 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121810. FT COILED 322 346 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 469 AA; 52602 MW; 88476630ABFAC693 CRC64; MVSNLRLAVL LLLVVSGLGI GMELHSELPE VDEAPIPEIL EAHHQILDLE EQFLDISCNR TAESSSNGEE VSPFDPDRAT ALAEERSVPL ASPELNPLHS HILVSASENQ DDDNRHKRDR QVGGNEHNYA AASLGAKVLG ANKEGKGAGN ILVKGNDKYF RNPCSAKEKF VMVELAEEIL VETFVIANYE LYSSNPRELE LLGSLSYPTS GWRLLGKFEA RNVGQPQRFI LSKPEWARYL QLRILSHYGT EFYCTLSTFE VFGVALGRML EDLIGSSPGD STSGSTTSVN KFLIEQILQL ESDQKTFEEY VEFSNSQNTA AVNEYQEELT NIMRQLKSLN DTVQKQHRLA DDVFSKSMKE LRLCRKEVAR INRGQIYSLT VALVAMLVVV AKIRILVRLA LTGFLGARYV PPEHSCLAAI GVKFLPFARI FSQWILKVVL YNREPLSKEF CGKFMTSPKW HAITWHKEG // ID D8RHG9_SELML Unreviewed; 757 AA. AC D8RHG9; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ28525.1}; GN ORFNames=SELMODRAFT_411386 {ECO:0000313|EMBL:EFJ28525.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ28525.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377579; EFJ28525.1; -; Genomic_DNA. DR RefSeq; XP_002970395.1; XM_002970349.1. DR STRING; 88036.EFJ28525; -. DR EnsemblPlants; EFJ28525; EFJ28525; SELMODRAFT_411386. DR GeneID; 9632707; -. DR KEGG; smo:SELMODRAFT_411386; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; D8RHG9; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 2. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR SUPFAM; SSF49785; SSF49785; 2. DR PROSITE; PS51469; SUN; 2. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 757 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003121820. FT TRANSMEM 362 382 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 724 744 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 757 AA; 85624 MW; 9D7FA38BEEFF29F7 CRC64; MAGDRRLPVV LIFEILLFWF VSGAGIGIEL HYGDPETHEA QIFEAQLSDE LQAGEERDEM QVGDFRGFRA DFAGSCAEPG DLGTAASETS DILVEENYAA ASLGAKVIGA NKEAEGGGYV LNKDKDKHFR SPCSAEEKFV VVELSEETLV VTIAIANYEL LSSNPRELEL LGSLEYPTEE WKLLGRFEAK DVRIPQRFTL SVPEWARYLK LRYLSHYGTE FYCTLSTFEV FGDGVERMIE GWMGKRPILN ISSSSEENQS QAPFQKVFPR VPQGEKGFSQ VVLVGKVREL AMKRKILLDY IDYVRMVLLE NSHREMPKVV SRLESVKIEV QAARVLVDNI TVLQFPIKRC VKQIRNISDR RYFVVTMAFL VTVYLFLSSM AIRQAIKLMW RRIFGSTRYE HLLRMKVGIL KVTEIESPLP GSAPGYISAS SGRSWLLPAS ICDGKRARHE ESSPDVEYQI LESDELYFGD LERDEAEISV ELDGVEILSA RTALSAAESH EEPMENTTAC ERLKKVTTVS LKDYKRMVSW ASESDKWNTT IKHRLEPGGE EFNFAAASHG AQIVTSSSDG GNLLKDKYFR SPCKAKEKSF VLQLAEEVLV DTVVIENHEL YSSNPRELEV LGTLRYPTES WRLLGNVEAQ NICRPQRFVL PKPEWARYLK LRILSHYGNE YYCTLNHLQI FGSGLEGLIE EAQPYCEQQQ QDATQIHKLQ EELRLLKSQM ADDVLYVAIA VLVVVSAVLV YPWMGTIVYN GKAWSSG // ID D8SQF9_SELML Unreviewed; 336 AA. AC D8SQF9; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ13391.1}; GN ORFNames=SELMODRAFT_122417 {ECO:0000313|EMBL:EFJ13391.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ13391.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377633; EFJ13391.1; -; Genomic_DNA. DR RefSeq; XP_002985517.1; XM_002985471.1. DR STRING; 88036.EFJ13391; -. DR EnsemblPlants; EFJ13391; EFJ13391; SELMODRAFT_122417. DR GeneID; 9653378; -. DR KEGG; smo:SELMODRAFT_122417; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8SQF9; -. DR KO; K19347; -. DR OMA; WVHTSPR; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT COILED 60 94 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 336 AA; 36996 MW; EB7D7A4CB37531DD CRC64; MATTAWKWSR DPGVAPGISI AQMKELEDFS RKTTKWMQVQ LELVDMKIGK EIEGLRRDVE DKIEEQALAL ETSIMNLKSQ VEDMDSSVQR LTQDGLLTKK EGMELIASII QQRAAEESGK SLTLDDVRIA ARQIVEAELE KHSADGIGRV DYALGSGGGK IIEHSEGFFT GGRAGWLSIL GAGLSAGGAV RHPMAHKVLE PSYGEPGQCL PLKGSNVFVE IALRTHIHPD AITIEHVPKS VAYDVTSAPK DFRVFGWLER SIQAGTIARP VKKLLLGEFS YSLDGSNIQT FSFPEEVSRE LINTVRIHIT SNHGSASHTC LYRVRVHGFE PSPQHF // ID D8SZM9_SELML Unreviewed; 301 AA. AC D8SZM9; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ10049.1}; DE Flags: Fragment; GN ORFNames=SELMODRAFT_44381 {ECO:0000313|EMBL:EFJ10049.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ10049.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377656; EFJ10049.1; -; Genomic_DNA. DR RefSeq; XP_002988787.1; XM_002988741.1. DR STRING; 88036.EFJ10049; -. DR EnsemblPlants; EFJ10049; EFJ10049; SELMODRAFT_44381. DR GeneID; 9661413; -. DR KEGG; smo:SELMODRAFT_44381; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8SZM9; -. DR KO; K19347; -. DR OMA; EICAWRT; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT NON_TER 1 1 {ECO:0000313|EMBL:EFJ10049.1}. FT NON_TER 301 301 {ECO:0000313|EMBL:EFJ10049.1}. SQ SEQUENCE 301 AA; 33836 MW; 06153677412FDA8D CRC64; AKVSEVEDFM RKTSKWLQVQ LDVVDDKIGK EVSGVRSELE EKLSERTRGF EKDIGSIKAQ VQKVDNSLKM LYSQELLSRE ETLELVKSAM DQRAREGSDK AISLDDVRAA ARKVVQSELE THAADGIGRV DFALESGGGK VVHHSDGYFQ GLHWTRLGIH VLPGVFRRHP MADRLLRPSF GEPGQCLPLK GSNVTVDIRL RAHIFPEAVT LEHLSKKVAY DPRSAPRDFE ICAWRTVKDD VLDARNVTSL GRFTYDLEKG SIQTFDLSDK PTSSVNMIRL HVLSNYGSPT HTCLYRLRVH G // ID D8T8N8_SELML Unreviewed; 336 AA. AC D8T8N8; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ06924.1}; GN ORFNames=SELMODRAFT_134561 {ECO:0000313|EMBL:EFJ06924.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ06924.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377691; EFJ06924.1; -; Genomic_DNA. DR RefSeq; XP_002991956.1; XM_002991910.1. DR STRING; 88036.EFJ06924; -. DR EnsemblPlants; EFJ06924; EFJ06924; SELMODRAFT_134561. DR GeneID; 9645205; -. DR KEGG; smo:SELMODRAFT_134561; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8T8N8; -. DR KO; K19347; -. DR OMA; VKHSEPF; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT COILED 60 94 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 336 AA; 37012 MW; EB7D7A4CAD74D1DD CRC64; MATTAWKWSR DPGVAPGISI AQMKELEDFS RKTTKWMQVQ LELVDMKIGK EIEGLRRDVE DKIEEQALAL ETSIMNLKSQ VEDMDSSVQR LTQDGLLTKK EGMELIASII QQRAAEESGK SLTLDDVRIA ARQIVEAELE KHSADGIGRV DYALGSGGGK IIEHSEGFFT GGRAGWLSIL GAGLSAGGAV RHPMAHKVLE PSYGEPGQCL PLKGSNVFVE IALRTHIHPD AITIEHVPKS VAYDVTSAPK DFRVFGWLER SIQAGTIARP VKKLLLGEFS YSLDGSNIQT FSFPEEVSRE LINTVRIHIT SNHGSSSHTC LYRVRVHGFE PSPQHF // ID D8TER8_SELML Unreviewed; 317 AA. AC D8TER8; DT 05-OCT-2010, integrated into UniProtKB/TrEMBL. DT 05-OCT-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFJ04843.1}; GN ORFNames=SELMODRAFT_138182 {ECO:0000313|EMBL:EFJ04843.1}; OS Selaginella moellendorffii (Spikemoss). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Lycopodiidae; Selaginellales; Selaginellaceae; Selaginella. OX NCBI_TaxID=88036 {ECO:0000313|Proteomes:UP000001514}; RN [1] {ECO:0000313|EMBL:EFJ04843.1, ECO:0000313|Proteomes:UP000001514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551031; DOI=10.1126/science.1203810; RA Banks J.A., Nishiyama T., Hasebe M., Bowman J.L., Gribskov M., RA dePamphilis C., Albert V.A., Aono N., Aoyama T., Ambrose B.A., RA Ashton N.W., Axtell M.J., Barker E., Barker M.S., Bennetzen J.L., RA Bonawitz N.D., Chapple C., Cheng C., Correa L.G., Dacre M., RA DeBarry J., Dreyer I., Elias M., Engstrom E.M., Estelle M., Feng L., RA Finet C., Floyd S.K., Frommer W.B., Fujita T., Gramzow L., RA Gutensohn M., Harholt J., Hattori M., Heyl A., Hirai T., Hiwatashi Y., RA Ishikawa M., Iwata M., Karol K.G., Koehler B., Kolukisaoglu U., RA Kubo M., Kurata T., Lalonde S., Li K., Li Y., Litt A., Lyons E., RA Manning G., Maruyama T., Michael T.P., Mikami K., Miyazaki S., RA Morinaga S., Murata T., Mueller-Roeber B., Nelson D.R., Obara M., RA Oguri Y., Olmstead R.G., Onodera N., Petersen B.L., Pils B., RA Prigge M., Rensing S.A., Riano-Pachon D.M., Roberts A.W., Sato Y., RA Scheller H.V., Schulz B., Schulz C., Shakirov E.V., Shibagaki N., RA Shinohara N., Shippen D.E., Soerensen I., Sotooka R., Sugimoto N., RA Sugita M., Sumikawa N., Tanurdzic M., Theissen G., Ulvskov P., RA Wakazuki S., Weng J.K., Willats W.W., Wipf D., Wolf P.G., Yang L., RA Zimmer A.D., Zhu Q., Mitros T., Hellsten U., Loque D., Otillar R., RA Salamov A., Schmutz J., Shapiro H., Lindquist E., Lucas S., RA Rokhsar D., Grigoriev I.V.; RT "The Selaginella genome identifies genetic changes associated with the RT evolution of vascular plants."; RL Science 332:960-963(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL377747; EFJ04843.1; -; Genomic_DNA. DR RefSeq; XP_002994096.1; XM_002994050.1. DR STRING; 88036.EFJ04843; -. DR EnsemblPlants; EFJ04843; EFJ04843; SELMODRAFT_138182. DR GeneID; 9637516; -. DR KEGG; smo:SELMODRAFT_138182; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; D8TER8; -. DR KO; K19347; -. DR OMA; MEIARHS; -. DR Proteomes; UP000001514; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001514}; KW Reference proteome {ECO:0000313|Proteomes:UP000001514}. FT COILED 55 75 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 317 AA; 34939 MW; 24E4B6436DE995B3 CRC64; MAQMKELEDF SRKTTKWMQV QLELVDMKIG KEIEGLRRDV GDKIEEQALA LETSILNLRS QVEDMDSSVQ RLTQDGLLTK KEGMELIASI IQQRAAEESG KSLTLDDVRI AARQIVEAEL EKHSADGIGR VDYALGSGGG KIIEHSEGFF TGGRAGWLSI LGAGLSAGGA VRHPMAHKVL EPSYGEPGQC LPLKGSNVFV EIALRTHIHP DAITIEHVPK SVAYDVTSAP KDFRVFGWLE RSIQAGTIAR PVKKLLLGEF SYSLDGSNIQ TFSFPEEVSR ELINTVRIHI TSNHGSASHT CLYRVRVHGF EPSPQHF // ID E0SA25_ENCIT Unreviewed; 264 AA. AC E0SA25; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Spindle pole body associated protein {ECO:0000313|EMBL:ADM12647.1}; GN ORFNames=Eint_111480 {ECO:0000313|EMBL:ADM12647.1}; OS Encephalitozoon intestinalis (strain ATCC 50506) (Microsporidian OS parasite) (Septata intestinalis). OC Eukaryota; Fungi; Microsporidia; Unikaryonidae; Encephalitozoon. OX NCBI_TaxID=876142 {ECO:0000313|EMBL:ADM12647.1, ECO:0000313|Proteomes:UP000002313}; RN [1] {ECO:0000313|EMBL:ADM12647.1, ECO:0000313|Proteomes:UP000002313} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 50506 {ECO:0000313|EMBL:ADM12647.1, RC ECO:0000313|Proteomes:UP000002313}; RX PubMed=20865802; DOI=10.1038/ncomms1082; RA Corradi N., Pombert J.-F., Farinelli L., Didier E.S., Keeling P.J.; RT "The complete sequence of the smallest known nuclear genome from the RT microsporidian Encephalitozoon intestinalis."; RL Nat. Commun. 1:77-77(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP001952; ADM12647.1; -; Genomic_DNA. DR RefSeq; XP_003074007.1; XM_003073961.1. DR EnsemblFungi; ADM12647; ADM12647; Eint_111480. DR GeneID; 9699716; -. DR KEGG; ein:Eint_111480; -. DR EuPathDB; MicrosporidiaDB:Eint_111480; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000149206; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000002313; Chromosome XI. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002313}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 58 76 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 85 105 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 264 AA; 30075 MW; 9A016BCDF4837AA8 CRC64; MNRRDRLQVK RTPDSTLNLG MDETIILQAA APKARANKSV EPMEGNGYGQ ERKGIKDYPI YMAIAIPYVL FFYMAIKRPM DSMILTNLME EINILREENS RISSQIEMMK HIKEVNYAKI EEGARIRIES MSQLFSYGFL GFRKHKEPST IFDENVGIGE CLAFKGAGCK FSIDLEKEAA ISKIGLYHPV TKDTSSAIRE FEVFSNSPEG NLLLGRFEYD TSTCGFQTFE WEETPISSVE IVVRSNGGNK KYTCIYKVYL FGNK // ID E0VFB3_PEDHC Unreviewed; 2686 AA. AC E0VFB3; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Hect E3 ubiquitin ligase, putative {ECO:0000313|EMBL:EEB12069.1}; GN ORFNames=Phum_PHUM154190 {ECO:0000313|EMBL:EEB12069.1}; OS Pediculus humanus subsp. corporis (Body louse). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Paraneoptera; Phthiraptera; Anoplura; OC Pediculidae; Pediculus. OX NCBI_TaxID=121224 {ECO:0000313|Proteomes:UP000009046}; RN [1] {ECO:0000313|Proteomes:UP000009046} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=USDA {ECO:0000313|Proteomes:UP000009046}; RX PubMed=20566863; DOI=10.1073/pnas.1003379107; RA Kirkness E.F., Haas B.J., Sun W., Braig H.R., Perotti M.A., RA Clark J.M., Lee S.H., Robertson H.M., Kennedy R.C., Elhaik E., RA Gerlach D., Kriventseva E.V., Elsik C.G., Graur D., Hill C.A., RA Veenstra J.A., Walenz B., Tubio J.M., Ribeiro J.M., Rozas J., RA Johnston J.S., Reese J.T., Popadic A., Tojo M., Raoult D., Reed D.L., RA Tomoyasu Y., Krause E., Mittapalli O., Margam V.M., Li H.M., RA Meyer J.M., Johnson R.M., Romero-Severson J., Vanzee J.P., RA Alvarez-Ponce D., Vieira F.G., Aguade M., Guirao-Rico S., Anzola J.M., RA Yoon K.S., Strycharz J.P., Unger M.F., Christley S., Lobo N.F., RA Seufferheld M.J., Wang N., Dasch G.A., Struchiner C.J., Madey G., RA Hannick L.I., Bidwell S., Joardar V., Caler E., Shao R., Barker S.C., RA Cameron S., Bruggner R.V., Regier A., Johnson J., Viswanathan L., RA Utterback T.R., Sutton G.G., Lawson D., Waterhouse R.M., Venter J.C., RA Strausberg R.L., Berenbaum M.R., Collins F.H., Zdobnov E.M., RA Pittendrigh B.R.; RT "Genome sequences of the human body louse and its primary endosymbiont RT provide insights into the permanent parasitic lifestyle."; RL Proc. Natl. Acad. Sci. U.S.A. 107:12168-12173(2010). CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS235110; EEB12069.1; -; Genomic_DNA. DR RefSeq; XP_002424807.1; XM_002424762.1. DR EnsemblMetazoa; PHUM154190-RA; PHUM154190-PA; PHUM154190. DR GeneID; 8236459; -. DR KEGG; phu:Phum_PHUM154190; -. DR VectorBase; PHUM154190; Pediculus humanus. DR CTD; 8236459; -. DR InParanoid; E0VFB3; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; E0VFB3; -. DR Proteomes; UP000009046; Partially assembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009046}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EEB12069.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000009046}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1258 1285 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2686 AA; 296904 MW; 78C3AFC88B93DC84 CRC64; MAEVDPETLL EWLSIGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCRI FLDECAPDNV LEVTARAITY YLDVSAECTR RIIAIDGAVK AICNRLVVSD FSERTSKDLA EQCIKVLELI CVREAGAVFE GGGLGCVLMF IRDNGWRVHK DTLHSAMAVV SRLCTKMEPQ DSSLPACVEA LSTLLKHEDT HVSDGALRCF ASLSDRFTRR GVDPAPLAEH GLVNELLSRL SNAAGPAAVT NGPVTLGANT STSAPETTKC SSSVSTIISL LSALCRGSHS ITHDLLRSQL PDAIEKALQG DERCSLDSMR LVDLLLILLF EGRKALLRAG SSSGQLLPRM RRMDSTGEKC HRQLIDCIRS KDTDALVEAI DNAGVEVNFM DDVGQTLLNW ASAFGTQEMV EFLCERGADV NKGQRSSSLH YAACFGRPAI AKVLLRHGAN PDLRDEDGKT PLDKARERID EGHREVAAIL QSPGEWVIPG DKDKKCEAVE SEDVNEPKGD PEMAPIYLKR LLPVFCATFQ STMLPSVRKA SLSLIKKMVH YIQPSLLIEA CNSDVSTHNL GTMLVEVIAT VLDNEEDEDG HLVVLQIIQD LTIKAQDIFL DHFARLGVFS KVLQLAGPQD VMTPPIKKPD KNEETNQEPV LEDAKEILSG KAYHWRDWCI CRGRDCLYIW SDAAALELSN GSNGWFRFIL DGKLATMYSS GSPEGGSDSS ENRGEFLEKL QRARAQVKPN SVSQPLLSNP GPTRLVVGNW ALSSRKEGEI HILNSDGQQS VWCGFQQATI LREDLPGFIF ESNRGTKHSF TAETSLGPEF AAGWSGKKGK RLRSKIEAIK QKVKIQAQEI YDKYFKVAQA QPRGVVAKLG NIVAQIERAC QKQHTNFRDG STWREILKSA LDELSQLLSE EGVVSAYELH SSGLVQALLS LLSTGPWDEG QKSNKTSKLQ KQRVRVFKNC FKEKDGENGS INAGMILVHK LISVLESIEK LPVYLYDTPG SGYGLQILTR RLRFRLEKGA GESSLIDRTG RGLKMEPLST VMQLEKYLLK MVAKQWYDYD RSTFSFVRKL KETNQMIHFK HQTDFDENGV IYWIGTNGKT SSEWVNPAQY GLVMVTSSDG RNLPYGRLED ILSRDSSALN CHTNDDRKAW FAIDLGLWLI PSCYTIRHAR GYGRSALRNW LFQVSKDGVN WTTLYTHTDD TSLNEPGSTA SWPLDPPLDE TQGWRHVRLQ QTGKNASGQA HYLSLSGFEV YGMVTGVCED LGKAAKEAEA NLRRQRRLLR NQILKHLVVG ARVVRGLDWK WRDQDGPPPG EGTVSGELHD GWIDVTWDHG GSNSYRMGAE GKYDLRLSVM SESDSNFVPV SSSKPVVPSG KAKSEGKTSV LTSRKSSSTP SLPDATESHS KTSVASTDQA ASADNLAAKQ AAEAIADSVL SIARAEAKVA IAGNKKNNST PGSELSVVVH TLRGPHHDLS SISSAGSSDL ATIVETLTLD TKSNYVVGGT SNQQVKRQSS CSDEIQTNNT TNTYNNGNKG NVEATVGKTN LLVSSNALSN SSVLNKSFYC PSRTNKMSSS NPPPGEVHTL LSAETIEVFD KMREGQDLLR NNTNSFLSGL MAANLTPCVR ISVTGGESDN VSDERAIRIK NRHQPIQPSV TNAGAESVNT KRDKENKDNK DKDKDKDKDK DKDKDKDRKD GNNSVVVTNP MSVSVPNLTS GNNTIENTGT TGLLETFAAM ARRRTLGSVA GNTSVVNDNQ TVISTNSSAN NTNSTNHRNS MFPRGPSSVS SLVRLALSPN FPGGLLSTAQ SYPSLTSSGP TGTGTCVSTT AGTGPCLSQA LTMSLTSTSS ESEQVSFEDF LDSCRAPTLL AELDDDDMPD DNDENDDNDD ENEDDDYEEV TVSRNLLAFM EEETFDTRNT GGKRRSWDDE YILKRQFSAL IPAFDPRPGR TNVNQTTDLE IPAPGLASEA TTSTETCLTT QPKLQLILRG PNLPGVQDVE IELKDPKWTI FMAVQELIQA VDLGSRQEKL RRIWEPTYTI IYREIKEEES EINSVVTPVV TLYSRHGKNS GLFSSLLSPQ TPVSPSMSCT VEDVLHLLRH LFVIMTYRDE AMCNILDDEI LHPDEFTSKK ITNKLLQQIQ DPLVLSSGAL PTWCEELNHS CPFLFPFETR QLYFSCTAFG ASRSIVWLQT QRDVTLERQR TPGLSPRRDD PHEFRVGRLK HERVKVPRGE KLLDWAIQVM KIHADRKSIL EVEFQGEEGT GLGPTLEFYA LVAAELQRKD LGMWLCDDDV NTSNLDSDIS IDLGEGMKPP GYYVRRSSGL FPAPLPQDSP ACDRACTYFW FLGVFLAKVL QDNRLVDLPL STPFLKLMCH GEIHNNVNER IGLLPGTNSC KKSIDDDLMV SSYISEESEK DWDSDPLKGH PDDSKPWFSG ILNCEDLTEV DPVRGRFLRQ IQSLIARKSR IAQDNTLSPE ARAHQIQNLA LVSSSGPVVR LEDLAITFTY LPSSQVFGFS AAELTPGSCD KEVTMENVEE YSDLTTAFCL EKGITRQLNA FHSGFNKVFP ITKLKAFSPD EVRIMLCGDQ NPHWTREDVL NYTEPKLGYT RDSPGFLRFV NVLVKMSADE RKAFLQFTTG CSSLPPGGLA NLYPRLTVVR KVDAGEGSYP SVNTCVHYLK LPDYPTEELL RERLLAATRE KGFHLN // ID E0VIY0_PEDHC Unreviewed; 576 AA. AC E0VIY0; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEB13336.1}; GN ORFNames=Phum_PHUM235450 {ECO:0000313|EMBL:EEB13336.1}; OS Pediculus humanus subsp. corporis (Body louse). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Paraneoptera; Phthiraptera; Anoplura; OC Pediculidae; Pediculus. OX NCBI_TaxID=121224 {ECO:0000313|Proteomes:UP000009046}; RN [1] {ECO:0000313|Proteomes:UP000009046} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=USDA {ECO:0000313|Proteomes:UP000009046}; RX PubMed=20566863; DOI=10.1073/pnas.1003379107; RA Kirkness E.F., Haas B.J., Sun W., Braig H.R., Perotti M.A., RA Clark J.M., Lee S.H., Robertson H.M., Kennedy R.C., Elhaik E., RA Gerlach D., Kriventseva E.V., Elsik C.G., Graur D., Hill C.A., RA Veenstra J.A., Walenz B., Tubio J.M., Ribeiro J.M., Rozas J., RA Johnston J.S., Reese J.T., Popadic A., Tojo M., Raoult D., Reed D.L., RA Tomoyasu Y., Krause E., Mittapalli O., Margam V.M., Li H.M., RA Meyer J.M., Johnson R.M., Romero-Severson J., Vanzee J.P., RA Alvarez-Ponce D., Vieira F.G., Aguade M., Guirao-Rico S., Anzola J.M., RA Yoon K.S., Strycharz J.P., Unger M.F., Christley S., Lobo N.F., RA Seufferheld M.J., Wang N., Dasch G.A., Struchiner C.J., Madey G., RA Hannick L.I., Bidwell S., Joardar V., Caler E., Shao R., Barker S.C., RA Cameron S., Bruggner R.V., Regier A., Johnson J., Viswanathan L., RA Utterback T.R., Sutton G.G., Lawson D., Waterhouse R.M., Venter J.C., RA Strausberg R.L., Berenbaum M.R., Collins F.H., Zdobnov E.M., RA Pittendrigh B.R.; RT "Genome sequences of the human body louse and its primary endosymbiont RT provide insights into the permanent parasitic lifestyle."; RL Proc. Natl. Acad. Sci. U.S.A. 107:12168-12173(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS235212; EEB13336.1; -; Genomic_DNA. DR RefSeq; XP_002426074.1; XM_002426029.1. DR EnsemblMetazoa; PHUM235450-RA; PHUM235450-PA; PHUM235450. DR GeneID; 8230203; -. DR KEGG; phu:Phum_PHUM235450; -. DR VectorBase; PHUM235450; Pediculus humanus. DR CTD; 8230203; -. DR InParanoid; E0VIY0; -. DR KO; K19347; -. DR OMA; GHIESAP; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000009046; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009046}; KW Reference proteome {ECO:0000313|Proteomes:UP000009046}. FT COILED 228 255 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 576 AA; 66531 MW; 8C422E3AF9C45012 CRC64; MFFFFFFFFS KRRRFFLFKA EEKSGIISDQ FKALDCQTNG LLTSLESKVK PIWKTGYVTY KKATAAVNCY ALDTYDRLKL PFVQGGQWMN SYFSNVHDDP LTSVALGFWK TFSTGLCTIA NALQNAIFHF CYVLIDFFTD VLSAASNLFV YTLTYLKDVF FTVFEILKRF LGNFYDALKK SVLNVWSYLY LLLFSTKDDD KILRNEMSEK VMDTSLHYKS SPVKSDELKN IAWDLSNFKK RFDELESEFR KEKQSVKMRN DFYNKQFENN NLALQKYETV LNRAVSSAKK PDSITNSTVQ YLLDNHFNEL KQWILENFML KIDSPNLDEN KIVSIIAKHL NKTGEELMIL KEEFKRFTHD HDVELKSVKK DFGSLITSNM GGDISDSYIR KIVNDAIKLY DADKTGRVDF ALESAGGQIV SIRCTETYMA SPKSYVLMGW PLFTKASSPR DAITHTTAAG QCWAFVGSQG FLVIQLSHNI KVTGFTMEHM SRLLAPNGHI ESAPKNFSMW GLKMEHDSKP YLFGEYIYLD NDETLQYFPV AHPTSEKYQM VELKIASNHG NPNYTCLYRI RVHGNL // ID E0VJM7_PEDHC Unreviewed; 981 AA. AC E0VJM7; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEB13583.1}; GN ORFNames=Phum_PHUM248060 {ECO:0000313|EMBL:EEB13583.1}; OS Pediculus humanus subsp. corporis (Body louse). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Paraneoptera; Phthiraptera; Anoplura; OC Pediculidae; Pediculus. OX NCBI_TaxID=121224 {ECO:0000313|Proteomes:UP000009046}; RN [1] {ECO:0000313|Proteomes:UP000009046} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=USDA {ECO:0000313|Proteomes:UP000009046}; RX PubMed=20566863; DOI=10.1073/pnas.1003379107; RA Kirkness E.F., Haas B.J., Sun W., Braig H.R., Perotti M.A., RA Clark J.M., Lee S.H., Robertson H.M., Kennedy R.C., Elhaik E., RA Gerlach D., Kriventseva E.V., Elsik C.G., Graur D., Hill C.A., RA Veenstra J.A., Walenz B., Tubio J.M., Ribeiro J.M., Rozas J., RA Johnston J.S., Reese J.T., Popadic A., Tojo M., Raoult D., Reed D.L., RA Tomoyasu Y., Krause E., Mittapalli O., Margam V.M., Li H.M., RA Meyer J.M., Johnson R.M., Romero-Severson J., Vanzee J.P., RA Alvarez-Ponce D., Vieira F.G., Aguade M., Guirao-Rico S., Anzola J.M., RA Yoon K.S., Strycharz J.P., Unger M.F., Christley S., Lobo N.F., RA Seufferheld M.J., Wang N., Dasch G.A., Struchiner C.J., Madey G., RA Hannick L.I., Bidwell S., Joardar V., Caler E., Shao R., Barker S.C., RA Cameron S., Bruggner R.V., Regier A., Johnson J., Viswanathan L., RA Utterback T.R., Sutton G.G., Lawson D., Waterhouse R.M., Venter J.C., RA Strausberg R.L., Berenbaum M.R., Collins F.H., Zdobnov E.M., RA Pittendrigh B.R.; RT "Genome sequences of the human body louse and its primary endosymbiont RT provide insights into the permanent parasitic lifestyle."; RL Proc. Natl. Acad. Sci. U.S.A. 107:12168-12173(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS235226; EEB13583.1; -; Genomic_DNA. DR RefSeq; XP_002426321.1; XM_002426276.1. DR EnsemblMetazoa; PHUM248060-RA; PHUM248060-PA; PHUM248060. DR GeneID; 8238768; -. DR KEGG; phu:Phum_PHUM248060; -. DR VectorBase; PHUM248060; Pediculus humanus. DR CTD; 8238768; -. DR InParanoid; E0VJM7; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; E0VJM7; -. DR Proteomes; UP000009046; Partially assembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009046}; KW Reference proteome {ECO:0000313|Proteomes:UP000009046}. FT COILED 94 114 {ECO:0000256|SAM:Coils}. FT COILED 706 735 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 981 AA; 111295 MW; 33B4E093131058C1 CRC64; METVSATQSD EYSISINQYE ADSQEVTHEK ISLFTASESS HVTNKIIGKS NNEETLDYIK PDSNNRPLLN EEKFSSSLVS NNEQSVTFSP TDVSEELNNN LVKSQVNIEK FNNTQQISDS YENTENIYNQ DELKLEEHIM ENASTLNQQQ NESGHHIVSS QEEFKRAPPH HEDIPSFSEW TQKQLAEAEK KKTDQNETTD KFPRLRGKFR IKNYASPDCG AKIVAANPES LSSSSVLSSL KDEYMLNYCT NRIWFIIELC EAIQAKQLDL ANFELFSSSP KHFSVFVSHR FPTREWSSVG KFTAQDSRDV QTFNLHPHFF GKYVKVEMHS HYGKEQFCPV SWVGVYGTSE FEVLAKEDER NSLSEDDDGD PYDEELMFHH KKDSPKNLFS SATDAVLSIV KKATAPFMKS DNNQTESNNR VKTESSDDSL CITPRFVVIC NNCTINKNQM VLNVTLHGVQ TLKNLTKKKD NCEDNGPTKS HFQDSFLPVK SQIPDESMEK ASVEIQFTTS LTDETIKSKE NCGMKIEPTK TLNEEEMSIL ISANLKSEKD PEAPRVLDKS KDLLKTEYIN PSLATDTLSE ISASVVEYEK KEEKDKTYSG ENNTEVQKDA EAENNNNSNS NNNNNNNNSN TTNQDSSFDS IISDLNAIEK VESSTSAPFT ASNLPSESIF LRLANRIKNL EVNMSLSSTY LEELSKRYRT QLELISKTLN LTIQKVEERA KLEEEKERKR EKEIMLIRMQ LANLTRDVST FLREQESWKP QASSVSQHIF FILIEILLLW FFFSYWKKPG SRVLQETEGK KLHKSYERRS SLEGVKGHDG PKKKVRRPSD EALDIALSRT YKIDKKRKRK KKDSKCSLNN FNPKRKTSEG DASQMFKERC SLQGEEKHVK NLNNATENKA EIIQNVFPIE SNNVFNKLST ASNVPVISGG SGSGGGGSTG KSFQGITSPP YIKTAASSRD FRLRLGSNNN NNNNNNKESS H // ID E0VN23_PEDHC Unreviewed; 237 AA. AC E0VN23; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEB14789.1}; GN ORFNames=Phum_PHUM327380 {ECO:0000313|EMBL:EEB14789.1}; OS Pediculus humanus subsp. corporis (Body louse). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Paraneoptera; Phthiraptera; Anoplura; OC Pediculidae; Pediculus. OX NCBI_TaxID=121224 {ECO:0000313|Proteomes:UP000009046}; RN [1] {ECO:0000313|Proteomes:UP000009046} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=USDA {ECO:0000313|Proteomes:UP000009046}; RX PubMed=20566863; DOI=10.1073/pnas.1003379107; RA Kirkness E.F., Haas B.J., Sun W., Braig H.R., Perotti M.A., RA Clark J.M., Lee S.H., Robertson H.M., Kennedy R.C., Elhaik E., RA Gerlach D., Kriventseva E.V., Elsik C.G., Graur D., Hill C.A., RA Veenstra J.A., Walenz B., Tubio J.M., Ribeiro J.M., Rozas J., RA Johnston J.S., Reese J.T., Popadic A., Tojo M., Raoult D., Reed D.L., RA Tomoyasu Y., Krause E., Mittapalli O., Margam V.M., Li H.M., RA Meyer J.M., Johnson R.M., Romero-Severson J., Vanzee J.P., RA Alvarez-Ponce D., Vieira F.G., Aguade M., Guirao-Rico S., Anzola J.M., RA Yoon K.S., Strycharz J.P., Unger M.F., Christley S., Lobo N.F., RA Seufferheld M.J., Wang N., Dasch G.A., Struchiner C.J., Madey G., RA Hannick L.I., Bidwell S., Joardar V., Caler E., Shao R., Barker S.C., RA Cameron S., Bruggner R.V., Regier A., Johnson J., Viswanathan L., RA Utterback T.R., Sutton G.G., Lawson D., Waterhouse R.M., Venter J.C., RA Strausberg R.L., Berenbaum M.R., Collins F.H., Zdobnov E.M., RA Pittendrigh B.R.; RT "Genome sequences of the human body louse and its primary endosymbiont RT provide insights into the permanent parasitic lifestyle."; RL Proc. Natl. Acad. Sci. U.S.A. 107:12168-12173(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS235332; EEB14789.1; -; Genomic_DNA. DR RefSeq; XP_002427527.1; XM_002427482.1. DR EnsemblMetazoa; PHUM327380-RA; PHUM327380-PA; PHUM327380. DR GeneID; 8236165; -. DR KEGG; phu:Phum_PHUM327380; -. DR VectorBase; PHUM327380; Pediculus humanus. DR CTD; 8236165; -. DR InParanoid; E0VN23; -. DR KO; K19347; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; E0VN23; -. DR Proteomes; UP000009046; Partially assembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009046}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009046}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 50 70 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 237 AA; 27524 MW; B5A79D0876620F2B CRC64; MKEKKSEFDP VVRLETIDLL KNPKNLKSTF YLKGNKILLK KFNIKFNSRL ILIIIFFIFI TSTFLCHFGA KILSTFDTES YGTVSLKWWN YKLNIPYQFF NNFFHLSKGP NLVLVPWTKS GDCWAFQGSK GRIAIELSEK IKILAVTIDH IRLSENELQS APKKFSVFGV LDKSIDLNSD KIHLGTFQYD VTGSTMQTFL ISSKLKIRFK IIQVQFESNH GNKYYTCIYK IRVHGHS // ID E0VSH7_PEDHC Unreviewed; 223 AA. AC E0VSH7; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 16-SEP-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EEB16333.1}; GN ORFNames=Phum_PHUM418710 {ECO:0000313|EMBL:EEB16333.1}; OS Pediculus humanus subsp. corporis (Body louse). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Paraneoptera; Phthiraptera; Anoplura; OC Pediculidae; Pediculus. OX NCBI_TaxID=121224 {ECO:0000313|Proteomes:UP000009046}; RN [1] {ECO:0000313|Proteomes:UP000009046} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=USDA {ECO:0000313|Proteomes:UP000009046}; RX PubMed=20566863; DOI=10.1073/pnas.1003379107; RA Kirkness E.F., Haas B.J., Sun W., Braig H.R., Perotti M.A., RA Clark J.M., Lee S.H., Robertson H.M., Kennedy R.C., Elhaik E., RA Gerlach D., Kriventseva E.V., Elsik C.G., Graur D., Hill C.A., RA Veenstra J.A., Walenz B., Tubio J.M., Ribeiro J.M., Rozas J., RA Johnston J.S., Reese J.T., Popadic A., Tojo M., Raoult D., Reed D.L., RA Tomoyasu Y., Krause E., Mittapalli O., Margam V.M., Li H.M., RA Meyer J.M., Johnson R.M., Romero-Severson J., Vanzee J.P., RA Alvarez-Ponce D., Vieira F.G., Aguade M., Guirao-Rico S., Anzola J.M., RA Yoon K.S., Strycharz J.P., Unger M.F., Christley S., Lobo N.F., RA Seufferheld M.J., Wang N., Dasch G.A., Struchiner C.J., Madey G., RA Hannick L.I., Bidwell S., Joardar V., Caler E., Shao R., Barker S.C., RA Cameron S., Bruggner R.V., Regier A., Johnson J., Viswanathan L., RA Utterback T.R., Sutton G.G., Lawson D., Waterhouse R.M., Venter J.C., RA Strausberg R.L., Berenbaum M.R., Collins F.H., Zdobnov E.M., RA Pittendrigh B.R.; RT "Genome sequences of the human body louse and its primary endosymbiont RT provide insights into the permanent parasitic lifestyle."; RL Proc. Natl. Acad. Sci. U.S.A. 107:12168-12173(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS235751; EEB16333.1; -; Genomic_DNA. DR RefSeq; XP_002429071.1; XM_002429026.1. DR EnsemblMetazoa; PHUM418710-RA; PHUM418710-PA; PHUM418710. DR GeneID; 8234450; -. DR KEGG; phu:Phum_PHUM418710; -. DR VectorBase; PHUM418710; Pediculus humanus. DR CTD; 8234450; -. DR InParanoid; E0VSH7; -. DR KO; K19347; -. DR OMA; NEEENQY; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; E0VSH7; -. DR Proteomes; UP000009046; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009046}; KW Reference proteome {ECO:0000313|Proteomes:UP000009046}. SQ SEQUENCE 223 AA; 25737 MW; 1E1DDCF76E843EFF CRC64; MKAEIKTWIQ AEIEKNSKLC SKTNKINPEQ VRTMIKEEMK KQQKVQLKPD FASASAGMVL EATPTWTEGY SYIYFFFIPI RSTAPPPSII LEPTRQAGDC WPMEGQKGKL LIQLAARANI IGFTMEHIDP AMSLSGGTPE APNNFSVFGL EYFNKDGERH FFGTYSYDNK NEEENQYFRV QNVVRKSFLY IQLEISSNHG NDKYTCLYRF QVHGFIDRLH HCQ // ID E1BLD1_BOVIN Unreviewed; 2610 AA. AC E1BLD1; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 2. DT 11-NOV-2015, entry version 38. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSBTAP00000053792}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSBTAP00000053792}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913 {ECO:0000313|Ensembl:ENSBTAP00000053792, ECO:0000313|Proteomes:UP000009136}; RN [1] {ECO:0000313|Ensembl:ENSBTAP00000053792, ECO:0000313|Proteomes:UP000009136} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000053792, RC ECO:0000313|Proteomes:UP000009136}; RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42; RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., RA Puiu D., Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., RA Marcais G., Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.; RT "A whole-genome assembly of the domestic cow, Bos taurus."; RL Genome Biol. 10:R42.01-R42.10(2009). RN [2] {ECO:0000313|Ensembl:ENSBTAP00000053792} RP IDENTIFICATION. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000053792}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSBTAP00000053792}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DAAA02052668; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002696742.1; XM_002696696.4. DR RefSeq; XP_005222128.1; XM_005222071.2. DR ProteinModelPortal; E1BLD1; -. DR STRING; 9913.ENSBTAP00000053792; -. DR PaxDb; E1BLD1; -. DR Ensembl; ENSBTAT00000061351; ENSBTAP00000053792; ENSBTAG00000032477. DR GeneID; 536240; -. DR KEGG; bta:536240; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; E1BLD1; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR NextBio; 20876915; -. DR Proteomes; UP000009136; Chromosome 21. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009136}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000009136}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289338 MW; 6F297B4625349330 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTISGP SSACKPSRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATV LKEDLPGFVF ESNRGTRHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSA RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTGSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EEGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID E1BQ86_CHICK Unreviewed; 932 AA. AC E1BQ86; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 34. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000005991}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000005991, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000005991, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000005991, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000005991} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000005991}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000005991}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03006965; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9031.ENSGALP00000005991; -. DR PaxDb; E1BQ86; -. DR Ensembl; ENSGALT00000006001; ENSGALP00000005991; ENSGALG00000003781. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; E1BQ86; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR PRO; PR:E1BQ86; -. DR Proteomes; UP000000539; Chromosome 14. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 356 379 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 399 421 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 433 453 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 582 616 {ECO:0000256|SAM:Coils}. FT COILED 624 644 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 932 AA; 105508 MW; A452C5395D77AF45 CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFEIEH KIDPVFDSPR MSRRSLRLAA AGFNKPDSAR NDLLHDSSYA GNVTFRDQSS KMVKQRKSIN KQSGSVRDVP RKNLSSSPIF NQSSFLSRAS DTSMVSTVLD ESSIREQTEV DHFWGLDDDG DPKGSDATLL QRNGDIATAE TQTTMINGYT CSDCSMLSER KEVLTAYSAS SVPSSRIYSR DRSQKHASRG TYFYMSKILR LVRHTAASFA SLLVQLFQMV LLKQSYESKV LKISGSSYST AYSELHYLSC KGYIAAHSDY CGSMNIKEFY REDSHLDVNE ESICDDCKGK KQLEIHTTEH MQSSRAKRVA RTISHIFSYA GYFVLHMLRT VGAAGWLVSQ KVLSLLWLAI LSPGRAASGI FRLLRAGWNQ LLTLLSLLKV FILRKCLPKI SRLLLFLIPL LFLLGIEGLW FWGFDTFIAL LPLLNRTRID KVQSVDDSTY VPDPQPDSSR FVQPPKDTIN IFDSARISEL EKQMAFVSDR CHHHDEEYSK VLLLLHNLQD QVAQMGNRNE ILKLIKNVMD QHLKEYAAFV FPVFKTDFLA LHQEHNLRII TLEDLLRKLS AESKDIQKEL EIAKAKTIRD GDEHNQLLSR VKKLELELSQ VKSELLTGES VKTSCEKVDA QVKESVKMML FGDQHKDFPE SLLQWLTSNF VTRSDLQTLL QDLELQILKN ITVQMAVTNQ RITSEVVTNA VNNAGISGIT EAQAQIIVNN ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALI SLFGIPLWYF SQSPRVVIQP DMYPGNCWAF KGSEGYLVVR LSMKIYPTAF SLEHIPKTLS PSGNITSAPR KFSVYGLDDE YQEEGTFLGQ YVYDQEGEPL QMFTVEKSEN VFQIVELRIL SNWGHAEYTC LYRFRVHGKP AE // ID E1C040_CHICK Unreviewed; 2570 AA. AC E1C040; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 39. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000016152}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSGALP00000016152}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000016152, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000016152, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000016152, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000016152} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000016152}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000016152}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03004639; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_421227.3; XM_421227.4. DR UniGene; Gga.3240; -. DR STRING; 9031.ENSGALP00000016152; -. DR PaxDb; E1C040; -. DR Ensembl; ENSGALT00000016171; ENSGALP00000016152; ENSGALG00000009946. DR GeneID; 423310; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; E1C040; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000000539; Chromosome 5. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2570 AA; 284962 MW; 182EE39451710EC0 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTASGP SSACKPGRTS TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDEKKKKDA NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDL FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPSTA SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESAW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQSL LTVLNNNVDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERASGET SLIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRAS FVFVRKLREG QTFVFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DSSALNCHTN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTTL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGIDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DSAASPKPVS STVSGTTQSW SSLVKNNCPD KTTAAAGSSS RKGSSSSVCS VASSSDISLG STKMERRSES VMEQNIVSGT DVHEPIVVLS SADNVPQAEV GSSSSASTST LTADMGNENT ERKLGPDNSI RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TTGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMEEEEY ETKGGRRRTW DDDYVLKRQF SALVPAFDPR PGRTNVQQTT DLEIPPPGTP HSELLEEVEC MPSPRLALTL KVSGLGTTRE VELPLTNFRF TIFYYVQKLL QLSCNGSVKS DKLRRIWEPT YTIMYREMKD SDKEKESGKM GCWSVEHVEQ YLGTDELPKN DLITYLQKNA DSAFLRHWKL TGTNKSIRKN RNCSQLIAAY KDFCEHGSKS GLSQGAISTL QNSDILSLAK EQPQAKAGSG QNSCGVEDVL QLLRILYIVA SDPYTARTSQ EEGDEHPQFN FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSTVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDDFP DDESRQVDIG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RSDRDLHCTE SQSEASTEEG HDSLSVGSLE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LINPHRARFL KEIKDLAIKR RQILSNKSLS EDEKNTKLQE LMLKNPSGSG PPLSIEDLGL NFQFCPSSKV YGFTAVDLKP GGEDETVTMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN RVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID E1C5H2_CHICK Unreviewed; 246 AA. AC E1C5H2; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 30. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000036843}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000036843, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000036843, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000036843, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000036843} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000036843}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000036843}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03002009; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSGALT00000037637; ENSGALP00000036843; ENSGALG00000013105. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; E1C5H2; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000000539; Chromosome 2. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. FT COILED 1 21 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 246 AA; 27450 MW; 3A686AFBB1BC01EC CRC64; MVSLDEELAK VQRQLHVLQW RARDITERAL HEALRRTELP GFTGEAVQKI INQVLEKLEE SPFQMTNYAS KTSGAAIIRS KTSPSWIGSG RVFWQSLPLM AYMRPPEVIL EPDNHPGNCW PFPGSQGHVF IKLPVAVFPT AVTINHGIPA AAYHADSISS APKDFAVYGL QEEDDEKGTL LGEFIFTPGQ APGQTFQLKN EHSGFIKYVR LQVLSNWGHP DYTCVYQFRL HGDPSHDGDA RGKLSA // ID E1C5H3_CHICK Unreviewed; 246 AA. AC E1C5H3; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 14-OCT-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000036842}; GN Name=LOC776146 {ECO:0000313|Ensembl:ENSGALP00000036842}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000036842, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000036842, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000036842, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000036842} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000036842}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000036842}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03002452; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_001232383.3; XM_001232382.3. DR Ensembl; ENSGALT00000037636; ENSGALP00000036842; ENSGALG00000023084. DR GeneID; 769743; -. DR KEGG; gga:769743; -. DR GeneTree; ENSGT00390000011587; -. DR OMA; NEEENQY; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000000539; Chromosome 2. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. FT COILED 1 21 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 246 AA; 27404 MW; 6D7C30CE283A1BA4 CRC64; MVSLDEELAK VQRQLHVLQW RARDITERAL HEALRRTELP GFTGEAVQKI IDQVLEKLEE SPFQMTNYAS RTSGAAIVRS KTSPSWIGSG RVFWQSLPLV AYMRPPEVIL EPDNHPGNCW PFPGSQGHVF IKLPVAVFPT AVTINHGVPA AAYHADSISS APKDFAVYGL QAEDDEKGTL LGEFIFTPGQ APGQTFQLKN EHSGFIKYVR LQVLSNWGHR DYTCVYQFRL HGDPAHDGDA RGKLSA // ID E1C6B8_CHICK Unreviewed; 152 AA. AC E1C6B8; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000029232}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000029232, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000029232, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000029232, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000029232} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000029232}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000029232}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9031.ENSGALP00000035871; -. DR PaxDb; E1C6B8; -. DR Ensembl; ENSGALT00000029878; ENSGALP00000029232; ENSGALG00000028672. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; NIMMHIE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000000539; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. SQ SEQUENCE 152 AA; 17062 MW; BF44D0881E46EC26 CRC64; MERTSKSYGG NRWLSPFFSS AKPPETILQP DISAGNCWAF QGSRGHVVIR LPEKIWPTAF TIWHISKAVS PSGEVSTAPR EFVVSGVDED EGEALLGSFI YDVDGEIAQT FQVQEEPRKA FRQIKLEVRS NWGNKEYTCL YRADVHGNPE KA // ID E1C801_CHICK Unreviewed; 70 AA. AC E1C801; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000035871}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000035871, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000035871, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000035871, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000035871} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000035871}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000035871}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03001984; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR PaxDb; E1C801; -. DR Ensembl; ENSGALT00000036660; ENSGALP00000035871; ENSGALG00000029116. DR InParanoid; E1C801; -. DR TreeFam; TF323915; -. DR Proteomes; UP000000539; Chromosome 2. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 20 38 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 70 AA; 7642 MW; D0BF3B38AA296E6E CRC64; MLAQLGRQRA AEERGRRTPV LLLAVLSAGA FLWILTAWKS NVMQVSAGPV HELPCLGNSP VCLELKMKTI // ID E1C967_CHICK Unreviewed; 202 AA. AC E1C967; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 30. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000036286}; DE Flags: Fragment; GN Name=Gga.49934 {ECO:0000313|Ensembl:ENSGALP00000036286}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000036286, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000036286, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000036286, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000036286} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000036286}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000036286}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03002845; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSGALT00000037074; ENSGALP00000036286; ENSGALG00000013110. DR GeneTree; ENSGT00390000011587; -. DR OMA; NINSAPK; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000000539; Chromosome 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGALP00000036286}. SQ SEQUENCE 202 AA; 22495 MW; 3A6F2958141EEDCB CRC64; QAVQKIIHQV LEKLEESPFQ MTNYASKTSG ATIIRSKTSP SWIGSGRVFW QSLPLMAYMR PPEVILEPDN HPGNCWPFPG SQGHVFIKLP VAIFPMAVTI NHGVPAAAYH ADSISSAPKD FAVYGLQEED DEKRTLLGEF TFMPGQAPGQ TFQLKNEHSG FIKYVRLQVL SNWGHPDYTC LYQFRLHGDP AHDGDARGKL SA // ID E1EXQ4_GIAIA Unreviewed; 849 AA. AC E1EXQ4; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFO65041.1}; GN ORFNames=GLP15_2585 {ECO:0000313|EMBL:EFO65041.1}; OS Giardia intestinalis (strain P15) (Giardia lamblia). OC Eukaryota; Diplomonadida; Hexamitidae; Giardiinae; Giardia. OX NCBI_TaxID=658858 {ECO:0000313|EMBL:EFO65041.1, ECO:0000313|Proteomes:UP000008974}; RN [1] {ECO:0000313|EMBL:EFO65041.1, ECO:0000313|Proteomes:UP000008974} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=P15 {ECO:0000313|EMBL:EFO65041.1, RC ECO:0000313|Proteomes:UP000008974}; RX PubMed=20929575; DOI=10.1186/1471-2164-11-543; RA Jerlstrom-Hultqvist J., Franzen O., Ankarklev J., Xu F., Nohynkova E., RA Andersson J.O., Svard S.G., Andersson B.; RT "Genome analysis and comparative genomics of a Giardia intestinalis RT assemblage E isolate."; RL BMC Genomics 11:543-543(2010). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFO65041.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACVC01000052; EFO65041.1; -; Genomic_DNA. DR ProteinModelPortal; E1EXQ4; -. DR EnsemblProtists; EFO65041; EFO65041; GLP15_2585. DR Proteomes; UP000008974; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008974}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 242 264 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 340 360 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 849 AA; 92785 MW; 0099770724EBFFD9 CRC64; MPPRTRSSGA LVRMNPTRED EAYTKEVLRK TSDGFRAELT SGKSERITQN LRNQDEGTLR LQEYLMAAEV DARPPKKIDT LMDQSDALKL QKSLTALQVA PIQRSRVQEP PKYEITSEDK DSSSSRPQQA SLSETSSTST PPESSPTPSP PLPPPARKSP KDVPKPATPT PLRQSVTPVK KPSPAIQPSQ EDEYTDEYVM IKQRVKRKPR PASKLSTGAR VVTAGEPGML YAYKKAWGCA EYVYASTCGV LALMVLFIFS YLLMRLWTGC AGLGVVGTGV PLVPMHTSDG VAGVSCTGPS LKEIQSLLEL NNKVLLKEWS NARTFTLDNK EIHEIATQVA SKLSDQHQKL ELRIAEIIDK HTVRTTSQNI TTNQLKDLAK IVRDEVTLAT KERISTLSDA IAKFEKEMLS AQSATTNLMR DLLKNNSFNV SSTSAKKASE KIKTAVEKMS DAADAAVTAG SANFTALFTM IENLSSRLET SYRNESITKE LQALQQSILL DIQTQLKDAA AATNQVISDS LASIRTQTES LSTVLIDLAA KSAVNAEVPV NGTAPVMFAS ESLNSMQHAI DKIIVALGEL SDRVGSSVGS TGSYGGSQSS SLTLADLEEV HHKLSQHITQ QAQNVTLAIQ ERMDASERKV AQEVNERVDH LKIGIAGIIK KALLKQTGYS SESTSAVSRG IVDDLRLQYT DFTKQSFGTR IAGKSDDIAN IAESLKTLIS GNERVRLMFN DNMSPGACWP TKKNGHVVLR FKDPVTLYYG TISHPAAPKL STGRTTVPRD LTFIGRTTTG KEIQLGSFTF DVDGPEQQAF QLQENHDLIQ VRVQFANNGG EYTCIYNLGL FGEKDSRKL // ID E1FGX9_LOALO Unreviewed; 2930 AA. AC E1FGX9; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 03-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 30. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFO28334.2}; GN ORFNames=LOAG_00153 {ECO:0000313|EMBL:EFO28334.2}; OS Loa loa (Eye worm) (Filaria loa). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Spirurida; OC Filarioidea; Onchocercidae; Loa. OX NCBI_TaxID=7209 {ECO:0000313|EMBL:EFO28334.2, ECO:0000313|Proteomes:UP000007040}; RN [1] {ECO:0000313|Proteomes:UP000007040} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Nutman T.B., Fink D.L., Russ C., Young S., Zeng Q., Koehrsen M., RA Alvarado L., Berlin A., Borenstein D., Chapman S.B., Chen Z., RA Engels R., Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., RA Heilman E.R., Heiman D., Hepburn T., Howarth C., Jen D., Larson L., RA Lewis B., Mehta T., Park D., Pearson M., Richards J., Roberts A., RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Walk T., RA White J., Yandava C., Haas B., Henn M.R., Nusbaum C., Birren B.; RT "The genome sequence of Loa loa."; RL Submitted (SEP-2010) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH712150; EFO28334.2; -; Genomic_DNA. DR EnsemblMetazoa; EFO28334.2; EFO28334.2; LOAG_00153. DR InParanoid; E1FGX9; -. DR OMA; NRQCIEG; -. DR Proteomes; UP000007040; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR001878; Znf_CCHC. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SMART; SM00343; ZnF_C2HC; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. DR PROSITE; PS50158; ZF_CCHC; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000007040}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007040}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2930 AA; 324333 MW; 37A3C17DDAB525CD CRC64; MEGVDPDTLL EWLQTGVGEE RDIQLMALEQ LCMLLLMSDN IDQCFESCPP RTFLPALCKI FLDETATENV LEVTARAITY YLDVSNECTR RITQMDGAVK AICNRLAVAE MSDRTSKDLA EQCVKLLEHV CQRETSAVYD AGGLQCMLTL VRQHGQCVHK DTIHSAMSVI TRLCSKMEPN DSAMPECSAS LGALLEHDDP KVSECALRCF AALTDRFIRK FMDPVEMVRH GNLVEHLLNS LVPIHPTSVT CTASLSHMRN TNSVGSIDSV TSSLAFGLNR PASFISVVIS LLSNLCRGSA IVTEQVVSSP VLFCALKTVL TSKDERCIMD ALRFCDLLMV LLCEGRSALP KSMGSTVTSR NEPGPNFDRS HRHLIDAIRQ RDTDALIDAI ESGQVDPNFT DDVGQTLLNW SSAFGTVEMV TYLCDKGADV NKGQRSSSLH YAACFGRSDI VKILLKYGAN PDLRDEEGKT ALDKAHERSE EEHQLVASIL ESPSAYMQRV DEEKIVKAES DVARETSNEK IDPVIVRQLL EQLLPVLCDV YQRSLGASVH RSSLSLLRKA VHHISSESME SMVKEDNGVN ISTDICFSRG QRLAENLVSV LMMGLEHEDD TESHENVLQI VKSLFVKNTD FWLEQLVRVG IFEKVEAIAN QPVTPIKSDA LTTMVIDESG RISSSLSITV VDNMFVKADE VSQDESSGIN SRQHLASNPS KSCSDESTMS STVVSSNKEL NDLTMVNFPV HDDNLHNTQI TPSASADPVT PSLEASEIGS VTPVSSKYNI GIDLSSAVDD QWEIVQGKGY RWKDWRMIKS QDSFFIWCDA VAVEFSDGSN GWFRFMLDGQ LSTMYSSGSP EVGSDNAETR GEFLDKLTKA RNAVPPGSLP HSIFSAPDIN KTIGVGSWIL ASPKIGELTV TNRDGNQQRL IIVEDLPGFI FESNRQTKHC FQAEKTLGLD FVTGWAARGG ERRLRFRAET QKAKLQEMAK EIWDNYLKEA QSKPRDALVE LQKASSTVKE ICHQKNGQLS ARMLSELEAA LKCMHSSVVN DRLLSTFELS ISGIVDALLA FLKVIQRDTD GEIAIIFRRV FVEQRSLSAL VCKMVSILEA VQKFPQYLYD TPGGSSFGLQ LLNRRIKLKL EQFNPNTPCQ MQLLDRSGRT MKVEPLNTVK QLKRYILRMV AKQWYDRGRE TFHFVEEIKE AKKHGAKISF TYTNDFDDKG IIYWLGTNGK TVAEWTNPAN VHVVFVTSSD GERLPYGKHE DILSREALNC HTSDDKNAHF TIDLGIYFYP NTYTLRHARG YGRSALRNWL LQGSHNGRIW DILVVHENDA SLNYPGSTAT WPIVCLEGKG PYRYIRIAQN GKNASNQNHY LSLSGFEIYG DVVDVVVDDF KMLEEKPSIS TKRSKKCITS TKEISVKESI TSKAAFASTN TDVRLPASLS DASVAHLPGG QTNNKVVSGI IGSDVILSAT NTMKNRTFRY RRGSRLLAAN IAGRGVRIAN APDAVACPVG SRVVRGPDWK WNDQGSNLEG TVVSLIDNGW VDVQWDDCTS NSYRFGADGK YDVELKSVEW TPSLRRGFRT VGTRHAAAAL SPTSSGQVPP GFARQLPPVP RGFMDYSSTG TLRKAKPGGA PFSRYHQSVN NNTEMGFDRN MSDSCLHCGS QEHRTKDCPN GLDSSNSQKS KINAEGENNP DAKVITTNQL SSAIAQKSMS TTNLFDGSEA MKRASVASTN QAASAESLQH QTPSLENLLA RSKIFDDRIP EVVSDDMPAV ENQRTTGCLT AVDTDQEGVS TNNYAGLDNS FESSDAIAII EQTHDVNIPM KTSTYVGGSQ PVLLDQHCDS SGEARIIKQR DTCSYPQDSV KSTIQNDLAP VSNSNTISLG EANARNLANL SVSAPNLVIL RQLQQSGTVA NTDQVFMPGD DDLVRNILHD LANDPSRLRS LRRFALAGSG SILINDDDNV IRDSTQRYLV NEEHTSITDE SSGVLTHEVI RKNADMGSEV SDRIDAIIPV TGVENNGHRE AAKGEVSTAG STVFYSTDGK ASLSSCSRTQ FSGDVFDKVD ASIEAVGSNQ PDGSQKASGK PSFETDFRSA SVTQSAACSR RNIQSNGGGF RSRLGSYADV LRTVVMQQII DSGGSLNGLE LEEIEDDIYD DEMQDEGNED EYDEEYASGL SVEALAQAAV ALRKQSGGGS TGSSGELKLN WKQIVIGEAG RLISDRGLRV STSPGDNKQN GGSLSRNWDD EFVLKHQLPA LIPAFDPRPG RTNVNQTQDV ELPKDINESQ SGVAFPSACS VLHPEEEPEL RLYLQGPNLA NINNVTVELD DDDRSIFFYL QQLVQSVEWG QKNERTRRVW EPTYTLIYGD ACDNNELPQT VNITVAEMNG IPENVANTLV VLSNLYQIGA SVMKYEMTTD IFVSEKLTQK LMQELADPLI VSARALPSWC DELVFKYPCL FSVETRNNYF RATAFGTSRS IVWLQTRQDQ LLEQSRGTTS AAAASNLAGT RRDDSYPEFR IGRIKHERIK VPRNDDQLFE YAVRLLEFHA SRKAVLEVEY IDEEGTGLGP TLEFYALMAA EFQRKDLAMW ICDDMDTDQT EELDLGEGMK PPGYYVRRAG GLFPAALPVS STENLRVSKL FRIFGIFLAK VLQDGRLVDI PLSQPFLKMI TNSQLTEKET PDLNGVLNLD DLEEVSPVRG RLLKELTAYV VQKRSIEADH RFDPNTRRRQ IQQLKLNING SECAVEDLSL TFCVNPSSTV FSYKQMELIE NGANIDVTAN NVELYIAACT NFYLISGILN QLKAFREGFD LVFPLRNLRM FVPRELQTLL SGDQCPEWTR EDIISFTEPK LGYTKESPGF LRFVDVLVGM SANERKSFLQ FTTGCSSLPP GGLANLHPRL TIVRKVDSGD GSYPSVNTCV HYLKLPDYSS TEIMRERFLT ATNEKGFHLN // ID E1FI93_LOALO Unreviewed; 874 AA. AC E1FI93; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 03-OCT-2012, sequence version 2. DT 14-OCT-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFO27874.2}; GN ORFNames=LOAG_00617 {ECO:0000313|EMBL:EFO27874.2}; OS Loa loa (Eye worm) (Filaria loa). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Spirurida; OC Filarioidea; Onchocercidae; Loa. OX NCBI_TaxID=7209 {ECO:0000313|EMBL:EFO27874.2, ECO:0000313|Proteomes:UP000007040}; RN [1] {ECO:0000313|Proteomes:UP000007040} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Nutman T.B., Fink D.L., Russ C., Young S., Zeng Q., Koehrsen M., RA Alvarado L., Berlin A., Borenstein D., Chapman S.B., Chen Z., RA Engels R., Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., RA Heilman E.R., Heiman D., Hepburn T., Howarth C., Jen D., Larson L., RA Lewis B., Mehta T., Park D., Pearson M., Richards J., Roberts A., RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Walk T., RA White J., Yandava C., Haas B., Henn M.R., Nusbaum C., Birren B.; RT "The genome sequence of Loa loa."; RL Submitted (SEP-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH712076; EFO27874.2; -; Genomic_DNA. DR EnsemblMetazoa; EFO27874.2; EFO27874.2; LOAG_00617. DR InParanoid; E1FI93; -. DR OMA; SHYGREH; -. DR Proteomes; UP000007040; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007040}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007040}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 701 727 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 639 659 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 874 AA; 97828 MW; 7586542F31C833B1 CRC64; MFRNNLHGFF SLYLRFLSVA HTIQFLTSVA IVDTGWMSTY DISPFKKFLS DYRFPICSIY KEKKCDATSN DTRPSVRSPK LNSEAVQSTE SAAKIVKYES TVFLENTEAG NDAIPSVQEV DLTVPFSSEQ PPIVTFDEWT KEKLKKEEIR KAEQQQNHQH KANMHSDAGE TSKMDSSSIP ASKVTEGITR LASSNMISQE ATARNYASKE CGAKVLFSND EAENKNAILN EKEADDYMRN PCERAEHKWL IIELCETIQP TVLEIANFEL FSSGPQNIRI LGSERYPSNE WVALGDFIVG NNREIQRFTV TARSYVKFLR LELLSHYGRE HYCTLSLVRL LGISMVDEYE AEAEAAAVSD TSFSVPVEVQ AINSTPEEKM NIGAVTPTST AEKNESVDDS PLVNAVVNVV GSIGIVDIKG VLESTFLLKR TGASSYNITR RNAAVVELCR KCSLDTVDSY VLFCRAFFGF QYSFIDTGSA SEVKNKQIHI HSSFKNQKKR NRTRVLKFFL TSVLPTYFCD LQSGDFENMQ NSTSHNVAES DVISTISRTR NSPATSNSLN RKSGGGFMIP GGVMSHKESI FLKLNKRISS LELNMSLSSE YLSELSRRYV LQTNESRRHA ELIIKQAEEA AINAIRPSIN ALKIQIDALA ASLRELTETV KGLPQEVTPA HRTMILKHIV SRGGDADASL ARSQAHFYGC FGVWTIGELV CIVAFINVLT LLILLLLHYA STIISRSADG QIMNEQLQNF IQENSDVLST TEAYPEQHLI AEVSSVVCDV VNITEVENVS SPSLALPNET CLLTENEARE ENHLNRPQSV ELLDRPSSQL STHSEQNPWQ TRKRKNKKRR GNRKSENLDE PLYGFRILAN LGIE // ID E1ZER1_CHLVA Unreviewed; 267 AA. AC E1ZER1; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFN55542.1}; GN ORFNames=CHLNCDRAFT_23042 {ECO:0000313|EMBL:EFN55542.1}; OS Chlorella variabilis (Green alga). OC Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; OC Chlorellaceae; Chlorella. OX NCBI_TaxID=554065 {ECO:0000313|Proteomes:UP000008141}; RN [1] {ECO:0000313|EMBL:EFN55542.1, ECO:0000313|Proteomes:UP000008141} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NC64A {ECO:0000313|EMBL:EFN55542.1, RC ECO:0000313|Proteomes:UP000008141}; RX PubMed=20852019; DOI=10.1105/tpc.110.076406; RA Blanc G., Duncan G., Agarkova I., Borodovsky M., Gurnon J., Kuo A., RA Lindquist E., Lucas S., Pangilinan J., Polle J., Salamov A., Terry A., RA Yamada T., Dunigan D.D., Grigoriev I.V., Claverie J.M., RA Van Etten J.L.; RT "The Chlorella variabilis NC64A genome reveals adaptation to RT photosymbiosis, coevolution with viruses, and cryptic sex."; RL Plant Cell 22:2943-2955(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL433844; EFN55542.1; -; Genomic_DNA. DR RefSeq; XP_005847644.1; XM_005847582.1. DR GeneID; 17355037; -. DR KEGG; cvr:CHLNCDRAFT_23042; -. DR InParanoid; E1ZER1; -. DR KO; K19347; -. DR Proteomes; UP000008141; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008141}; KW Reference proteome {ECO:0000313|Proteomes:UP000008141}. FT COILED 9 29 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 267 AA; 27657 MW; 469928DC874A460F CRC64; MCSTQAGTSA SLEAELAHVQ AELAQLRLAA ASSATAAAAA ADAAPAACEP AGAAGRQLLQ QVVREEVSEA LELFAADRTG LPDYALAAGG AEVVAHSPAH LPSSASLLAT LPRLHSGVVH PQASKLLLAP ISQPGLCLPL VGAAGWVDVR LTRRIHPTAF TYEHIPAAIA FDIRSAPRNL VLTGFLGPPP PHPGPAANSS AEGAEALGVP LGSFSYDAHS RHPVQTFQLG PEWRQVEVDH VRLAFAANNG HPGYTCLYRF RMHGTPA // ID E1ZW84_CAMFO Unreviewed; 1366 AA. AC E1ZW84; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Protein C1orf9-like protein {ECO:0000313|EMBL:EFN74593.1}; GN ORFNames=EAG_11116 {ECO:0000313|EMBL:EFN74593.1}; OS Camponotus floridanus (Florida carpenter ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. OX NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311}; RN [1] {ECO:0000313|EMBL:EFN74593.1, ECO:0000313|Proteomes:UP000000311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL434776; EFN74593.1; -; Genomic_DNA. DR InParanoid; E1ZW84; -. DR Proteomes; UP000000311; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000311}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000311}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 71 92 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 947 975 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1366 AA; 152156 MW; 0B25D8D454A132C0 CRC64; MVNKPERRIA TIRRGKTMVK SWCHITSWCE VTCVPEPKVM AKSIVKNSDG GCPLHWVDNP TGQPRHLLRV VVFYLTILAI LWCMPTCLYH RISETGQAVV LTLVDTAAAE LQNLAEPKID RDFSSRSSIK NSSLLNDNQR QTEHIKEIVV PTTQIQPPLM VDETTTTTER PNDTEALEDN EEVLLLKKIS TEEAPEVVVI VRAEQKVNTD ELESRVEEEL GQVQVPEELS SKVDETFTTA PELNDTAARA QLIGGSRDET AAVILDGLVS SGPSDPHEDI PSFSEWAQKR LEEAEKKKTH PNASVQTPGG PGRGVSGMKI RSKNYASPDC GAKIVAANPE ANSAKNVLVS TRDEYMLNAC TSRVWFVVEL CEAIQAKKIE LANFELFSSS PKDFSVYVSD RFPTKDWSPV GQFTAKDVKD IQSFALHPHF FGKFIKVELQ SHYGSEHFCP VSLFRAYGTS EFEVLETETE NQILQERNTE DDDDEDSDEE ELLNSEGGDP PRNLFGSARD AVISLMKKAA EVLVKSSDLT GNNITEIQQS IDDGNILNNS YTSCTTPRYT ILCGNCTDQK FASVFQLVSC RDRQLNDLLK IDLVNRTLRR GRLCSFHGVE IESSWQEKKE DINYNDTTRF NLAEDLQAIF LASVFKPEYI VALCNVLATR DRKVVMNTSY EIPVNNSENA AGENILSTKT DNSDATSHHQ ASVTCTSDSN SPACKSATFP GKELLGQDIK NEETEDINFA SKIKTSASMD SLASQIKPTK TLSKEDMKKE SSVPILEPSK ELTEETLQPE VLTTVPPPSD PTSTLKIVEE PIVGTAVEMT SKIINSPSTN SFDNQKTGDT LVPDTESGEA IVQIKTNKSE NGEQDGKQMK DLNEQEVRLS SQDHLTLDTL FSDLEGDTVN IQNGASSSSS ITQPTASAAP QKESVFLRLS NRIKILERNM SLSGQYLEEL SRRYKKQVEE MQRSLERAVA AMGEESRKGE EREAKRVEEI ATLKEEIAIL SKSVEILLYD RDSWRSRISA IVQHALLICL EVIVIIVILS YCRRGDFEEE EKLQSNTKKE NTRRKSAENF SSHTATKKTK KRRPSEIASH ISGTYHELMI DDQPHETRKE RKKKRKKEIV SAGIKTGINV DAKQEIIRHK SVLNVIPDGT TLSSRRASSI EPSHSKDSQN LIDKRPESAP EAVVGWFDDQ IDKRERIAQP ALPKNKVFKI ESDSELVRQD DELSESNSSS MTGHRSVEGL ATIESKIGGS KNGSFRTSGI LKSAKLSSPS FMKTALSTRN KRKFSSNSIE KWEWSQDLEY SNDRSFPFSS AGLKTLSQTI DGNINGRTAN GLIEESDESG SSNVTPISSK KEKRSAGLKK MVRKFF // ID E2AJ06_CAMFO Unreviewed; 853 AA. AC E2AJ06; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Protein unc-84-like protein A {ECO:0000313|EMBL:EFN66595.1}; GN ORFNames=EAG_15593 {ECO:0000313|EMBL:EFN66595.1}; OS Camponotus floridanus (Florida carpenter ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. OX NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311}; RN [1] {ECO:0000313|EMBL:EFN66595.1, ECO:0000313|Proteomes:UP000000311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL439921; EFN66595.1; -; Genomic_DNA. DR RefSeq; XP_011258938.1; XM_011260636.1. DR RefSeq; XP_011258939.1; XM_011260637.1. DR RefSeq; XP_011258940.1; XM_011260638.1. DR RefSeq; XP_011258941.1; XM_011260639.1. DR GeneID; 105252944; -. DR InParanoid; E2AJ06; -. DR Proteomes; UP000000311; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000311}; KW Reference proteome {ECO:0000313|Proteomes:UP000000311}. FT COILED 549 569 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 853 AA; 99425 MW; EC7BB00172998B91 CRC64; MENEQHHYEL RSRSRSRSHT PLIRSLAEPE VTEHHYDLRS KSRERSHTPG EVTSSRRSGS RSLTGSGSKV LDKSMEMIEE NKEGSVTESV TDNQSTKSEG SGVTIRKAER RSERQRAKRQ IFVNGQSESK ENVSNEKSEQ KRKSVTPHRI LTSDYSSEEG EREDPPSRPG SAYEIYKQAG EWWNVFPKTD YTYSQTSQCR YEIAPGVLAM PNMSRRSIHA DESIIPQVSY EKLSQTSQGT TESGISDMDT VDFKQKSSLS YPFDNDHDMS YVPNSSAKTT LYKKTHIEQY KSHKEVIYSQ SPGYSSDFST RYLSWSNTPS RFIGKYNAAD SDTELDETLT SSSVQTNQRW KIVQWLTYFT TFITIWFKKT VEFFKFKKRQ HYYDAQTYRS HNESKWRALW QALDHYTHNT YFFFVRMLVL DTWLLSQFTG VRKWLQEKNS RILWITLLPL LLLFGGWCLV QSFSLLSDIK TVPQTMTEKL LIDLQQDDKS RNVEEKSINK EIHTNDLVDK ANLDSRIKIL ENKQTNQMEY LINITRVLED QKQKDVDFQK EYNDKIINLE NKLDVELNNV MYNELKVIKY EFEKLRELYS ELKSCCNTNA EFITNHDLEK HVERILFSYL PGFSKEDLVK ISQNLLTSYN REDQTASNND VGNANVRVSD EQIRKIAKEV LQIYDADKTG QVDYALESAG GQIVSTRCTQ RYDIKSRAFT LFGFTLYYES NNPRTVIQRN PIQPGVCWAF QDFPGYLLVQ LRSAIYVTGF TVEHVSKLIL SSENMSSAPR KFNVWGLTNE NDPEPVMFGD YEFTYSDDNL QYFPVQNAAI NRPYEYVELR IHSNHGQLDY TCLYRFRVHG RPA // ID E2B1C7_CAMFO Unreviewed; 2551 AA. AC E2B1C7; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:EFN60527.1}; GN ORFNames=EAG_12486 {ECO:0000313|EMBL:EFN60527.1}; OS Camponotus floridanus (Florida carpenter ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. OX NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311}; RN [1] {ECO:0000313|EMBL:EFN60527.1, ECO:0000313|Proteomes:UP000000311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL444841; EFN60527.1; -; Genomic_DNA. DR InParanoid; E2B1C7; -. DR Proteomes; UP000000311; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000311}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EFN60527.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000000311}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 884 904 {ECO:0000256|SAM:Coils}. FT COILED 1746 1768 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2551 AA; 280208 MW; 77AB7913F9B9D97C CRC64; MADVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFECCPP RTFLPALCRI FLDELVPDSV LEVTARAITY YLDVSAECTR RVVAMEGAVK AICSRLSGAG LGSRASRDLA EQCIKVLELV CAREAGAVFE AGGLPCALCF IREHGARVHR DTLHSAMAVV TRLCGKVEPQ DKALPDCVEA LSVLLRHEDA HVADGALRCF ASLADRFSRR GTDPAPLASN GLVSELLYRL SNAAGPGTST TATCSNPKTP PPSSTITTVP APEPKSCASV STIISLLSTL CRGSPSITHD LLRSELPDAI EKALKGDERC ALDSMRLIDL LLVLLFEGRL ALGRSTTGGP SGPLLPRLRR LDSAGEKSHR QLIDCIRSKD TDALIEAIDT GGIEVNFMDD VGQTLLNWAS AFGTQEMVEF LCDRGADVNK GQRSSSLHYA ACFGRPAIAK VLLRHGANPD LRDEDGKTPL DKARERVDEG HREVAAILQS PGEWMLPPSQ EHRKLETEIE EFTEPKGDPE MAPVYLKRLL PVFCATFQST MLPSVRKASL SLIRKMVHYI QPELLIETCR SDRTGGCGAM LVEVIANVLD NEEDEDGHLV VLQMIQDLMI KGKDEFLEHF ARLGVFSKVA ALAGPQETTP EPETESNQSG DDQRMEDAKE LLIGRAYHWR DWCICRGRDC LYIWSDAAAL ELSNGSNGWF RFILDGKLAT MYSSGSPEGG TDTSENRGEF LEKLQRARSQ LKVNFVSQPV LSRPGTTRLV VGNWALSSRK ESELCIHNSD GQQQATILRE DLPGFIFESN RGTKHSFTAE TSLGPEFAAG WTGKRGKRLR SKIEAIKQKV KVQAQEIYER YFKVAQAQPR GVVAKLGTIV SQIEKASQKQ QSGNREWRNV LQSALEQLKI LLNEEGRVSA YELHSSGLVQ ALLVLLAAPP GPSPPTLRAT KLRMQRITVF NSCFQTKDKD KEHNSAKILV HKLVSVLESI EKLPVYLYDT PGSGYGLQIL TRRLRFRLEK ASGESALIDR SGRSLKMEPL STIQQLENHL LKMVAKQWHD HDRSTFTFVK KLKEENKIIF KYQHDFDENG LLYWIGTNAK TCPEWVNPGQ YGLVVITSSD GRNLPYGHLE DILSRDPSAL NCHTNDDKRA WFSIDLGVWI IPNAYTLRHA RGYGRSALRN WMFQASKDGV TWTTLYAHVD DCSLNEPGST ATWTLDPPSE ETQGWRHLRL QQIGKNASGQ THYLSVSGFE VYGEVTGVCE DLGRAAKEAE AGVRKQRRFI KTQVLKHLVA GVRVARGLDW KWRDQDGVPP GEGTVTGELH NGWIDVTWDH GGSNSYRMGA EGKYDLRLVG TSLETDSSVK CKNGGGVLTG RKSNSTPSLP DCTDTAMRSS VASTDQAASA DNLAAKQAAE SIAESVLSVA RAEAVVAVTG ESGANSTGEL SVVLHPRPDT AVTSDLATIV ESLTLNTDCP VNSTSNRASS SKPLFATIRG NKASGGLLSL ETAEVLDRMR EGADRLRNNT NSFLSGELLG LVPVRISVSG ESDENSLRIK SVPRHHPTGI ADVAKDCITR EKEASSSTQN TTGNCPVVVT NPMSVSVPNL ACSDANNTLE STAATGLLET FAAMARRRTL GPTGGQHLAS NSSTSSNPRG PNSVSSLVRL ALSPNFPGGL LSTAQSYPSL TSSGQVAGSG VTTTTGPGLG QALTMSLTST SSDSEQVSLE DFLESCGGVA TSSAGGGRTT GGPTLLTELE DDEDGVLEEE EDNEENDQEE EDEENEEEGD GCEGEYEEVM VSRNLLAAFM EEEAPQSSKR RAWDDEFVLK RQFSALIPAF DPRPGRTNIN QASNTTDLEV PPPGSEAQIN SRIGTLPMPR LSLSLKGPGF PGIPDIEIPL SDPHASIFQA VQELMQLTEL GSRQEKLKRI WEPTYTIIYK ETRDEESSGR ATPIVTLYSR NLTQSTNACT MEDVLQLLRH VFVLSTSRDD GISLEQEDLN DTTYWLHPDD FTSKKITNKI VQQIQDPLAL AAGALPNWCE ELARSCPFLL PFETRRLYFS CTAFGASRSI VWLQTQRDAI LERQRAPGLS PRRDDSHEFR VGRLKHERVS VPRGEKLLDW AEQVLKIHAS RKSILEVAFV GEEGTGLGPT LEFFALVAAE LQRKDLGLWL CDDAADDNGM QILSEEQTCV PGEKIRPAGY YVTRASGLFP APLPQDSAAC DRAVRYFWFL GVFLAKVLQD NRLVDLPLSR PFLKLMCRGD ISNNVNEKIG LTGITQDSMS SSMSSSFISE EGETDVAYSS LEPCPWYAGL LDIEDLAEVD PVRGEFLKEI QTAIAKRDRT FSDGVNSTDE EMSLYITHPS GTSVAIEDLT LTMTYSLSSK VFQHDHVELV KGGSDIAVTI ENAREYTNLT INYCLNQGIY RQLEAFKLGF SKVFPMEKLH VFSPEEMRAM LCGEQNPQWT REDLLNYTEP KLGYTKESPG FQRFVNVLLS LTGSERKAFL QFATGCSALP PGGLCNLHPR LTVVRKVDAG SGGYPSVNTC VHYLKLPEYP TEEILKERLL AATRERGFHL N // ID E2B1L3_CAMFO Unreviewed; 219 AA. AC E2B1L3; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Protein unc-84-like protein B {ECO:0000313|EMBL:EFN60369.1}; GN ORFNames=EAG_02081 {ECO:0000313|EMBL:EFN60369.1}; OS Camponotus floridanus (Florida carpenter ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Formicinae; Camponotus. OX NCBI_TaxID=104421 {ECO:0000313|Proteomes:UP000000311}; RN [1] {ECO:0000313|EMBL:EFN60369.1, ECO:0000313|Proteomes:UP000000311} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL444953; EFN60369.1; -; Genomic_DNA. DR InParanoid; E2B1L3; -. DR Proteomes; UP000000311; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000311}; KW Reference proteome {ECO:0000313|Proteomes:UP000000311}. SQ SEQUENCE 219 AA; 24301 MW; CA5604ABA57254DE CRC64; MRNELKSKLK EVGNVIPKMS EAILNLRKEV SEGKEEMNLH TKNLLKALSL ETVKDMVRNE LQTYDADKTG RTDYALESSG GTILSTRDTE PYSTGAPVLN LFGIPLCQQQ NTPRAVIQTG VLPGECWAFK GSRGSVVIRL LGQVNVSGIS LEHISPLISP TGETATAPRD FSIWGLTDLE DEKPFSFGTF TYDNTGSPLQ YFEIQVFENY DFFKVMANP // ID E2B589_HARSA Unreviewed; 796 AA. AC E2B589; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Tetratricopeptide repeat protein 39C {ECO:0000313|EMBL:EFN89142.1}; GN ORFNames=EAI_09054 {ECO:0000313|EMBL:EFN89142.1}; OS Harpegnathos saltator (Jerdon's jumping ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237}; RN [1] {ECO:0000313|EMBL:EFN89142.1, ECO:0000313|Proteomes:UP000008237} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN89142.1, RC ECO:0000313|Proteomes:UP000008237}; RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL445739; EFN89142.1; -; Genomic_DNA. DR InParanoid; E2B589; -. DR Proteomes; UP000008237; Unassembled WGS sequence. DR Gene3D; 1.25.40.10; -; 2. DR InterPro; IPR019412; OMP_Iml2/TPR_39. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR011990; TPR-like_helical_dom. DR Pfam; PF10300; DUF3808; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008237}; KW Reference proteome {ECO:0000313|Proteomes:UP000008237}. SQ SEQUENCE 796 AA; 89222 MW; 1C0B0BF4228D0CCB CRC64; MATGEDCPSK DWNIARTGIS LLLNNKTEEA EALFTGHPHS FHVKAGRCFV LFMNALMTYE HDKLQQAMLL LKEMERECAG DIGWLKSVKS KVFKAEETGK DYVNKLERQI VLADSQVCSA ILTLLQQELT GYVRGGWMLR KAWRVYQHAY TQISQLYHRT FGTNPTGLSP CCGTPMSNGS SLQSPQSPGS SEWSIPSCNG YANNVTPVSS PSGLRSSLSM FFSLTGITSE QQTPFVEPTE VSRLMSAVSF GYGIYQLCVS LLPPSLLKVI HFLGFEGDRQ AGLTALMYAR LSEDMRAPLA TLSLLWYHTI VRPFFALDGS NVKAGVAVAK QLIAECHTEF HDSALFLFFA GRVERLESNV NGALEAYGKA VEASTQREIR LLCLHEVAWC HLIRLSYEEA YRSLLQLQHQ SRWSKSFYAY LAMVCCGANG KFDVLLTSYD KILHWFHEMN RETQLGAFIL HRVPKLINKD TGSPYTILYY RLLVYEMLYL WNAMPSCSVE SLRGILLECK GSRAEEPMVG LADLVEGAAC SYLGDTEAAI KCYRNCLKRR YPSKDEYDQH VSAFALYELG SSLCNNNNSD EGRGFLMKAQ SQYRDYDFES RLNVRIHSAL KNLEVKNMME IRDELKNKLK EVGSVIPKMS EAIIRLRNEV SEEMILHTRN LLKALSSETV KNMVKSELET YDADKTGRTD YALENSGGEI LSTRDTELYS TGASVLNFFG TPRCQQQNTP RAVIQTGVLP GECWAFKGSS GSIVIRLLGY VHVSGISLEH ISSLISPTGE TDTAPRDFSV WARDNV // ID E2BD04_HARSA Unreviewed; 398 AA. AC E2BD04; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Protein unc-84-like protein A {ECO:0000313|EMBL:EFN86429.1}; GN ORFNames=EAI_05267 {ECO:0000313|EMBL:EFN86429.1}; OS Harpegnathos saltator (Jerdon's jumping ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237}; RN [1] {ECO:0000313|EMBL:EFN86429.1, ECO:0000313|Proteomes:UP000008237} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN86429.1, RC ECO:0000313|Proteomes:UP000008237}; RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL447495; EFN86429.1; -; Genomic_DNA. DR InParanoid; E2BD04; -. DR Proteomes; UP000008237; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008237}; KW Reference proteome {ECO:0000313|Proteomes:UP000008237}. FT COILED 398 398 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 398 AA; 45808 MW; 5AFEB6361CD7FEAF CRC64; MKCIPGCWCM AEYLSLFPAI TNVPETTTEK VLPNIQWKNR ESRKVGEKLV DNDGTLIDRV ATLEKEQIYQ MEYLMNVTQI LEDYKRSDVD FQKKYNDKIT HVANKLDISE LSNVLNNELK IIRHEFEKLH NLYAELKSCC NANKEVIANV DTEEHVEKIL SGYFPPGTSK EDLAKNIQRI LASRGGADRA VLSNNADDTS VSISEEHVRK IVKEILRIYD ADKTGQVDYA LESAGGQIIS TRCTQRYDIK SRAFSLFGFT LYYESNNPRT VIQGNALQPG ACWAFQDFPG YLLIQLRCVI YVTGFTLEHV SSLILPNENM SSAPKKFDVW GLTDENDPEP MRFGDYEFTY SEDNLQYFPV QNTEIKRPYE FIELRIHSNH GQLDYTCLYR FRVHGRPA // ID E2BQP7_HARSA Unreviewed; 1328 AA. AC E2BQP7; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Protein C1orf9-like protein {ECO:0000313|EMBL:EFN81966.1}; GN ORFNames=EAI_00994 {ECO:0000313|EMBL:EFN81966.1}; OS Harpegnathos saltator (Jerdon's jumping ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237}; RN [1] {ECO:0000313|EMBL:EFN81966.1, ECO:0000313|Proteomes:UP000008237} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN81966.1, RC ECO:0000313|Proteomes:UP000008237}; RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL449798; EFN81966.1; -; Genomic_DNA. DR RefSeq; XP_011143430.1; XM_011145128.1. DR GeneID; 105185540; -. DR InParanoid; E2BQP7; -. DR Proteomes; UP000008237; Unassembled WGS sequence. DR GO; GO:0003723; F:RNA binding; IEA:InterPro. DR GO; GO:0003968; F:RNA-directed RNA polymerase activity; IEA:InterPro. DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR002166; RNA_pol_HCV. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00998; RdRP_3; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008237}; KW Reference proteome {ECO:0000313|Proteomes:UP000008237}. FT COILED 910 930 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1328 AA; 146514 MW; 80AD130123659DA6 CRC64; MAKGIVKNNA GCPLHWVDDS AGQPRHLFQA AALYLTVLAA LWCIPTCLYH RISETGQAVV LTLVDTAAAE LQSLAEPKIN RELSSSPSAK NSTLLSGQPE LKEQLPTTQI PPPPTVEDTT ERPNDTEVEH EDEVLLLKKI SDEPPEVVVI VRAEQKINAN DVELREDDEI GHVILEEPVP KVDDTFTTAP ELNDTAARAQ LVGGSRDETA AVILDGLVSS GPTDPHEDIP SFSEWAQKRL EEAEKKKTHP NASVQTPGGP GRGVSGMKVR SKNYASPDCG AKIVAVNPEA RGAKNVLVST RDEYMLNACT SRVWFVVELC EAIQAKKIEL ANFELFSSSP KDFSVYVSDR FPTRDWSPVG QFTAKDVKDV QSFALHPHFF GKFIKVELQS HYGSEHFCPV SLFRAYGTSE FEVLETETED QILQEANANE DDDDDSDEEE PLDTEGGDPS RNLFGSARDA VLSIVKKAAE VLVKSSDLSG NNITEIQHSI DGGNILDNSY TSCMTPRYTI LCGNCTDQKF ASVFQLVSCR DRHLDGLLKI DLVNRTLRRG KLCGLHGVEI DSFCQKQEEK FAGDDRGKRD DATHFDLAQD LQTSFLASVF KPEYIVALCN VLATKERKVV MNTSYEISAN ASEDVVKEDI HLSTKDTDQS NEGISHHQVS ATCTLDANSA VCKTAPSRDN RRRAAQDINE ETENISMTTI ESTISTESLA SQIKPTKTLS KEDLRKESSV PILEPSKEPT EETLQAEALT TVAPSLNLPT PTLKIAEDLP PITGTSVEIV SQTVTNIPPI NVDGQETGDI PVADTDSAEM GAQVKTERSE NGEQDGKQPK DLSEQEVKLS SQDHLTLDTL FSDLKDFEGD AANMQNGASS ASSMTQPTAS TAPQKESVFL RLSNRIKALE RNMSLSGQYL EELSRRYKKQ VEEMQRSFER AVSAMSEESR KGEEREAKRV EEIAVLREEI AVLSKSVQTL LHDQDSWQSR ISAVGQHALL IWFEVVVITL FLSYCRRTSD LDEDEKVQSK DAKTEGTRRK SAENFGSHDA AKKTKKRRSS EIASHINSTY HELMIDERCH ETKKERKKKR KREAVAAGAR LTANADTRPE ATRQKSVLNV MPGGTTLPSR RASSIDPPRS KESQEASGKR PESAPETSSS WFDDQMESII ERTAQPAMRL RMEPNRNGSS ELSHHGDEFS ESNSSSGAVD RSLEGSVTSE RRIDVPKNSS FRTGGILKSA KLSSPSFMKT ALGTRSKRRL LSNRERWDWN QDSEHSNNIS APSTPTGPSP RINDSANGSA AHGLIEESDE SRSSSATRTS SKKEKKSTGL RKMVRKFF // ID E2BZD0_HARSA Unreviewed; 2600 AA. AC E2BZD0; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:EFN78940.1}; GN ORFNames=EAI_12212 {ECO:0000313|EMBL:EFN78940.1}; OS Harpegnathos saltator (Jerdon's jumping ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237}; RN [1] {ECO:0000313|EMBL:EFN78940.1, ECO:0000313|Proteomes:UP000008237} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN78940.1, RC ECO:0000313|Proteomes:UP000008237}; RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL451589; EFN78940.1; -; Genomic_DNA. DR InParanoid; E2BZD0; -. DR Proteomes; UP000008237; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008237}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EFN78940.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000008237}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 929 949 {ECO:0000256|SAM:Coils}. FT COILED 1780 1812 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2600 AA; 284980 MW; B9FDF5AD4D69F504 CRC64; MADVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFEWIFL DELVPDSVLE VTARAITYYL DVSAECTRRV VAMEGAVKAI CSRLSGAGLG SRASRDLAEQ CIKVLELVCA REAGAVFEAG GLPCALCFIR EHGARVHRDT LHSAMAVVTR LCGKVEPQDK ALPDCVEALS VLLRHEDAHV ADGALRCFAS LADRFSRRGT DPAPLASNGL VSELLYRLSN AAGPGTSATA TCSNPKTPPP SSTATTVPAP EPKSCASVST IISLLSTLCR GSPSITHDLL RSELPDAIEK ALKGDERCAL DSMRLVDLLL VLLFEGRSAL GRSTTGGPSG PLLPRLRRLD SAGEKSHRQL IDCIRSKDTD ALIEAIDSGG IEVNFMDDVG QTLLNWASAF GTQEMVEFLC DRGADVNKGQ RSSSLHYAAC FGRPAIAKVL LRHGANPDLR DEDGKTPLDK ARERVDEGHR EVAAILQSPG EWMLPPNQEH RKLEAEVEEF TEPKGDPEMA PVYLKRLLPV FCATFQSTML PSVRKASLSL IRKMVHYIQP ELLIETCGSD STGGCGAMLV EVIANVLDNE ELEIKAPLPP STAFKAFIGP KLPGLTTRKS SSSRNYILDK TEDEDGHLVV LQMIQDLMIK GKDEFLEHFA RLGVFSKVAA LAGPQETTPE PEAESNQSGE EQRMEDAKEL LVGRAYHWRD WCICRGRDCL YIWSDAAALE LSNGSNGWFR FILDGKLATM YSSGSPEGGT DTSENRGEFL EKLQRARSQL KVNFVSQPVL SRPGTTRLVV GNWALSSRKE SELCIHNSDG QQQATILRED LPGFIFESNR GTKHSFTAET SLVNKSLELI ESRDVYIIGP EFAAGWTGKR GKRLRSKIEA IKQKVKVQAQ EIYERYFKAA QAQPRGVVAK LGAIVSQIEK ATQKQQSGNR EWRNILQTAL EQLKVLLNEE GRVSAYELHS SGLVQALLAL LAAPPGPSPP TLRATKLRMQ RITVFKSCFQ TKDMNKEHNS AKILVHKLVS VLESIEKLPV YLYDTPGSGY GLQILTRRLR FRLEKASGES ALIDRSGRSL KMEPLSTIQQ LENHLLKMVA KQWHDHDRST FTFVKKLKEE SRIMFKYQHD FDENGVLYWI GTNAKTCSEW VNPGQYGLVV VTSSDGRNLP YGHLEDILSR DPSALNCHTN DDKRAWFSID LGVWIIPSAY TLRHARGYGR SALRNWMFQA SKDGVTWITL YAHVDDCSLN EPGSTATWTL EPPSEETQGW RHLRLQQIGK NASGQTHYLS VSGFEVYGEV TGVCEDLGRA AKEAEAGVRK QRRFIKTQVL KHLVAGVRVA RGLDWKWRDQ DGVPPGEGTV TGELHNGWID VTWDHGGSNS YRMGAEGKYD LRLVGTTLET DSAAKCKSSG GVLTGRKSSS TPSLPDCTDT AMRGSVASTD QAASADNLAA KQAAESIAES VLSVARAEAV VAVTGEGGAN STSELSVVLH PRPDTTVTSD LATIVESLAL NTDCPANSTS NRASSSKPLF ATLRGNKASG GLLSLEAAEV LDRMREGADR LRNNTNSFLS GELLGLVPVR ISVSGESDEN SLRIKSVSRH HPTGITDVAK DCTREKEASS STQNTTGGCP VVVTNPMSVS VPNLACSDAN NTLEPTAATG LLETFAAMAR RRTLGPTGGQ HLASNSNTSS NSRGPNSVSS LVRLALSPNF PGGLLSTAQS YPSLTSSGQV AGSGVTTTTG PGLGQALTMS LTSTSSDSEQ VSLEDFLESC GGVATSSVGG GRTTGAPTLL AELEDDEDGV LEEEEDNEEN DQEEEDEENE EEGDGCEGEY EEVMVSRNLL AAFMEDETPQ SSKRRAWDDE FVLKRQFSAL IPAFDPRPGR TNINQTTDLE VPPPGSETQI NSRVGSLPMP RLSLSLKGPG FPGVADVEIP LSDPHASIFQ AVQELMQLTE LGSRQEKLRR IWEPTYTIIY KEARDEESSG RATPIVTLYS RNATQNANAC TVEDVLQLLR HVFVLSTVRD DGRPAALEQE ESDDTACWIH PDDFTSKKIT NKIVQQIQDP LALAAGALPN WCEELARSCP FLLPFETRRL YFSCTAFGAS RSIVWLQTQR DAILERQRAP GLSPRRDDSH EFRVGRLKHE RVSVPRGDKL LDWAEQVLKV HASRKSILEV AFVGEEGTGL GPTLEFFALV AAELQRRDLG LWLCDDANDA EGDDENGVRI LSEEQCVTTE EKIRPAGYYV TRASGLFPAP LPQDSAACDR AVRYFWFLGV FLAKVLQDNR LVDLPLSRPF LKLMCRGDIS NNVNEKIGLT GVTQESMSSS MSSSFISEEG ETDAAYSTLE PCPWYAGLLD IEDLAEVDPV RGEFLKEIQN AITKRDRTFS DGPSAVDEET SLLYICHQSG ISVAIEDLAL TMTYSPSSKV FQHDQVELVE NGADVTVTIE NAREYTNLTI NYCLNQGICR QLEAFKSGFS KVFPMEKLHA FSPEEMRAML CGEQHPQWTR EDLLNYTEPK LGYTKESPGF QRFVNVLLSL TGSERKAFLQ FATGCSALPP GGLCNLHPRL TVVRKVDAGS GGYPSVNTCV HYLKLPEYPT EEILKERLLA ATRERGFHLN // ID E2BZQ4_HARSA Unreviewed; 329 AA. AC E2BZQ4; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Protein unc-84-like protein A {ECO:0000313|EMBL:EFN78816.1}; GN ORFNames=EAI_04306 {ECO:0000313|EMBL:EFN78816.1}; OS Harpegnathos saltator (Jerdon's jumping ant). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Ponerinae; Ponerini; Harpegnathos. OX NCBI_TaxID=610380 {ECO:0000313|Proteomes:UP000008237}; RN [1] {ECO:0000313|EMBL:EFN78816.1, ECO:0000313|Proteomes:UP000008237} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=R22 G/1 {ECO:0000313|EMBL:EFN78816.1, RC ECO:0000313|Proteomes:UP000008237}; RX PubMed=20798317; DOI=10.1126/science.1192428; RA Bonasio R., Zhang G., Ye C., Mutti N.S., Fang X., Qin N., Donahue G., RA Yang P., Li Q., Li C., Zhang P., Huang Z., Berger S.L., Reinberg D., RA Wang J., Liebig J.; RT "Genomic comparison of the ants Camponotus floridanus and Harpegnathos RT saltator."; RL Science 329:1068-1071(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL451657; EFN78816.1; -; Genomic_DNA. DR InParanoid; E2BZQ4; -. DR Proteomes; UP000008237; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008237}; KW Reference proteome {ECO:0000313|Proteomes:UP000008237}. FT COILED 50 87 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 329 AA; 38669 MW; 9E8F8FC2F4117AC1 CRC64; MIFEKEQIYQ KEYLMNITFI LDDYQKRQMD FQKECNDKNA DKINKINELR SLVNNELKVI KHELKKLQNL STELQSLIIN YTKTQVEEIL SDLSYFPPKI LKKDLMENNF VSDSDKDVAV LDDNVSDSNV LITEEYVRRI VKEILQIYDA DKTGQVDYAL ESAGGEIIST RDTQEYNIKS RAINIFGFLL YYQSYNNDPR IVIQKNSIHP GSCWAFQNFP GYLLIRLRSA IYVTGFTLEH VSRLIVPAGN MSSAPKNFNV WGLTDENDPK PVMFGDYEFT YFEYNLQYFP IQNTGIERSY EYIELRIQSN HGQLDYTCLY KFRIHGRRT // ID E2R045_CANFA Unreviewed; 318 AA. AC E2R045; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 31-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 29. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000012051}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSCAFP00000012051}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000012051, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000012051, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000012051, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000012051} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000012051}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000012051}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03013911; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000012051; -. DR PaxDb; E2R045; -. DR Ensembl; ENSCAFT00000013024; ENSCAFP00000012051; ENSCAFG00000008191. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; E2R045; -. DR OMA; FREVCSI; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 20859757; -. DR Proteomes; UP000002254; Chromosome 24. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 57 80 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 93 127 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 318 AA; 35739 MW; 954B624977602168 CRC64; MGRGAGPDPQ RLEPDSHLAE SCSRELGLGP RRGAVGFIKV RGLSLPPFFR EVCSIRFLLT AVSLLSLFLA ALWWGLLYLI PPLENEPKEM LTLSEYHERV RSQGQQLQQL QAELDKLHKE VSSVRAANSE RVAKLVFQRL NEDFVRKPDY ALSSVGASID LEKTSHDYED ANTAYFWNRF SFWNYARPPT VILEPDVFPG NCWAFEGDQG QVVIRLPGRV QLSDITLQHP PPSVAHTGGA KSAPRDFAVY GLQVDDETEV FLGKFTFNVE KSEIQTFHLQ VCVSGGEGAK RMGQQRKTGE KGSPWDSRPW HDPFVSLE // ID E2R357_CANFA Unreviewed; 365 AA. AC E2R357; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 30-NOV-2010, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000010865}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSCAFP00000010865}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000010865, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000010865, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000010865, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000010865} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000010865}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000010865}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03013893; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000010865; -. DR PaxDb; E2R357; -. DR Ensembl; ENSCAFT00000011724; ENSCAFP00000010865; ENSCAFG00000007323. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; E2R357; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002254; Chromosome 24. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}. SQ SEQUENCE 365 AA; 41687 MW; D5BCED298F664745 CRC64; MPRSSRSPVD QCDLPEDARP RRVIPRSRNT CRITEDALSN SNDAFVLPIR IHPPAPGLTQ CILACISWIT CLACFLRTQA HQVLFNTCRC KLLFQKLMEK TGVLVLCAFG FWVFSMHLPS KMEVWQDDSI NSPLQSLRMY QEKVRHHTGE IQDLRGNMTL LIAKLQLMEA MSDEQKMAQK IMKMIQGDYI EKPDFALKSI GASIDFEQTS ATYNHDKARS YWNWIRLWNY AQPPDVILEP NMTPGNCWAF SGDRGQVTIR MAQKVYLSNL TLQHIPKTIS LSGSLDTAPK DFVVYGMEGS PREEVFLGAF QFQPENIIQM FQLQNQPARA FGAVKVKISS NWGNPRFTCL YRVRVHGSVT PPRQP // ID E2R4M4_CANFA Unreviewed; 823 AA. AC E2R4M4; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 31-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 35. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000016600}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSCAFP00000016600}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000016600, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000016600, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000016600, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000016600} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000016600}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000016600}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03004353; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAEX03004354; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAEX03004355; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000016590; -. DR PaxDb; E2R4M4; -. DR Ensembl; ENSCAFT00000017926; ENSCAFP00000016600; ENSCAFG00000011273. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR TreeFam; TF323915; -. DR Reactome; R-CFA-1221632; Meiotic synapsis. DR Proteomes; UP000002254; Chromosome 6. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR015880; Znf_C2H2-like. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00355; ZnF_C2H2; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 268 289 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 295 316 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 328 347 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 399 433 {ECO:0000256|SAM:Coils}. FT COILED 465 499 {ECO:0000256|SAM:Coils}. FT COILED 514 534 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 823 AA; 91676 MW; BDB270EA57EE2462 CRC64; MDFSRLHMYT PPQCVPENTG YTYALSSSYS SDALAFETEH RLDPVFDSPR MSRRSLRLVT TACATEDGQA RDTHSCASST ASLKDRAART VKQRRSASKP AFSINHTSRK VVSCGVGQSG TSTLSSAACL RPPVLDESLI REQTKVDHFW GLDDDGDLKG GNKAATQGNG DLAAEAARSN GYTCSDCLLL SERKDTLTAH SATHGPSPRL YSRDQGQKRN DSKGKELLEM HRAIRQQSSS PKRVAGAIWH IFSYTGHLLV QMLQRIGASG WSVSKTLLCV LWLAMVAPGK AASGIFWWLG IGWYQFVTLI SWLNVFLLTR CLRNICKFLI LLIPLLLLLG GGLSLWGQGD FLSGLPVFNW TRIYGTQRVD SPESMFTPDA SQLSQLSHSD GEAFHEHRMS EVERQMTSLS GQCHSHEEKL RELATVLQTL QARVDQMDGD GEATLSLVQR VVGQHLKETD TVTFRQEHEL RLSNLEDILG KLTQTSEAIQ KELEQTKLRT ASGAEEEQRL LSVVKHLEEE LGHLKSELSS WQHLKTSCEE VDTIHGKVDA QVRETVKLMF SDGEQGGSLK WLLQTVSSQF VSRDDLQVLL RDLELQILKN ITHYISVTKQ VPDSETVVSA ANEAGISGIT EAQARVIVNN ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALI SLFGIPLWYF SQSPRVVIQP DIHPGNCWAF RGSQGYLVVR LSMKIRPTAF TLEHIPKTLS PTGNITSAPK DFAVYGLEDE YQEEGQLLGQ FTYDQEGESL QMFHVPKRPE GAFQIVELRI FSNWGHPEYT CLYRFRVHGE PIK // ID E2RGH9_CANFA Unreviewed; 2609 AA. AC E2RGH9; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 31-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 34. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000019037}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSCAFP00000019037}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000019037, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000019037, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000019037, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000019037} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000019037}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000019037}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03005651; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAEX03005652; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000019037; -. DR PaxDb; E2RGH9; -. DR Ensembl; ENSCAFT00000020512; ENSCAFP00000019037; ENSCAFG00000012911. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; E2RGH9; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000002254; Chromosome 8. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1244 1264 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2609 AA; 289214 MW; 78D13F3EC422870A CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNRVLTF IRDSGHLVHK DTLHSALTEV SRLCGTMEPL DSSLEIVESL SSLLKHEDHQ VSDGALRCFA SLADRFTRRG VDPAPLAKHG LTEELLSRMA AAGGTVSGPS SACKPGRSTA GAPSTAADSK LSNQVSTIVS LLSTLCRGSP VVTHDLLRSE LPDSIESALQ GDERCVLDTM RLVDLLLVLL FEGRKALPKS SAGSTGRIPG LRRLDSSGER SHRQLIDCIR SKDTDALIDA IDTGAFEVNF MDDVGQTLLN WASAFGTQEM VEFLCERGAD VNRGQRSSSL HYAACFGRPQ VAKTLLRHGA NPDLRDEDGK TPLDKARERG HSEVVAILQS PGDWMCPVNK GDDKKKKDTN KDEEECNEPK GDPEMAPIYL KRLLPVFAQT FQQTMLPSIR KASLALIRKM IHFCSEALLK EVCDSDVGHN LPTVLVEITA TVLDQEDDDD GHLLALQIIR DLVDKGGDIF LDQLARLGVI SKVSTLAGPS SDDENEEESK PEKEDEPQED AKELQQGKPY HWRDWSIIRG RDCLYIWSDA AALELSNGSN GWFRFILDGK LATMYSSGSP EGGSDSSESR SEFLEKLQRA RGQVKPSTSS QPILSAPGPT KLTVGNWSLT CLKEGEIAIH NSDGQQATIL KEDLPGFVFE SNRGTKHSFT AETSLGSEFV TGWTGKRGRK LKSKLEKTKQ KVRTMARDLY DDHFKAVESM PRGVVVTLRN IATQLESSWE LHTNRQCIES ENTWRDLMKT ALENLIVLLK DENTISPYEM CSSGLVQALL TVLNNSMDLD MKQDCSQLVE RINVFKTAFS ENEDDESRPA VALIRKLIAV LESIERLPLH LYDTPGSTYN LQILTRRLRF RLERAPGETA LIDRTGRMLK MEPLATVESL EQYLLKMVAK QWYDFDRSSF VFVRKLREGQ NFIFRHQHDF DENGIIYWIG TNAKTAYEWV NPAAYGLVVV TSSEGRNLPY GRLEDILSRD NSALNCHSND DKNAWFAIDL GLWVIPSAYT LRHARGYGRS ALRNWVFQVS KDGQNWTSLY THVDDCSLNE PGSTATWPLD PPKDEKQGWR HVRIKQMGKN ASGQTHYLSL SGFELYGTVN GVCEDQLGKA AKEAEANLRR QRRLVRSQVL KYMVPGARVI RGLDWKWRDQ DGSPQGEGTV TGELHNGWID VTWDAGGSNS YRMGAEGKFD LKLAPGYDPD TVASPKPVSS TVSGTTQSWS SLVKNNCPDK TSAAAGSSSR KGSSSSVCSV ASSSDISLGS TKTERRSEIV MEHSIVSGAD VHEPIVVLSS AENVPQTEVG SSSSASTSTL TAETGSENAE RKLGPDSSVR TPGESSAISM GIVSVSSPDV SSVSELTNKE AASQRPLSSS ASNRLSVSSL LAAGAPMSSS ASVPNLSSRE TSSLESFVRR VANIARTNAT NNMNLSRSSS DNNTNTLGRN VMSTATSPLM GAQSFPNLTT PGTTSTVTMS TSSVTSSSNV ATATTVLSVG QSLSNTLTTS LTSTSSESDT GQEAEYSLYD FLDSCRASTL LAELDDDEDL PEPDEEDDEN EDDNQEDQEY EEVMILRRPS LQRRAGSRSD VTHHAVTSQL PQVPAGAGSR PIGEQEEEEY ETKGGRRRTW DDDYVLKRQF SALVPAFDPR PGRTNVQQTT DLEIPPPGTP HSELLEEVEC TPSPRLALTL KVTGLGTTRE VELPLTNFRS TIFYYVQKLL QLSCNGNVKS DKLRRIWEPT YTIMYREMKD SDKEKENGKM GCWSIEHVEQ YLGTDELPKN DLITYLQKNA DAAFLRHWKL TGTNKSIRKN RNCSQLIAAY KDFCEHGTKS GLNQGAISTL QSSDILNLAK EQPQAKAGNG QNSCGVEDVL QLLRILYIVA SDPYSRISQE EGDEQPQFTF PPDEFTSKKI TTKILQQIEE PLALASGALP DWCEQLTSKC PFLIPFETRQ LYFTCTAFGA SRAIVWLQNR REATVERTRT TSSVRRDDPG EFRVGRLKHE RVKVPRGESL MEWAENVMQI HADRKSVLEV EFLGEEGTGL GPTLEFYALV AAEFQRTDLG AWLCDDNFPD DESRHVDLGG GLKPPGYYVQ RSCGLFTAPF PQDSDELERI TKLFHFLGIF LAKCIQDNRL VDLPISKPFF KLMCMGDIKS NMSKLIYESR GDRDLHCTES QSEASTEEGH DSLSVGSFEE DSKSEFILDP PKPKPPAWFN GILTWEDFEL VNPHRARFLK EIKDLAVKRR QILSNKGLSE DEKNTKLQEL VLKNPSGSGP PLSIEDLGLN FQFCPSSRIY GFTAVDLKPS GEDEMITMDN AEEYVDLMFD FCMHTGIQKQ MEAFRDGFNK VFPMEKLSSF SHEEVQMILC GNQSPSWAAE DIINYTEPKL GYTRDSPGFL RFVRVLCGMS SDERKAFLQF TTGCSTLPPG GLANLHPRLT VVRKVDATDA SYPSVNTCVH YLKLPEYSSE EIMRERLLAA TMEKGFHLN // ID E2RIK8_CANFA Unreviewed; 309 AA. AC E2RIK8; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 31-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 31. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000020836}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSCAFP00000020836}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000020836, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000020836, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000020836, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000020836} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000020836}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000020836}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03011096; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000020836; -. DR PaxDb; E2RIK8; -. DR Ensembl; ENSCAFT00000022440; ENSCAFP00000020836; ENSCAFG00000014138. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; E2RIK8; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 20855724; -. DR Proteomes; UP000002254; Chromosome 18. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 17 {ECO:0000256|SAM:SignalP}. FT CHAIN 18 309 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003164040. SQ SEQUENCE 309 AA; 35475 MW; 2C1A6DE324CB2B83 CRC64; MVLSTLWILT LLLVGLGNHR WLKEAEFPQK SRHFYALVAE YGSRLYNYQA RLRMPKEQLE LLKKESQTLE NNFREILFLI EQIDVLKALL RDTQEDLHSY SWNTDQDEDQ DPMEATEEMS NLVNYVLKKL REDQVQMADY ALKSAGASIV EAGTSESYKN NKAKLYWHGI GFLTYEMPPD IILQPDVHPG KCWAFPGSQG HALIKLARKI VPTAVTMEHI SEKVSPSGNI SSAPKEFSVY GISKQCEGEE IFLGQFVYNK TGSTVQTFEL QHDVFESLLC VKLKILSNWG HPKYTCLYRF RVHGTPGEY // ID E2RL48_CANFA Unreviewed; 762 AA. AC E2RL48; DT 30-NOV-2010, integrated into UniProtKB/TrEMBL. DT 31-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 32. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000001961}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSCAFP00000001961}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000001961, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000001961, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000001961, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000001961} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000001961}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000001961}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03007314; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000001961; -. DR PaxDb; E2RL48; -. DR Ensembl; ENSCAFT00000002119; ENSCAFP00000001961; ENSCAFG00000001371. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; E2RL48; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Reactome; R-CFA-1221632; Meiotic synapsis. DR Proteomes; UP000002254; Chromosome 10. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 232 248 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 257 278 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 316 336 {ECO:0000256|SAM:Coils}. FT COILED 406 444 {ECO:0000256|SAM:Coils}. FT COILED 447 474 {ECO:0000256|SAM:Coils}. FT COILED 514 548 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSCAFP00000001961}. SQ SEQUENCE 762 AA; 85427 MW; 2340E9B0F9290853 CRC64; LASSTSAGRG EEEEPIWASG AFRLSSGGRS PSYLVMSRRS QRLTRYSQGD DDGGSSSGGS SVMGSQSTLF KDSPLRTLKR KSSNMKRLSP APQLGPSSDA HTSYYSESVV RESYFGSPRA ASLARSSILD DHLHGDPYWS EDLRVRRRRG TGGTESSKLN GLAEDKSSED FLGSSSGYSS EDDFAGYSET DHRSSGSRLR NAVSWAASCF WMVVTSPGRL FGLLYWWVGT TWYRLTTAAS LLDVFVLTRR FSSVKTFLWF LLLLLLMTGL TYGAWYFYPY GLQTFHPALV SWWAAKGSSR QHDVWESRDA SHFQTEQRIL SRIHSLERRL EALAAEFSSN WQKEAMRLER LELRQGATGG GGHVGLSQED TLALLEGLVS RREAALKEDF RRDTAARIQE ELVTLRAEHQ QDLEDLFKKI VQASQESEAR LQQLKSEWQR MTQESFRENS MKELGRLEGQ LTGLRQELAA LSLKQSSVAD QVGLLPQQLQ AVRDDVESQF PAWVSQFLLR GGGTRTGLLQ REEMQAQLQE LENKILAHVA EMQGKSAREA AASLGLTLQK EGVIGVTEEQ VQRIVNQALK RYSEDRIGMV DYALESGDLG ASVISTRCSE TYETKTALLS LFGIPLWYHS QSPRVILQPD VHPGNCWAFQ GPQGFAVVRL SARIRPTAVT LEHVPKSLSP NSTISSAPKD FSIFGFDEDL QQEGTLLGQF TYDQDGEPIQ TFYFQDTKMA TYQVVELRIL TNWGHPEYTC IYRFRVHGEP TY // ID E3KEU9_PUCGT Unreviewed; 276 AA. AC E3KEU9; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 2. DT 14-OCT-2015, entry version 23. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP82824.2}; GN ORFNames=PGTG_09792 {ECO:0000313|EMBL:EFP82824.2}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP82824.2, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS178284; EFP82824.2; -; Genomic_DNA. DR RefSeq; XP_003327243.2; XM_003327195.2. DR EnsemblFungi; EFP82824; EFP82824; PGTG_09792. DR GeneID; 10544526; -. DR KEGG; pgr:PGTG_09792; -. DR EuPathDB; FungiDB:PGTG_09792; -. DR InParanoid; E3KEU9; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}. FT COILED 43 63 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 276 AA; 30819 MW; 88C72BC584435D40 CRC64; MGLTFFSADK TGLLGEQRSP IVAPLFKTCN ESHLSWLLLN QKIEDVSSRI EALESSFQRF QETNWVEIKK IKRSKFVQSE GVKEVAHAVV KDALRQYSRE QIGQQDFALY SGGARIIPSL TSKTYQVEMK GMKAIFISFW TGAPRIISGR GPIAALSPEI EAGMCWSISG STGQLGVSLA RRILVTSVSV EHISTLSAYE IDSSPKEMEV CPEIPELYST QSSLKSTTIT HKSNRQRFSV FSPNLTAKVT VRDQVQLWKS LLYLYLSIQS AWIDGE // ID E3KL34_PUCGT Unreviewed; 1056 AA. AC E3KL34; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 2. DT 14-OCT-2015, entry version 22. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP85009.2}; GN ORFNames=PGTG_11178 {ECO:0000313|EMBL:EFP85009.2}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP85009.2, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS178293; EFP85009.2; -; Genomic_DNA. DR RefSeq; XP_003329428.2; XM_003329380.2. DR EnsemblFungi; EFP85009; EFP85009; PGTG_11178. DR GeneID; 10526986; -. DR KEGG; pgr:PGTG_11178; -. DR EuPathDB; FungiDB:PGTG_11178; -. DR InParanoid; E3KL34; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 1056 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003173491. FT COILED 358 378 {ECO:0000256|SAM:Coils}. FT COILED 672 692 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1056 AA; 117459 MW; AEBD6BE71E646E57 CRC64; MKNNFLLLPL LLLLTQQTNQ QQQQQNKQPT LNSFNQLLNQ QSKTCPATTN TITTTTTTTT ANNNQKPKLL SFSDWKQQQH HQQQQPVDEN KNISQPTPST WSSTLSQSLS QTSKRATDYL KTTNSADSLL PICQPEHTPL KTNLSEPHQD KQHSIKLNQQ HPPILYQPDP NTGTGTPDDP LKILSTRTNY ASFDCSASIH RSSKHTKSPS SILNEKKDKY LLTPCTNHKS KQHNNFVVFE LCDEIEIDHV VLANFEFFSS MFKLIKMSVS NSGLEGVGRA EWVDVGFLKT HNTRGFQVFP IKHLKGFYRY VRLDFLTHYG SEYYCPLSLV RIYGLTQIDA YRRDEKLEQR NKVDESVIQA QEQDQEESVQ EEEQQQQQLV PNSEEQLPLS QQPELPIQTD NKSTTATSDD PADDHPSADS NIPSPAVQER SNEAFKSNEE EKDLNQNHIE KQQDQSDSPE ATTTPDLVVQ LPETVQPEPT TNQESPPSAH DSQSTSTQPV LPSPSSEPDI LPAPSDPPQP QDDYSLATDT EKVPLPPPAQ SSDSQEHNPT TESNNHNPYE TTTSESIPNP PASSAGRGGE AKKQTLTGHD PTHPTKPIIS SSKSTQAEKP TPPLHHNPPA PGNPTGHGSE SIFGQIMKRL NALESNHLLL LKYTEDQVLG LGDSSLKVQS HLDELEKILK LQTKTHEESV EEIQGIKNQM TVERNFLNHR IESLDSSITF IKRIGLVQFL SIIAILIFLS MTRIQHGSSP TNGRMTREER RGGGSHPWTS HPIRRLSSSP RLLSEHPSLR HQRTPGGAKK LIVGSRREST SSATTAAKNE LFLVSSLHRK LAHHSSPRLP SLKLGLRRPD SPLRRAASLS HHHHQKQLSE SLNGPAYPRT RRARVLEQRE DQKDELGGPD QAVSSIDDDW VKDDDGEEQF PLERPTLPKA HHLDHHLAPR RDADDRPLEQ AATAAAKEIE PSSGPSEEAR PCAFTLPTPP PSSPPSSPSP PPHLQPLPPL PLLDHSSSLL PTVPLLPPSY RSSIRHPLPA SQANSSAEPS ILEKSVEKPV TAHLGQ // ID E3KTE5_PUCGT Unreviewed; 1156 AA. AC E3KTE5; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 2. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP87570.2}; GN ORFNames=PGTG_13941 {ECO:0000313|EMBL:EFP87570.2}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP87570.2, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS178307; EFP87570.2; -; Genomic_DNA. DR RefSeq; XP_003331989.2; XM_003331941.2. DR EnsemblFungi; EFP87570; EFP87570; PGTG_13941. DR GeneID; 10538630; -. DR KEGG; pgr:PGTG_13941; -. DR EuPathDB; FungiDB:PGTG_13941; -. DR InParanoid; E3KTE5; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}. FT COILED 751 771 {ECO:0000256|SAM:Coils}. FT COILED 1148 1156 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1156 AA; 128686 MW; 62FAC83B3A1E5A77 CRC64; MFSTDTPPQS RRKSFKAFTK QLVNPLQRVQ QQDQQLQPSS SMSDRTADNQ QQQPQNTNST QQSQSDRNQQ LPPSASLILG QTAKINPRLP RPSASTITLS SVPYSYAYGA AGSPPQPAPK KSYNSAQTVD TTDSISPRSR SSRGRQTALH PDDMSVASVS SEKTGSEPAQ IRYARLNQRL QNTGSANAPT QVPPPKVIVP KPNSTHLNDT SVNIASAFER AQLASRLNKN HNPPQKSTQQ SANNNHTRPE NIQSSYDRQN LNPSNNQDDQ YTTSELNQTH QTNKKRKKGR RSIDPSYKYR PGDSATSDSE AGEGIADKRL RREKKARLEQ EKLEAQTGTS KKRAISKRRR TTSRRSNGSD DLTDSDLDTG PGIAATRGRR IPKESQPGNA SDTSNNVNRR NNGVSRKKGT LNNQRIGDQS DTNDDDIPEG GFIRPKKPTN QNLDTDVTPH ARNQHPSATF FLSDPEASQP PEEGERNQSN DQDMTVATTE SSRSALRLSG DSYDFAEEER IVKALGDKTR QQRPLAAETS IGDGSTSDYH RQESSSNSGW AGSSRDRIRS DEHARRSDSG MREIEEIIAS SRQTAHSRPE KPRVLPLPIP HQKNPISTIV EETHSELRKE SDTSLFHSIP ESLGTKWGKQ FTSLLSHLFR AEFWNRFTWL VLLGSIAFAL LINSYKTGDL NIPLLSLFSS SKRTSPTFSA PTTEIQNLAE LIERLKSLES VVSTISVEQR QDSKNLQQSL VRDKRELDYK VGKLELKLEE EEKVRGGLED KVQRKVDQGV KGLRSDLESV IIGHRHDHGQ LEKIKQTVEG LGGRIESLDN SNSNSVTREE ARVMIEQLWS IKGRDGPGDP EKLKQAVLKA TLKQIGQEGY VTKTEIGKLL ESELQTMGMG FELQLDSKLE KIRQELQAPR SRTRNLDGSV GGEVLDSVQG MIEEAIERYS QDGIGKRDFA LYSGGARVIP QLTSKTYEIS VKTWSQKFIG LITGAESVIR GRGPITALSP EVELGKCWPI AGQQGHLAVY LSRKILVTEV SIEHVSKSLA YELDSAPREI EVWSIEDSTK LLDIVYDPNL PSRIQTFKVP AQFNYLSSFE NQDDHDVPSS SSSSKNEPPT DSPSVQTPIL RQGILFKIKS NHGNPYYTCI YRLRVHGLME SELNLK // ID E3L7I3_PUCGT Unreviewed; 305 AA. AC E3L7I3; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 16-SEP-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP92508.1}; GN ORFNames=PGTG_18506 {ECO:0000313|EMBL:EFP92508.1}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP92508.1, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS178366; EFP92508.1; -; Genomic_DNA. DR RefSeq; XP_003336927.1; XM_003336879.2. DR EnsemblFungi; EFP92508; EFP92508; PGTG_18506. DR GeneID; 10538503; -. DR KEGG; pgr:PGTG_18506; -. DR EuPathDB; FungiDB:PGTG_18506; -. DR InParanoid; E3L7I3; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}. SQ SEQUENCE 305 AA; 34102 MW; 0F4A0483AA2C80B2 CRC64; MVGAIGRAAA SLIKFFMRKW LVALGTCILL HALTNLRQNV FNFQSEVYGI NERLKLLEIE VEEKSKIIRS LMSATSNLVT KHDFSLFQAE HSTNGNSEAH RTSNGMIPNL MDASPRKNFA SFPAGANILQ SWTSPTWYRY LNSDSTFINK RSKVEGNPPI TVLISDLSLG VCWPFSGEKG QIGIQLSRVI HITGITISHV SHTVAYDIRT APNKFELWGL GLNHQENLDL LHVGTYDIKA AQNIQYFAVS SPKTSLYSKI LVKIRSNHGN HDLTCVYQIS IHGETQEHPD GIENLPKELN YIVPT // ID E3LB16_PUCGT Unreviewed; 321 AA. AC E3LB16; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 2. DT 16-SEP-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP93741.2}; GN ORFNames=PGTG_19725 {ECO:0000313|EMBL:EFP93741.2}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP93741.2, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS178406; EFP93741.2; -; Genomic_DNA. DR RefSeq; XP_003338160.2; XM_003338112.2. DR EnsemblFungi; EFP93741; EFP93741; PGTG_19725. DR GeneID; 10535111; -. DR KEGG; pgr:PGTG_19725; -. DR EuPathDB; FungiDB:PGTG_19725; -. DR InParanoid; E3LB16; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}. SQ SEQUENCE 321 AA; 36322 MW; 51A12044B4D9D8DA CRC64; MYTPQSNRPR QLSHDARRTN PKYDGTFGRK AAIGWKWLKK NWFLTGTCFL CQIVTTRKES SLDEKVTLLD QRLVSLEEQG LNTAADIKSL IDGMLDVVKR SEFEEVLLLT ENRWGRMSGQ AKQEGHESIV TPRKDFASFS AGASIIDTLT SPTWSNDREI QPSKFTLFKK SQKVFGSPPL TVLVSDLSLG ICWPFHGTTG QIGIRLSRTI WVEGITIGHV PRSLAYDIRT APKQFELWGL DHGNQEVEGN LLLEGTYSIH GLENVQEFLV PKTRLQLHSK VVLKIKSNHG NPDLTCIYRV QVHGKVKDNH ENYDLDHVTP T // ID E3LBM0_PUCGT Unreviewed; 254 AA. AC E3LBM0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 2. DT 16-SEP-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP93945.2}; DE Flags: Fragment; GN ORFNames=PGTG_19875 {ECO:0000313|EMBL:EFP93945.2}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP93945.2, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS178421; EFP93945.2; -; Genomic_DNA. DR RefSeq; XP_003338364.2; XM_003338316.2. DR EnsemblFungi; EFP93945; EFP93945; PGTG_19875. DR GeneID; 10535572; -. DR KEGG; pgr:PGTG_19875; -. DR EuPathDB; FungiDB:PGTG_19875; -. DR InParanoid; E3LBM0; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}. FT NON_TER 254 254 {ECO:0000313|EMBL:EFP93945.2}. SQ SEQUENCE 254 AA; 28834 MW; 94804E99569C002C CRC64; MPTPQPHDAR GTNPRSVGTF GRKAAIGWKR LKQKWSVFAR KQSSFDEKVT LLDNRIQDLQ EKGSKTAEKI EGLVGLYHDL VQRSEFEKVL FQVENHSKMS SPWENPSHEF PAYEEGKRCI TQSKQDFASF SAGASIIEHL TSPTWSYYRQ IRPSQFPFAK KFQKISGSPP LTILVSDLSL GTCWPFHGTS GQIAIQLSRT IRVEGVTIGH VSQSLAYDIQ TAPKQFELWG LDHESQEVER NLLLEDCQPQ STFH // ID E3M5G8_CAERE Unreviewed; 184 AA. AC E3M5G8; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92431.1}; GN ORFNames=CRE_10907 {ECO:0000313|EMBL:EFO92431.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268425; EFO92431.1; -; Genomic_DNA. DR RefSeq; XP_003108739.1; XM_003108691.1. DR STRING; 31234.CRE10907; -. DR EnsemblMetazoa; CRE10907; CRE10907; CRE10907. DR GeneID; 9799459; -. DR CTD; 9799459; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 184 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003174325. SQ SEQUENCE 184 AA; 21940 MW; C706ACAE8DCE4CF3 CRC64; MSRKRFSFIH LLVRIQFFLT QACLDYYCDN LEPLVSNCEY RATRDNKQEQ FCSIPFNQNH SSIGKIQFHF RQNHGNVMKT CAHSIRVYAE TKEIPKVEER TLKQAETCSK LTYDYHHHSW IYNIFDFKNC NAKQMVKLLD LQNPNNRLDS FSRKNADFPP ENVDFQPKNW QPGEFLEKRE GALH // ID E3M5H8_CAERE Unreviewed; 433 AA. AC E3M5H8; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92415.1}; GN ORFNames=CRE_10914 {ECO:0000313|EMBL:EFO92415.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268425; EFO92415.1; -; Genomic_DNA. DR RefSeq; XP_003108723.1; XM_003108675.1. DR STRING; 31234.CRE10914; -. DR EnsemblMetazoa; CRE10914; CRE10914; CRE10914. DR GeneID; 9799454; -. DR CTD; 9799454; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M5H8; -. DR OMA; FANMERK; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 65 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 386 419 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 433 AA; 49964 MW; D44F954416C9FDAB CRC64; MDKTSLGRDS ESAYGSEVYS NATFKLQIKE STTKKEIWYE WIRNRLRYYM ILELLFSICL VLILWKQYHI SSQNDKTLEL ISSIQSEFQN FKLDIESDRP SKPTYTMNLD GGNKELEEFV KEVMKDMKNP SIERNQKEYP KQVIPTQDNS SPNNSIFQIN AASIILGASV DLSRSSSSDN NPLIGRDQSG YVLIDRSDPP SDKAWCSNEE NPILTIELAK YIRPISVSYQ HSKWSGMVPD GAPRRYDVLA CLDYYCNNLE PLVSNCEYKA TRDNKQEQFC PIPFNQNHSS IGKIQFHFHQ NHGNVMKTCA HTIRVYGETK EVPKVKERTL EQAETCSKLT YDYHHHSWTY NIFDFKNCIV LYSNDCCTEC PECCDECLIK DTNSEIVGFS IFLIIMSPFI IGILLFLICL IVSPIVLILE CLFCKRDEST LEE // ID E3M5H9_CAERE Unreviewed; 408 AA. AC E3M5H9; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92298.1}; GN ORFNames=CRE_10915 {ECO:0000313|EMBL:EFO92298.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268425; EFO92298.1; -; Genomic_DNA. DR RefSeq; XP_003108606.1; XM_003108558.1. DR STRING; 31234.CRE10915; -. DR EnsemblMetazoa; CRE10915; CRE10915; CRE10915. DR GeneID; 9799836; -. DR CTD; 9799836; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M5H9; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 30 48 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 370 394 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 408 AA; 47559 MW; 8F948BFB80C730F0 CRC64; MNRLNIEEEE EENNTKSAPG MWYQWLEYRL RYYMVLEGII IIILFFSLSN YHDLASRNME LNGKLEIQID NLGKRLDEIY ELLKTNSVPK PEKNEILKNI REESIRPAEV KTNEKFIEKT NSFPITSLNY SRFEMNAANI LMGASVDLSL SSSSVSSEDG FFNNFFYPFT RDQSGYILLD REELPPNKSW CSEEKQPVLT INLAKNTEVL YVSYQHSKWN GLIPDGAPKK YNVLACLDSK CKYLEPLETN CKYEKSVNGQ DIQEQFCRIS SDSVAPPVRK VQFHFLENHG NVKKTCIYSV RVFGIRRNLF RTELKKLQDK KKCEELAWNH KHSSLVYSWQ EKNCTLLYSM ECCSDCPECC SECKMEDFNY MFFGETTLAL IFLLISILAV IWVVREMLKN LKANSVNV // ID E3M5Q5_CAERE Unreviewed; 445 AA. AC E3M5Q5; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92347.1}; GN ORFNames=CRE_11130 {ECO:0000313|EMBL:EFO92347.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268425; EFO92347.1; -; Genomic_DNA. DR RefSeq; XP_003108655.1; XM_003108607.1. DR STRING; 31234.CRE11130; -. DR EnsemblMetazoa; CRE11130; CRE11130; CRE11130. DR GeneID; 9799230; -. DR CTD; 9799230; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M5Q5; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 70 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 253 272 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 418 442 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 445 AA; 51201 MW; 514BE41B2DEEC994 CRC64; MDQTSFGRDC ESAYGSEVSS NATFKLQKDR FQIEESTTKK EIWYEWIRNR LRHYMILELL FSICLVLILW KQYHISSQND KTLELISSIQ SEFRNFKIDI ESNRASKPTD TMNLDGGNEK LEEFVEEVMK DIKNPSIERN QKSKEYPKQV IPTQDNSSPN NSVFEINAAS LVLGATVDSS RSSSSDNNPF FGRDQSGYVL IDRSDPPSDK AWCSNENNPI LTIDLAKYIR PISVSYQHSK WSGMVPDGAP SRYDVLVSLF IISCLFNSFL FIQACLDYYC NTLEPLVSNC EYRATRDNEQ EQFCSIPLNK NHSSIGKIQF HFRQNHGNVM KTCAHSIRVY AETKEVPKVK ERTLKQAETC SKLTYDYHLK SWTYNMVCFL NTKLFDFKNC KVLYSNDCCT ECPECCDECL IQDTNGGTVF ICLIPIMFFS MVIISIVVVW IIGPQ // ID E3M5R0_CAERE Unreviewed; 628 AA. AC E3M5R0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92173.1}; GN ORFNames=CRE_10957 {ECO:0000313|EMBL:EFO92173.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268425; EFO92173.1; -; Genomic_DNA. DR RefSeq; XP_003108481.1; XM_003108433.1. DR STRING; 31234.CRE10957; -. DR EnsemblMetazoa; CRE10957; CRE10957; CRE10957. DR GeneID; 9798964; -. DR CTD; 9798964; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M5R0; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 54 73 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 409 435 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 628 AA; 73096 MW; 4C6A5F46B1539CCF CRC64; MDQTSFGRDC ESAYGSEVSS NATFKLQKDR FQIEESTTKK EIWYEWIRNR LRHYMILELL FSICLVLILW KQYHISSQSD KTLELISSIQ SEFRNFKIDI ESNRASKPTD PMNLDGGNKK LEEFVEEVMK DIKNPSIERN QKWKEYPKQV IPTQDNSSSN NSVFQINAAS LILGASVDSS RSSKSDNNPL FGRDQSGYVL IDRSDPPSDK AWCSNEKNPI LTIDLAKYIR PISVSYQHSK WSGMVPDGAP SRYDVLACLD YYCNTLEPLV SNCEYRATRD NEQEQFCSIP LNRNHSSIGK VQFHFRQNHG NVMKTCAHSI RVYGETKEVP KVKERTLKQA ETCSKLTYDY HHKSWTYNIV CVLNIKSYNY FNYFQIDYKN CTVLYSNDCC NECPECCDEC VIKDINSETV FFCVFFIIIS PVIIGPILFF IALIIDFGKS TKNEEESECE QEKEIPSKTP RMQIEIRLAL KLLMNQDIHQ KFVFSPVSVI LGIYPFFELT NPETRLKIAE YFLGGATEEE MSEYFIDLLS VIKASRLRGP YIPDYYYSCA PNPYYWRFGI HTLDEKTIND FKTRKLQFIE FVSEGKEDMI INSIPYNPLL DDVIHDPIFR ERTFNFTENS VQTILFME // ID E3M5R9_CAERE Unreviewed; 427 AA. AC E3M5R9; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92153.1}; GN ORFNames=CRE_10965 {ECO:0000313|EMBL:EFO92153.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268425; EFO92153.1; -; Genomic_DNA. DR RefSeq; XP_003108461.1; XM_003108413.1. DR STRING; 31234.CRE10965; -. DR EnsemblMetazoa; CRE10965; CRE10965; CRE10965. DR GeneID; 9799489; -. DR CTD; 9799489; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M5R9; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 54 73 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 400 424 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 427 AA; 49201 MW; D61B19A3B88C3B9B CRC64; MDQTSFGRDC ESAYGSEVSS NATFKLQKDR FQIEESGTKK EIWYEWIRNR LRHYMILELL FSICLVLILW KQYHISSQND KTLELISSIQ SEFRNFKLDI ESNKASKPTD TMNLNGGNKK IKEFVEEVMK DIKNPSIERN QKSKEDPKQV IQNEMNSSPN NSVFQINAAS LVLGASVDSS RSSSSDNNPF FGRDQSGYVL IDRSDPPSDK AWCSNEKNPI LTIDLAKYIR PISVSYQHSK WSGMVPDGAP RRYDVLACLD YYCNNLEPLV SNCEYRATRD NEQEQFCSIP FNQNHSSIGK IQFHFRQNHG NVMKTCAHTI RVYAETKVVP KVKGRTLEHA ETCSKLTYDY HLKSWIYKMV CFLNTKLFDF KDCKVLYSND CCAECPECCD ECLIQDIDGV MVFLCIFIAI TLIFPIAIIG NLLYERK // ID E3M6J5_CAERE Unreviewed; 507 AA. AC E3M6J5; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=CRE-SUN-1 protein {ECO:0000313|EMBL:EFO93082.1}; GN Name=Cre-sun-1 {ECO:0000313|EMBL:EFO93082.1}; GN ORFNames=CRE_10151 {ECO:0000313|EMBL:EFO93082.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93082.1; -; Genomic_DNA. DR RefSeq; XP_003108282.1; XM_003108234.1. DR STRING; 31234.CRE10151; -. DR EnsemblMetazoa; CRE10151; CRE10151; CRE10151. DR GeneID; 9829028; -. DR CTD; 9829028; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6J5; -. DR OMA; VPNHAPK; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}. FT COILED 194 228 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 507 AA; 57871 MW; E20CE0DEFDE6066C CRC64; MALRRPVSPQ LSNRSSPPVT RSVSRNGSRQ PFETSTPLTR RSLQPGMHID TIERVFESAD DTDVDLNSSK FIYREHFTVT ERTSIQKELW YDWLVYHIRM IRRHLLPEFK TVRETMMVLL LLLMITKCRL CKQLCCLNEL FSDARDCYSE EMPKSFEQHQ PADYSSEWIA EIQKIEKSVR KLIISISSIL SVQISSLQAK IDSTESNLHQ LENHLNQFDL KTDQLINQLN DDNGWKESVM EELKRIKVWQ TESDLSMHSL KKEVEEKNCD KVIAAEEEKK PAEPTPSFPS DIQVHSSQTS RRPHVGINVA NSLIGASIDN SCSSRPVSAK DGIFYDVMSY FGSFKEGYVL LDRDVLSPGE AWCTNDDRPT LTIKLARHIV PTSVSYQHVR WSGIVPNHAP KVYDLVACMD SCCIRSEPLV TDCEYKSSND EHDEQEQFCS IPSNPNLSSV NHVQFRFREN HGNMKKTCAY LVRVYGDLAN PPNAQTPDNA TASLLEETVA DSMSESV // ID E3M6K4_CAERE Unreviewed; 411 AA. AC E3M6K4; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93094.1}; GN ORFNames=CRE_10155 {ECO:0000313|EMBL:EFO93094.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93094.1; -; Genomic_DNA. DR RefSeq; XP_003108294.1; XM_003108246.1. DR STRING; 31234.CRE10155; -. DR EnsemblMetazoa; CRE10155; CRE10155; CRE10155. DR GeneID; 9828877; -. DR CTD; 9828877; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6K4; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 29 48 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 382 404 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 83 117 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 411 AA; 47657 MW; 1A4D25DC9D4F728B CRC64; MIVRHMSIEE PTTKSEMWYD WLEYRIRRFM VIECVFFVMF VFLISKSFSI SDEIQRTHEM MYQMQLKIDS IENRLSSRHY MKAENIESIS EDSQQEISRL REMNEKLENE MKAASIASIM SHRQSALPSH NLNSTPPNST SCSREIFNAA SMMAGASIVD KLSSHTVSAQ TGGYIRSGEE SYVLLDRKEL PLHKAWCSDE SKPKLTINLA KYIKPISVSY QHTKWSGLIP DGAPRIYDVV NCLDNDCKKW DALVSNCEYK SSGYSIPKQE QTCLIPANRS RTSIKNVQFR FRENYGNKNR TCVYLVRVYG ERSEPPEDRK AIERKEEDRE STCSWISWQY NNFRFLYNAR NKTCPVLYEN NCCIECPECC MECTMTTSFS DAMQAILLFI IVIGTLSLSI YGCLKLCQHL Q // ID E3M6K5_CAERE Unreviewed; 444 AA. AC E3M6K5; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92982.1}; GN ORFNames=CRE_10156 {ECO:0000313|EMBL:EFO92982.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO92982.1; -; Genomic_DNA. DR RefSeq; XP_003108182.1; XM_003108134.1. DR STRING; 31234.CRE10156; -. DR EnsemblMetazoa; CRE10156; CRE10156; CRE10156. DR GeneID; 9829172; -. DR CTD; 9829172; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6K5; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 58 77 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 412 438 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 76 96 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 444 AA; 50852 MW; 74EBB56A3FBC10AE CRC64; MKYKTGAENK PFLRDVESPS DNPSNPFHFR KNEYFGNEKP IAKKESWHKV LNNRLRHYTV LEAFLFVFLV ILLFKIYSLQ SQIDTLERKL DSKKHAESHL MKTKEILEEK KVIHEIVQNV INPSSPFPKE KEGKVKLNSE FNAASLVLGA SIETRQSSHS VSPGNSYFDI VSFALGSDQS EFSLLDRVEL PVDKAWCTDD RKPVLTVNLA DYIKPISVTY QHSKWNRTVP NGAPKLYDVV VSSFRVEILS RISFFQACID GDCNQPLVSN CEYSKSGNQE QKCLISTDLP LVNKIQFRFH ENHGNLNKTC VYLIRVYGEP SGSKEVKIQV KNQKEKEETA KICSRLAWFH DNIPVFYNGL VSFSNPILRN PNNFQASKNC STLYSNNCCR ECPNCCSECQ INDSTLLNNL QFFIIFFVLF FILFPMYIAG ISACCFGLKR FFGT // ID E3M6K6_CAERE Unreviewed; 406 AA. AC E3M6K6; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93152.1}; GN ORFNames=CRE_10157 {ECO:0000313|EMBL:EFO93152.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93152.1; -; Genomic_DNA. DR RefSeq; XP_003108352.1; XM_003108304.1. DR STRING; 31234.CRE10157; -. DR EnsemblMetazoa; CRE10157; CRE10157; CRE10157. DR GeneID; 9828976; -. DR CTD; 9828976; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6K6; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 28 46 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 368 393 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 60 80 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 406 AA; 47365 MW; EF3C35ABEF51F8B8 CRC64; MNRLNIEEEE NNTKNAPGMW YQWLEYRLRY YMVLEGIIII ILFFSLSNYH DLASRNMELN AKLEIQIDNL DKRLDEIYEL LKTNSFPKTE RNKMLKNIRE ESIRPVEVKT SEKLIEKSNS FPITSLNYSR FEMNAANILM GASVDLGLSS SSVSSEDGFF NNFFYPFTRD QSGYILLDRE ELPPNKSWCS DEEKPVLTID LVKNTEIMYV SYQHSKWNGV FPDGAPKKYN VLACLDSKCE HLEPLATNCE YEKSVNGQYI QEQMCRISSD SVAPPVRKVQ FHFLENHGNV EKTCIYSIRV FGIRRNLFRT EQKKLEDKKK CEELAWNHKH SSLVYSWQEK NCTLLYSMEC CSDCPECCSE CKMKDFNYVF VGNILLGLFF IFVFVCWIIC AVAECKKNIK ADSVNA // ID E3M6K7_CAERE Unreviewed; 443 AA. AC E3M6K7; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93229.1}; GN ORFNames=CRE_10158 {ECO:0000313|EMBL:EFO93229.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93229.1; -; Genomic_DNA. DR RefSeq; XP_003108429.1; XM_003108381.1. DR STRING; 31234.CRE10158; -. DR EnsemblMetazoa; CRE10158; CRE10158; CRE10158. DR GeneID; 9798802; -. DR CTD; 9798802; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6K7; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 414 439 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 443 AA; 50938 MW; EFACED8E679A8F87 CRC64; MVQTSFGRDS ESAYGSEVSS NATFKLQKDR FQIEESTAIK EIWYEWIRNK LRHYMILELL FSICLVLILW KQYHISSQND ITLELISSIQ SEFQNFKIDI ESNTASKPTD TMNLDEGNKK LEELVEEVMK DIKNPSVEIN QKSKEYPKQF IPAQDNSSPN NSVFQINAAS LFLGATVDSS RSSNSDNNPL FGRDQSGYVL IDRSDPPSDK AWCSNEENPI LTIDLAKYIR PISVSYQHSK WSGMVPDGAP SRYDVLACLD YYCNNLEPLV SNCEYKATRD NKQEQFCSIP FNKNHSSIGK IQFHFRQNHG NVIKTCAHSI RVYGETKEVP KVKERTLKQA ETCSELTYDY HHKSWTYNTV CFLNIKSYNN LNYFQIDRKN CTVLYSNGCC TECPECCDEC VIKDINSETI NSCILLIFIS FVLITIFLLP IALIIECLFC KRK // ID E3M6K8_CAERE Unreviewed; 461 AA. AC E3M6K8; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93155.1}; GN ORFNames=CRE_09970 {ECO:0000313|EMBL:EFO93155.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93155.1; -; Genomic_DNA. DR RefSeq; XP_003108355.1; XM_003108307.1. DR STRING; 31234.CRE09970; -. DR EnsemblMetazoa; CRE09970; CRE09970; CRE09970. DR GeneID; 9798960; -. DR CTD; 9798960; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6K8; -. DR OMA; WETIAEN; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 70 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 431 455 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 76 96 {ECO:0000256|SAM:Coils}. FT COILED 336 380 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 461 AA; 53218 MW; 06FC90F5DCFB6B65 CRC64; MDIKPKEQTN LVGESMSSIN ISMNKGRFSP NKANSSSGLW SQWIRYQVKH YMILEGLFLI SILFLLINSY NVSTQNHKTN EMISNLQNRV AFLENQLNIS TNFETFNEIY PEKEEEKPIE KKVVIEDIEE PETANESISV QENLSTSTPI PVISNDSVPF NAADIILGAS IDYDQSSQVI STREGFLGDV ENFFGTVQSD YVLLDRHELP LNKAWCSLEK YPILTVNLAK PIRLNSVSYQ HSKWNGTIPV DAPKLYEIMV SCLACLNSNC EKWELVASNC EYKMTDEENQ EQNCTIVEKF NWYPINKIRI RFVENQGNVN KTCAYLVRVY GEPIEFEKEE EKEEKEKDSK SQMSEEERQI KQLEKIMNEK KKKKEKEDAI LQHCTQLKWF HDNARVLYNA KTEKNCVPLY SKNCCSVCPE CCLECEMSLG LYNTLLALSI LFGPTLIIIG LYFVLRSFQY M // ID E3M6K9_CAERE Unreviewed; 569 AA. AC E3M6K9; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93056.1}; GN ORFNames=CRE_09971 {ECO:0000313|EMBL:EFO93056.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93056.1; -; Genomic_DNA. DR RefSeq; XP_003108256.1; XM_003108208.1. DR STRING; 31234.CRE09971; -. DR EnsemblMetazoa; CRE09971; CRE09971; CRE09971. DR GeneID; 9798805; -. DR CTD; 9798805; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6K9; -. DR OMA; ENEMERT; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 536 561 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 569 AA; 65480 MW; 9BFB9B6BAE787934 CRC64; MRDHILLRGN SESAYGSEES SNATFNLQKD RFQIEESTSK QEIWYEWIKQ RVRRHMILEL FVLICIVLVI SKLHQSLSQN ERNHEFVRRN RAIYDLFFIF QISNIQSELK TFKLDIESKI QSNREDEKYD EEVIEDFESS SAGKINKLKK IPEHPFRDQK NSLREFPGNQ MNAASLILGA TVDSSRSSSS DNNPFFGRDQ SGYVLIDRFD PPSDKAWCSN EENPILTIDL AKYIRPISVS YQHSKWSGIV PDGAPRRYDV FKLEELVEEV IKDIKNPSIE INQKSKEYPK QVIPNEVNSS PNNSVFQINA ASLILGATVD SSRSSNSDNN PLIGRDQSGY VLIDRSDPPS DKAWCSNEEN PILTIDLAKY IRPISVSYQH SKWSGIVPDG APNRYDVLAC LDYYCNNLEP LVSNCEYKAT SDNKQEQFCS IPFNKNHSSI GKIQFHFRQN HGNVMKTCAH SIRVYGETKE VPKVKERTLK QAETCSELTY DYHHKSWTYN MFDYKNCTVL YSNDCCTECP ECCNECLIQD TNIETIVFSI LLIIFSVILI TVIFTLIAII IEISFSKRK // ID E3M6L0_CAERE Unreviewed; 406 AA. AC E3M6L0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92961.1}; GN ORFNames=CRE_09972 {ECO:0000313|EMBL:EFO92961.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO92961.1; -; Genomic_DNA. DR RefSeq; XP_003108161.1; XM_003108113.1. DR STRING; 31234.CRE09972; -. DR EnsemblMetazoa; CRE09972; CRE09972; CRE09972. DR GeneID; 9829132; -. DR CTD; 9829132; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6L0; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 28 46 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 369 393 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 60 80 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 406 AA; 47382 MW; 616B252CD2550233 CRC64; MNRLNIEEEE NNTKNAPGMW YQWLEYRLRY YMVLEGIIII ILFFSLSNYH DLSSRNMELN AKLEIQIDNL DKRLDEIYDL LKTNSVPKPE KNEMLKNIRE ESIRPVEVKT SEKLIEKSNS FPINSLNYSR FEMNAANILM GASVDLGLSS SSVSSEDGFF NNFFYPFTRD QSGYILLDRE ELPPNKSWCS DEEKPVLTID LVKNTEILYV SYQHSKWNGV IPDGTPKTYN VLACLDSKCE HLEPLATNCN YERSVNGQDF QEQMCRISSD SVAPPVRKVQ FHFLENHGNV EKTCIYSIRV FGIRRNLFKT EQKKLEDKKK CEELAWNHKH SSFVYSWQEK NCTLLYSMEC CSDCPECCSE CKMKDFNSFY FAQIVFGLLF ILFVVIFIIA VVVECLRMIK RNSVNA // ID E3M6L1_CAERE Unreviewed; 414 AA. AC E3M6L1; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92878.1}; GN ORFNames=CRE_09973 {ECO:0000313|EMBL:EFO92878.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO92878.1; -; Genomic_DNA. DR RefSeq; XP_003108078.1; XM_003108030.1. DR STRING; 31234.CRE09973; -. DR EnsemblMetazoa; CRE09973; CRE09973; CRE09973. DR GeneID; 9828873; -. DR CTD; 9828873; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6L1; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}. FT COILED 118 145 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 414 AA; 47596 MW; 823C7C1225D02D75 CRC64; MDKTSFGRDS ESAYGSEVSS NATFKLQKDR FQIEESTSKN EIWQEWIRNR LRHYMILELL FSICLVLILW KQDHISSQND KTIELISSIQ SEFRNFKLDI ESNRASKLTD TMNLDEGNKK LEELVEEVMK DIKNTSIEIN QKSKEYRKQF IPAQDNSSPN NSSLRMNAAS LILGATVDSS RSSNSDNNPL FGRDQSGYVL IDRSDPPSDK AWCSNEENPI LTIDLAKYIR PISVSYQHSK WSGIVPDGAP SRYDVLACLD YYCNNLEPLV SNCEYRATRD HEQEQFCSIP FNQNHSSIGK VQFHFRQNYE NVMKTCVHTI RVYGETKEVP KVKERNLKQA ETCSELTYDF HHHSWTYNFV CVLSIKPAII EIISSLTTRI ARFFTRMTAV LNARNAVPNV SFKTPTGVQS AIVF // ID E3M6P7_CAERE Unreviewed; 683 AA. AC E3M6P7; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93052.1}; GN ORFNames=CRE_10173 {ECO:0000313|EMBL:EFO93052.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93052.1; -; Genomic_DNA. DR RefSeq; XP_003108252.1; XM_003108204.1. DR STRING; 31234.CRE10173; -. DR EnsemblMetazoa; CRE10173; CRE10173; CRE10173. DR GeneID; 9829196; -. DR CTD; 9829196; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6P7; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 29 48 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 90 117 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 683 AA; 78909 MW; 02A54F399D3A1E31 CRC64; MIVRHMAIEE PTTKSEMWYD WLEYRIRRFM VIECVFFVMF VFLISKSFSI SDEIQRTHEM MYQMQLKIDR IENRLSSRHY MKTENTESIS GESQQEISRL REINEKLENE MKAASIASIM SHRQSALPSR NLNSIPPNST TSSREIFNAA SMVAGASIVD KLSSHTVSAQ TGGYIRSGEE TYVLLDRKKL PLYKAWCSDQ SKPRLTINLA KYVKPISVSY QHTKWNGLIP DDAPRIYDVV NCLDNNCKKW DVLVSNCEYK SSGYSISKQE QTCLIQSNRS MMPVNTVQFR FRENYGNKNR TCAYSVRVYG ERSEPPEDRK AIERKEEERE STCSWISWQY NNFRILYNAS RKRLLEIAKA YGIEGYDGFC TSFSIESLDF DGMECHMSNV TNPKEPSGMV PEDALENLLK LCCRTFFTFW TKSTLFDWVS DEWAFSHHHS RLTHGGYVKI LVNLERLMNG VCKMKEKDVN LFNLTTMENV ANNEFLNLEE TERSDTVEKE ENIGMVEDER DEPMIEATGE KDQVEDGNGE FSIVNVNRAA SYSAEKVLDD LLAISLESTT CVRLPMKRFG HVLTFSIRSP MDAKILNECI GSEKIWYTRF NGIFMMSGGI RSKKLINRFQ ASVWTPITVE EAVQGVLATS IKKFDDSYQL PENVKDLSGE KLKRYIMIST SYHDIILDLK KIL // ID E3M6S8_CAERE Unreviewed; 440 AA. AC E3M6S8; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93136.1}; GN ORFNames=CRE_10011 {ECO:0000313|EMBL:EFO93136.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO93136.1; -; Genomic_DNA. DR RefSeq; XP_003108336.1; XM_003108288.1. DR STRING; 31234.CRE10011; -. DR EnsemblMetazoa; CRE10011; CRE10011; CRE10011. DR GeneID; 9829134; -. DR CTD; 9829134; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6S8; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 59 81 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 407 433 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 440 AA; 50301 MW; B64AA67EDB53F954 CRC64; MFLFRYPLNR RTSFMAACAD EIDAELAYGS PVRSKKVLKR QNFWRQSERK QGSRKGCRTI FDTCVVILIL ATISISALLG FQSFKTQKLV SNIQKYINIL EQQLNVRNNS TTETENNLME RAIQSQDNGK SVEKNKKVMM LSTEEIVIER EHQAGSISKT YLNGGIQINS TFGINAANVL LHASIDYELS SKEVSFEDGF IANSFLGSDF GGYVLLDRIE LPLNKAWCTN ELQPVLTINL ASYTNLSAVS YQHTRWNGSV PDGSPKSYDV MSCLDENCRK MERVMSNCVY KNLEDKQEQV CVIPTSLRLS PTKKVQFRIR ENHGNTRKTC VYLVRVYGHI EEKLLTGNQK MKRDHVHTCL SITNRYHNYR ILFDIFDGYC ISLFSKGCCK QCPECCTQCR INPSFSAVVY GIILVLITPL VIISPFFIIL LLVKMYIRFF // ID E3M6T0_CAERE Unreviewed; 467 AA. AC E3M6T0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92941.1}; GN ORFNames=CRE_10013 {ECO:0000313|EMBL:EFO92941.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268426; EFO92941.1; -; Genomic_DNA. DR RefSeq; XP_003108141.1; XM_003108093.1. DR STRING; 31234.CRE10013; -. DR EnsemblMetazoa; CRE10013; CRE10013; CRE10013. DR GeneID; 9798952; -. DR CTD; 9798952; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3M6T0; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 48 66 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 432 454 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 467 AA; 53703 MW; 8C0EF6B233AFAECF CRC64; MKIKIKDRDV ESAKDSELAD TLIVSEKNYS VKRATRFYAF PLLELQKWFI GTILLFIAII LFLNTWELTT LHSDIQEFSY NIESRLNKME SRAEFISMLK SENEEDKEVE NPIFFHIETR LIQHDSGIQK NGEEEKEIGK NSFEDYFGFE NGTDSEEGDY YDDRIIFHKV LPSESEEETM NTSTNSTNRI NAANALFGAF IDERLSSPPV SPGDGFMDKV WDFFGAVDGG YVLLDREELP VNKSWCSDEE DSILTIQLSQ DISPISISYQ HSKWNGTVPN GAPKSYFVMG CLDTQCESRV VLGPRCEYKS DNQSTQEQEC QVKPQWRVSH IKAVQIQIRE NHGNVEKTCA YLFRVYGISD STQKELKPVS RIQDISIRDE MCSYAASEYY SLPSFFYNAM NFNCTKLYSN DCCSYCPECC TECNMSLTNE SVFVFAVIIF GFFGFVLLME FLLIRAAKFL WVSEDSH // ID E3M739_CAERE Unreviewed; 2775 AA. AC E3M739; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=CRE-HECD-1 protein {ECO:0000313|EMBL:EFO93814.1}; GN Name=Cre-hecd-1 {ECO:0000313|EMBL:EFO93814.1}; GN ORFNames=CRE_12651 {ECO:0000313|EMBL:EFO93814.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268427; EFO93814.1; -; Genomic_DNA. DR RefSeq; XP_003107915.1; XM_003107867.1. DR ProteinModelPortal; E3M739; -. DR STRING; 31234.CRE12651; -. DR EnsemblMetazoa; CRE12651; CRE12651; CRE12651. DR GeneID; 9828540; -. DR CTD; 9828540; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; E3M739; -. DR OMA; NRQCIEG; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 2. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2775 AA; 306472 MW; 16DF7BBF70F025BA CRC64; MDGIDPETLL EWLQTGIGDE RDLQLMALEQ LCMLLLMADN IDRCFESCPP RTFIPALCKI FIDETAPDNV LEVTARAITY YLDVSNECTR RITQVEGAVK AICARLAAAE MSDRSSKDLA EQCVKLLEHV CQRETMAVYD AGGINAMLNL VRVHGAQVHK DTMHSAMSVV TRLCGKMEPT DLELAKCAES LGALLEHDDP KVSESALRCF AALTDRFVRK MMDPAELALH SNLVEHLISI MVSSNDENSP ATASANILSI VLSLIGNLCR GSSLITEKVL TSPNMIKGLR ATLTNKEERV VTDGLRLCDL LLVLLCEGRS ALPLTCAVSG DYAAGSGAER VHRQLIDAIR QKDLTALVEA IESGQVDVNF ADDVGQSLTN WASAFGSIEM VQYLCDKGAD VNKGHKSSSL HYAACFGRPD VSYDTDICCA WWFKLPHKIN AVKNVVTYSF SQHLLSQVVK LLLQRGANPD LRDEDGKTAL DKARERSDDD HNQVANILES PSAFMRNKEE QKTKASTSQQ PGTSAKPELP NPQLVRKVLH QLLPIFCEIF QRSLNGTVRR TSLSLMRKIV ENIGDLRQSA ASEDGVPAVS TNSARKMSAD VSAGAESLVA VVVSVMDQEE DHEGHEQVLL ILESLLEKDA ELWVTELVRL GVFERVEAMA KEPPKGLEEV LNAIRLEGRS RVTPMEIDFQ SQQPSSSATT SNDIMDTTTA TVPSTDNTEG EATQAPPAVE VRYAFFQIAD PEPPTPSTSQ QAAPKARSTA SSSASSAILQ VVSKLSGVAS LDKSAADKKP SKMILNQGTP YRWKEWRIVR GPTSLFIWSD VLLIELPFQS NGWFRYLADN DSHVQFVTGT ANVDQQMTEE EKDNFQKTER REMVSRWNAV KGVFDDDWNA VQVSVLQVPC SLKKLEVPAW ELWSTKVSEL QIKSVSSSTP CGQTNTMITT IKVQDDAGGF LFETGTGRKT NVMPEHALPP DFHTGWSLHG VTTRKMKFRQ DIQKRKVQEL AWKLWNDHLK EAHAKPREAL VRLENAAHAI EGAVRLMKTQ NNKHRSAKHA RIERVQEYTK AIKTVHESII DDRRLSTFEF SVSGIVPALY ALLSSMDKYP DCFTTKIFME VFAAGEALSQ LALKMVAVLE ANEKFPQYLY DSPGGSSFGL QLLSRRVRTK LEMIPRADGK ENNDENLVNK TGKTIKCEPL ASVGSIRTYL MKLVARQWHD RDRANYKYVK EIQDLKAKGQ SVELRHTGDF DENGVIYWIG TNGKTAPSWT NPATIKAVKI TCSDPRQPFG KPEDLLSRDQ NPINCHTSDD KNSHFTIDLG LFVIPTSYTL RHARGYGRSA LRNWALQGSN DTKSWDILIT HTDDKSLGDP GSTATWHLEK GTASYRYIRI AQNGKNSSGQ THYLSCSGFE IYGDIVDVVK EAICEDLPKK ESVAGSSGAS SSMSSLTKEQ VLEMLPAHDN NNRLKSGLSL DTVTAMMQRS RHRLRGTFKI SDSKSKVVRG KDWRWEEQDG GEGKFVSSLE YHTVTKRKFQ GRITSPPDNG WVDVTWENGY SNSYRFGANG NFDIERVNSS GHRYTMPSMH SSVPSSVMDA VRRNRAFYTP KTTGPPPSNF GASSSAGSSR GGENSSSSSS PFPNLPVPPW RSSKSSTSPA IASRLINSVT SSGASPPPPP SSSLSTFSSL ASGLGFGLNR HKQHNKPGPS TLSRFSSVKN PAPTGTPTSG VSSGGAIGKK SMSTTNLVDD RQKSSGPSVA STGQAASAES LQHQTPSLEN LLARAMPHTF GRIAENQEQE DEPMGGEESD SAASMRSAAS SNSQISMDSS QQPQQQQPDS ETTPRESAGT PSTPRDEKNQ TLSVSAPDLA AARQRQASAE VEGGDDLDET NSEDKTVGGE DAMEEDDEEE ETMEDEEDDD DDDDDESSNE NQEKLVELLA GERGLFDKLK EVITGESLSD ASSSAKDGNT NEAQKKGGSK KPKKWFKKMS SYTDVLKGLM QSRYPVSLLD PAAAGIEMDE MMDDDEYYDF SEEGADDGDS VEDEVAAHLG MPPESFASMV AARTPITWRQ FSELMSGSNR ERAAMARAVA SSRGSPWDDE SIVKCSFEAL IPAFDPRPGR SNVNQTLEVE LPQVVNEFGS SKSSSSAKKD KGDQVRFFLR GPNMTGVDNI TVEMDDDSSS LFRYMQIINN NANWATKSDR GRRIWEPTYF ISYCSADQTN SEVSKIPDEE SSTPAQVNQC LETIGLLSRI QESLPLAEIS PSVFISDKLT LKVTQVLSDA LVVAARALPE WCSRLVYKYP CLFTVETRNM YMQATAFGVS RTIVWLQQRR DAAVERARGS AQAGNSSAAR QHDRYHEYRV GRLRHERVKV TRAEDTLLDQ AIRLMKFHAD RKAVLEIEYT NEEGTGLGPT LEFYALVAAE LQRKSLALWV CDDDDTHASK SGEEREVDLG EGKKPAGYYV RRMGGLFPAP LPPGSEEAKK AADMFRVLGV FLAKVLLDGR LVDLPLSRPF LKLLVSPQVG DDAHGPNLHR VLTIDDFEEV NPAKGGFLKE LLALVQRKRL IENDNNIDQS AKRRKIAELK LHIKGSTCKV EDLALNFTVN PPSKVFQYAE MELVSGGSEI DVTLDNVEQY IEKCEQFYLN TGIAYQMRAF REGFDRVFPL STLRAYSPEE VQRLLSGEQC PEWSRDDILN FTEPKLGYTR ESPGFLRFVD VMEALTAQER KNFLQFATGC SSLPPGGLAN LHPRLTIVRK VESGDGSYPS VNTCVHYLKL PEYSSAEILR ERLLTAINEK GFHLN // ID E3ME00_CAERE Unreviewed; 1109 AA. AC E3ME00; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=CRE-UNC-84 protein {ECO:0000313|EMBL:EFO99395.1}; GN Name=Cre-unc-84 {ECO:0000313|EMBL:EFO99395.1}; GN ORFNames=CRE_22298 {ECO:0000313|EMBL:EFO99395.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268438; EFO99395.1; -; Genomic_DNA. DR RefSeq; XP_003105477.1; XM_003105429.1. DR STRING; 31234.CRE22298; -. DR EnsemblMetazoa; CRE22298; CRE22298; CRE22298. DR GeneID; 9817078; -. DR CTD; 9817078; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3ME00; -. DR OMA; WKSEFAS; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 104 123 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 129 148 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 381 403 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 441 463 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 502 522 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 1109 AA; 125550 MW; A4F066C30EB94617 CRC64; MPPSIDNDFD THEWKSEFAS TQSGRSSPNI FAKVRRKLLL TPPVRNARSP RLTEEELDAL TGDLKIYDPS LPDHWEVPNL AGGTSPGTLA QQEHYSAASL SRQLLYILRF PVYLILHVIT YILEAFYHVI KIATFTLWDY FLYLIKIVKL RYARYQENRY RTALIRNRQD PFRVKVANFF YRFCEIIWLI VTTPYRMLTN GNGGVGQYDY KSIKNQLETE RASRVTTRSQ ALEKSRTFAG LSRSPARRVT PITTTSTITR ITARVFSSSP FGAGESSSGT GTPTVITTKT IKERSVTPRF KSTRGILKVD GLQKAFDTPE IDTPLSQYGL RSRASHVHTP EPTFDIGDLA ATSTPLVPRG INKLDYIWES RDEERSTLQT LLSWIGYIIL FPFYAARHIW YTIFDYGKSA YMKVTNYQQM PMEAIHVRDI DEPAPSYVDN AGVLTTSWSA SIYNFFASFF SAIKESHQIV FAMLTGAVQD TTSYVGGLFS GLTNKNSSKF NWCQILGLLL ALLLALFLFG FLTSDNTAIR VQKLEKEAND SKSPDGELPA VPVWLNGVNH AKHYMWMAKE YVYDMAFDSY NVIKPIVGRT ATAPKYAWGL LASGCGAVTK FLGSVVTGAE RFAGSLWYFL TGNFASAYES IGGFANGVYN STSNGIGWIA KNTKNLVVNG ISGIYNFFSW MFTRLLNFST NSQTAVVSAF KSARDGSANF FYNYIYTPIA GCFTYLTGNY QNLLKPVWSA LRWTYDSTVF VIQKIVEWAC FLVTYPIGLI TRGWVKISQY APEDVVQVIP IPQAVTPTPE IEITKEQQEV KILKKKPEVE DEEQELVIIP APAPEPIPLP VPPRDPVVIH QTNVVETVDK EAIIKEVSDK LRAELSAQLS AQFQQDLTEK IEQNYNTIIN KLKVENNNMQ YDNNHLEAII RQLIYEYDTD KTGQVDYALE SSGGAVISTR CSETYKSYTR LEKFWNIPIY YHEYSPRVVI QRNSKSLFPG ECWCFKDGRG YIAVELSHYI DVSSISYEHI GKEVAPEGNR SSAPRGVLVW AYKQIDDLES RVLIGDYTYD LDGPPLQFFL AKHKPNFPVK FVELEVTSNY GAPFTCLYRL RVHGKMLKV // ID E3MGH5_CAERE Unreviewed; 844 AA. AC E3MGH5; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP01419.1}; GN ORFNames=CRE_23924 {ECO:0000313|EMBL:EFP01419.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268443; EFP01419.1; -; Genomic_DNA. DR RefSeq; XP_003104768.1; XM_003104720.1. DR STRING; 31234.CRE23924; -. DR EnsemblMetazoa; CRE23924; CRE23924; CRE23924. DR GeneID; 9809127; -. DR CTD; 9809127; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; E3MGH5; -. DR OMA; ERCEETQ; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 844 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003176850. FT TRANSMEM 688 710 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 844 AA; 95321 MW; 08BDB25C143B5DB4 CRC64; MKLKQLLIFC VLLVLPNIHA NQDVSFVKHW KDILLTDGDN GSMCYLSVDE CTRAAPYNIT KKVIKTSGNT PSKAIHFQII FSVNASEKES VPEKSIESFD EWTKKRRDAV ANQNGQHQKT VEPTPGTTIR HDEVVISLPS ISRPARNFAS RECGAKVIAA NLEAENAKAV VNEKDVDDYM RNPCQSAKEK FIVIELCEAI QIKKFAIGNF ELFASRPKTV HVFISERYPP LTSWVSLGTF NLQDHHKQLQ TFEVPNTNIY AKYIRINLED HYGKEHYCIV SVVNVMGSTL ADEYDKEEAA AHLLNVIDEK KDEPVTTPPP SEQKVQTQLP VPPKSSNQNN ASGVKAFNFR QLKSICSQCS AGKVSYLICH LLPRQSKPIK LNPTPKPFSA KPPVTENKNL TVELGLWAER SRQSNFEQSR RRNMATIQRL LEKKNALEPE TFPPSTMTFT EPSLSKIENG EKPAEKVAVP EEAKSSSQPA VQPPFQEQPP PKSKTEHILP AGGSTSQREM VLMKLSKRIA AVEMNLTLST EYLSELSKQY VSQMSGYQQE LKETRKASRK SSQTVEAVMH SKINNVKREL RDLRHSVYLL QQLENNRYKN AQNEMSRNVF MSSCHISSNV PPSPTLARLP LVIPSINSKF ENFTNFEERV KKIYQTAKSV MFGSITWNVS RWLKTTEVLR KKIQTDHLIV ALISFNVLAL SFLFAGVFYI HRRNKERCEE TQVIVKNELR ARIAKIGADN RKMISKGMRR AELAVTAAVS SALKVEKTSS NRKTMTELET ALANLFAAQQ IRIEEQFEQN QKILRDALTT GQRSSADDTL SMEGSESSSE TEQSKEDTPT FNQD // ID E3N2H8_CAERE Unreviewed; 1074 AA. AC E3N2H8; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84203.1}; GN ORFNames=CRE_16277 {ECO:0000313|EMBL:EFO84203.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268513; EFO84203.1; -; Genomic_DNA. DR RefSeq; XP_003097402.1; XM_003097354.1. DR STRING; 31234.CRE16277; -. DR EnsemblMetazoa; CRE16277; CRE16277; CRE16277. DR GeneID; 9806997; -. DR CTD; 9806997; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3N2H8; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 2. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 43 62 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 377 402 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1030 1055 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 1074 AA; 123960 MW; BEB6F122B6B33A9B CRC64; MVQASFGKDL ESANGSKESS NAIEESTTKK EIWYEWIRNR LRYYMILELL FSICLVLILW KLCHISSQIS SIHSEFRDFK LDIESNRVSK PTDTINLNGG NKKIEEFVEQ VMKDIKNPSK EYPKQVILTR DNSSTNNSVF QINAASLVLG ASVDSSRSSN SDNNPLFGRD QSGYVLIDRS DPPSDKAWCS NEKNPILTID LAKYIRPISV SYQHSKWSGI VSDGAPRRYD VLACLDYYCN NLEPLVLNCE YRPTGDSKQE QFCSIPFNRN HSSIGKVQFH FRQNHGNVMK TCAHTIRVYG ETKEVPKVKE RTLKRAETCS KLTYDYHHNP WIYNMVCFLN IKLFDYKNCT LLYSNDCCTE CPECCDECVI KDTNFDTVAF CFVFMMLVPI FILLIVILPI FLTDYPTADI RLQFIICCDE GEVEPVLRTI HNFFLSWIGS NIKYELNSCY YIPRMTNITS SQIWLLDSKP AALQLTSFLS YSPVPEYLCL IEWRDILLAN FKGRELYVYK GELNDSTVIQ FLNDSKSSHG YLNLRVVHII VNESCELHHD IIISQCNFKT FDSMENLPIF HYKQRNNIHP IVFHSLKFSS QYYIVWESDG YVASFKVQSN SIFFTAWNMN EKDFLEHHVA NEYLLKISIK MVQASFGKDL ESANGSKESS NAKLPKDGFQ IEEPKTKKEM WYEWIRNRLR HYLILELVFS ICLVLILWKL CHISSQNDKT IELISSIHSE LRYLKLDIES NRASTPTDTV NLDGGNNKLE EFVEEVIKDI KNPTKEYPKQ IIPTEDNSST NNSVVQINAA SLILGATVDS SRSSNSDNNP LFGRDQSGYV LIDRSDPPSD KAWCSNEKNP ILTIDLAKYI RPISVSYQHS KWSGMVPDGA PSRYDVLACL DYYCNSLEPL VSNCEYRATG DNKQEQLCSI PFNRNHSSIG KVQFHFRRNH GNVIKTCAHT IRVYGETKEE VLKVKEMTLK RAETCSKLTY DYHHDPWTYN MFDSKNCKVL YSNECCTECP ECCDECDIKD INKETIVICY LLIIIFPSLI VILFLSITLI ILRLLSKRKL SKFRRVITQK NEYC // ID E3N2H9_CAERE Unreviewed; 370 AA. AC E3N2H9; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84178.1}; GN ORFNames=CRE_16246 {ECO:0000313|EMBL:EFO84178.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268513; EFO84178.1; -; Genomic_DNA. DR RefSeq; XP_003097377.1; XM_003097329.1. DR STRING; 31234.CRE16246; -. DR EnsemblMetazoa; CRE16246; CRE16246; CRE16246. DR GeneID; 9807029; -. DR CTD; 9807029; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3N2H9; -. DR OMA; RESEKIC; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 70 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 370 AA; 42407 MW; 5E3029375314CF59 CRC64; MVQASFGKDI ESANGSKESS NAKLQKDRFQ IEESTTKKEI WYKWIRSRLR YYMILELLFS ICLVLILWKL CYISSQISSI HSEFRDFRLD IESNRASKPT DTINLDGGNK KIEKFVEQVM KDIKNPSKEY PKQVILTRDN SSSNNSVSQI NAASLILGAK VDSSRSSNSD NNPLFGRDQS GYVLIDRSDP PSDKAWCSNE KNPILTIDLA KYIRPISVSY QHSKWSGIVS DGAPRRYDVL ACLDYYCNSL EPLVTNCEYR ATRDNKQEQF CSIPFDSNHS SIGKVQFHFR RNHGNVIKTC AHTKARRVGT RLLKLLRHQK QILKRCVMHP RVLNAGRHWV GRNDSSNRLV KLSVTVSVGK LSNVFRLHLE // ID E3N2I0_CAERE Unreviewed; 430 AA. AC E3N2I0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84192.1}; GN ORFNames=CRE_16278 {ECO:0000313|EMBL:EFO84192.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268513; EFO84192.1; -; Genomic_DNA. DR RefSeq; XP_003097391.1; XM_003097343.1. DR STRING; 31234.CRE16278; -. DR EnsemblMetazoa; CRE16278; CRE16278; CRE16278. DR GeneID; 9807005; -. DR CTD; 9807005; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3N2I0; -. DR OMA; CKECHIS; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 389 422 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 430 AA; 49259 MW; C630A7EF81D6FBF3 CRC64; MVQASFGKDL ESANGFKESS NTKLQKYGFQ IEESTTKKEI WYEGIRNRLR HYMILELLFS ICLVLILWKL CHISSQNDKT LELISSINSE LRYLKLDIES NRASKPTMNS DGENKKLEEF VEEVIKDIKH PSIERNQKSK EYPKQVIPNE VNSSPNNSVF QINAASLILG ATVDSSRSSN SDNNPLFGRD QSGYVLIDRS DPPSDKAWCS NEKNPILTID LAKYIRPISV SYQHSKWSGM VPDGAPSRYD VLACLDYYCN NLEPLVTNCE YRATGDNKQE QFCSIPFNRN HSSIGKVQFH FRQNHGNVMK TCAHTIRVYG ETKEVSKVKE RTLKQAETCS KLTYDYHHKS WTYNMIDFKN CTELYSNDCC TECPECCNEC VIEDINSDTI FICFLLILCS PLFIGYLLIL ILLIGIPIAL IIECLFSKRK // ID E3N2I6_CAERE Unreviewed; 431 AA. AC E3N2I6; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84197.1}; GN ORFNames=CRE_16281 {ECO:0000313|EMBL:EFO84197.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268513; EFO84197.1; -; Genomic_DNA. DR RefSeq; XP_003097396.1; XM_003097348.1. DR STRING; 31234.CRE16281; -. DR EnsemblMetazoa; CRE16281; CRE16281; CRE16281. DR GeneID; 9806987; -. DR CTD; 9806987; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3N2I6; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 43 62 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 377 410 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 431 AA; 48989 MW; 0BE5A7F21D7F5AD5 CRC64; MVQASFGKDL ESANGSKESS NAIEESTTKK EIWYEWIRNR LRYYMILELL FSICLVLILW KLCHISSQIS SIHSEFQYFK LDIESNRVSK PTDTINLDGG NKKLEEFVEQ VIKDIKNPSI ENNEKSKEYP KQTTPTKDNS SSNNSVFHIN AASLILGATV DSSRSSNSDN NPSIGRDQSG YVLIDRSDPP SDKAWCSNEE NPILTIDLAK YITPISVSYQ HSKWSGIVSD GAPRRYDVLA CLDYYCNSLE PLVSNCEYRA TGDNKQEQSC SIPFNQNHSS IGKVQFHFRQ NHGNVMKTCA HTIRVYGETK EVPKVKEMTL KQAETCSELT YDYHHNPWTY NIFDYKNCTV LYSNDCCTEC PECCDECVIE DINTGTFAFW FGFMIVVPIL IFAIGFIFIT TVLLFIAAGI GLFEIAKLQI GCLFRKRKQS T // ID E3N2I7_CAERE Unreviewed; 423 AA. AC E3N2I7; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84193.1}; GN ORFNames=CRE_16250 {ECO:0000313|EMBL:EFO84193.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268513; EFO84193.1; -; Genomic_DNA. DR RefSeq; XP_003097392.1; XM_003097344.1. DR STRING; 31234.CRE16250; -. DR EnsemblMetazoa; CRE16250; CRE16250; CRE16250. DR GeneID; 9807024; -. DR CTD; 9807024; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3N2I7; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 370 391 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 423 AA; 48636 MW; B3EDD5BB3CDC9429 CRC64; MVPQQLEWSL LSLFHSLVAN EYLLTISIKM VQASFGKDIE SANGSKESTN AKLQKDRFQI EESKTKKEIW FEWIRNRLRH YLILEFLFSI CLVLILWKLC YISSQISSIH SEFRNFKLDI ESNRASKPTD TINLDGGNKK IEEFVEQVIK DIKNPSIENN QKSKEYPKQT TPTKDNSSSN NSVSQINAAS LILGATVDSS RSSKSDNNPL FGRDQSGYVL IDRSDPPSDK AWCSNEENPI LTIDLAKYIT PISVSYQHSK WSGIVSDGAP RRYDVLACLD YYCNNLEPLV SNCEYRATRD NKQEQFCSIP FNRNHSSIGK VQFHFRQNHG NVMKTCAHTI RVYGETKVKE MTRKQATCSE LTYDYHHNPW IYKIVCFLNI KPAIIYIIFS WTTRIARYFT RTTAVLNARN AAMNVILKIS TLT // ID E3NM26_CAERE Unreviewed; 751 AA. AC E3NM26; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP06587.1}; GN ORFNames=CRE_26855 {ECO:0000313|EMBL:EFP06587.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS268972; EFP06587.1; -; Genomic_DNA. DR RefSeq; XP_003090548.1; XM_003090500.1. DR STRING; 31234.CRE26855; -. DR EnsemblMetazoa; CRE26855; CRE26855; CRE26855. DR GeneID; 9823388; -. DR CTD; 9823388; -. DR eggNOG; KOG1476; Eukaryota. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410XP79; LUCA. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NM26; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR GO; GO:0015018; F:galactosylgalactosylxylosylprotein 3-beta-glucuronosyltransferase activity; IEA:InterPro. DR Gene3D; 3.90.550.10; -; 1. DR InterPro; IPR005027; Glyco_trans_43. DR InterPro; IPR029044; Nucleotide-diphossugar_trans. DR InterPro; IPR023796; Serpin_dom. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF03360; Glyco_transf_43; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR Pfam; PF00079; Serpin; 1. DR SUPFAM; SSF53448; SSF53448; 1. DR SUPFAM; SSF56574; SSF56574; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 54 73 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 409 439 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 751 AA; 87417 MW; 9F5AE6891565D597 CRC64; MDQTSFGRDC ESAYGSEVSS NATFKLQKDR FQIEESTTKK EIWYEWIRNR LRHYMILELL FSICLVLILW KQYHISSQSD KTLELISSIQ SEFRNFKIDI ESDRASKPTD TMNLDVGNEK LEEFVEEGMK DMKNPSIERN QKSKEYPKQV IPTQENSSPN NSVFQINAAS LVLGATVDSS RSSSSDNNPL FGRDQSGYVL IDRSDPPSDK AWCSNEENPI LTIDLAKYIR PISVSYQHSK WSGMVPDGAP SRYDVLACLD YYCNTLEPLV SNCEYRATRD NKQEQFCSIP LNRNHSSIGK VQFHFRQNHG NVMKTCAHSI RVYGETKEVP KVNERTLKQA ETCSKLTYDY HHKSWTYNIV CVLNIKSYNY FNYFQIDYKN CTVLYSNDCC NECPECCDEC VIKDINSETV FFCVFFIIIS PFIIGPILFF IALIIDYLLS VIKASRLRGP YIPDYYYSCA PNPYYWRFGI HTLDEKTIND FKTRKLQFVE FNSEEKEEMI INSIPYNPLL DDIIHDPIFR EQTFNFSENF FQTILFMEWN QHKHKYAENE TFQMVEYKMR TFVSFYVFLP KIRFGLQNAL KNLYHLINTA NEKYVDIRVP RFKIDTEADL GSFSNSIGIE KGLYEDVSNK VLGKTPRFVH KVQFENSMLD ISLSRTFAVD MAGFAVNLRV VMNSTAVFGL HCKERYAPET CLLEDMGLER KDIEPFGWEG EKDREILVWH TKTSTPNFPK AEKNATKPAP PPETYGYFVE V // ID E3NPT0_CAERE Unreviewed; 408 AA. AC E3NPT0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO83226.1}; GN ORFNames=CRE_30297 {ECO:0000313|EMBL:EFO83226.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269416; EFO83226.1; -; Genomic_DNA. DR RefSeq; XP_003089591.1; XM_003089543.1. DR STRING; 31234.CRE30297; -. DR EnsemblMetazoa; CRE30297; CRE30297; CRE30297. DR GeneID; 9812853; -. DR CTD; 9812853; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NPT0; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 30 48 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 370 395 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 55 82 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 408 AA; 47449 MW; 381C631ECFAEF65B CRC64; MNRLNIEEEE EENNTKSAPR MWYQWLEYRL RYYMVLEGII IIILFFSLSN YHDLASRNME LNARLEIQID NLEKRLDEIY ELLKTNSVPK PEKNEILKNI REESIRPAEV KTNEKLIEKS NSFPITSLNY SRFEMNAANI LMGASVDLAL SSSSVSSEDG FFNNFFYPFT RDQSGYILLD REELPPNKSW CSEEKQPVLA INLAKNTEVL YVSYQHSKWN GLIPDGAPKK YNVLACLDSK CEHLKHLATN CEYEKSVNGQ DIQEQFCRIS SDSVAPPVRK VQFHFLENHG NVKKTCIYSV RVFGIRRNLF RTELKKLQDK KKCEELAWNH KHSSLVYSWQ EKNCTLLYSM ECCSDCPECC SECKMKDFNY MFFGETTLAL IFLLISILAV IWAVVGMLKN LKASSVNV // ID E3NPT1_CAERE Unreviewed; 430 AA. AC E3NPT1; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO83223.1}; GN ORFNames=CRE_30298 {ECO:0000313|EMBL:EFO83223.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269416; EFO83223.1; -; Genomic_DNA. DR RefSeq; XP_003089588.1; XM_003089540.1. DR STRING; 31234.CRE30298; -. DR EnsemblMetazoa; CRE30298; CRE30298; CRE30298. DR GeneID; 9812850; -. DR CTD; 9812850; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NPT1; -. DR OMA; TSIGRDS; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 70 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 396 423 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 430 AA; 49600 MW; 351FE30427D1F195 CRC64; MDKTSIGRDS ESAYGSEVSS NATFKLLKDR FQIEESTTKK EIWYEWIRNR LRYYMILELL FSICLVLILW KQYHISSQND KTLELISSIQ SEFRNFKLDI ESDRASKPTD TMNMDEGNKK LEEFVEEVMK DMKNPSIERN QKEYPKKVIP TQDNSSPNNS VFQINAASLI LGATVDSSRS SSSDNNPLIG RDQSGYVLID RSDPPSDKAW CSNEENPILT IDLAKYIRPI SVSYQHSKWS GMVPDGAPSR YDVLACLDYY CNNLEPLVSN CEYKATRDNE QEQFCSIPFN KNHSSIGKIQ FHFRQNHGNV IKTCAHLIRV YGETKEVPKV KERTLKQAET CSELTYDYHQ KPWTYNIFDF KNCKVLHSND CCTECPECCD ECLIEDTNSE TVFICFLYIL LSPVIIFLLL ILTALIIECF LCIRKTSAEV // ID E3NQ34_CAERE Unreviewed; 413 AA. AC E3NQ34; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84141.1}; GN ORFNames=CRE_07925 {ECO:0000313|EMBL:EFO84141.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269462; EFO84141.1; -; Genomic_DNA. DR RefSeq; XP_003089486.1; XM_003089438.1. DR STRING; 31234.CRE07925; -. DR EnsemblMetazoa; CRE07925; CRE07925; CRE07925. DR GeneID; 9811857; -. DR CTD; 9811857; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NQ34; -. DR OMA; YNGLASK; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 58 77 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 381 407 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 76 96 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 413 AA; 47053 MW; 7EB119115DE93E69 CRC64; MKYKTGAENK PFLRDVESPS DNPSNPFHFR KNEYFGNEKP IAKKESWHQV LNNRLRHYTV LEAFLFVFLV ILLFKIYSLQ SQIDTLERKL DSKKNAESHL MKTKEILEEK KVIHEIVQNV INPSSPFPKE KEGKVKLNSE FNAASLVLGA SIETRQSSHS VSPGNSYFDI VSFALGSDQS AFSLLDRVEL PVDKAWCTDD RKPVLTVNLA DYIKPISVSY QHSKWNRTVP NGAPKLYDVV ACIDGDCNQP LVSNCEYSKS GNQEQKCLIS TGLPLVNKIQ FRFHENHGNL NKTCVYLVRV YGEPSGSKEV KIQVKNQKEE EETAKICSRL AWFHDNIPVF YNGLASKNCS TLYSNNCCHE CPNCCSECQI NDSTLLNNLQ FFIIFFVLFF ILFPMYIAGI SACCFGLKRF FGI // ID E3NQ35_CAERE Unreviewed; 406 AA. AC E3NQ35; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84142.1}; GN ORFNames=CRE_07926 {ECO:0000313|EMBL:EFO84142.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269462; EFO84142.1; -; Genomic_DNA. DR RefSeq; XP_003089487.1; XM_003089439.1. DR STRING; 31234.CRE07926; -. DR EnsemblMetazoa; CRE07926; CRE07926; CRE07926. DR GeneID; 9811853; -. DR CTD; 9811853; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NQ35; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 30 49 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 368 395 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 60 80 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 406 AA; 47040 MW; 786CDCFAA231D79A CRC64; MNRLNIEEEE NNTKNAPGIW YQWLEYRLRY YMVLEGIIII ILFFSLSNYH DLASRNMELN AKLETQIDNL DKRLDEIYEL LKTNSVPKTE RNEMPKNIQE ESIRPVEVKT SEKLIEKSNS FPINSLNYSR FEMNAANILM GASVDLGLSS SSVSSEDGFF NNFFYPFTRD QSGYILLDRE ELPPNKPWCS DEEKPVLTIN LAKNTEILYV SYQHSKWNGV IPDGAPKIYN VLACLDSKCE NLEPLASNCE YEKSVNGQDI QEQMCQISSD SVAPPVRKVQ FHFLENHGNV EKTCIYSIRV FGIRRNLFKT EQKKLEDKKK CEELAWNHKH SSLTYSWQEK NCTLLYSMEC CSDCPECCSE CKMNDFNYVF VGNILLVLLF LLIFVCWIII AFAYCKKNIK AGSVNA // ID E3NQ36_CAERE Unreviewed; 443 AA. AC E3NQ36; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84139.1}; GN ORFNames=CRE_07927 {ECO:0000313|EMBL:EFO84139.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269462; EFO84139.1; -; Genomic_DNA. DR RefSeq; XP_003089484.1; XM_003089436.1. DR STRING; 31234.CRE07927; -. DR EnsemblMetazoa; CRE07927; CRE07927; CRE07927. DR GeneID; 9811854; -. DR CTD; 9811854; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NQ36; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 410 436 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 443 AA; 50918 MW; 8A5CECBFA07A3787 CRC64; MVQTSFGRDS ESAYGSEVSS NATFKLQKDR FQIEESTAIK EIWYEWIRNR LRHYMILELF FSICLVLILW KQYHISSQND KTLELISSIQ SEFRNFKLDI ESNRAPKPAD PMNLDGGNKK FEELVEEVMK DINNPSIEIN QKSKEYPKQV IPNEVNSSPN NSVFQMNAAS LILGATVDSS RSSNSDNNPL FGRDQSGYVL IDRSDPPSDK AWCSNEENPI LTIDLAKYIR PISVSYQHSK WSGIVPDGAP SRYDVLACLD YYCDNLEPLV SNCEYKATSD NKQEQFCSIP FNKNHYSIGK IQFHFRQNHG NVMKTCAHTI RVYGETKEVP KVKEMTLKQA ETCSELTYDY HHKSWTYNIV CFLNIKPYNN LNYFQLDFKN CTVLYSNDCC TECPECCDEC VIKDINADKV AYSILFIIIS SPLIFIIGIL IILIIAKLFC KSK // ID E3NQ37_CAERE Unreviewed; 460 AA. AC E3NQ37; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84138.1}; GN ORFNames=CRE_07923 {ECO:0000313|EMBL:EFO84138.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269462; EFO84138.1; -; Genomic_DNA. DR RefSeq; XP_003089483.1; XM_003089435.1. DR STRING; 31234.CRE07923; -. DR EnsemblMetazoa; CRE07923; CRE07923; CRE07923. DR GeneID; 9811855; -. DR CTD; 9811855; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NQ37; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 70 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 430 454 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 76 103 {ECO:0000256|SAM:Coils}. FT COILED 338 379 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 460 AA; 53148 MW; 2E799F5305FB93BC CRC64; MDNKPKEQTN LVGESMSSIN ISMNKGRFSP NKANSSNGLW SQWIRYQLKH YMILEGLFLI SILFLLINSY NVSTQNHQTN EMISKLQNRV EILEKQLNIS TNSEAFNEIY SEKKEEKPIE TKVVIEDIEE PETANESISV QENLSTSTHI PVISNDSVPF NAADIILGAS IDYDQSSQVI STREGFLGDV ENFFGTVQSD YVLLDRDELP LNKAWCSLEK YPILTVNLAK SIRLNSVSYQ HSKWNGTIPV DAPKLYEIMV SCLACLNSNC EKWELVASNC EYKMTDEENQ EQNCTIVEKF NWYPINKIRI RFVENQGNVN KTCAYLIRVY GEPIEYEKEE KKEKEEDSKR QMSDEERQIK QLEKIMNEKK KKKEKEDAIL QHCTQLKWFH DNARVLYNAK TEKNCVPLYS KNCCSVCPEC CLECEMSLGL YNTLLALSIL FGPTLIIIGL YFVLRSFQYM // ID E3NQ38_CAERE Unreviewed; 338 AA. AC E3NQ38; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO84140.1}; DE Flags: Fragment; GN ORFNames=CRE_07924 {ECO:0000313|EMBL:EFO84140.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269462; EFO84140.1; -; Genomic_DNA. DR RefSeq; XP_003089485.1; XM_003089437.1. DR STRING; 31234.CRE07924; -. DR EnsemblMetazoa; CRE07924; CRE07924; CRE07924. DR GeneID; 9811856; -. DR CTD; 9811856; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NQ38; -. DR OMA; QRHMILE; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}. FT NON_TER 338 338 {ECO:0000313|EMBL:EFO84140.1}. SQ SEQUENCE 338 AA; 39581 MW; 718101F600AFA3BD CRC64; MRDHILLRGN SESEYGSEES SNATFNLQKD RFQIEESTSK QEIWYEWIKQ RVQRHMILEL FVLICIVLVI SKLHQSLSQN ERNHEFVRRN RAIYDLFFIF QISNIQSELK NFKLDIESKM HSNREDEKYD EEVIEDFESS SAGKRNKLKK IPKHPFRDQK NSLREFPGNQ MNAASLILGA TVDSSRSSSS DNNPFFGRDQ SGYVLIDRFD PPSDKAWCSN EENPILTIDL AKYIRPISVS YQHSKWSGIV PDGAPRRYDV FVSLFIISRV FNSFLFIQAC LDYYCNNLEP LVSNCEYRAT RDNEQEQFCS IPFNKNHSSI GKIQFHFRQN HGNVVKTC // ID E3NQP0_CAERE Unreviewed; 2639 AA. AC E3NQP0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO85309.1}; GN ORFNames=CRE_21657 {ECO:0000313|EMBL:EFO85309.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269558; EFO85309.1; -; Genomic_DNA. DR RefSeq; XP_003089282.1; XM_003089234.1. DR ProteinModelPortal; E3NQP0; -. DR STRING; 31234.CRE21657; -. DR EnsemblMetazoa; CRE21657; CRE21657; CRE21657. DR GeneID; 9806221; -. DR CTD; 9806221; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; E3NQP0; -. DR OMA; NILEATM; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 2. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 1. DR PROSITE; PS50237; HECT; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2639 AA; 291248 MW; D0129BF3A278106E CRC64; MDGIDPETLL EWLQTGIGDE RDLQLMALEQ LCMLLLMADN IDRCFESCPP RTFIPALCKI FIDETAPDNV LEVTARAITY YLDVSNECTR RITQVEGAVK AICARLAAAE MSDRSSKDLA EQCVKLLEHV CQRETMAVYD AGGINAMLNL VRVHGAQVHK DTMHSAMSVV TRLCGKMEPT DLELAKCAES LGALLEHDDP KVSESALRCF AALTDRFVRK MMDPAELALH SNLVEHLISI MVSSNDENSP ATASANILSI VLSLIGNLCR GSSLITEKVL TSPNMIKGLR ATLTNKEERV VTDGLRLCDL LLVLLCEGRS ALPLTCAVSG DYAAGSGAER VHRQLIDAIR QKDLTALVEA IESGQVDVNF ADDVGQSLTN WASAFGSIEM VQYLCDKGAD VNKGHKSSSL HYAACFGRPD VSYDTDICCV WFVRFSLNLF ENLKNVVTYS FSQHLLSQVV KLLLQRGANP DLRDEDGKTA LDKARERSDD DHNQVANILE SPSAFMRNKE EQKTKASTSQ QPGTSAKPEL PNPQLVRKVL HQLLPIFCEI FQRSLNGTVR RTSLSLMRKI VENIGDLRQS AASEDGVPAV STNSARKMSA DVSAGAESLV AVVVSVMDQE EDHEGHEQVL LILESLLEKD AELWVTELVR LGSFRTCRRL EEVLNAIRLE GRSRVTPMEI DFQSQQPSSS PTTSNDIMDT TTATVPSTDN TEGEATQAPP AVEVRIADPE PPTPSTSQQA APKARSTASS SASSAILQVV SKLSGVASLD KSAADKKPSK MILNQGTPYR WKEWRIVRGP TSLFIWSDVL LIELPFQSNG WFRYLADNDS HVQFVTGTAN VDQQMTEEEK DNFQKTERRE MVSRWNAVKG VFDDDWNAVQ VSVLQVPCSL KKLEVPAWEL WSTKVSELQI KSVSSSTPCG QTNTMITTIK VQDDAGGFLF ETGTGRKTNV MPEHALPPDF HTGWSLHGVT TRKMKFRQDI QKRKVQELAW KLWNDHLKEA HAKPREALVR LENAAHAIEG AVRLMKTQNN KHRSAKHARI ERVQEYTKAI KTVHESIIDD RRLSTFEFSV SGIVPALYAL LSSMDKYPDC FTTKIFMEVF AAGEALSQLA LKMVAVLEAN EKFPQYLYDS PGGSSFGLQL LSRRVRTKLE MIPRADGKEN NDENLVNKTG KTIKCEPLAS VGSIRTYLMK LVARQWHDRD RANYKYVKEI QDLKAKGQSV ELRHTGDFDE NGVIYWIGTN GKTAPSWTNP ATIKAVKITC SDPRQPFGKP EDLLSRDQNP INCHTSDDKN SHFTIDLGLF VIPTSYTLRH ARGYGRSALR NWALQGSNDT KSWDILITHT DDKSLGDPGS TATWHLEKGT ASYRYIRIAQ NGKNSSGQTH YLSCSGFEIY GDIVDVVKEA ICEDLPKKES VAGSSGASSS MSSLTKEQVL EMLPAHDNNN RLKSGLSLDT VTAMMQRSRH RLRGTFKISD SNQKWSLEKI GDGRPRRRRG KIRERNRAFY TPKTTGPPPS NFGASSSAGS SRGGENSSSS SSPFPNLPVP PWRSSKSSAS PAIASRLINS VTSSGASPPP PPSSSLSTFS SLASGLGFGL NRHKQHNKPG PSTLSRFSSV KNPAPTGTPT SGVSSGGAIG KKSMSTTNLV DDRQKSSGPS VASTGQAASA ESLQHQTPSL ENLLARAMPH TFGRIAENQE QEDEPMGGEE SDSAASMRSA ASSNSQISMD SSQQPQQQQP DSETTPRESA GTPSTPRDEK NQTLSVSAPD LAAARQRQAS AEVEGGDDLD ETNSEDKTVG GEDAMEEDDE EEETMEDEED DDDDDDDESS NENQEKLVEL LAGERGLFDK LKEVITGESL SDASSSAKDG NTNEAQKKGG SKKPKKWFKK MSSYTDVLKG LMQSRYPVSL LDPAAAGIEM DEMMDDDEYY DFSEEGADDG DSVEDEVAAH LGMPPESFAS MVAARTPITW RQFSELMSGS NRERAAMARA VASSRGSPWD DESIVKCSFE ALIPAFDPRP GRSNVNQTLE VELPQVVNEF GSSKSSSSAK KDKGDQVRFF LRGPNMTGVD NITVEMDDDS SSLFRYMQII NNNANWATKS DRGRRIWEPT YFISYCSADQ TNSEVSKIPD EESSTPGPSK PMSRDHRTSF TNSRIFTTGR DSPSVSSATA FGVSRTIVWL QQRRDAAVER ARGSAQAGNS SAARQHDRYH EYRVGRLRHE RVKVTRAEDT LLDQAIRLMK FHADRKAVLE IEYTNEEGTG LGPTLEFYAL VAAELQRKSL ALWVCDDDDT HASKSGEERE VDLGEGKKPA GYYVRRMGGL FPAPLPPGSE EAKKAADMFR VLGVFLAKVL LDGRLVDLPL SRPFLKLLVS PQVGDDAHGP NLHRVLTIDD FEEVNPAKGG FLKELLALVQ RKRLIENDNN IDQSAKRRKI AELKLHIKGS TCKVEDLALN FTVNPPSKVF QYAEMELVSG GSEIDVTLDN VEQYIEKCEQ FYLNTGIAYQ MRAFREGFDR VFPLSTLRAY SPEEVQRLLS GEQCPEWSRD DILNFTEPKL GYTRESPGFL RFVDVMEALT AQERKNFLQF ATGCSSLPPG GLANLHPRLT IVRKVESGDG SYPSVNTCVH YLKLPEYSSA EILRERLLTA INEKGFHLN // ID E3NR38_CAERE Unreviewed; 467 AA. AC E3NR38; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO87062.1}; GN ORFNames=CRE_24892 {ECO:0000313|EMBL:EFO87062.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS269641; EFO87062.1; -; Genomic_DNA. DR RefSeq; XP_003089135.1; XM_003089087.1. DR STRING; 31234.CRE24892; -. DR EnsemblMetazoa; CRE24892; CRE24892; CRE24892. DR GeneID; 9814332; -. DR CTD; 9814332; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NR38; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 48 66 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 432 454 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 467 AA; 53607 MW; 5F67DF14B6219F05 CRC64; MKIKIKDRDV ESAKDSDVAD ALIVSENNHS VKRATRFYAF PLLELQKWFI GTILLFIAII LFLNTWELTT LHSDIQEFSY NIASRLNKME SRAEFISMLK SKNEEDKEVE NPIFFHIETR LIQHDSGIQK NGEEEKEIGK HSFEDYFGFE NGTDSKEGDY YDDRIIFHKV LPSESEEEMM NTSTNSTNRI NAANSLFGAF IDERLSSPPV SPGDGFMDKV WDFFGAVDGG YVLLDREELP VNKSWCSDEE DSILTIQLSQ DISPISISYQ HSKWNGTVPN GAPKSYFVMG CLDTQCENRV VLGPRCEYKS DNQSTQEQEC QVKPQWRVSH IKAVQIQIRE NHGNVEKTCA YLFRVYGISD STQKELKPVS RIQDISIRDE MCSYAASEYY SLPSFFYNAM NFNCTKLYSN DCCSYCPECC TECNMSLTNE SVFVFAVIIF GFFGLVLLME FLLIRAAKFL WVSEDSH // ID E3NTN0_CAERE Unreviewed; 422 AA. AC E3NTN0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92610.1}; GN ORFNames=CRE_29381 {ECO:0000313|EMBL:EFO92610.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS270246; EFO92610.1; -; Genomic_DNA. DR RefSeq; XP_003088242.1; XM_003088194.1. DR STRING; 31234.CRE29381; -. DR EnsemblMetazoa; CRE29381; CRE29381; CRE29381. DR GeneID; 9805648; -. DR CTD; 9805648; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NTN0; -. DR OMA; CNISSHL; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 49 68 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 388 418 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 422 AA; 48472 MW; 43692900B55C3033 CRC64; MDQTSFGKDC ESAYGSEVSS NATFKLQIEE STTKKEIWYK WIRNRLRHYM ILELLFSICL VLILWKQYHI SSQNDKTLEL ISSIQSEFRN FKIDIESDRA SKPTDTINLD GGNKKLEEFV EEVMEDMKNP STERNQKSKE YPKQVIPTQD NSPPNNSVFE INAASLILGA TVDSSRSSSS DNNPLIGRDQ SGYVLIDRRD PPSDKAWCSN EENPILTIDL AKYIRPISVS YQHSKWSGIV PDGAPSRYDV LACLDYYCNN LEPLVSNCEY KATRDNKQEQ FCSIPFNKNH SSIGKVQFHF RQNHGNVIKT CAHSIRVYGE TKEVPKVKER TLKQAETCSK LTYDYHHKSL SYNIFDFKNC TVLYSNDCCS ECPECCDECV IEDINRKTVL LCVLFIFVSP FIIGPILFFI ALIIDCLFCK RK // ID E3NTP3_CAERE Unreviewed; 439 AA. AC E3NTP3; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO92664.1}; GN ORFNames=CRE_09935 {ECO:0000313|EMBL:EFO92664.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS270256; EFO92664.1; -; Genomic_DNA. DR RefSeq; XP_003088229.1; XM_003088181.1. DR STRING; 31234.CRE09935; -. DR EnsemblMetazoa; CRE09935; CRE09935; CRE09935. DR GeneID; 9805634; -. DR CTD; 9805634; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NTP3; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 49 68 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 391 424 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 439 AA; 50406 MW; 6DA4F98AAAC83F3E CRC64; MVQPSIGKDL ESANGSEESS NTKLQKDGFQ IEEPKTKKEM WYEWIRSRLR YYMILELLFS ICLVLILWKQ YHISSQISSI HSEFRDFKLD IESNRISKPT DTINLDGGNK KLEEFVEQVM KDIKNPSIEN NQKSKEYPKQ IIPTEDNSSP NNSVSQINAA SLILGATVDS SRSSNSDNNP SIGRDQSGYV LIDRSDPPSD KAWCSNEENP ILTIDLAKYI RPISVSYQHS KWSGMVPDGA PRRYDVLACL DYYCNNLEPL VSNCEYRATG DNKQEQFCSI PFNRNHSSIG KIQFHFRQNH GNVMKTCAHT IRVFGETKEV PKVKEMTLKR AETCSKLTYD YHHHSWTYNM WDYKNCTVLY SNDCCTECPE CCDECLIEDT NFDTFGFCFG FMIVVPILIF AIGFILITTV LLFIAAVIGL FEIAKLQIDC LFRKRKQST // ID E3NU22_CAERE Unreviewed; 413 AA. AC E3NU22; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO93549.1}; GN ORFNames=CRE_27993 {ECO:0000313|EMBL:EFO93549.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS270392; EFO93549.1; -; Genomic_DNA. DR RefSeq; XP_003088099.1; XM_003088051.1. DR STRING; 31234.CRE27993; -. DR EnsemblMetazoa; CRE27993; CRE27993; CRE27993. DR GeneID; 9817490; -. DR CTD; 9817490; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NU22; -. DR OMA; NNTATEC; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}. SQ SEQUENCE 413 AA; 47464 MW; 7D03BA67ABC3588F CRC64; MVQASFGKDL ESANGFKESS NAKLQKDRFQ IEEPKTKKEM WYEWIRNRLR HYMILELLFS ICLVLILWKL CHISSQNDKT IELISSIHSE LRYLKLDIES NRASKPTDTI NLDGGNKKLE EFVEQVIKDI KNPSIENNQK SKEYPKQIIP TEDNSSPNNS VSQINAASLI LGATVDSSRS SNSDNNPSIG RDQSGYVLID RSDPPSDKAW CSNEENPILT IDLAKYIRPI SVSYQHSKWS GMVPDGAPRR YDVLACLDYY CNNLEPLVSN CEYRATGDNK QEQFCSIPFN SNHSSIGKVQ FHFRQNHGNV MKTCAHTIRV YGETKEEVLK VKEMTRKQET CSKLTYNYHH NPWTYKIVCF LNIKPAIIYI IYSMTPRIAR YFTRMTAVLN ARNAVMNVIL KTSTKKQLLS ATY // ID E3NUG8_CAERE Unreviewed; 351 AA. AC E3NUG8; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFO94729.1}; GN ORFNames=CRE_09941 {ECO:0000313|EMBL:EFO94729.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS270544; EFO94729.1; -; Genomic_DNA. DR RefSeq; XP_003087953.1; XM_003087905.1. DR STRING; 31234.CRE09941; -. DR EnsemblMetazoa; CRE09941; CRE09941; CRE09941. DR GeneID; 9817100; -. DR CTD; 9817100; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NUG8; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 326 346 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 30 57 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 351 AA; 40250 MW; CE470DD64C932528 CRC64; MYQMQLKIDS IENRLSSRHY MKTENTESIS GESQQEISRL REINEKLENE MKAASIASIM SHRQSALPSD NLNSIPPNST TSSREIFNAA SMVAGASIVD KLSSHTVSAQ TGGYIRSGEE TYVLLDRKEL PLYKAWCSDQ SKPRLTINLA KYIKPISVSY QHTKWNGLIP DDAPRIYDVV NCLDNNCKKW DVLVSNCEYK SSGYSISKQE QTCLIPSNRS MTSVKNVQFR FRENYGNKNR TCAYLVRVYG ERTEPPEDRK AIERKEEERE STCSWISWQY NNFRILYNAR NKTCPVLYEN NCCIECPECC MECTMTTSFS DSMQGFLLFI IVLGTLSLIF SGYVQLCKHL Q // ID E3NWV1_CAERE Unreviewed; 454 AA. AC E3NWV1; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP05271.1}; GN ORFNames=CRE_10800 {ECO:0000313|EMBL:EFP05271.1}; OS Caenorhabditis remanei (Caenorhabditis vulgaris). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281}; RN [1] {ECO:0000313|Proteomes:UP000008281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281}; RG Caenorhabditis remanei Sequencing Consortium; RA Wilson R.K.; RT "PCAP assembly of the Caenorhabditis remanei genome."; RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS271851; EFP05271.1; -; Genomic_DNA. DR RefSeq; XP_003087120.1; XM_003087072.1. DR STRING; 31234.CRE10800; -. DR EnsemblMetazoa; CRE10800; CRE10800; CRE10800. DR GeneID; 9828935; -. DR CTD; 9828935; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E3NWV1; -. DR Proteomes; UP000008281; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 412 436 Helical. FT COILED 436 454 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 454 AA; 52089 MW; 4C00D32401227377 CRC64; MDKTSFGRDS ESAYGSEVSS NATFKLQKDR FQIEESTSKN EIWQEWIRNR LRHYMILELL FSICLVLILL KQDHISSQNN KTLELISSIQ SEFRHYKLDI ESNRASKRTD SNDNTGNKKL EELVEEVIKD IKNPSIEINQ KSKEYPKQVI PNEVNSSPNN SVFQINAASL ILGATVDSSR SSNSDNNPFF GRDQSGYVLI DRSDPPSDKA WCSNEENPIL TIDLAKYIRP ISVSYQHSKW SGMVPDGAPS RYNVLVSLAC LDYYCNNLEP LATNCEYKAT SDNKKEQFCS IPFNKNHSSI GKIQFHFRQN HGNVMKTCAH SIRVYGETKE VPKVKERTLK QAETCSELTY DYHHKSWTYN MVCFLNIKSY NNLNCFQFDY KNCTVLYSND CCTECPECCA ECLIQDTNWS TIGYICLNLI GILMVLFVGL IIYCAVQEAI EDANEMIRLE NELK // ID E3NXE9_PUCGT Unreviewed; 135 AA. AC E3NXE9; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 29-APR-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFP94248.1}; DE Flags: Fragment; GN ORFNames=PGTG_20164 {ECO:0000313|EMBL:EFP94248.1}; OS Puccinia graminis f. sp. tritici (strain CRL 75-36-700-3 / race SCCL) OS (Black stem rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Pucciniaceae; Puccinia. OX NCBI_TaxID=418459 {ECO:0000313|EMBL:EFP94248.1, ECO:0000313|Proteomes:UP000008783}; RN [1] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CRL 75-36-700-3; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Cuomo C., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Mauceli E., RA Brockman W., Young S., LaButti K., Sykes S., DeCaprio D., Crawford M., RA Koehrsen M., Engels R., Montgomery P., Pearson M., Howarth C., RA Larson L., White J., Zeng Q., Kodira C., Yandava C., Alvarado L., RA O'Leary S., Szabo L., Dean R., Schein J.; RT "The Genome Sequence of Puccinia graminis f. sp. tritici Strain CRL RT 75-36-700-3."; RL Submitted (JAN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000008783} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CRL 75-36-700-3 / race SCCL RC {ECO:0000313|Proteomes:UP000008783}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS989933; EFP94248.1; -; Genomic_DNA. DR RefSeq; XP_003338667.1; XM_003338619.1. DR EnsemblFungi; EFP94248; EFP94248; PGTG_20164. DR GeneID; 10527335; -. DR KEGG; pgr:PGTG_20164; -. DR EuPathDB; FungiDB:PGTG_20164; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008783; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008783}. FT NON_TER 1 1 {ECO:0000313|EMBL:EFP94248.1}. SQ SEQUENCE 135 AA; 14777 MW; BB7CFA6236F94702 CRC64; TNPKLRDLTT NKPSHLPYLL TLRAQPLQVS SIEADKDAYS IDGPNNVQEF SVARGELQLY SKVLLKIISN HGNPDLTCLY RVQVHDQLIH VDVTQDAMIT AIIVVLNLIS LGASGPLPTD KPLGPRSFSS THISP // ID E3QMF1_COLGM Unreviewed; 1210 AA. AC E3QMF1; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Spindle pole body-associated protein sad1 {ECO:0000313|EMBL:EFQ32039.1}; GN ORFNames=GLRG_07183 {ECO:0000313|EMBL:EFQ32039.1}; OS Colletotrichum graminicola (strain M1.001 / M2 / FGSC 10212) (Maize OS anthracnose fungus) (Glomerella graminicola). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Glomerellales; Glomerellaceae; OC Colletotrichum. OX NCBI_TaxID=645133 {ECO:0000313|Proteomes:UP000008782}; RN [1] {ECO:0000313|Proteomes:UP000008782} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=M1.001 / M2 / FGSC 10212 {ECO:0000313|Proteomes:UP000008782}; RX PubMed=22885923; DOI=10.1038/ng.2372; RA O'Connell R.J., Thon M.R., Hacquard S., Amyotte S.G., Kleemann J., RA Torres M.F., Damm U., Buiate E.A., Epstein L., Alkan N., RA Altmueller J., Alvarado-Balderrama L., Bauser C.A., Becker C., RA Birren B.W., Chen Z., Choi J., Crouch J.A., Duvick J.P., Farman M.A., RA Gan P., Heiman D., Henrissat B., Howard R.J., Kabbage M., Koch C., RA Kracher B., Kubo Y., Law A.D., Lebrun M.-H., Lee Y.-H., Miyara I., RA Moore N., Neumann U., Nordstroem K., Panaccione D.G., Panstruga R., RA Place M., Proctor R.H., Prusky D., Rech G., Reinhardt R., RA Rollins J.A., Rounsley S., Schardl C.L., Schwartz D.C., Shenoy N., RA Shirasu K., Sikhakolli U.R., Stueber K., Sukno S.A., Sweigard J.A., RA Takano Y., Takahara H., Trail F., van der Does H.C., Voll L.M., RA Will I., Young S., Zeng Q., Zhang J., Zhou S., Dickman M.B., RA Schulze-Lefert P., Ver Loren van Themaat E., Ma L.-J., RA Vaillancourt L.J.; RT "Lifestyle transitions in plant pathogenic Colletotrichum fungi RT deciphered by genome and transcriptome analyses."; RL Nat. Genet. 44:1060-1065(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG697359; EFQ32039.1; -; Genomic_DNA. DR RefSeq; XP_008096059.1; XM_008097868.1. DR EnsemblFungi; EFQ32039; EFQ32039; GLRG_07183. DR GeneID; 24412548; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008782; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008782}; KW Reference proteome {ECO:0000313|Proteomes:UP000008782}. FT COILED 926 946 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1210 AA; 134082 MW; 900D36463B2A0259 CRC64; MPPKRRVRGE DDDMPAISRG GAAVTSSLPP LASRLDTSYG SAPSTALRNT RRGQRRDLQA IIKETLHDSD DPDDPDDDDD DGHDDDQDGH DGRNERSRSA KTDHLPLKKS PTPKPEPRPR PARKVQPAPS PDPDPDPDSE PDFGPGRDSS ESPEPPRDPS PDPPIAQRSL RQRLLPGRIG KYSLRRSATS LTVVSLLLLN PTPGPSGEYH QGATSPRRQV LPNVAPPRGP SPPPQQPSTQ GVASKGLAPP APTSRGLSPS KGNTLAPSSR SVFARAISEQ SAEPTHALDS IRSFGQESSL YGGAAIESTP SRLSLSPEES EEEEPPLLPR PPRSNHPDQF RGTVRDPIGE VVHRPEPLDS PRDKPARQLQ PEPPSTPPAQ RFRLQSESRP RESLDRSIAR FSVTRDSPRT NTVSRALSAE PAPSSPATRS SSMRPSSRAA DHLPTTNRFS TRLTNHVAEE APTVRQELKG PSNHAIEEPF HLDHVSRSQL SPRRPPQQAS PSPDRRNVWP PAVNEKPGAA RRLFDYPPPG NTFAPRPGRM GGRSAGPSTA KQPIEDRDGR SAIGPSRSGL FDSSMQEKIR QEREHEKAQQ HALRKERNAR LKAWPRIYNT VYSWTHAPRP DSPPPRDEDP YDYDSPIEPP QQVPWSESSW STWILFKLAD TFVDVLNFLF SPHSWHIKAL IGILLVSLLG WGAMTGLNSA PALGPGGIQW YGLSDISHNL GQFVPLWMSR PTTVFSDEDT REYIQQQRNH EYELSNLAKA SKLHEGSLSR LEEIVPKVVH MTLDKRGRPV VGDDFWHALR DLMKSDTEIL TMDRGSGGYY FISEEHWRAT RDRSLKDPVY QTGIEKIAQT KFDKSWEKWL KTNDKKVAKI LEPALAASVP DKTGKDLESK IEKIVKDRFK NTDAKDVVVT RNEFIRHLKG EFAAHRNEVK AEAQELQKKL EHYVNNAIKS ASEQTPPAGV SRAEMAHVVD GMIRQAIANA GLEALAQGKI GATWDRELRH QVNFLGRNTG VGIDPDQTTP DYNPPVQGKI WSPTWFGGTK RAPKAPTTPA SATLWNWEDE GDCWCGSVTV DKTSGHELGV SVSYLLAHYI IPQHVVVEHI LPGATLDPNA RPKQIEVLAY FTELNTRNRV MDFSTAHFPD GQKVPRDGWV QIGAFTYESS DALNGVYVHK LSPELVSIGA VTDQVILRAV SNYGSDHTCL YRVRLYGEKA // ID E3R0X0_COLGM Unreviewed; 890 AA. AC E3R0X0; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFQ36758.1}; GN ORFNames=GLRG_11908 {ECO:0000313|EMBL:EFQ36758.1}; OS Colletotrichum graminicola (strain M1.001 / M2 / FGSC 10212) (Maize OS anthracnose fungus) (Glomerella graminicola). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Glomerellales; Glomerellaceae; OC Colletotrichum. OX NCBI_TaxID=645133 {ECO:0000313|Proteomes:UP000008782}; RN [1] {ECO:0000313|Proteomes:UP000008782} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=M1.001 / M2 / FGSC 10212 {ECO:0000313|Proteomes:UP000008782}; RX PubMed=22885923; DOI=10.1038/ng.2372; RA O'Connell R.J., Thon M.R., Hacquard S., Amyotte S.G., Kleemann J., RA Torres M.F., Damm U., Buiate E.A., Epstein L., Alkan N., RA Altmueller J., Alvarado-Balderrama L., Bauser C.A., Becker C., RA Birren B.W., Chen Z., Choi J., Crouch J.A., Duvick J.P., Farman M.A., RA Gan P., Heiman D., Henrissat B., Howard R.J., Kabbage M., Koch C., RA Kracher B., Kubo Y., Law A.D., Lebrun M.-H., Lee Y.-H., Miyara I., RA Moore N., Neumann U., Nordstroem K., Panaccione D.G., Panstruga R., RA Place M., Proctor R.H., Prusky D., Rech G., Reinhardt R., RA Rollins J.A., Rounsley S., Schardl C.L., Schwartz D.C., Shenoy N., RA Shirasu K., Sikhakolli U.R., Stueber K., Sukno S.A., Sweigard J.A., RA Takano Y., Takahara H., Trail F., van der Does H.C., Voll L.M., RA Will I., Young S., Zeng Q., Zhang J., Zhou S., Dickman M.B., RA Schulze-Lefert P., Ver Loren van Themaat E., Ma L.-J., RA Vaillancourt L.J.; RT "Lifestyle transitions in plant pathogenic Colletotrichum fungi RT deciphered by genome and transcriptome analyses."; RL Nat. Genet. 44:1060-1065(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG697510; EFQ36758.1; -; Genomic_DNA. DR RefSeq; XP_008100778.1; XM_008102587.1. DR EnsemblFungi; EFQ36758; EFQ36758; GLRG_11908. DR GeneID; 24417271; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008782; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008782}; KW Reference proteome {ECO:0000313|Proteomes:UP000008782}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 32 {ECO:0000256|SAM:SignalP}. FT CHAIN 33 890 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003181068. SQ SEQUENCE 890 AA; 98147 MW; 069F64BA4FFA9528 CRC64; MTRICRRRYP SRRLPILLLA LVVHCAQVLC QASNILDGQA SVCEFKTINY ITHTLPQQCL KTSWPGQKGP NESVHPNIDD PRASPDDSTN PFGGHHNAPG PSPSTTGLST DATDDNDPTP RPFMSFEDWK EMMLRKAGQD PSDLKSRKSL DRTIEAERVA RSSGLDSIGD DGEIDLNFDV VSEKISNIAS APQATPTEIA ASELQQEPVL YDDGRTQYYR SKDAGKTCKE RFSYSSFDAG ATVLKTNKGA KNAKAILVEN KDSYMLLECS AENKFVIVEL SDDILVDTVV LANFEFFSSM IRHFRVSVSD RYPVKVDKWK DLGTFEAKNS RDIQPFLVQN PLIWAKYVRI EFLTHYGNEF YCPVSLLRVH GTRMLESWKD QETPAEDEDV DEPATEVIGP AQDNTIGDTV ERQDNTTTSP RGGAAIEVEP AVSVDPLMPI HIFRLSGDEN ATCLSSAAST YISNGSSASA GSLSKTQGDS DHATIQIRSG MERLFDQIPE EEFSASTTDG RHTEGNTVPL PVTSGSSATS QSSVNLDMKA GGASENSNGG ASATRVPSQA TTQARNKNNS TAAPASPTVQ ESFFKAVSKR LQYLESNVSL SLKYIEEQSR SLQATQLLAE RKQLSRIDLF LDSLNHTVLS ELRTVRQQYD QIWQSTVIAL ESQREQSQRE TVALSSRLNI LADEVVFQKR MAIVQAILLL SCLILVIFSR AMTQPILTGS FDLGRSPSRN NRLPSSSLDR GLGGAYIPRY DKFGRPLDND HTGEVELSDR NPDMHRPIPE ASTRLLTPAS EARYSHRPDT AMTPELLPVE EFMESDPDSS YRESSPDSDI RSPADETFEV NNSGQTDHEY PSETQGQTTA ALADSVSMYT RGSLSSTLRK PLPALPEYPH // ID E3RE76_PYRTT Unreviewed; 976 AA. AC E3RE76; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFQ95974.1}; GN ORFNames=PTT_03822 {ECO:0000313|EMBL:EFQ95974.1}; OS Pyrenophora teres f. teres (strain 0-1) (Barley net blotch fungus) OS (Drechslera teres f. teres). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; Pyrenophora. OX NCBI_TaxID=861557 {ECO:0000313|Proteomes:UP000001067}; RN [1] {ECO:0000313|EMBL:EFQ95974.1, ECO:0000313|Proteomes:UP000001067} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=0-1 {ECO:0000313|EMBL:EFQ95974.1, RC ECO:0000313|Proteomes:UP000001067}; RX PubMed=21067574; DOI=10.1186/gb-2010-11-11-r109; RA Ellwood S.R., Liu Z., Syme R.A., Lai Z., Hane J.K., Keiper F., RA Moffat C.S., Oliver R.P., Friesen T.L.; RT "A first genome assembly of the barley fungal pathogen Pyrenophora RT teres f. teres."; RL Genome Biol. 11:R109.1-R109.14(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL532365; EFQ95974.1; -; Genomic_DNA. DR RefSeq; XP_003295931.1; XM_003295883.1. DR STRING; 861557.XP_003295931.1; -. DR EnsemblFungi; EFQ95974; EFQ95974; PTT_03822. DR GeneID; 10515637; -. DR KEGG; pte:PTT_03822; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001067; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001067}; KW Reference proteome {ECO:0000313|Proteomes:UP000001067}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 976 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003181334. SQ SEQUENCE 976 AA; 106184 MW; 252E435BDB0B3674 CRC64; MITTGAPLRN WTILLLLCSL PTAVLAEVAN GTATDESSAT AASGATTQHT STLTLSPTSI RRYTDSETTS PYRTINYITH TLRQQCAKAT WSAPHEAPST NGTTVERGII GLQTPIPIRE GSEGTIKEGG HSASPDTVAE PGATSSGTSS EESGLELETD SPFDNANFLS FEEWKKKNLA EVGQSPENVG QGRAAAAANQ PARRRPVNVN ALDSLGDEGE ISIDFSGFGS PEDANVANSI QQGRQSAGAT KAPEGEGKVA PSAWSLSKDA GKTCKERFNY ASFDCAATVL KTNKQAKSSS SILVENKDSY MLNTCSSDNK FLIVELCDDI LVDTVVLANY EFFSSMFRHF RVSVSDRYPV KMEKWRTLGT FEARNSRDIQ PFLITEPQIW ARYLRIEFLT QYGNEYYCPL SLLRVHGTTM MEQFRREEEG ARGIDDDDDD LEAEGVDVKK PAEDSGPLPP EEIPIEAIKG SSFDSGGPAV AQPIGHQATS QDTAVKSAST IDPSSSSTST AAMEASVGKV TDTPQTRPIS DPPSESSPSP ATGNFVGSDT NITARRETQS SDRHGSMSKG VDKPQAHSSM SDASSKQSPS VSRDDGPSVS STNTAVSSLS NSAAKASTNN TVVSQQTQSQ GRGSATQPNA PTPSTQESFF KSIHKRLQYL EANSTLSLQY IEEQSRALRD AFVKVEKRQL AKTEKFLDHL NSTVMLELKS FRTMYDQLWQ STVIELESMK ERQKSEMGEI GTRLSLMADE LVWQKRMAVV QSTLLLLCLG LVLFVRSGTL GSNADVPIVQ QLGSKYTSFF ESSPPRSPPE SGMVRQRRAF KNMWRSESEQ QSDGQQAPSD TETEGLRSPV QTTYDPPTPD TLSNRHGREF SPERNGMKAA PHPTQENMSP TPSFADQAAR IQVLETQSGP ATPNGTRDSR PSWEEVDRAM DLLKAGEESH SPPRPKARDR GKKQKRSPLR RAQSNHESVT DEEPPP // ID E3S580_PYRTT Unreviewed; 812 AA. AC E3S580; DT 11-JAN-2011, integrated into UniProtKB/TrEMBL. DT 11-JAN-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFQ86959.1}; GN ORFNames=PTT_17756 {ECO:0000313|EMBL:EFQ86959.1}; OS Pyrenophora teres f. teres (strain 0-1) (Barley net blotch fungus) OS (Drechslera teres f. teres). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; Pyrenophora. OX NCBI_TaxID=861557 {ECO:0000313|Proteomes:UP000001067}; RN [1] {ECO:0000313|EMBL:EFQ86959.1, ECO:0000313|Proteomes:UP000001067} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=0-1 {ECO:0000313|EMBL:EFQ86959.1, RC ECO:0000313|Proteomes:UP000001067}; RX PubMed=21067574; DOI=10.1186/gb-2010-11-11-r109; RA Ellwood S.R., Liu Z., Syme R.A., Lai Z., Hane J.K., Keiper F., RA Moffat C.S., Oliver R.P., Friesen T.L.; RT "A first genome assembly of the barley fungal pathogen Pyrenophora RT teres f. teres."; RL Genome Biol. 11:R109.1-R109.14(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL537212; EFQ86959.1; -; Genomic_DNA. DR RefSeq; XP_003305022.1; XM_003304974.1. DR EnsemblFungi; EFQ86959; EFQ86959; PTT_17756. DR GeneID; 10513791; -. DR KEGG; pte:PTT_17756; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001067; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001067}; KW Reference proteome {ECO:0000313|Proteomes:UP000001067}. SQ SEQUENCE 812 AA; 89598 MW; 7F64C87C78147DA6 CRC64; MSSRVNADEP TPYRRSGRLS ARASSVAAES AITNVTSSGA KRSKTTLTKV SARRSNAYGA SGRVGNPDKL TAGPATGFAQ AFQNQRGQST DGEDDDSEEE EEKGDDTDEL AAGPQSAFIK QARHAGQFAP PSKSKAAPGY SFIDSDDLTP SEDDLAASSV GNTTKSFGPS HEAGMLASRD PFAGFQIPDE SPFAKPVGLI RKPASRTING PRTQAPTPVQ AQIPAPAKTS IQAKTPTQVK TPTQTQNPIK SFIRSQAHVK PAPAQVPARA TPTGLEQSVD EVVAEEQARL QRDGPPSSQP QSQSQSMRQP PRRRPHHKGV AELNAWIGDV EASDDEEDEP VWPWKKLSTW AFWGLALSLL LGWVLSSMMA TEHAESSPRT PGLLKAVGAR VVYTYDKVAE YISPPTGPSE IDQEIDRVKA YRANGEDHFL WARMSNMDTK NERRISELRT ALLELKDQLP DMMLMRREED GSLRISDEFW HALLSKARSS ENDSEWARFL ADSKGKLRDL FDPSVHHERG NTETWAEAVT RDEFVRHMEK QYHNITSRVD KKVEEAIRAQ SAQIKTTMQA EAKKMMMDQI HLHALAQANL VANYESHLTK PNYFSPGLGA IIDPDMSSTT FYNRPGRLAE VARRLSWLPS RNPPMAALTK WQEPGDCWCS AGQSEGPTGQ AQLAVKLARP VIPKQVTIEH IPMSMVPARN ISNAPRDIEL WVQTDAPINP YYSHRQVSCK GPPPESISPA ISWKCLGSFK YNIHASNHLQ TFDLAGEPSE PIRNTILRVT SNWGASHTCL YQVRLHGTDA HRDYEYPVGL MD // ID E4UTC6_ARTGP Unreviewed; 910 AA. AC E4UTC6; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EFR00687.1}; GN ORFNames=MGYG_03692 {ECO:0000313|EMBL:EFR00687.1}; OS Arthroderma gypseum (strain ATCC MYA-4604 / CBS 118893) (Microsporum OS gypseum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Microsporum. OX NCBI_TaxID=535722 {ECO:0000313|Proteomes:UP000002669}; RN [1] {ECO:0000313|Proteomes:UP000002669} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4604 / CBS 118893 {ECO:0000313|Proteomes:UP000002669}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS989824; EFR00687.1; -; Genomic_DNA. DR RefSeq; XP_003173517.1; XM_003173469.1. DR STRING; 535722.XP_003173517.1; -. DR EnsemblFungi; EFR00687; EFR00687; MGYG_03692. DR GeneID; 10028797; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; E4UTC6; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002669; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002669}; KW Reference proteome {ECO:0000313|Proteomes:UP000002669}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 37 {ECO:0000256|SAM:SignalP}. FT CHAIN 38 910 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5005344703. FT COILED 438 465 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 910 AA; 100464 MW; 3B72179B3C01C1B3 CRC64; MNRQLSPHWR KRSRQDRITS AFLAFLAVCS APGPAAAESV ADSSRLMAVD KSNTICEHVS GDLGAEYIRY PICLETRWNA AATTASSATA GGEGPPMTGS AGGVHADARD SPGSVIVPVR GTVAAGSSKS GESRSSGSGD DADVESPLDT SNFLSFEEWK NQNLAKAGQS AETMRRHRQD KGQQARRRHT RSSQINDPLD GLGEDSEIDL EFGGFSTDES GIASWERKDG GKAPADNMDA VATRGGAIGG KEGKNPSQPV FELDGQDAEN MPRKGIGRRK HAGTTCKERF NYASFDCAAT VLKTNRQCTG SSAVLNENKD SYMLNECRAK DKFLIMELCD DILVDTVVLA NYEFFSSIFR SFRVSVSDRY PIKADKWRVL GTYEAANARQ VQAFAVENPF IWARYLKIEF LSHYGNEFYC PVSLVRVHGT TMMEEYKNDG EAARADEEED ANAQEEAEQQ QQQQEQADVV VHEKAPIPTL DIDDQMVPLS NLSDHELNEL RCFVERNETE SILLGLVSGK MCAIKERAVH AESQPVMATR VKDETAAPAS GSITSINTLE QIRSVSSTRA STASDREETR RSSTGSSVTA NGSHTEPAKM NSAPYTPPPS SPPPNPSTQE SFFKSVNKRL QMLETNSTLS LLYIEEQSRI LRDAFNKVEK RQLAKTSTFL ENLNSTVLQE LKEFRQQYDH LWHSVFIEFE QQRQQYHREV YSIAAQLGVL ADELVFQKRV AVIQSIFVLV CFGLVLFSRS SGTPYFEFPR NIVSRTRSFR SSSVAYDSPA PSASPSPPPM SRMGSLLSRS ETDEHRLHHS RSRHLRSPSE QTDYEGENPT FTYSPPTPTS RATTPDRNRT GKRLFSPEPE PRSGLIASAT SSPASGSDPD LSLRQRPVKS VEVKHESESD AEQAEVDSFT // ID E4WS77_OIKDI Unreviewed; 244 AA. AC E4WS77; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY20610.1}; GN ORFNames=GSOID_T00000611001 {ECO:0000313|EMBL:CBY20610.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY20610.1, ECO:0000313|Proteomes:UP000001307}; RN [1] {ECO:0000313|EMBL:CBY20610.1, ECO:0000313|Proteomes:UP000001307} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN653015; CBY20610.1; -; Genomic_DNA. DR InParanoid; E4WS77; -. DR Proteomes; UP000001307; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001307}; KW Reference proteome {ECO:0000313|Proteomes:UP000001307}. SQ SEQUENCE 244 AA; 27598 MW; 1B1535D8FFB62AC7 CRC64; MSEDCNKMSK FDHWIKDEYK SEEFKSVIED QIEKRTKKGS IFDSLGSFGG SAKLQDIEAL VATRLREYHE DKTGKMDYAL SSNGGSILEH RCTASKADNS RYWKILGLTV MHFQNHPSRA IEDNTGPGNC WAFDGARGHL TIKLAKPVMV NSITVEHIPA NLSPTGSISA PRAFTVSAMN NENDLVGHEL GQWEYNMNDH PIQSFNIMRK MHFPSGIIDF RFESNWGGSY TCIYRVRVHG DPFD // ID E4X416_OIKDI Unreviewed; 612 AA. AC E4X416; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 9. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY23804.1}; GN ORFNames=GSOID_T00001128001 {ECO:0000313|EMBL:CBY23804.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY23804.1, ECO:0000313|Proteomes:UP000001307}; RN [1] {ECO:0000313|EMBL:CBY23804.1, ECO:0000313|Proteomes:UP000001307} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN653024; CBY23804.1; -; Genomic_DNA. DR InParanoid; E4X416; -. DR Proteomes; UP000001307; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001307}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001307}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 16 {ECO:0000256|SAM:SignalP}. FT CHAIN 17 612 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003189959. FT TRANSMEM 519 538 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 547 570 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 612 AA; 67486 MW; 2D36A7208DE264EF CRC64; MYLWSLLVIG IGLCSDFDQN SNIGESNEGK ENALNLNENH AELATEDREF ANAEPGEHAE DENTGHVVSD ELAPASAEDD SNIVPEQVDV ETFNNVVSRL ETDGQEAAAL VEEPKETWKL TSKTTKNFAS DSCGARIEKS APDVTGAKNV IKNDFDKYAK APIDKKFAFT VELCEEIQIQ RLFIGVNELF ASRPSKFSVQ AADKVKSEWH FLGVFNIEPS GKSNLLKENF NVTMTHQYFK FVKYEALEYS GDEPFSVLTS LQVFGFPIDA QKSNDYEDDL EDADQEGEQL GSGESGVKNI INGIKKILTG APAENNGQIE QEDLVASREL LAVFQRCEWN GLVHRSCNLG CNIYADAAFN FRLTQVNMRP RISQISPPVE AVQTEQPKVA TTTLDGPETS TKASSSSIAS NSTSSSSKPA SGQAPKGDTQ VQNAIPGTLK QIAKLEKNLT WMGSYLETLS QTYKKQMDDV RASFEKTSSR LTKSEVATQQ TQEQISRLLE LVKFLFTEID RLDTTLEKII IAASVVVALQ VLTVLLVCQL RRGQKKMRQL EENQVSMEML KELLEGQKKK RKHGGGAGDG FSRRSSETSG IPKPSSRGKR PSKAPKAGSR KK // ID E4XDW7_OIKDI Unreviewed; 448 AA. AC E4XDW7; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 9. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY19356.1}; GN ORFNames=GSOID_T00008366001 {ECO:0000313|EMBL:CBY19356.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY19356.1, ECO:0000313|Proteomes:UP000001307}; RN [1] {ECO:0000313|EMBL:CBY19356.1, ECO:0000313|Proteomes:UP000001307} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN653040; CBY19356.1; -; Genomic_DNA. DR InParanoid; E4XDW7; -. DR Proteomes; UP000001307; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001307}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001307}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 81 99 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 158 178 {ECO:0000256|SAM:Coils}. FT COILED 194 221 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 448 AA; 50185 MW; 385905D22F722E94 CRC64; MRRARDYNSQ HIVQPRRSMV TPAKAQPEKY HGVVRELPRV DLDRSGSSGS SSLNASTLLY HEEKVSNVTL LEHWIKANLK TVISLFACLL IICVAAQLGR FEGEASVEPH YENIMTNTIV PDFSPMQKQL EDLKLMVTSL NEKQEASEKL LGKLPEQLEL LQRLVNKQEA QMQTMQVRTT MTPPQRIIEP CDFKDSEERI LNSLKDELDN IGNSYEGLQS DFEAKYDLFF AQIGEVSKIL RDGSQVASKD SCDSQQAPSL DIGQIMEEKL WQFDADRTGM ADFALESAGA EIIPEHTSQG MKNYSPMLSI WKFPVFFQKM SPRIAITPGA SPGSCFAFEG GSGTLGIKLS QLIVIQNITL QHIPKETSPV GHIESAPRSF ELFSINGHSE EYLGVFEFSN EGKPIQTFQV KRNAIVSAVK FKFISNWGAA HTCIYRTRVH GSLSHNVV // ID E4XQH1_OIKDI Unreviewed; 403 AA. AC E4XQH1; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 9. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY12057.1}; GN ORFNames=GSOID_T00017924001 {ECO:0000313|EMBL:CBY12057.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY12057.1, ECO:0000313|Proteomes:UP000001307}; RN [1] {ECO:0000313|EMBL:CBY12057.1, ECO:0000313|Proteomes:UP000001307} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN653105; CBY12057.1; -; Genomic_DNA. DR InParanoid; E4XQH1; -. DR Proteomes; UP000001307; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001307}; KW Reference proteome {ECO:0000313|Proteomes:UP000001307}. FT COILED 186 206 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 403 AA; 45751 MW; BEE2F17BEA6E3126 CRC64; MTIKRKFITQ PRRADFKIPD LITAEMQLKK RNLFSEDKKI SNEEIPIIEK SEPELFKTAI SETSPQKSRF IDFTVVFEIV KLKLLVFANF MEQLFSKISG NMKNLKTKIT TKKLLPLLLI FFPFSLPFMK DVKFPELSRP NLDISLPSFS ALSSTYFPNI PSMYNIVPSP PVIFNANNKD DKDDFIHDLK SELRQLRSEI DELRRNPFRS EAVTEKLQEM IYTTLEDKTG MADYALESAG AEVIDKWTTP GIPTGNALMK IWNLPIFYHT MSPRLALQPN VHPGNCFAFA GQSGSLTVKL ARPIYPTNFT IEHIPKALSF GDVSSAPQNV TLETVNPITG KGSLIGSSVF NIDGIPVQTF RTSISESLLK PTQFIRLRIN SNWGNPNFTC LYRLRVHGSQ NDF // ID E4YH40_OIKDI Unreviewed; 173 AA. AC E4YH40; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY34814.1}; DE SubName: Full=Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_273; GN ORFNames=GSOID_T00024851001 {ECO:0000313|EMBL:CBY34814.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY34814.1}; RN [1] {ECO:0000313|EMBL:CBY34814.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN654547; CBY34814.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; SQ SEQUENCE 173 AA; 18903 MW; 85BB60133C92073F CRC64; MADYALESAG AEVIDKWTTP GIPTGNALMK IWNLPIFYHT MSPRLALQPN VHPGNCFAFA GQSGSLTVKL ARPIYPNNFT IEHIPKALSF GDVSSAPQNV TLETVNPISG KGSIIGSSVF NIDGIPVQTF RTSISESLSK PTQFIRLRIN SNWGNPNFTC LYRLRVHGSQ NDL // ID E4YP64_OIKDI Unreviewed; 432 AA. AC E4YP64; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY37262.1}; DE SubName: Full=Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_657; DE Flags: Fragment; GN ORFNames=GSOID_T00030350001 {ECO:0000313|EMBL:CBY37262.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY37262.1}; RN [1] {ECO:0000313|EMBL:CBY37262.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN654931; CBY37262.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 114 134 {ECO:0000256|SAM:Coils}. FT COILED 142 162 {ECO:0000256|SAM:Coils}. FT COILED 178 205 {ECO:0000256|SAM:Coils}. FT COILED 432 432 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:CBY37262.1}. SQ SEQUENCE 432 AA; 48004 MW; AC98B49CEC9C1791 CRC64; RSMVTPAKAQ PEKYHGVVRE LPRVDLDRSG SSGSSSLNAS TLLYHEEKVS NVTLLEHWIK ANLKTVISLF ACLLIICVAA QLGRFEGEAS EEPHYENIMT NTIVPDFSPM QKQLEELKLM VTSLNEKQEA SEKLLGKLPE QLELLQRLAN KQEAQMQTMK VRTTMTPPQR IIEPCDFKDS EERILNSLKD ELDNIGNSYE GLQSDFEAKY DLFFAQIGEV SKILRDGSQV ASKDSCDSQQ APSLDIGQIM EEKLWQFDAD RTGMADFALE SAGAEIIPEH TSQGMKNYSP MLSIWKFPVF FQKMSPRIAI TPGASPGSCF AFEGGSGTLG IKLSQLIVVQ NITLQHIPKE TSPVGHIESA PRSFELFSIN GHSEESLGVF EFSDEGKPIQ TFQVKGNAIV SAVKFKFISN WGAAHTCIYR TRVHGSLSHN VV // ID E4YYY5_OIKDI Unreviewed; 244 AA. AC E4YYY5; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY40663.1}; DE SubName: Full=Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_1761; GN ORFNames=GSOID_T00022684001 {ECO:0000313|EMBL:CBY40663.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY40663.1}; RN [1] {ECO:0000313|EMBL:CBY40663.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN656035; CBY40663.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; SQ SEQUENCE 244 AA; 27571 MW; F45EC1731FAC4BAC CRC64; MSEDCNKMSK FDHWIKDEYK SEEFKSVIED QIEKRTKKGS IFDSLGSFGG SAKLQDIEAL VATRLREYHE DKTGKMDYAL SSNGGSILEH RCTASKADNS RYWKILGLTV MHFQNHPSRA IEDNTGPGNC WAFDGARGHL TIKLAKPVMV TSITVEHIPA NLSPTGSISA PRAFTVSAMN NENDLVGHEL GQWEYNMNDH PIQSFNIMRK MHFPSGIIDF RFDSNWGGSY TCIYRVRVHG DPFD // ID E4Z2H3_OIKDI Unreviewed; 403 AA. AC E4Z2H3; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 9. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY41901.1}; DE SubName: Full=Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_2445; GN ORFNames=GSOID_T00023997001 {ECO:0000313|EMBL:CBY41901.1}; OS Oikopleura dioica (Tunicate). OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Oikopleuridae; OC Oikopleura. OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY41901.1}; RN [1] {ECO:0000313|EMBL:CBY41901.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21097902; DOI=10.1126/science.1194167; RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C., RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C., RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., RA Cross I., Yadetie F., Muffato M., Louis A., Butcher S., RA Tsagkogeorga G., Konrad A., Singh S., Jensen M.F., Cong E.H., RA Eikeseth-Otteraa H., Noel B., Anthouard V., Porcel B.M., RA Kachouri-Lafond R., Nishino A., Ugolini M., Chourrout P., Nishida H., RA Aasland R., Huzurbazar S., Westhof E., Delsuc F., Lehrach H., RA Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F., RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., RA Du Pasquier L., Boudinot P., Liberles D.A., Volff J.N., Philippe H., RA Lenhard B., Roest Crollius H., Wincker P., Chourrout D.; RT "Plasticity of animal genome architecture unmasked by rapid evolution RT of a pelagic tunicate."; RL Science 330:1381-1385(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN656720; CBY41901.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 186 206 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 403 AA; 45645 MW; 04ABDD8F832B5886 CRC64; MTIKRKFITQ PRRADFKIPD LITAEMQLKK RNLFSEDQKI GDEAIPIIEK SEPETIKTAI SETCPQKSRF TDFTVVFETV KLKFLVLVNF MEQLFSKISG NMKKLKTKIT TKKVLPLLLI FFPFSLPLMK DVKFPELSRL NLDISLPSFP ALPSTNFPNI PSIYNIIPSP PVTFNANNKD ENDDFIHDLK SELRQLRSEI EELRRNPFRS ETVTEKLQEM IYTILEDKTG MADYALESAG AEVIDKWTTS GIPTGNALMK IWNLPIFYHT MSPRFALQPN VHPGNCFAFS GQSGSLTVKL ARPIYPTNFT IEHIPKALSF GDVSSAPQNV TLETVNPITG KGSIIGSSVF NIDGIPVQTF RTSISESLSK PTQFIRLRIN SNWGNPNFTC LYRLRVHGSQ NDF // ID E4ZRK8_LEPMJ Unreviewed; 783 AA. AC E4ZRK8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Similar to spindle pole body-associated protein sad1 {ECO:0000313|EMBL:CBX93855.1}; GN ORFNames=LEMA_P035290.1 {ECO:0000313|EMBL:CBX93855.1}; OS Leptosphaeria maculans (strain JN3 / isolate v23.1.3 / race OS Av1-4-5-6-7-8) (Blackleg fungus) (Phoma lingam). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Leptosphaeriaceae; Leptosphaeria; Leptosphaeria maculans complex. OX NCBI_TaxID=985895 {ECO:0000313|Proteomes:UP000002668}; RN [1] {ECO:0000313|Proteomes:UP000002668} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JN3 / isolate v23.1.3 / race Av1-4-5-6-7-8 RC {ECO:0000313|Proteomes:UP000002668}; RX PubMed=21326234; DOI=10.1038/ncomms1189; RA Rouxel T., Grandaubert J., Hane J.K., Hoede C., van de Wouw A.P., RA Couloux A., Dominguez V., Anthouard V., Bally P., Bourras S., RA Cozijnsen A.J., Ciuffetti L.M., Degrave A., Dilmaghani A., Duret L., RA Fudal I., Goodwin S.B., Gout L., Glaser N., Linglin J., Kema G.H.J., RA Lapalu N., Lawrence C.B., May K., Meyer M., Ollivier B., Poulain J., RA Schoch C.L., Simon A., Spatafora J.W., Stachowiak A., Turgeon B.G., RA Tyler B.M., Vincent D., Weissenbach J., Amselem J., Quesneville H., RA Oliver R.P., Wincker P., Balesdent M.-H., Howlett B.J.; RT "Effector diversification within compartments of the Leptosphaeria RT maculans genome affected by Repeat-Induced Point mutations."; RL Nat. Commun. 2:202-202(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FP929116; CBX93855.1; -; Genomic_DNA. DR RefSeq; XP_003837295.1; XM_003837247.1. DR EnsemblFungi; CBX93855; CBX93855; LEMA_P035290.1. DR GeneID; 13284670; -. DR InParanoid; E4ZRK8; -. DR OMA; VANYELH; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002668; Genome. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002668}; KW Reference proteome {ECO:0000313|Proteomes:UP000002668}. SQ SEQUENCE 783 AA; 86062 MW; 7FDA935181EE5DD6 CRC64; MSQLASSGAP TPRRSGRLSN KASSIAETAV TTVTKAGTRA RRTGPLIEVK SRKSNAYGAS GRVGTAEELP VAATGFAQAF QNQRGNALVR EGPVEESEDG TDSADELAAE TPRLSGARNG HFPASSPPRS PTPTAGTAAT SVPGFSFLQS EDTPASEEDD AESVGNTSKS FGPLHEAGMI GQQDRPHMPY SSTQATPEPT PLVQKTNLRR SLQSQTTRMG ASQIKVPLQE QGAPPLRPSY LQTPAPGREN GTLAASAAAS AAHKKAIDES VDALLAKEQA RLHRDGAPQS QPKYQGRRRH ANSPKTVNEQ PGEVESPQKF QIDWPLKKHL SWVLGVLAAI TLVGWLGHSM MSSVASASDA NTTNNKPGLL SAVNARASYT MGKVAEFIQP PRGPTVEEEV AAFRAGDDNI MWHRMYKMSD KFETRINGVH ATIEELRKEL PDMLIVRRHE DGRSEISDDF WQALQAKLRS EEENPEWVQY LTQVKQKLDD IFDHSVDRDD TKVRPQAVSR QEFLELIDQR FRELSTRVNE NIEEAFKSQT EKFQSLVTAE AKKAMIESVR LQSLAQTNLV ANYELHLKSP NYFSPSLGAV VVPHLTSATR LDRARWFTTI AQKLALLPQR NPPQAALTEW RQPGDCWCSA PNVLGGAQTQ LTVSLALPMT PQKVTIEHVP MSMVPARDVS NAPRDVEIWV QTEKPVKSYY RYSGGTCGEG LPGWACLGAF KYNIHASNHV QTFDLVSETS EPIKRAMLRV KSNWGADHTC LYQVRLHGED ARADYEYQVR LND // ID E5ABM2_LEPMJ Unreviewed; 1043 AA. AC E5ABM2; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 14-OCT-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CBY01063.1}; GN ORFNames=LEMA_P021930.1 {ECO:0000313|EMBL:CBY01063.1}; OS Leptosphaeria maculans (strain JN3 / isolate v23.1.3 / race OS Av1-4-5-6-7-8) (Blackleg fungus) (Phoma lingam). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Leptosphaeriaceae; Leptosphaeria; Leptosphaeria maculans complex. OX NCBI_TaxID=985895 {ECO:0000313|Proteomes:UP000002668}; RN [1] {ECO:0000313|Proteomes:UP000002668} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JN3 / isolate v23.1.3 / race Av1-4-5-6-7-8 RC {ECO:0000313|Proteomes:UP000002668}; RX PubMed=21326234; DOI=10.1038/ncomms1189; RA Rouxel T., Grandaubert J., Hane J.K., Hoede C., van de Wouw A.P., RA Couloux A., Dominguez V., Anthouard V., Bally P., Bourras S., RA Cozijnsen A.J., Ciuffetti L.M., Degrave A., Dilmaghani A., Duret L., RA Fudal I., Goodwin S.B., Gout L., Glaser N., Linglin J., Kema G.H.J., RA Lapalu N., Lawrence C.B., May K., Meyer M., Ollivier B., Poulain J., RA Schoch C.L., Simon A., Spatafora J.W., Stachowiak A., Turgeon B.G., RA Tyler B.M., Vincent D., Weissenbach J., Amselem J., Quesneville H., RA Oliver R.P., Wincker P., Balesdent M.-H., Howlett B.J.; RT "Effector diversification within compartments of the Leptosphaeria RT maculans genome affected by Repeat-Induced Point mutations."; RL Nat. Commun. 2:202-202(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FP929138; CBY01063.1; -; Genomic_DNA. DR RefSeq; XP_003844542.1; XM_003844494.1. DR EnsemblFungi; CBY01063; CBY01063; LEMA_P021930.1. DR GeneID; 13291710; -. DR InParanoid; E5ABM2; -. DR OMA; AKTSWSA; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002668; Genome. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002668}; KW Reference proteome {ECO:0000313|Proteomes:UP000002668}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 1043 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003194939. SQ SEQUENCE 1043 AA; 112210 MW; A0E585AAFBB63945 CRC64; MPTIGAPLRT WTLLLLVCSS ATYVTGHAAN ETSNAISAAT AASSITTPSS PAATATVPLP SQAPSPAPRR YSNSDAQCPA RTINYITHTL PQQCAKTSWS APSETASTHI TTGEAQSQPT TNAPTNTGEP RPEAESQLDD KLSSAISLIA AHAHGTPSGT AGDSGVEVET DSPFDNASFL SFEEWKKKNL ERVGQSPENV GQGRTAPSGE QARRRPVNVN ALDSLGDEGE IELDFSGFGN PGDKKETSNQ SRQEGTQEPG MAKATGEEGI AAPSSWALSK DAGKTCKERF NYASFDCAAT VLKTNKQAKS ATSILVENKD SYMLNECSAD NKFLIVELCD DILVDTIVLA NYEFFSSMFR HFRVSVSDRY PVKLERWRTL GTFEARNSRD IQPFLITEPQ IWARYLRVEF LTQYGNEFYC PLSLIRVHGT TMMEQFRQEE EQARGIEDDG DLEAESRDVV KPAEDSGPLP PDQIPIQAVK DGNAGSQDTS SSESITTATP SSGPETVPSQ TPQQGQEHDM SSGSTTVASV TEFTEAASDT AAAVSSSPAV KQRGTSGESD PSASHGSSVP ATPSPQSSTG SEISTQSQSH ASDAATTVSS PPTSQSSSPS SSTGNGTGSP PSNSQPAKAA SDATGASPPI QPPPRGSSTQ PNGAVPSTQE SFFKSIHKRL LYLEANSTLS LQYIEEQSRS LRDAFLKVEK RQLAKTEKFL DHLNSTVMLE LKSFRNMYDQ LWQSTVIELE SMKERQRNEM GEIGARLSLL ADELVWQKRM AVVQSTLLLV CLGLVLFVRS GTLGSSTDVP IVQHLGSKYT SLFDTPSRPQ PESGLARRRR TFRDMWRSDT SAGLSDRDHQ SDAPHPLSDA DTDGARSPLH NEYSPGSPNP PTPATLGARD ARLHDRTLRA YADNDMDLSD PEPEPQLDPD DQAARIQVLE TQSGPATPNG TRDCRPSWEE VDRAMDELKA EGRGASGHSG GPVGGSQVGR GEAVGKDEDR ERERDRDKGK KKAKGKGKAM GNAQRLGQAQ ADKKSPLRRA YSSFDERNGD EDS // ID E5ESI2_9PHYC Unreviewed; 5688 AA. AC E5ESI2; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 16-SEP-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:ADQ91794.1}; GN ORFNames=BpV1_167 {ECO:0000313|EMBL:ADQ91794.1}; OS Bathycoccus sp. RCC1105 virus BpV1. OC Viruses; dsDNA viruses, no RNA stage; Phycodnaviridae; Prasinovirus; OC unclassified Prasinovirus. OX NCBI_TaxID=880159 {ECO:0000313|EMBL:ADQ91794.1}; RN [1] {ECO:0000313|EMBL:ADQ91794.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=20861243; DOI=10.1128/JVI.01123-10; RA Moreau H., Piganeau G., Desdevises Y., Cooke R., Derelle E., RA Grimsley N.; RT "Marine prasinovirus genomes show low evolutionary divergence and RT acquisition of protein metabolism genes by horizontal gene transfer."; RL J. Virol. 84:12555-12563(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HM004432; ADQ91794.1; -; Genomic_DNA. DR RefSeq; YP_004061597.1; NC_014765.1. DR GeneID; 10020077; -. DR KEGG; vg:10020077; -. DR Gene3D; 2.130.10.30; -; 1. DR Gene3D; 2.60.120.200; -; 2. DR Gene3D; 2.60.120.260; -; 2. DR InterPro; IPR013320; ConA-like_dom. DR InterPro; IPR000421; FA58C. DR InterPro; IPR013517; FG-GAP. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR013519; Int_alpha_beta-p. DR InterPro; IPR009091; RCC1/BLIP-II. DR InterPro; IPR000408; Reg_chr_condens. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00754; F5_F8_type_C; 1. DR Pfam; PF14312; FG-GAP_2; 12. DR Pfam; PF00415; RCC1; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PRINTS; PR00633; RCCNDNSATION. DR SMART; SM00191; Int_alpha; 9. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF49899; SSF49899; 2. DR SUPFAM; SSF50985; SSF50985; 1. DR PROSITE; PS50022; FA58C_3; 1. DR PROSITE; PS50012; RCC1_3; 4. PE 4: Predicted; SQ SEQUENCE 5688 AA; 603822 MW; 3FA1479BD5E6953F CRC64; MRFLEVPALI TEDIEDPDIV DITETETVTA SINGTTLTID GDSKVSGETT ITLTGDVTVS TVTTSNIRQP SIHKIIIKTN GAGNFTYMPN NTVTASSVST TTKTGSDLDI LLITTSGGDT YLDLDLADYV SPAANFSNVL TGFNVSELTS AYGTTAKLYK PDGTLLTTVN SAATGSGIGF GSGVESSPGD VIYKLEVTDD VGRKSKYNLP RKWINYAMET TSTPGFEKLA LEGNEYTSTV TLPNLTSSSK ARIHKTTDHG YMGIKFKIQG RNAPNDDWVD VDDTGENENA DESAEFLAFG DYLYQRVWVK PTAKATTTVS SSNVVTLTLS NVTEDVTTKT ITGMDPFEID ETSNVFVFEE PAKIGTVPYT VTVGTETYTN AVTATTEEVG STPNTDLTAA GPNMKHLIGG TFSAGYMCQA SADGKYIAYN KTNDSVLYVL KGDPVSGYTD YTTITYSQVS NANLVGSMSY DGKYIIIARG QSNSYTYEIW KNDESTSTFT KLAQSVSRIG GYNMVYKPSF VPRSGTYDFA ITGTDNNNNF DIKLYKHTTD TDTWVGQTTI NDPTNRPTGD GYRGYFSCQF TRDGKYLMMG AYSTYGGYQM YTIDWDANTA VFSGANWNQN GNIGHSGTVT LDGKYVLIFH NDSNSRKIFK NNDAGDWSSA TDVTTDFTFN NDDNDQGHDI SFFGDNSEYF VSKMNGGYVR VYSWYAKKEV TLTYNGKDML TLAHKGGLTT TSVKLYKDDI LYHTFGASET SVVIAEAGVY QAIADEKYYS AKVTVTTVTE TQADLYLSYR TCFLLKKDGK VWYWGESTNG ANATGNNTTV NVATLNDNLN ALPSGIKQIG RAAVDAHCRC AITNDGKLYT WGYNGHGGLG RGNTTDYTNQ GPWLVSTQSS NTFTFCHSSY YSNFALQDNG YIWSCGYNGH YLIGNGNISN ISTFFRINLP NVIDFHANHN IALALTSDNE VYIWGTEYNS SMGGASGTTG TPTHMTALDG KNIVKVRTGG YQGYAISSDG KLYSWGYNNN YQLPKGATGN ISTPEEMTWF SRKNIKVVDV APGHNETAHL ALDDQGNMYV WGIDNHGQLG SGAGGVQSGW PFLLKENIAS ISCGYNECGC IDKFGQVYTW GDAAQGQQNW IHGTNSDTDI THPTSNNLSV GSSVVYDGFD KYLLGKDTTV TSNVTFGSNT VSLGTKSEVF INDPHTYKFK IMDSGKTTYT STVISSTPTR PTGRVYPPKS GTHKNLTTSS TANVDNTWTI DEALYGRGDY KASMDQTGNH VSQDAYYGFN GLIDNSCLHT RTSPTVAIQL HMPQKIKLTK YAMYSRNFTG EYTYAPRDWK VYGSNDNGTN WTELDSQTNQ TVSAWGSQLD FRDTKREYTV TGNTKYFSSY KLDVTANGGD GNYVVLSQIE YYGDEEGFLS DDGFGKLTLD VKGDTSATSN ITFHSNTFVM GAARDLYIKD VGEYTADIYG SSKAFLGSKT HTVSTVDTVP GFTAAFHHGA FSASDYSSAY SSVAAAATAG FVYSDTPAGE YTWGTLAHVK ERENPDFTAN STLHTDYSST YGWTTNSGWE VSAKDEFSTT YEAWKAFSKT YAYNNNWLSG SAPSASSPQW LKIKYPSVQV IKSYVIRARD NSSPRFPTAW KLQGSNNDSD WTDIGTEQTQ TEWVPGQNKS FDIPTNTTAY QYYRLRITGA RESASLSNSD YVAIGHWFLL TETSGSTVHT KYTRYTYTPP ASTITANVLR VAGGGGGSGT RGGGGGAGGL LYSENVSLSG TKSIVVGNGG IGYEGTQQSI ANGYDTVFTG LTTSIGGGHG GGNDAIIPTP GGSGGGGSAR IAAVGTGAAG TSGQGNAGGN GSSGDNHNGG GGGGAGGVGG NSGGVGGVGL DYSSVFGTTY GVSGFFAGGG GGGNSSGGNT GGNGGGGGGS NGHGTKHTGG GGAGIGNASS TSGDGGSGIV IIKKLGAANP PALNFDGYNK LSIDNVSSDA FTNFPTSIPI TAGNWTGDNF VKTSTSTSTV LYYDYVGGSY ATGTTIAGIE FRANSTGFTV HVLTDDNTDP NQLTVNGGSS TTSSVCAVGD TVRLYRNGSP HLATITVPDF YASSVESTYT IKKDGAAFAT TTSNTVYIRE AGTYTAEVKG SGAYVTEVSK VVSGSVTTNQ EWKANEDQIL YASDGSANNN FGRSCAMWGS YMVVGSFEGE SNTQGAIYIY KKESGTWTFK QKISGEGAND YFSGYGAVDI HEDTIIVGAY KHGTNNNTGK AYVYTRSGET WSLQSSIVSD DLATDDKFGI SVSVHGDYAV VGAQLDDGPN NSGAGYIFYR NGTSWSQQQK LNPGADDENV GTHVSIYGDY VVLSGKRSGS DLRGRAHVYK RSGTTWGNHT VISNPNPASN DAFSERLSIT DGYVAIGSDA DDPGGTSDAG QVYVFKHSGT NDWTLEATLE ASDKTSSMRF ASGISMTPNV LVVGAHQEST GGANAGAVYV FERSGTIWTE VKKVVASDAA AGDRLGIYVG VNSDGLSFVA GAYGESTNGA SAGAAYVYEK GPVGPNITYD NYNKITLNQL NYEDTATVTD PNGSTHDIGT AKTMYVKDAG EYVFKISGTD KYVESNVHVS SVDLAGAPTK PIDFDGYNKL TLIDAGSNVS ANVTLGATKY DLGSASTFYI KDTGTYDLEM SGSNVFALSS NVVSSISSLP DTLLHLDFES GGLVSTGLNT TQYKYGSSSL YRNNSSVTVS NDGSYNIGNT SSAKATIMLW VRPESYPSYS GNANRIIVAA DGRSESSPSY KKWAIVLREH SSIDEEWQFH YPNSSNTNRV ATYKPADIPV GAWYHVCVTI ESGSLKFYID GVDVTSSLTG SKSDIESAWA GFNDDEGFKF CKTDANYLKG WVDDFKLFNS VLTLSEITAH MNASKLSSPS LTFDNYNKLT VSNFTTTDIE WPPASFTSPS VSDGNPASIN GITSTNSAKD AEWVISGASY GNGTYKASTT VAVHETHAVH GPFQMFNKVN GSLDTKTTTT NTSGEWTIEL PSSILLYKYN LHHGNAVSTS ANYPKDWTIE GSNDGTTWVV IDTRSNETYA SSVTGFDVSK REYTVSNNTT KYKHYKLNVS AINSGSYILI GAWRLFEMNT RTATLTDPNG STYALGQTQD TIYIRDTGDY TLDVTNNDQK AIVAKTVSGT LSANPGTPSI TSILVTRNNG SDYTEASPST ERNGAWESRF VDGNAGTVEM DGSSGSGVGF SMYLNFDQTF TVNTAVFKMI STYQTYKDVS FRCGVSDSDY ITVDPGGLNF ASTQTITFTF SSPITVNKLY FAVVQDNGND LYNIRMSSGD IIINDVAYAL SDSDKVLPSP ALNFDTYNKL TFANVDSDAT SNIDFFSNTY EMGSRKELII NDTGTYYANI YSSNTLALVK KEIITESGAK KVAFHHGAFS ASDYSSAYST VEAAATAGHV YSDTSTTPTY TWGTLGTISS TSTNTTYTWT PASTGFNKAD ILMVAGGGGG SKSSHSYAAG AGGGAGGLVY KENQSISGVQ TIIVGNGGAS GNGGTDGQNT TALGFTALGG GGGATSGQDG NNGGSGGGGD GSSTTNSSSA PSVSNHLSSW VSSTGDYSVS NYRGSLRNPS HDTSAKTYSY YYYWEYNSGN GNYDSRRDNV LVYNYGDRKW YDGRNDEQPV YVMNGDATTY SGTYPTSSDP NQQTISLWSS SVRLGVFNNP YYDSSWVAPN AGGSASQPSH ASGGFGNSGG QGGGAQTGGG GGGGAGSVGQ DGNTATGNGG NGGDGKMYSI SGSNVYYAGG GGAGTGTGGS VAGTGGQGGG GNGGQGNGGN GGNGSSHTGG GGGGSNKNST AGNGGSGVVI IKTGGIVTIG PETTLSETSI VKQFIYDENG YTDAYFSGGG EQAQEIRLSD DGLAMAIGDN MYNSRVGRAY YYERSSISGT FTKTHTFDAV QSGSTYDFGN GIAMNEAKTR IAISAPDSSS GGTSVAKIYV YDRASTSASW PGTPTFTIDY PNSSVIRFAR GIEMSGDGNT ILAGSDSHAT NSNSEGGMFV FEYASSSWSK TFEVTNSTHR FGMRIHMSKD GTRIIGGGTP SGMLVYHKVS GTWSSTIQLT TESAHTCGIS PDGNTVAVGI LGYSSNLGRV AVYKYSASSW GSVVYVDTTV GSGTYQFGTT PRFNNDGTLL VTGCSAYNSY MGCFEMWKYE SGSWVFKKQF LNPTVKTGHG GDNGEFFGEY MAMDNAGTSL IVSNQGNDVQ GADYGRVVLY GAGSPPSLTF DGYNKLTPPV LEASFTATRL SDWDNNNQSK SPNSGKGDWP GTSFTGESDA HWWTLHNNDK VYTSSYNYGT TRDAAQLFTT VTSSDWNHGY HTQSGWNATK ILKLGYKFTV GSKTLGSMKL WQAPASYPTG DVTIKYWDGS SLKTVTNQSP SGFPSSISYY TEQEFTFNSA NAQYWLIECK THASSPSTNY IGLAGWQLLS GRDPVSTVLT KGSDSYDIGT ASSIYIDATG TYDAQAKNSN TFVIKTSNVV SGSITRSQVW KADGTEDQIL YGSDPGADDN FGHAVAVDGN YAVVGSRYND TGGTDRGAAF IFHKSGGTWT EQAMVQPSDT ANDDWFGRGV AISGDYVIVN AYAKTTNNTG QGAAYIFYRS GTSWAQQAKL NASDSEASDA YSYSVDIDGD YAIVGSLNED PGGTSNAGSA YIYVRSGTSW SQQAKIQASD KEASDTFGRG VTISGDYAAV GADNEDTSGS NAGSVYIFKK VDIFTSYTVR ITRNGDTSST GTGNFDITPS DGNVTLRNWG NSPSSGNLGA ALWDNASGTS LRVYDDGTII LAGETLTTYT VGGSPATFYE CGSGLGGTNF HVAITIVSRT TGPGWSQQQK IQSSDIQADD YFGGGAGQGV SLSGDYLAVG ARKEDTGGSE AGSVYIFKKA SGSETWSQEA KLQASDVSAA AQFGMSVSLS GDVLAVGANA EDTSASDAGA AYIFERSGTT WTEVKKITAS DTQASDYFGE SISTDGTTTF VGAFGEDTKG SNAGAAYVYE KQYVGPTLTY DNSNKLSLTG VTTPSSNLTV GTNTYDIGSA KDVYIKDQGT YTFHTNDGDQ ALILNKTVSS TPSGTTYNYT TGTAYTITTD SLFFNYEAWN YSGSGNWLNQ VNTANNPGVF PSSITYNSTS PKSFVFNNSN TERINIGNVD MQQDWTLECW AKLATVSTAG LFGHGVHSTQ QGLHININNS GGKSRMGFFS NDLDATFSFQ VDTWYHLVYV YDRTNSHNKK IYINGVKEAD ANGSSYNRGT DEFSIGNTYS NGTNGDPMRG EIAVARMYTK CLTAAEIGVN YAAGYLGSST TTTLSGTLPS QVYDNTKTIT VSNIPSGTST VGKIYKGATA YTIHATEPTS NVIIKNTGSY VSVFTTSNIA YLTNTVNVNA TPTTTSDDNT IEDAAPIVTI TTTTVGEPLS YKAYLKTGIN VNASTSWTAY DIFGSGYSEE INQGSFTLAN SNLVAPETGY YRLSFNIYIV SVSGSGERTH VGVKPTVEGT DLNEISASNY IRVASGHNEA STSMSTIIHL TGNKEVNLKF ARLSTVGRNN QINPGSLITL EKVTTSTLYK AYLKTGINVN ASTSWTAYDI FGSGYSEEIN QGSFTLANSN LVAPETGYYR LSLNIYLYST GDRTNVGVKP VVEGTDLNEI SASDYIRNYE GHNEASTNMS TIVYLSGNKE VNLKFARLTD ITTSVTIQSG SFITLEKI // ID E5R2R0_ARTGP Unreviewed; 576 AA. AC E5R2R0; DT 08-FEB-2011, integrated into UniProtKB/TrEMBL. DT 08-FEB-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFQ97044.1}; GN ORFNames=MGYG_00088 {ECO:0000313|EMBL:EFQ97044.1}; OS Arthroderma gypseum (strain ATCC MYA-4604 / CBS 118893) (Microsporum OS gypseum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Microsporum. OX NCBI_TaxID=535722 {ECO:0000313|Proteomes:UP000002669}; RN [1] {ECO:0000313|Proteomes:UP000002669} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4604 / CBS 118893 {ECO:0000313|Proteomes:UP000002669}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS989822; EFQ97044.1; -; Genomic_DNA. DR RefSeq; XP_003175996.1; XM_003175948.1. DR EnsemblFungi; EFQ97044; EFQ97044; MGYG_00088. DR GeneID; 10031307; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; E5R2R0; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002669; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002669}; KW Reference proteome {ECO:0000313|Proteomes:UP000002669}. SQ SEQUENCE 576 AA; 63938 MW; 3B22787709A62D47 CRC64; MAPQRRTRRA TPTAAAPAAP EAENPFLPSI ETQQSFAYGS STPALPRRLG SHPSANNAEE VAATLSMSAG FHQIEDEARR SPEKQRRAES VLSGRETSLS PAPRETIRRL TPDIQLMGSL REASGEPEDH QDEHRHQHQH QHQQHHQQHH QQDDDAEADL LADAIDGSSI SWNTERHLLA TEQQQVSRRP ANLGFPNWPA RATAATAARR TAGPPMSPSQ ASSTSIQLQH LQRQQHLQHH QLRGQPQRSR ATAERIERGT AIGRPVTTTT TAAGIATTTS LSPSGRRESA SDGERVDTPQ SEHTPSSSRP PSAQDILTPI SPSTSSSSGL RLGFSHVITI LLTVMVALNG YLLRDEIASA AKSIYLPGRG SSSLTGSNCT ENISQMMTAV EQRLTTMTKD ITLLKQEVSK VPEVNQNPHQ SDKIKSQNKI QLAVELGQAI VPEEVIVEHM PREATLDNGA AAPQLMELWG EYEEAEKDEP LKERLARVWP GEPQSAYVNE PSLGPSFVRL GRWRYDIHHP HSQHQHDHQY HHIQRFPVQA SSVIERKTKR VVVVARTNWG QREYTCIYRV RLHGRP // ID E5SG90_TRISP Unreviewed; 546 AA. AC E5SG90; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFV56143.1}; GN ORFNames=Tsp_06489 {ECO:0000313|EMBL:EFV56143.1}; OS Trichinella spiralis (Trichina worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia; OC Trichocephalida; Trichinellidae; Trichinella. OX NCBI_TaxID=6334 {ECO:0000313|EMBL:EFV56143.1, ECO:0000313|Proteomes:UP000006823}; RN [1] {ECO:0000313|EMBL:EFV56143.1, ECO:0000313|Proteomes:UP000006823} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ISS 195 {ECO:0000313|EMBL:EFV56143.1, RC ECO:0000313|Proteomes:UP000006823}; RX PubMed=21336279; DOI=10.1038/ng.769; RA Mitreva M., Jasmer D.P., Zarlenga D.S., Wang Z., Abubucker S., RA Martin J., Taylor C.M., Yin Y., Fulton L., Minx P., Yang S.P., RA Warren W.C., Fulton R.S., Bhonagiri V., Zhang X., Hallsworth-Pepin K., RA Clifton S.W., McCarter J.P., Appleton J., Mardis E.R., Wilson R.K.; RT "The draft genome of the parasitic nematode Trichinella spiralis."; RL Nat. Genet. 43:228-235(2011). CC -!- SIMILARITY: Belongs to the mitochondrial carrier (TC 2.A.29) CC family. {ECO:0000256|RuleBase:RU000488}. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFV56143.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABIR02000949; EFV56143.1; -; Genomic_DNA. DR RefSeq; XP_003378612.1; XM_003378564.1. DR STRING; 6334.EFV56143; -. DR EnsemblMetazoa; EFV56143; EFV56143; EFV56143. DR GeneID; 10909509; -. DR KEGG; tsp:Tsp_06489; -. DR CTD; 10909509; -. DR eggNOG; KOG0753; Eukaryota. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR Proteomes; UP000006823; Unassembled WGS sequence. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0006810; P:transport; IEA:UniProtKB-KW. DR Gene3D; 1.50.40.10; -; 2. DR InterPro; IPR018108; Mitochondrial_sb/sol_carrier. DR InterPro; IPR023395; Mt_carrier_dom. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00153; Mito_carr; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF103506; SSF103506; 1. DR PROSITE; PS50920; SOLCAR; 2. DR PROSITE; PS51469; SUN; 1. PE 3: Inferred from homology; KW Complete proteome {ECO:0000313|Proteomes:UP000006823}; KW Membrane {ECO:0000256|SAAS:SAAS00013441}; KW Reference proteome {ECO:0000313|Proteomes:UP000006823}; KW Transmembrane {ECO:0000256|SAAS:SAAS00013441}; KW Transmembrane helix {ECO:0000256|SAAS:SAAS00013441}; KW Transport {ECO:0000256|RuleBase:RU000488}. SQ SEQUENCE 546 AA; 62011 MW; B0557B237B2C46C8 CRC64; MRFVNCTTFS YRDEMLSTRE QLMQLHQDLK ECDRRQVQLQ SLINNHEALF NSLDLKLREL SSVCQREQTS VEVSKISGVS VSVSDVDARI NLALKRYDAD RTMMPDFALE SSGGSVLSIR CTETYDQRVR VVTLFGIPLY YKAFSPRIVI QPGIVPGECW AFKGSVGSLV IKLSGVINVT SFSYEHVSKF IATDGNIESA PREFEVYGLM SKHDENPQLL GQYTYDDMGD PLQHFPVTAA NITPVPIVEF KIIRNYGHPK YTCLYRFRVH GERIYIIYWQ SNFMDKTQTS RESIGLKYVL SCAAATLAET ATYPLDLLKT RLQIQGEHGK LNSQFMTTPK QGMFTIFSNI VRKEGFFGLW NGITPAVTRH YDVYFFSVYT GVRVIFYETF REKLFHRNAD GTFDLWKAMC SSMASGAIGQ FLASPTDLVK VQMQMEGRRR LDGLPPSAFS GLAAAITSTP VDVVKTRMMN QTAANIAVGE RFYKSSIDCL LKTISNEGFF ALYKGFVPIW ARMAPWSLTF WKIVQCRLFS FNLIFNIQNA KAIVNN // ID E5SGH7_TRISP Unreviewed; 1051 AA. AC E5SGH7; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFV56063.1}; GN ORFNames=Tsp_06578 {ECO:0000313|EMBL:EFV56063.1}; OS Trichinella spiralis (Trichina worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia; OC Trichocephalida; Trichinellidae; Trichinella. OX NCBI_TaxID=6334 {ECO:0000313|EMBL:EFV56063.1, ECO:0000313|Proteomes:UP000006823}; RN [1] {ECO:0000313|EMBL:EFV56063.1, ECO:0000313|Proteomes:UP000006823} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ISS 195 {ECO:0000313|EMBL:EFV56063.1, RC ECO:0000313|Proteomes:UP000006823}; RX PubMed=21336279; DOI=10.1038/ng.769; RA Mitreva M., Jasmer D.P., Zarlenga D.S., Wang Z., Abubucker S., RA Martin J., Taylor C.M., Yin Y., Fulton L., Minx P., Yang S.P., RA Warren W.C., Fulton R.S., Bhonagiri V., Zhang X., Hallsworth-Pepin K., RA Clifton S.W., McCarter J.P., Appleton J., Mardis E.R., Wilson R.K.; RT "The draft genome of the parasitic nematode Trichinella spiralis."; RL Nat. Genet. 43:228-235(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFV56063.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABIR02000950; EFV56063.1; -; Genomic_DNA. DR RefSeq; XP_003378692.1; XM_003378644.1. DR EnsemblMetazoa; EFV56063; EFV56063; EFV56063. DR GeneID; 10909832; -. DR KEGG; tsp:Tsp_06578; -. DR CTD; 10909832; -. DR eggNOG; ENOG410IIUZ; Eukaryota. DR eggNOG; ENOG41107GE; LUCA. DR InParanoid; E5SGH7; -. DR Proteomes; UP000006823; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006823}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006823}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 22 39 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 45 67 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 88 109 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 129 151 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 190 211 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 269 290 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 302 324 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 344 366 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 387 409 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 429 453 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 812 830 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 1051 AA; 119129 MW; 5BD99A000944172C CRC64; MDNVHHTTFA DSNKYSLTMG RINIPFVVFT CTIFISPSLH GNIDVILPIG QFSLLTLSSV TIYNEFIRVD ILTSNKMGCL YRAELNSIAS IWYCFFFVFL QGYLLYVGIE RYSTLAESKL NENSSSFLPL YIGLHGLAVT LLPFFIVSAI FKVGNLANDC VRIGEKRPPI IHTAGNWCCR KLKACWLHGL PVTQTLHLII ASTFLLCGAL VEAEFQGDES QLRIELKFVY RNQRFRIPSY APYENDSRIE QKLHQTVIPV RTFGISLPFL NYACALAAFA IVYPSVFWRA SRSFSLVFSM HLMIHAISAL FCFVAYSILR QVYIAGLYDP AHQTPFLLQE PFLVIIYLAT TAVMLFASMV VYGYGYNKYC LCVLSARGHC RLLHKVYSVY CEGYSAHIAA IVMLVMMVLC KAPILYDLMV LYQHQPQPML LSCIISDVCY LFLWILLWLG LTLKRDWTFT VRHTLTELGN LLDMRTLEGQ ACNESTLILL SDELAFATNN ECKKLALLQA AEKCNNASKG AGEVYWLKSH HGTEDSTKSQ KLAPLNNPEL NWSSRLALCE DNVSTFGTLP RAADRQKQNS HTTLYHHSHG WESPKVGEST VHSLDRAPYA TISRSQRANR PSAFEPRRTP SNNAFQSYGS RQSSYANLSN STTLNKQQQQ WSLTKPPPSS RSRNIPATSV QSVDAEIRST GNDPTFKPPI NSNSVEKRTT QLSWNVNPLY DANKATTKTS TISSLPDATT TTSAHFTTSI KRNVIYALLV SSYDIYRSFN SSISLNDSVP LSNSRSSKRT ATSNVSSAVR FEAELIASEE RFYVLWMKLL FYIIILLSAV TCGTGNQTIT HNDKVLSSLL NNFLSNRFKW INFADEYNGA KILDIPETEP YPCGSFLSYW TGYRYLTFYQ SARKVITHDT KSDECWTFKG NKGNLIIGLK SNVLITGFSY EHSASSNETV NYRKLTAPKE INFYAERIQM TSIIHVEIIG NYGNMPYTCI HRFRVYGILP SEFPKNSITN IREIMDQFTE QDPEDERSYK ARQDVLEMIS CYQEILHERK L // ID E5SIG9_TRISP Unreviewed; 1196 AA. AC E5SIG9; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Peptidase, S54 family {ECO:0000313|EMBL:EFV55435.1}; GN ORFNames=Tsp_04235 {ECO:0000313|EMBL:EFV55435.1}; OS Trichinella spiralis (Trichina worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia; OC Trichocephalida; Trichinellidae; Trichinella. OX NCBI_TaxID=6334 {ECO:0000313|EMBL:EFV55435.1, ECO:0000313|Proteomes:UP000006823}; RN [1] {ECO:0000313|EMBL:EFV55435.1, ECO:0000313|Proteomes:UP000006823} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ISS 195 {ECO:0000313|EMBL:EFV55435.1, RC ECO:0000313|Proteomes:UP000006823}; RX PubMed=21336279; DOI=10.1038/ng.769; RA Mitreva M., Jasmer D.P., Zarlenga D.S., Wang Z., Abubucker S., RA Martin J., Taylor C.M., Yin Y., Fulton L., Minx P., Yang S.P., RA Warren W.C., Fulton R.S., Bhonagiri V., Zhang X., Hallsworth-Pepin K., RA Clifton S.W., McCarter J.P., Appleton J., Mardis E.R., Wilson R.K.; RT "The draft genome of the parasitic nematode Trichinella spiralis."; RL Nat. Genet. 43:228-235(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFV55435.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABIR02001018; EFV55435.1; -; Genomic_DNA. DR RefSeq; XP_003375049.1; XM_003375001.1. DR STRING; 6334.EFV55435; -. DR EnsemblMetazoa; EFV55435; EFV55435; EFV55435. DR GeneID; 10904983; -. DR KEGG; tsp:Tsp_04235; -. DR CTD; 10904983; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; KOG2980; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; E5SIG9; -. DR Proteomes; UP000006823; Unassembled WGS sequence. DR GO; GO:0016021; C:integral component of membrane; IEA:InterPro. DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro. DR Gene3D; 1.20.1540.10; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR022764; Peptidase_S54_rhomboid_dom. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF01694; Rhomboid; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006823}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006823}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 93 112 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 152 171 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 183 202 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 234 254 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 260 279 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 291 312 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 951 971 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 877 901 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1196 AA; 134035 MW; 7E3B8B30C924B1A7 CRC64; MSNYLMNKIL NSNCFFLNRF MLRLCRLCYS LPTLSVPLAR NIHRFRRVEK KIIPRNSTDI SEKVVNGRNF RRSVEQFVVP VTERSLRYLC KPILFTAAFG CSCFLAATVA EYERYQSEKE YLLDRRNWRY EYGRKVGTLR QQLNDIWGKI TIGQKAIGGI IALNVAVYLA WKVPKLHPLL KRYFMCGYFG ASLCTPMIYS VFSHINFLHM AVNMYVLWSF GPTLVRLTGL EQFVALYLTS GAVSSMCSLI IKAINGNKKV SVGAAIIMVV GFDLAGLLFR WRLFDHAAHL GGSFFGILFY IGLTLFFILV VLRVKVTSEI NKSATFMLSY RHGSIEVRMS FFSCDVSCMP PSADGEVCFK KVRTSYGLLI NLIDDIAFDR TVRSVETPNE TISDEEPLQS FDEWTKKKLE EQQNLKPIGK GRGFVLFCSC CVRLNCVVCT VFQKLVATPS GSTATPYTNR NYASKDCGAK IMQANAGAQN VAALLKDKER DEYMLNACQT EVPKWFIVEL CETVQISAVE IANFELFSSS PKQFRLWVSE RYPTVEWNLV GEWTAQDARE VQRFQIPLKQ YAKYIRVELL SHYGSEHFCP VSLFRVFGIS MVEEYEAEAM HLEDEPNVPT SASPGEGVDP SSPSLVAVDQ DSAKSNDSSK MDLLNTAKDA VVNMAKEMLG KAKGVLRVVQ IAADKETATL EDSVPQRASC WTCTEQAVQS STKYCYFIRL VAAKASSMTT PSDLAEAEDV VSVSDRPTLL SLWNETEESG THESGSSFDQ LTTTADSDGK QWNGVAERQE QQQEEEKSNH HNGAADERLM CYLDNSSSVK NLSSLTKNPF DERLSPVLPN ELALPGSSTS SKESVFIRLN NRLKTLEMNV SLSSQYLSEL SRNYKRQIEE IQRLLNKTMQ VAGDVEMRLM MLIKSQDQTL GVLQSKVDNL TKRISLPVDR WVDQEAKWLQ YHYWLFFAQL GFCLLTLVLL MRSLRGALLK RDDVVRIVLE VLNMRAAQHA NSGAFTAFNN SPYGSRNTST SSAVRLGAGL VSHRKRSCSV NQHDKDLKLN RSNNNPLSVS SLTRAKLAHL TPNGSGSYHT HKSNGNSGGG DRHSRYQHLA GVLFSADSSK VPRNAELVRW LEMPIRLCTV KNVGSKKSQG EATSSEVNSS LQYGFGRLGF IGSNKRGSFK NRASLTKTGI SPALLAVAIA SCKRRC // ID E6R0R7_CRYGW Unreviewed; 899 AA. AC E6R0R7; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ADV20429.1}; GN OrderedLocusNames=CGB_B4380C {ECO:0000313|EMBL:ADV20429.1}; OS Cryptococcus gattii serotype B (strain WM276 / ATCC MYA-4071) OS (Filobasidiella gattii) (Cryptococcus bacillisporus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. OX NCBI_TaxID=367775 {ECO:0000313|EMBL:ADV20429.1, ECO:0000313|Proteomes:UP000007805}; RN [1] {ECO:0000313|EMBL:ADV20429.1, ECO:0000313|Proteomes:UP000007805} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=WM276 / ATCC MYA-4071 {ECO:0000313|Proteomes:UP000007805}; RX PubMed=21304167; DOI=10.1128/mBio.00342-10; RA D'Souza C.A., Kronstad J.W., Taylor G., Warren R., Yuen M., Hu G., RA Jung W.H., Sham A., Kidd S.E., Tangen K., Lee N., Zeilmaker T., RA Sawkins J., McVicker G., Shah S., Gnerre S., Griggs A., Zeng Q., RA Bartlett K., Li W., Wang X., Heitman J., Stajich J.E., Fraser J.A., RA Meyer W., Carter D., Schein J., Krzywinski M., Kwon-Chung K.J., RA Varma A., Wang J., Brunham R., Fyfe M., Ouellette B.F., Siddiqui A., RA Marra M., Jones S., Holt R., Birren B.W., Galagan J.E., Cuomo C.A.; RT "Genome variation in Cryptococcus gattii, an emerging pathogen of RT immunocompetent hosts."; RL MBio 2:E342-E342(2011). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=WM276; RA D'Souza C.A., Kronstad J.W., Taylor G., Warren R., Yuen M., Hu G., RA Jung W.H., Sham A., Kidd S.E., Tangen K., Lee N., Zeilmaker T., RA Sawkins J., McVicker G., Shah S., Gnerre S., Griggs A., Zeng Q., RA Bartlett K., Li W., Wang X., Heitman J., Stajich J.E., Fraser J.A., RA Meyer W., Carter D., Schein J., Krzywinski M., Kwong-Chung K.J., RA Varma A., Wang J., Brunham R., Fyfe M., Ouellette B.F.F., Siddiqui A., RA Marra M., Jones S., Holt R., Birren B.W., Galagan J.E., Cuomo C.A.; RT "Genome variation in Cryptococcus gattii, an emerging pathogen of RT immunocompetent hosts."; RL MBio 0:0-0(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP000287; ADV20429.1; -; Genomic_DNA. DR RefSeq; XP_003192216.1; XM_003192168.1. DR STRING; 367775.XP_003192216.1; -. DR EnsemblFungi; ADV20429; ADV20429; CGB_B4380C. DR GeneID; 10187392; -. DR KEGG; cgi:CGB_B4380C; -. DR EuPathDB; FungiDB:CGB_B4380C; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000007805; Chromosome B. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007805}. FT COILED 459 504 {ECO:0000256|SAM:Coils}. FT COILED 514 548 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 899 AA; 99338 MW; 9BC0FDE042EBE08D CRC64; MPPRRAAPQA SPARSTRSAQ SLTRDHAREE DDWEAAESMT GRSFKVPESR SKKPGTAIGL KDTSVNIAAA FHAAQTGHLP PPPHPNTSIS SNASYSRALQ VPRAISPAEQ LAQSARALSP VRFFLRPTEE DGDDYTSFSS VGIENTGESN TSGEGESYDY RQEEEYVRLV QQQKIAKSRA ETSTHVKNRR IKAMDEDMPY RPAEEDSVSL ASSDSGGGGE GIVKSGALYG RAGTRGKRLE RGEGYLGMGL GIQPRRRRKS RKGGMDGNES EEEGSLRTGR AWTPTVEVDG HRGSPTPLQL LRGRSPMMDR KSPVPLGAYQ QRRRPSDIRT IITNVLHGVV MGLQFVVELG TTVLYRIIIR PIEKAFGSGK GFVRRAKTDW WKWLGILLGI SLALRFLDNA FRTKGIYTAP DAPPSTIDEM SIRLTSLEHA TATLSDLLRA ISEGDNELHQ SAVIMKSKID EIEDAVSAER KRIEGVRGEL KKEKVTMQSE IDKLRGEIHV LSNQVGRHEN SLSSDRSTKS LQAVEREITQ LKSRMEQVER DVHAALEDGR LVAALERILP QWMPIRTDSQ GKFVVEPAFW TEMKKVMVGK GEVERIVRRL IGEAGVSGTR IKESPVDEQK VVEWMDKSFD RHLQGGVWIT REEFISTLNE KLQELARETP EKPTSKRPAA SSTVIIKSSK GEDLTSLLNS LIDTALLKYS KDTIARADYA LFTAGARVIP HLTSDTFTLQ KASTFGKLLW ASKDVQGRPP ATALHPDTSV GSCWPIKGSE GSLGVMLVDR VIVSDITIEH APQELALDIA TAPKAVKVLG LVDYAEGLEK LAEYRATHQM DLNHQEDTNY LPLGTFTYDP SSYSHIQTFP VSPDIVELGI RIGVVVFKIE SNWGGDLTCL YRVRVHGKA // ID E6R5V9_CRYGW Unreviewed; 713 AA. AC E6R5V9; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ADV21651.1}; GN OrderedLocusNames=CGB_D2700C {ECO:0000313|EMBL:ADV21651.1}; OS Cryptococcus gattii serotype B (strain WM276 / ATCC MYA-4071) OS (Filobasidiella gattii) (Cryptococcus bacillisporus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. OX NCBI_TaxID=367775 {ECO:0000313|EMBL:ADV21651.1, ECO:0000313|Proteomes:UP000007805}; RN [1] {ECO:0000313|EMBL:ADV21651.1, ECO:0000313|Proteomes:UP000007805} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=WM276 / ATCC MYA-4071 {ECO:0000313|Proteomes:UP000007805}; RX PubMed=21304167; DOI=10.1128/mBio.00342-10; RA D'Souza C.A., Kronstad J.W., Taylor G., Warren R., Yuen M., Hu G., RA Jung W.H., Sham A., Kidd S.E., Tangen K., Lee N., Zeilmaker T., RA Sawkins J., McVicker G., Shah S., Gnerre S., Griggs A., Zeng Q., RA Bartlett K., Li W., Wang X., Heitman J., Stajich J.E., Fraser J.A., RA Meyer W., Carter D., Schein J., Krzywinski M., Kwon-Chung K.J., RA Varma A., Wang J., Brunham R., Fyfe M., Ouellette B.F., Siddiqui A., RA Marra M., Jones S., Holt R., Birren B.W., Galagan J.E., Cuomo C.A.; RT "Genome variation in Cryptococcus gattii, an emerging pathogen of RT immunocompetent hosts."; RL MBio 2:E342-E342(2011). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=WM276; RA D'Souza C.A., Kronstad J.W., Taylor G., Warren R., Yuen M., Hu G., RA Jung W.H., Sham A., Kidd S.E., Tangen K., Lee N., Zeilmaker T., RA Sawkins J., McVicker G., Shah S., Gnerre S., Griggs A., Zeng Q., RA Bartlett K., Li W., Wang X., Heitman J., Stajich J.E., Fraser J.A., RA Meyer W., Carter D., Schein J., Krzywinski M., Kwong-Chung K.J., RA Varma A., Wang J., Brunham R., Fyfe M., Ouellette B.F.F., Siddiqui A., RA Marra M., Jones S., Holt R., Birren B.W., Galagan J.E., Cuomo C.A.; RT "Genome variation in Cryptococcus gattii, an emerging pathogen of RT immunocompetent hosts."; RL MBio 0:0-0(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP000289; ADV21651.1; -; Genomic_DNA. DR RefSeq; XP_003193438.1; XM_003193390.1. DR STRING; 367775.XP_003193438.1; -. DR EnsemblFungi; ADV21651; ADV21651; CGB_D2700C. DR GeneID; 10191134; -. DR KEGG; cgi:CGB_D2700C; -. DR EuPathDB; FungiDB:CGB_D2700C; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007805; Chromosome D. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007805}. FT COILED 143 190 {ECO:0000256|SAM:Coils}. FT COILED 328 348 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 713 AA; 79899 MW; 2543529A26F960D4 CRC64; MLTPCRAPEH WVVVELCDEI RIDALEIAVW EFFSGVVREV QVSVGGEDDD GHGDGADGGA AGRGHMWRQV GSFVGKNVRG PQTFTLSQPT AFHRFIRLDF PSYYGSEYYC PVSSLKVYGM NQMEAFKWEQ KQLHHSREKE MFREEEQVRT ANETQEREKK ERERDERDKQ QQREKELDEL EKLLHEQAGR LVPELLADAF EEATPVASTT VPTEPTLISK SEDERDDSSL PPTNDSSSST TSSTPVYTRP RSDSSESIYA FIVRRLNALE GNSSLVARYM EEQAKVMRSM LKRVEVGWDE WKGEWEDEDR GRWQQERMRQ EDRLGRVLSQ LEQQRIAFDA ERKSIETQLR VLADQLGYER RRGIAQLIIM VIIILLGAAS RSSTINAILT PLVTEARRRQ SDYYHRKNRS GPLAGLHIDM GAGRPPAVIG QARPRSPRSP SAHTHTHTQT QTPTHRRVPS STPTPRLKTS LSRTGSAYTS LKRRGVGVVP QVSIPVSIPV PSYYRSIPSS EFTSSPPNPA SLASPPLVNV WTPRTRGSVR LSPPPPPPPP PTATRKAARS AHLHMMETGD RIRPGERINQ DVGPSPNPNP SPNPNPKKTA PTITSLLLDL ENGGMNAKRR RRLRSVLNSV DQTRKDDDDY DDDDDDEGEK AGAGDTSQGE WGTDFDTEPS SAAASQSPSP SGSASEVEDQ VRDDTDTNEE MDGDTEQRVR EKI // ID E6ZPR9_SPORE Unreviewed; 1421 AA. AC E6ZPR9; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBQ69226.1}; GN ORFNames=sr15295 {ECO:0000313|EMBL:CBQ69226.1}; OS Sporisorium reilianum (strain SRZ2) (Maize head smut fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Sporisorium. OX NCBI_TaxID=999809 {ECO:0000313|EMBL:CBQ69226.1, ECO:0000313|Proteomes:UP000008867}; RN [1] {ECO:0000313|EMBL:CBQ69226.1, ECO:0000313|Proteomes:UP000008867} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SRZ2 {ECO:0000313|Proteomes:UP000008867}; RX PubMed=21148393; DOI=10.1126/science.1195330; RA Schirawski J., Mannhaupt G., Muench K., Brefort T., Schipper K., RA Doehlemann G., Di Stasio M., Roessel N., Mendoza-Mendoza A., RA Pester D., Mueller O., Winterberg B., Meyer E., Ghareeb H., RA Wollenberg T., Muensterkoetter M., Wong P., Walter M., Stukenbrock E., RA Gueldener U., Kahmann R.; RT "Pathogenicity determinants in smut fungi revealed by genome RT comparison."; RL Science 330:1546-1548(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FQ311435; CBQ69226.1; -; Genomic_DNA. DR EnsemblFungi; CBQ69226; CBQ69226; sr15295. DR InParanoid; E6ZPR9; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008867; Chromosome 14. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008867}; KW Reference proteome {ECO:0000313|Proteomes:UP000008867}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 1421 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003216860. FT COILED 892 912 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1421 AA; 154526 MW; 74E4C7A2FB4BD236 CRC64; MRPSWFWPAV ILVSAQLIKP CYTLLDSGAS PDAEGSTTGT AEQAGHARVA SPQTASAGSL VVQDTAQLSV RPGCRRSTDF FDQVCSIPSK GSTSGDHDSP SLLSPAVTAD LLVCDLPEVE PLIRPDLRGQ AAEKGPSATY AVSGTIVVHG DGSKLHSNGD RLPELGGSSL PRDATADARD AVTDPATSQP DASYPGPDDE GFWRQRDGQH EWHPASKQGG DQSQPTRDAK PHDAQPNQAE NWLPRHQDPS GLSEYDAWSA EQEQGAFLSF NEWKERHAQD HSGHARGSSS NSARSAAQGK TTQARSASVD SSAPPQQTAG HSDAPLSKDD AAQHQGSISQ TTRAGAANTP STSSTPPAPS PGQTESTQAS EHAVPAESRP PIEVADPPSR SISEPVKTGF SVAAGDTGSQ LSKLKHRWNF ASLDCAAVVH RTNPEAKFAS SILSEKKDRY MLSPCPHPGG KLKGGSQFVI VELCEEIKID TVVLANYEFF SNMFKKFTVT AARQLTGREN DWTQLGTFRA RNVRGQQVFR IPSAPRSEAF FRYVRIDFLE HFGSEYYCPV SLLRVYGITE MEEYKREFEE HDAELDPAAE VGESEQESVQ ATLPAPAESA PPEVPMVRDA SSNLDNANLT TQDEDIWRKH EQAFQRKLQN QSLTDEAAEI DMGDPMVSHS DNASAHTPAT QLEQPAKSAS SPRQPPIHEA VREYDVAQCV WEDDPTIKAR FGDVCIRPRG EAKPFHHLSS LSRRRAETKT IAADTASATR SNPASSSIAA KSRPASAAGD TAENDTDRGR RSQRPSGAVT SQQGNPGSES VYRAIHRRLN ALESNATLSH SYIEHSGQML REVFARMEKR QEARMSDMLR ALNASNWHQI ESLKRRQHVD LQRAIFEFDV HRQQADNERR ALLREVQLLA EEVLLEKRLS IAQLVMLLVV FVFVGMTRGS RAVPLFHTGF TKIGRSSKRR EAKSDGQAVA PSSPSRASAR IQSNSEAKQS NISLSRESEP TQPRTDARHE TITGSGVLPQ QSRSIVESTT PAATKVKPLP FGSTLPENTA NSKAAGQSHQ RNISKTLANG VAGRAIPRGE TLTGMLSHPR NRGRLILFLE TLDALDRAAD RRRLPKSLTR QRPDARRSKV GDKVIDADGP LRPRTEPNVL LPSDVHGRIA RSHRPLLTHK QEHGLTNGRR HATPGRDRKA ETFGFDDVAG MSSDWTERSE NGDASENAAY LSEDEDTAIH AAKSDLISLE PVEDLRINVL AERPDGTRTH HEPSHGHPAT AAADSLPPAL TAERLVATSE PDKHLALPDL PSIARADGHL SSSDSESEGG AWHRVLPRRI GNSNSLRRAR QGTIDGKSYK VDSETKLSTK PEASDRRGSP RPSSAQGTGF KSQKPALSRL FRTGTPEVGG PTSPQNRRSG TPDTLRDVRP A // ID E6ZWV0_SPORE Unreviewed; 788 AA. AC E6ZWV0; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBQ71707.1}; GN ORFNames=sr12561 {ECO:0000313|EMBL:CBQ71707.1}; OS Sporisorium reilianum (strain SRZ2) (Maize head smut fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Sporisorium. OX NCBI_TaxID=999809 {ECO:0000313|EMBL:CBQ71707.1, ECO:0000313|Proteomes:UP000008867}; RN [1] {ECO:0000313|EMBL:CBQ71707.1, ECO:0000313|Proteomes:UP000008867} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SRZ2 {ECO:0000313|Proteomes:UP000008867}; RX PubMed=21148393; DOI=10.1126/science.1195330; RA Schirawski J., Mannhaupt G., Muench K., Brefort T., Schipper K., RA Doehlemann G., Di Stasio M., Roessel N., Mendoza-Mendoza A., RA Pester D., Mueller O., Winterberg B., Meyer E., Ghareeb H., RA Wollenberg T., Muensterkoetter M., Wong P., Walter M., Stukenbrock E., RA Gueldener U., Kahmann R.; RT "Pathogenicity determinants in smut fungi revealed by genome RT comparison."; RL Science 330:1546-1548(2010). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FQ311452; CBQ71707.1; -; Genomic_DNA. DR EnsemblFungi; CBQ71707; CBQ71707; sr12561. DR InParanoid; E6ZWV0; -. DR OrthoDB; EOG7JQBZB; -. DR Proteomes; UP000008867; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008867}; KW Reference proteome {ECO:0000313|Proteomes:UP000008867}. FT COILED 123 150 {ECO:0000256|SAM:Coils}. FT COILED 407 434 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 788 AA; 87302 MW; 9E9AEF1A1D6A665D CRC64; MSTRNLGRGT PRRSPRISGR SETPDAFRGA TPSRRPLSPE LVEQMTPPAA SFRPRQLLPN LQPVASGSNT NSISIKQEDT STKYYVREPS YQPTLLRGDQ RSFNDARDAN HSHSASLSLF SSDSDLSDNY QQVEQEMAQI QQAQQQQASA SRGAVTRRPS GQRKARTSKD NLPYRPQSDD EDDDDDDIAS PTRRVRRGRN SNSNSLGAYE VGRIDNVRWM NAKSKKGRRR KTVNGVPVDA EEDDSRETNA DEDAGAASGK ETRFDEASAS DSADSDQEDD QVNVYVDDAV AQDPPAAPAK QPTQSDLPAA PSVVSRLVRG LASFLWRLVV LAVTYLLELP KLAWNKLGFE PPAITTNRAF AVLAGLLFAA AAFQASQLFG HSSSLSERFP MLLDDDVPGS SSSSTLALSL NRENQRLRAE LTRLTARLDT LSASIESQIS SSLSSAAAKI QAEAESRQST EISRITASTK RTVARLAQDE LKSIQDSVSS SVELMLRDLD KKINMQLKQR ADDTEGKFFH KLEKEVASIA KYANDEVNAR LGQAFDQTFL SALIDDKLEQ YSRDRTGRVD WAAVTSGAWV AEEGTVHRGY RFNSVWNVGQ FLAQGRKVPI GDPVKAITPG AGLGADNCWM TGWNSLLQVQ LAEPKVVDQV VVEHPLPGMT RTAPRRIIVW AHVDDSDRQY YLQYRRSKAT TQHDYLRTLL PDPFFDAIPP EYRAEDSAPL ILAHFEFKAN GSTLQTFNLT DEAQVYLFGV HAVRWQFVDG WAKTPPICVH RVRVHGSEWP VFGDKLAH // ID E7EZV6_DANRE Unreviewed; 987 AA. AC E7EZV6; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000111505}; GN Name=sun1 {ECO:0000313|ZFIN:ZDB-GENE-050522-551}; GN ORFNames=zgc:92151 {ECO:0000313|Ensembl:ENSDARP00000111505}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000111505, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000111505} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000111505}; RG The Danio rerio Sequencing Project at the Sanger Institute; RT "The genomic sequence of Danio rerio."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSDARP00000111505} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000111505}; RG Ensembl; RL Submitted (APR-2011) to UniProtKB. RN [3] {ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000111505}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CABZ01055403; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CABZ01055404; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CABZ01055405; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU571260; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7955.ENSDARP00000104532; -. DR PaxDb; E7EZV6; -. DR Ensembl; ENSDART00000130261; ENSDARP00000111505; ENSDARG00000055350. DR ZFIN; ZDB-GENE-050522-551; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E7EZV6; -. DR Proteomes; UP000000437; Chromosome 3. DR Bgee; E7EZV6; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:E7EZV6}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 358 378 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 422 440 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 447 471 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 621 641 {ECO:0000256|SAM:Coils}. FT COILED 659 686 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 987 AA; 109721 MW; 402931ED4EC98C06 CRC64; MDFSRLHTYT PPHCTPDNTG YTYSLSSSYS TAALEFEKEH KINPVYDSPK MSRRSLRLQT SSGLYDNSFT EVAGNHSVGS YKRTNTSTTT TTSSSSQPSL SVRGRRQQQD SSIYESQSVT GTPQSTSDLS FTSTDASLIS NLLDQSTLRQ SSTTETYSAT RRRRAVNRSL LENGNVSKTE AHANLANGYF CKDCSFHAEG NEKETSYSVP YSTSESAAYQ TTEAADATMT TMTTSLNSVD GAAHDSYCGS VNVRDVVTAD HLNLNGSLCD DCKGKQHMEM NTERKHYSYI HRVLTVLWAV VTYTGNVLHR VCQGFGSAGA FVSRKMKSVV GLAVCSPGDI CKEKQHMEMN TERKHYSYIH RMLTVLWAVV SYTGYGLLRV CRGFGSAGAF VSRKLKSILW FAVCSPGKAA TGAFWWLGTG WYQLVALMSL INVFLLTRCL PKLLKLLLFL LPFLLLFGLW YLGLPIALSF LPAVNLTEWK TSVTSFASLP ALPSFPSFPS LPALPSFTKE PLLKEQDVPP LVVAQAASDS INSERLALLE QRVSALWESV RQGELKAKQQ HEEALGLTQS LQEQIKTQTD RENLGLWVTE LLQPKFTALE GDMKTETLSR AETEEQHIQH QNILEARLAE LEVLLQNLNS RTEDIHLSQQ TPVQAPVSVG VSQEKHEALL SEVQRLEAEL GRIRGDLQGV MGCQGKCDRL DTIHETVSAQ VKEQLYALLY GRDRGEAVIP EPLLPWLASQ YTSNSDLTAT LVTLERSILG NLSLQLQESK QQQASAETVT QTVAHTAEAA GMSEEQVQLI VQRALKLYSE DRTGQVDYAL ESGGGSVLST RCSETYETKT ALMSLFGIPL WYFSQSPRVV IQPDMYPGNC WAFKGSQGYL VIRLSLRVIP NGFCLEHIPK SLSPSGNISS APRRFSVYGL DDEYQDEGKL LGDYTYQEDG DSLQNFPVME ENDKAFQIIE MRVLSNWGHP EYTCLYRFRV HGKPHAQ // ID E7F184_DANRE Unreviewed; 577 AA. AC E7F184; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 39. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000107022}; GN Name=si:dkey-92f12.2 {ECO:0000313|Ensembl:ENSDARP00000107022, GN ECO:0000313|ZFIN:ZDB-GENE-091204-379}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000107022, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000107022} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000107022}; RG Ensembl; RL Submitted (APR-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSDARP00000116173} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000116173}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [3] {ECO:0000313|Ensembl:ENSDARP00000107022, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000107022, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000107022}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR788236; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_001919691.1; XM_001919656.5. DR STRING; 7955.ENSDARP00000116173; -. DR Ensembl; ENSDART00000129701; ENSDARP00000107022; ENSDARG00000086490. DR Ensembl; ENSDART00000140086; ENSDARP00000116173; ENSDARG00000086490. DR GeneID; 796761; -. DR KEGG; dre:796761; -. DR ZFIN; ZDB-GENE-091204-379; si:dkey-92f12.2. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR KO; K19347; -. DR OMA; ATERNEW; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 20932730; -. DR PRO; PR:E7F184; -. DR Proteomes; UP000000437; Chromosome 6. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:E7F184}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 167 185 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 224 251 {ECO:0000256|SAM:Coils}. FT COILED 298 325 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 577 AA; 64634 MW; 32421DF8F541F8BF CRC64; MSRRSTRSVT KPTSFPDDDA ASTSSTGSTG HISYKESPTR IFQKRTSRKG TGSVSRNSSR ASSVSLILPP RIDFNPTENE NGIPTQGFSS GYSSAEDHYE QPIKSNPSST QSAAEPGFGV WDVFQSPAQA LVLLYWWLGT AWYSLTSRLS FINVFLLSRC TADMKKAVLL VLLLIFLIFG IWYWFPFSSR PAPHVVVVTS TPPVKHTDRP VKAVFDDHLQ HASLSNLRDE IANLHKREAN LMRDIELLKE ESVRQKAKSE MMQTDVRTMN DHMRNAESER GQQISELKSS ISNLHSTQDL LTRRVDALEA HNNNLRAELS DWLIKHLKDP SSLDSSIVLR PELQRALQDL EKQILEKLAH EKGSSRDVWR TVGETLQQEG AGAATIQDVK EIVHRAISLY RADGIGLADY ALESSGASVL NTRCSETYKT RSACLSLFGI PLWYHSESPR TVIQPELYPG KCWAFRGSQG FLVISLSYPV AITHVTLEHI PKDLSPTGRL DSAPKDFSVY GVSNETEDGK LLGTFIYDQD GEPIQTFKLP EVSDVYSMVE LRVLSNWGHL EYTCVYRFRV HGEPALA // ID E7FB83_DANRE Unreviewed; 723 AA. AC E7FB83; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 37. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000106535}; GN ORFNames=zgc:152977 {ECO:0000313|Ensembl:ENSDARP00000106535, GN ECO:0000313|ZFIN:ZDB-GENE-060825-353}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000106535, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000106535} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000106535}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSDARP00000106535, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000106535, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000106535}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR388227; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU633193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7955.ENSDARP00000106535; -. DR PaxDb; E7FB83; -. DR Ensembl; ENSDART00000124562; ENSDARP00000106535; ENSDARG00000077178. DR ZFIN; ZDB-GENE-060825-353; zgc:152977. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E7FB83; -. DR Proteomes; UP000000437; Chromosome 12. DR Bgee; E7FB83; -. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 75 93 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 99 122 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 196 217 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 349 376 {ECO:0000256|SAM:Coils}. FT COILED 404 431 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 723 AA; 80541 MW; 7C4CDBAE81E4478A CRC64; MEAKMQTSTT QNGYISKDHP IPSPTKNDSS LNNTAVHQAS LTEAPSTATT VYCRDKNRKH KPGVLRRLSE MCLHFSRRII ALVVYICTLL IQVTLLKGLV LSVSLWCVRV AKGIAHSAAT FLRKTLMSVR QMKAACWCAK RGNQKYISTM SVKEVVLQKE QPKFSGVLWK AASNSILWLV NLFTTCLPML SKVLFLVPLL LFLALYYWGP SGLIAMLPAT SMMRDNSHLS PPTSSMGGLD EGQTTADTDS QRPSVVSSVE AERLMRLEES VSQLWDRVAG GLLRQEEQHT EMLSLYNTLR SEIHGYTDRE SLGKWIGDML EERFSLLKGE VQKEVKYTQQ RQEKHAEERQ SENTRLAEAE ALLQTLARKT EELQRKQEPT LRKVPEAEAP SPDPQSEEAE SSAAEALLVE VRRLEDALER IRDDVQGLMK CRDKCEQLDS FSGSVSAQVK EEIKSLFYGN DVGAAELELP ESLLQWISDH FVSTSELQTS LSALESSILG NLSLQVEEGQ LPSKETITKT VLNAAGEAGL SEEHVQLIVT NAIKLYSEDR IGMVDYALES GGGSIISTRC SESFNTKTAL LSLFGLPLWY FSQSPRVVIQ PEVHPGNCWA FKGSTGYIVI GLSMKIVPTA FTLEHVAKSL SPTGNISSAP REFNVYGLDD EQQEEGQLLG QYVYEEDGDS LQTFLVSDEV SSGFQIIEMR VLSNWGNPEY TCVYRFRVHG KPS // ID E7FGG7_DANRE Unreviewed; 849 AA. AC E7FGG7; DT 08-MAR-2011, integrated into UniProtKB/TrEMBL. DT 08-MAR-2011, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000099282}; GN Name=sun1 {ECO:0000313|Ensembl:ENSDARP00000099282, GN ECO:0000313|ZFIN:ZDB-GENE-050522-551}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000099282, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000099282} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000099282}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSDARP00000099282, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000099282, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000099282}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CABZ01055403; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CABZ01055404; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CABZ01055405; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU571260; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7955.ENSDARP00000104532; -. DR PaxDb; E7FGG7; -. DR Ensembl; ENSDART00000109876; ENSDARP00000099282; ENSDARG00000055350. DR ZFIN; ZDB-GENE-050522-551; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR Proteomes; UP000000437; Chromosome 3. DR Bgee; E7FGG7; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:E7FGG7}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 275 297 Helical. FT TRANSMEM 309 333 Helical. FT COILED 483 503 {ECO:0000256|SAM:Coils}. FT COILED 521 548 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 849 AA; 94256 MW; BABB4F24C074DE65 CRC64; MDFSRLHTYT PPHCTPDNTG YTYSLSSSYS TAALEFEKEH KINPVYDSPK MSRRSLRLQT SSGLYDNSFT EVAGNHSVGS YKRTNTSTTT TTSSSSQPSL SVRGRRQQQD SSIYESQSVT GTPQSTSDLS FTSTDASLIS NLLDQSTLRQ SSTTETYSAT RRRRAVNRSL LENGNVSKTE AHANLANGYF CKDCSFHAEG NEKETSYSVP YSTSESAAYQ TTEAADATMT TMTTSLNSVD GAAHDSYCGS VNVRDVVTAD HLNLNGSLWK AATGAFWWLG TGWYQLVALM SLINVFLLTR CLPKLLKLLL FLLPFLLLFG LWYLGLPIAL SFLPAVNLTE WKTSVTSFAS LPALPSFPSF PSLPALPSFT KEPLLKEQDV PPLVVAQAAS DSINSERLAL LEQRVSALWE SVRQGELKAK QQHEEALGLT QSLQEQIKTQ TDRENLGLWV TELLQPKFTA LEGDMKTETL SRAETEEQHI QHQNILEARL AELEVLLQNL NSRTEDIHLS QQTPVQAPVS VGVSQEKHEA LLSEVQRLEA ELGRIRGDLQ GVMGCQGKCD RLDTIHETVS AQVKEQLYAL LYGRDRGEAV IPEPLLPWLA SQYTSNSDLT ATLVTLERSI LGNLSLQLQE SKQQQASAET VTQTVAHTAE AAGMSEEQVQ LIVQRALKLY SEDRTGQVDY ALESGGGSVL STRCSETYET KTALMSLFGI PLWYFSQSPR VVIQPDMYPG NCWAFKGSQG YLVIRLSLRV IPNGFCLEHI PKSLSPSGNI SSAPRRFSVY GLDDEYQDEG KLLGDYTYQE DGDSLQNFPV MEENDKAFQI IEMRVLSNWG HPEYTCLYRF RVHGKPHAQ // ID E7M0F0_YEASV Unreviewed; 587 AA. AC E7M0F0; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Slp1p {ECO:0000313|EMBL:EGA76829.1}; GN ORFNames=VIN13_4506 {ECO:0000313|EMBL:EGA76829.1}; OS Saccharomyces cerevisiae (strain VIN 13) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=764099 {ECO:0000313|EMBL:EGA76829.1, ECO:0000313|Proteomes:UP000000307}; RN [1] {ECO:0000313|EMBL:EGA76829.1, ECO:0000313|Proteomes:UP000000307} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=VIN 13 {ECO:0000313|Proteomes:UP000000307}; RX PubMed=21304888; DOI=10.1371/journal.pgen.1001287; RA Borneman A.R., Desany B.A., Riches D., Affourtit J.P., Forgan A.H., RA Pretorius I.S., Egholm M., Chambers P.J.; RT "Whole-genome comparison reveals novel genetic elements that RT characterize the genome of industrial strains of Saccharomyces RT cerevisiae."; RL PLoS Genet. 7:E1001287-E1001287(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGA76829.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADXC01000074; EGA76829.1; -; Genomic_DNA. DR EnsemblFungi; EGA76829; EGA76829; VIN13_4506. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000307; Chromosome XV. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000307}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000307}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67353 MW; 4A43AF40124071AD CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFRQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID E7Q9R0_YEASB Unreviewed; 587 AA. AC E7Q9R0; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Slp1p {ECO:0000313|EMBL:EGA56712.1}; GN ORFNames=FOSTERSB_4456 {ECO:0000313|EMBL:EGA56712.1}; OS Saccharomyces cerevisiae (strain FostersB) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=764102 {ECO:0000313|EMBL:EGA56712.1, ECO:0000313|Proteomes:UP000000309}; RN [1] {ECO:0000313|EMBL:EGA56712.1, ECO:0000313|Proteomes:UP000000309} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=FostersB {ECO:0000313|EMBL:EGA56712.1, RC ECO:0000313|Proteomes:UP000000309}; RX PubMed=21304888; DOI=10.1371/journal.pgen.1001287; RA Borneman A.R., Desany B.A., Riches D., Affourtit J.P., Forgan A.H., RA Pretorius I.S., Egholm M., Chambers P.J.; RT "Whole-genome comparison reveals novel genetic elements that RT characterize the genome of industrial strains of Saccharomyces RT cerevisiae."; RL PLoS Genet. 7:E1001287-E1001287(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGA56712.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEHH01000073; EGA56712.1; -; Genomic_DNA. DR EnsemblFungi; EGA56712; EGA56712; FOSTERSB_4456. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000309; Chromosome XV. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000309}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000309}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67304 MW; 0F91FD79B543DE6B CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFXQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNXWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IIQQAIR // ID E9B0F6_LEIMU Unreviewed; 589 AA. AC E9B0F6; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBZ28710.1}; GN ORFNames=LMXM_29_0320 {ECO:0000313|EMBL:CBZ28710.1}; OS Leishmania mexicana (strain MHOM/GT/2001/U1103). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmaniinae; Leishmania. OX NCBI_TaxID=929439 {ECO:0000313|EMBL:CBZ28710.1, ECO:0000313|Proteomes:UP000007259}; RN [1] {ECO:0000313|EMBL:CBZ28710.1, ECO:0000313|Proteomes:UP000007259} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MHOM/GT/2001/U1103 {ECO:0000313|EMBL:CBZ28710.1, RC ECO:0000313|Proteomes:UP000007259}; RX PubMed=22038252; DOI=10.1101/gr.122945.111; RA Rogers M.B., Hilley J.D., Dickens N.J., Wilkes J., Bates P.A., RA Depledge D.P., Harris D., Her Y., Herzyk P., Imamura H., Otto T.D., RA Sanders M., Seeger K., Dujardin J.C., Berriman M., Smith D.F., RA Hertz-Fowler C., Mottram J.C.; RT "Chromosome and gene copy number variation allow major structural RT change between species and strains of Leishmania."; RL Genome Res. 21:2129-2142(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR799582; CBZ28710.1; -; Genomic_DNA. DR RefSeq; XP_003877178.1; XM_003877129.1. DR STRING; 929439.XP_003877178.1; -. DR EnsemblProtists; CBZ28710; CBZ28710; LMXM_29_0320. DR GeneID; 13451653; -. DR KEGG; lmi:LMXM_29_0320; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000258883; -. DR Proteomes; UP000007259; Chromosome 29. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007259}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 521 541 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 466 493 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 589 AA; 62789 MW; 7D27D0858080D6FC CRC64; MRPQERLTVL CTLLLLLVYM LFSAPLEFLT VFWRPTTSAA VGVSSPSPLK RLSTGSAPGL STNYASLYLG AAVVSMEPSS CHGGVALISE SVDKYVLCPC DAPRKQFVVQ VIRDVQVRSV MVRNAEHFSS GVRNFTLLGS LQYPTPTWLV LGHFEAEQRR GRQYFDVTPR SRVRFIKLQW ATSYGPEPWC TITSFQVYGI DVLETLTRYD GGDDLVAGED AAGPSGGLRD TPDMNRLHLP ALPPTLEEVT APSLAGSSAA PSRNGATSAK DARMPTLSIE ELAAGMRDSA AATVGASRGV DADDLLLAPV DVGVSAETGP LSQPDADAKL PSPTNSIALA ATAPVFPSPN CSSAQPIGWN TSLKCAITDL AALWGSCAAT TPGASDFTAV TTPTSAPTLP VSTSSRKGLS ASGSIYRSPA GSLLTNLLRQ QRSTHHELTL LMQRERHLAQ ELNRTRILLS DFYAKYKAAE RESSEYRDRL HGLQLNLQLL QERFLLREHS NCGGEGGGAG RSGGSIMRSD TAMAVTSFVL LALTGILMLM YSSSSSRSVA GSPSGWGRYY NIGRGGGEVG SGGGNGPPLW PRHQRGRAR // ID E9BLC7_LEIDB Unreviewed; 586 AA. AC E9BLC7; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBZ36055.1}; GN ORFNames=LDBPK_300320 {ECO:0000313|EMBL:CBZ36055.1}; OS Leishmania donovani (strain BPK282A1). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmaniinae; Leishmania. OX NCBI_TaxID=981087 {ECO:0000313|Proteomes:UP000008980}; RN [1] {ECO:0000313|Proteomes:UP000008980} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=BPK282A1 {ECO:0000313|Proteomes:UP000008980}; RA Downing T., Imamura H., Sanders M., Decuypere S., Hertz-Fowler C., RA Clark T.G., Rijal S., Sundar S., Quail M.A., De Doncker S., Maes I., RA Vanaerschot M., Stark O., Schonian G., Dujardin J.C., Berriman M.; RT "Whole genome sequencing of Leishmania donovani clinical lines reveals RT dynamic variation related to drug resistance."; RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR799617; CBZ36055.1; -; Genomic_DNA. DR RefSeq; XP_003862747.1; XM_003862699.1. DR EnsemblProtists; CBZ36055; CBZ36055; LDBPK_300320. DR GeneID; 13385454; -. DR KEGG; ldo:LDBPK_300320; -. DR HOGENOM; HOG000258883; -. DR Proteomes; UP000008980; Chromosome 30. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008980}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 518 538 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 457 491 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 586 AA; 61966 MW; 3A41ABFAC7B14CB4 CRC64; MRPQERLTVL CTLLLLYVLF SAPVELLTVF WHRTTSAAVG ISSPSPLKRL STGSAPGLST NYASLYLGAA VVSMEPSSCH GGVALISESV DKYVLCPCDA PRKQFVVQLI RDVQVRSVMV RNAEHFSSGV RNFTLLGSLQ YPTSTWLVLG HFEAEQRRGR QYFDVAPRSR VRFIKLQWAT SYGPEPWCTI TSFQVYGIDV LETLTRYDGG DDLVAGEDAA GASGGLRGTP DMHRFHLPAL PPTPGEVAAP PLAGSSAVPS RNGATSANDA PAPAVFIDEL AAGMWAGAAA TVGASRGADA DDLLLAPVDV GASAETGPLS QADADAKRSS PTNSIALAAT APALQSVNCS AAQPIGRNAS VKCTITDLTA LWGPCAVATS GASDFTAVTT PTSAPALSVS TPSSKGLSAS GSIYQSAAGS LLTNLLRQQR STHHELTLLM QRERHLAQEL NRTRILLSDF YARYKATERE ADEYRDRLHG LQSKLQLLQE RFLREHSSCC GEGGGAGRSG GSIMRSDTAM AVASFALLAL AVILMLMYSS SSSRSVVGPP SGWGRYYNIG RGSGGVASGG GNRPPLWPRP QRGRAR // ID E9DB15_COCPS Unreviewed; 856 AA. AC E9DB15; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 7. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFW16501.1}; GN ORFNames=CPSG_07017 {ECO:0000313|EMBL:EFW16501.1}; OS Coccidioides posadasii (strain RMSCC 757 / Silveira) (Valley fever OS fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Coccidioides. OX NCBI_TaxID=443226 {ECO:0000313|Proteomes:UP000002497}; RN [1] {ECO:0000313|Proteomes:UP000002497} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RMSCC 757 / Silveira {ECO:0000313|Proteomes:UP000002497}; RG The Broad Institute Genome Sequencing Center for Infectious Disease; RA Neafsey D., Orbach M., Henn M.R., Cole G.T., Galgiani J., RA Gardner M.J., Kirkland T.N., Taylor J.W., Young S.K., Zeng Q., RA Koehrsen M., Alvarado L., Berlin A., Borenstein D., Chapman S.B., RA Chen Z., Engels R., Freedman E., Gellesch M., Goldberg J., Griggs A., RA Gujja S., Heilman E., Heiman D., Howarth C., Jen D., Larson L., RA Mehta T., Neiman D., Park D., Pearson M., Richards J., Roberts A., RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Walk T., RA White J., Yandava C., Haas B., Nusbaum C., Birren B.; RT "The genome sequence of Coccidioides posadasii strain Silveira."; RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL636497; EFW16501.1; -; Genomic_DNA. DR EnsemblFungi; EFW16501; EFW16501; CPSG_07017. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002497; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002497}; KW Reference proteome {ECO:0000313|Proteomes:UP000002497}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 31 {ECO:0000256|SAM:SignalP}. FT CHAIN 32 856 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003234379. SQ SEQUENCE 856 AA; 95268 MW; A98D96BFA5213EE5 CRC64; MRGRWSFVCF DIDFSVLNVL ILLFLLPLLV AENGDFQGRR HQPQPGGDVW SMDHGYAASC PVRDFVEVQA EYVRYPVCLG SRRASSAPDG MTESASVATE RTSEAPTASS AETVSTPKVE SELDTESPLD NEKFLSFEEW KKKNLAKIGQ SVDNVRGNRQ AVGSTEMRKR SRPGEISNAL DSLGEEGEIE LGFGGFGPGD SDIPPVEKKD AQSASSSVNG EKHVTKGTEG ESQSDGVPRR GIARRKDAGV TCKERFNYAS FDCAATVLKT NRECTGSSSI LIENKDSYML NECRAKDKFI ILELCDDILV DTVVLANYEF FSSIFRTFRV SVADRYPAKP DKWKELGTYE AANTREIQAF AVENPLIWAR YLKIEFFSHY GNEFYCPLSL VRVHGTTMME EYKNYGDSAR AEEEAVEAVV QAQQNPDSVP TMKNSNQTQR EIRDQNVNIS ITQTGSGTLP DEEALGASCF PQINEIERLL LGMSSDNMSS IYDMALDPDY QSEAHESAES ETWASNATGS IGLEDTSVSD TPPTMVGGSD HQRATPGSRM VSTSGSSRSK NETSADNQRT PVVSQPPPPN PTTQESFFKS VHKRLQMLET NSTLSLLYIE EQSRILRDAF NKVEKRQLAK TSSFLENLNS TVLHELRQFR QQYDHIWHSV VIEFEQQRQQ YHHELFAVTS QLAILADEVV FQKRVSIIQS VFVLLSFGLV LFSRSAVGSY LEFPKMQSRV SRSHSFRSAS PPYETPSPSP NSPMQSPTYQ EGNLHRRNPS DDQTDCEICN HTFPYSPPPS SDTLSPSEEE EKGLHDVHLE YSRSTASNLV PEENPAGIKR QRSSPADLCG HDEGDSAEFK LPQAPS // ID E9DIN8_COCPS Unreviewed; 628 AA. AC E9DIN8; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFW13642.1}; GN ORFNames=CPSG_09681 {ECO:0000313|EMBL:EFW13642.1}; OS Coccidioides posadasii (strain RMSCC 757 / Silveira) (Valley fever OS fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; mitosporic Onygenales; Coccidioides. OX NCBI_TaxID=443226 {ECO:0000313|Proteomes:UP000002497}; RN [1] {ECO:0000313|Proteomes:UP000002497} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RMSCC 757 / Silveira {ECO:0000313|Proteomes:UP000002497}; RG The Broad Institute Genome Sequencing Center for Infectious Disease; RA Neafsey D., Orbach M., Henn M.R., Cole G.T., Galgiani J., RA Gardner M.J., Kirkland T.N., Taylor J.W., Young S.K., Zeng Q., RA Koehrsen M., Alvarado L., Berlin A., Borenstein D., Chapman S.B., RA Chen Z., Engels R., Freedman E., Gellesch M., Goldberg J., Griggs A., RA Gujja S., Heilman E., Heiman D., Howarth C., Jen D., Larson L., RA Mehta T., Neiman D., Park D., Pearson M., Richards J., Roberts A., RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Walk T., RA White J., Yandava C., Haas B., Nusbaum C., Birren B.; RT "The genome sequence of Coccidioides posadasii strain Silveira."; RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL636512; EFW13642.1; -; Genomic_DNA. DR EnsemblFungi; EFW13642; EFW13642; CPSG_09681. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002497; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002497}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002497}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 298 320 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 148 168 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 628 AA; 68769 MW; B4659DE7CA2A2E82 CRC64; MTGARRTRSG RSISQEPTGQ GHRTRKTPGP VDSGDSAAIP TASFGNPSLQ ALQAQHSFAY GATGSPALPR QLRMCPPTGA MEMAANIEGR YLETHANDFE RIEEEARANP GTRRSTRSGA NTGSARQSVS PVRRTPGQRA RNREPTPDDQ LLESLREASE EAEETKETIL PSIEDSSVSW NTERHILGDP RSVPTATSSG GSLSQSQRQR EAAHPMAGPP LRPYTRSQPR TTFQASQAVR SGSSAASSAQ PAPRLGAAPV LSPPQAEDNA YATPNPRRQT RSVSQQTMSS TASQRQRGFS VISLGLMITI FMLAVAGMLF RFDDIEMIGK NILQNGIGKE FSLPSSFCGA QPPTSQYIEA FDKLSAGVDR RLADMARDVA TLKDEWNRRL PHLKQAIWPE MEDPLLPRKI NWFSVGMGAF VDPYLTTKHR SGLPPREIQL AVLLGRPLVP EEVVIEHIQK EATLDPESAP REMELWVEYV ARSHAAAPST TLPGFRATGA PGRSDQTATS SPVSTRRPEL LESSAEARAA FAGPLSPSQH EDIISTLRMA YPDEPETAYS HDTMLGSSFY RIGKFQYDIN GKHNIQKFHL DAVIDLPNIR TKKAVLRVKS NWGSVNTCVY RVRLHGHM // ID E9E9B9_METAQ Unreviewed; 902 AA. AC E9E9B9; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFY87490.1}; GN ORFNames=MAC_06467 {ECO:0000313|EMBL:EFY87490.1}; OS Metarhizium acridum (strain CQMa 102). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Clavicipitaceae; OC Metarhizium. OX NCBI_TaxID=655827 {ECO:0000313|Proteomes:UP000002499}; RN [1] {ECO:0000313|EMBL:EFY87490.1, ECO:0000313|Proteomes:UP000002499} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CQMa 102 {ECO:0000313|EMBL:EFY87490.1, RC ECO:0000313|Proteomes:UP000002499}; RX PubMed=21253567; DOI=10.1371/journal.pgen.1001264; RA Gao Q., Jin K., Ying S.-H., Zhang Y., Xiao G., Shang Y., Duan Z., RA Hu X., Xie X.-Q., Zhou G., Peng G., Luo Z., Huang W., Wang B., RA Fang W., Wang S., Zhong Y., Ma L.-J., St Leger R.J., Zhao G.-P., RA Pei Y., Feng M.-G., Xia Y., Wang C.; RT "Genome sequencing and comparative transcriptomics of the model RT entomopathogenic fungi Metarhizium anisopliae and M. acridum."; RL PLoS Genet. 7:E1001264-E1001264(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL698526; EFY87490.1; -; Genomic_DNA. DR RefSeq; XP_007812807.1; XM_007814616.1. DR EnsemblFungi; EFY87490; EFY87490; MAC_06467. DR GeneID; 19250778; -. DR KEGG; maw:MAC_06467; -. DR InParanoid; E9E9B9; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002499; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002499}; KW Reference proteome {ECO:0000313|Proteomes:UP000002499}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 902 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003238646. SQ SEQUENCE 902 AA; 98307 MW; 0BAE139D83230000 CRC64; MVRAAIFPRS LAAAILLAFG ITSAIDEPQA PRGANVTGSV PLSVDPRSTL EVCEARTINY ITHALPQSCL TSSWRSPAPT STAPAVSQHG RGNGSDHGAE NQPSRPPTPT LDHNVTEVQV TEPAATTFMS FEDWKEMMLR RAGQDPQDLR SRKPSEHNTD DRYSPESGHA GLGEEGEISL NFDDYGKGDH QKLASHRSSR GDGVDEQPAA GDALLYEEGK AATVHLSKDA GKTCKERFSY SSFDAGATIL KTSPGAKNAR AILVENKDTY MLLECDAASK YVIIELSDDI SVDTVVLANF EFFSSMVRHF RVSVSDRYPV KMDKWRELGT FEARNSRDIQ PFLVQNPQIW AKYVRVEFLT HFGNEYYCPI SLLRIHGSRM LDSWKDSEGG RDEEALIDGD ESGGTDTRQD EHQVAAAAAS RDPNVFMTVS NGTSARSLSY ALDMFTNVDA TCPASSSAGA NSSITDPKLS SVGLGTSQES DLVSSDVPQA RSASLDNSGD SSIAQAKTSL TTASMNYSTL ISISQDDRMR NHTGALNGAA SNVNTTTIDR EHNAATPTAK LSASSGQNGK PRSSGTTGAS AASPTMQEGF FNAITKRLQH VESNLTLSMK YVEDQSKHIQ EALQSREQRQ HAKINYFLDE LNKTVLAELH TVREQYDQIW QSTVLALESQ RDRSERDMMA LSSRLNLLAD EVVFQKRMAI IQAIILLSCL FLIIFSRGVS LPSLAPLLDQ PSNSRCATPA SPATPRQSSY QLSKGHHREG QSSFKPSDPP CQVQMLHPGE ARNSTSAEAL PFEAVSFQGR DEPKSECSAF QRLSPPPSPN LLGESSLSSD ASSAVGSSHT LPRRLGGTSH ISSRKPLPSL PEHPRSSYGD LSVMDLHLVR KHTQRDGINF VIRHSQNDTM WQ // ID E9EUD9_METRA Unreviewed; 870 AA. AC E9EUD9; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Sad1/UNC-lik {ECO:0000313|EMBL:EFZ01042.1}; GN ORFNames=MAA_03638 {ECO:0000313|EMBL:EFZ01042.1}; OS Metarhizium robertsii (strain ARSEF 23 / ATCC MYA-3075) (Metarhizium OS anisopliae (strain ARSEF 23)). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Clavicipitaceae; OC Metarhizium. OX NCBI_TaxID=655844 {ECO:0000313|EMBL:EFZ01042.1, ECO:0000313|Proteomes:UP000002498}; RN [1] {ECO:0000313|EMBL:EFZ01042.1, ECO:0000313|Proteomes:UP000002498} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ARSEF 23 / ATCC MYA-3075 {ECO:0000313|Proteomes:UP000002498}; RX PubMed=21253567; DOI=10.1371/journal.pgen.1001264; RA Gao Q., Jin K., Ying S.-H., Zhang Y., Xiao G., Shang Y., Duan Z., RA Hu X., Xie X.-Q., Zhou G., Peng G., Luo Z., Huang W., Wang B., RA Fang W., Wang S., Zhong Y., Ma L.-J., St Leger R.J., Zhao G.-P., RA Pei Y., Feng M.-G., Xia Y., Wang C.; RT "Genome sequencing and comparative transcriptomics of the model RT entomopathogenic fungi Metarhizium anisopliae and M. acridum."; RL PLoS Genet. 7:E1001264-E1001264(2011). RN [2] {ECO:0000313|EMBL:EFZ01042.1, ECO:0000313|Proteomes:UP000002498} RP GENOME REANNOTATION. RC STRAIN=ARSEF 23; RX PubMed=25368161; DOI=10.1073/pnas.1412662111; RA Hu X., Xiao G., Zheng P., Shang Y., Su Y., Zhang X., Liu X., Zhan S., RA St Leger R.J., Wang C.; RT "Trajectory and genomic determinants of fungal-pathogen speciation and RT host adaptation."; RL Proc. Natl. Acad. Sci. U.S.A. 111:16796-16801(2014). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EFZ01042.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADNJ02000003; EFZ01042.1; -; Genomic_DNA. DR RefSeq; XP_007819827.1; XM_007821636.1. DR EnsemblFungi; EFZ01042; EFZ01042; MAA_03638. DR GeneID; 19257924; -. DR KEGG; maj:MAA_03638; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002498; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002498}; KW Reference proteome {ECO:0000313|Proteomes:UP000002498}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 870 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003239306. SQ SEQUENCE 870 AA; 95028 MW; C1143F78C8E075F3 CRC64; MVRAALFARG LAATILLTFK TANAINEPHT PRGANITNSV PLSVDPRSTL EVCEARTINY ITHALPQSCL TSSWRSPAPT STASAVSQHG RDDGSDHGTD NQPSWPPTPT LDHNATEVQA TEPAATTFMS FEDWKEMMLR RAGQDPQDLR SRRPSEHNTD DRYSPESGHA GLGEEGEISL NFEDYGNGDH QKLTSPRSTR GDGVDEQPAP ADALLYEEGK AATVHLSKDA GKTCKERFSY SSFDAGATIL KTSPGAKNAR AILVENKDTY MLLECDAASK YVIVELSDDI SVDTVVLANF EFFSSMVRHF RVSVSDRYPV KMDKWRELGT FEARNSRDIQ PFLVQNPQIW AKYVRIEFLT HFGNEYYCPV SLLRIHGSRM LDSWKDSEGG RDEEALIDGD ESAGADSHQD ENHVADAAAS RDTNVHITAS NGTSARSLSY ALDIFTNMDA TCPAPSSAGA NPPTRDPKLS SVSLGTSQES DLVSSDVSQA RSASPENSGN SSIAQAKSIL ATGSINYSTI MPTSQDDRLR NHTGALNGAA LNVNTTTIDR ENNAATPTAK LSASSGQNGK PRSSGTTGAS AASPTMQEGF FNAITKRLQH VESNLTLSMK YVEDQSKHIQ EALQSREQRQ HAKINYFLDE LNKTVLAELH TVREQYDQIW QSTVLALESQ RDRSERDMMA LSSRLNLLAD EVVFQKRMAI IQAIILLSCL FLIIFSRGVS LPSLAPLLDQ PSNSPCATPT SPATPRQSSY QLSKRHHREG QSSFQPPDPS CQVQMLYPDE ARSDTSAEAL PLEAVSFQGT DEPKSECSAF QRLSPPPTPN LLGEMSLSSD PNSATGSNHT LRRRLGGTSH ISSRKPLPSL PEHPRSLDEE // ID E9GVR0_DAPPU Unreviewed; 431 AA. AC E9GVR0; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFX76452.1}; DE Flags: Fragment; GN ORFNames=DAPPUDRAFT_213964 {ECO:0000313|EMBL:EFX76452.1}; OS Daphnia pulex (Water flea). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. OX NCBI_TaxID=6669 {ECO:0000313|Proteomes:UP000000305}; RN [1] {ECO:0000313|EMBL:EFX76452.1, ECO:0000313|Proteomes:UP000000305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21292972; DOI=10.1126/science.1197761; RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A., Arnold G.J., Basu M.K., RA Bauer D.J., Caceres C.E., Carmel L., Casola C., Choi J.H., RA Detter J.C., Dong Q., Dusheyko S., Eads B.D., Frohlich T., RA Geiler-Samerotte K.A., Gerlach D., Hatcher P., Jogdeo S., RA Krijgsveld J., Kriventseva E.V., Kultz D., Laforsch C., Lindquist E., RA Lopez J., Manak J.R., Muller J., Pangilinan J., Patwardhan R.P., RA Pitluck S., Pritham E.J., Rechtsteiner A., Rho M., Rogozin I.B., RA Sakarya O., Salamov A., Schaack S., Shapiro H., Shiga Y., RA Skalitzky C., Smith Z., Souvorov A., Sung W., Tang Z., Tsuchiya D., RA Tu H., Vos H., Wang M., Wolf Y.I., Yamagata H., Yamada T., Ye Y., RA Shaw J.R., Andrews J., Crease T.J., Tang H., Lucas S.M., RA Robertson H.M., Bork P., Koonin E.V., Zdobnov E.M., Grigoriev I.V., RA Lynch M., Boore J.L.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331:555-561(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL732568; EFX76452.1; -; Genomic_DNA. DR STRING; 6669.DappuP213964; -. DR EnsemblMetazoa; EFX76452; EFX76452; DAPPUDRAFT_213964. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; E9GVR0; -. DR Proteomes; UP000000305; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000305}; KW Reference proteome {ECO:0000313|Proteomes:UP000000305}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 431 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003237497. FT NON_TER 431 431 {ECO:0000313|EMBL:EFX76452.1}. SQ SEQUENCE 431 AA; 47771 MW; 894C631A25E1780A CRC64; MRSRISPRPT VWCLIYCLVS LSFSFNQTDS VNTHSIENEY QPVEEAIIYP KDEEVDVKEY PKLIKNSLEI SPVVAGQPPH DGILLTTESP AILDDNLTSS EVTVPVDNLD ADLSVPSIEV LSSTEILEES AYPEISASEV ISADAEPNVK LENVLATNNV LNVTNNVTRN NESGSSEDDI PSFSEWTLKV LAEEEKSGAN GSTGVQTPAK LSATPKLRQK NYASPDCGAK ILDANSEAEH TSAILDPSRD EYFLSICSAK IWFVIELCEA IQAQRVGIAN FELFSSSPKD FRVYISDRYP TRDWALIGLF TAADERSIQN FTLERQLFGK FVKVELVSHY GKEHFCPIGL SLFHVYGNSE YEVLDNEEDS RTGSISHRQE EDSTEEEMLF DLQKNISNTD VVVEGRNLFG SAKDAVINIV KKAAEVLTKS P // ID E9HFW9_DAPPU Unreviewed; 217 AA. AC E9HFW9; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFX69393.1}; GN ORFNames=DAPPUDRAFT_62314 {ECO:0000313|EMBL:EFX69393.1}; OS Daphnia pulex (Water flea). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. OX NCBI_TaxID=6669 {ECO:0000313|Proteomes:UP000000305}; RN [1] {ECO:0000313|EMBL:EFX69393.1, ECO:0000313|Proteomes:UP000000305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21292972; DOI=10.1126/science.1197761; RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A., Arnold G.J., Basu M.K., RA Bauer D.J., Caceres C.E., Carmel L., Casola C., Choi J.H., RA Detter J.C., Dong Q., Dusheyko S., Eads B.D., Frohlich T., RA Geiler-Samerotte K.A., Gerlach D., Hatcher P., Jogdeo S., RA Krijgsveld J., Kriventseva E.V., Kultz D., Laforsch C., Lindquist E., RA Lopez J., Manak J.R., Muller J., Pangilinan J., Patwardhan R.P., RA Pitluck S., Pritham E.J., Rechtsteiner A., Rho M., Rogozin I.B., RA Sakarya O., Salamov A., Schaack S., Shapiro H., Shiga Y., RA Skalitzky C., Smith Z., Souvorov A., Sung W., Tang Z., Tsuchiya D., RA Tu H., Vos H., Wang M., Wolf Y.I., Yamagata H., Yamada T., Ye Y., RA Shaw J.R., Andrews J., Crease T.J., Tang H., Lucas S.M., RA Robertson H.M., Bork P., Koonin E.V., Zdobnov E.M., Grigoriev I.V., RA Lynch M., Boore J.L.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331:555-561(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL732637; EFX69393.1; -; Genomic_DNA. DR STRING; 6669.DappuP62314; -. DR EnsemblMetazoa; EFX69393; EFX69393; DAPPUDRAFT_62314. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; E9HFW9; -. DR OMA; FPLWYFS; -. DR PhylomeDB; E9HFW9; -. DR Proteomes; UP000000305; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000305}; KW Reference proteome {ECO:0000313|Proteomes:UP000000305}. SQ SEQUENCE 217 AA; 24085 MW; EF788992267CFDC9 CRC64; MKEDLIAEIR SQIKLVNANS GRGLSKDEVE QMIHSALGVY DSDKTGLADF ALEPAGGVIL STRCTETYNS HRPRLSIWGF SLWSEPNNPR VVIQPGIVPG QCWSFRGFDG YLVIQLSRKI VPTAFTLEHI PRSLAPDGQI DSAPRNFTVY GLTREVDIAG VVLGQYTFDN LGVPLQSFPV QAHEPGAFSI VELRIHSNYG NLNYTCLYRF RVHGHVA // ID E9HLA9_DAPPU Unreviewed; 2758 AA. AC E9HLA9; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFX67482.1}; GN ORFNames=DAPPUDRAFT_63883 {ECO:0000313|EMBL:EFX67482.1}; OS Daphnia pulex (Water flea). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. OX NCBI_TaxID=6669 {ECO:0000313|Proteomes:UP000000305}; RN [1] {ECO:0000313|EMBL:EFX67482.1, ECO:0000313|Proteomes:UP000000305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21292972; DOI=10.1126/science.1197761; RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A., Arnold G.J., Basu M.K., RA Bauer D.J., Caceres C.E., Carmel L., Casola C., Choi J.H., RA Detter J.C., Dong Q., Dusheyko S., Eads B.D., Frohlich T., RA Geiler-Samerotte K.A., Gerlach D., Hatcher P., Jogdeo S., RA Krijgsveld J., Kriventseva E.V., Kultz D., Laforsch C., Lindquist E., RA Lopez J., Manak J.R., Muller J., Pangilinan J., Patwardhan R.P., RA Pitluck S., Pritham E.J., Rechtsteiner A., Rho M., Rogozin I.B., RA Sakarya O., Salamov A., Schaack S., Shapiro H., Shiga Y., RA Skalitzky C., Smith Z., Souvorov A., Sung W., Tang Z., Tsuchiya D., RA Tu H., Vos H., Wang M., Wolf Y.I., Yamagata H., Yamada T., Ye Y., RA Shaw J.R., Andrews J., Crease T.J., Tang H., Lucas S.M., RA Robertson H.M., Bork P., Koonin E.V., Zdobnov E.M., Grigoriev I.V., RA Lynch M., Boore J.L.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331:555-561(2011). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL732677; EFX67482.1; -; Genomic_DNA. DR STRING; 6669.DappuP63883; -. DR EnsemblMetazoa; EFX67482; EFX67482; DAPPUDRAFT_63883. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; E9HLA9; -. DR OMA; NRQCIEG; -. DR Proteomes; UP000000305; Unassembled WGS sequence. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000305}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000000305}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1325 1345 {ECO:0000256|SAM:Coils}. FT COILED 1655 1675 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2758 AA; 296294 MW; CCB44128D31F11C1 CRC64; MADVDPETLL EWLTMGLGQE ELAERDMQLV ALEQLCMLLL MSDNVDRCFE SCPPRTFLPA LCRILLDKTA PDSVLEVTAR AITYYLDVSA ECTRRVVAVD GAVKALCGRL GTADLQQRTS RDLAEQCVKV MELICTREAG AVFEAGGLQC VLSFIRDAGT LVHKDTLHSA MVVVSRLCSK VEPHDQGLAA CVDALSTLLQ HEDGHVADGA LRCFASLADR FTRRGIDPGP LAEHGLVSAL LFRLSNAAGT AATTTTTSSS SSQQSTNSPA AAAAGIAGSA TASASATTSN PVEATGTKSA TCSSSISTVI SLLSTLCRGS PGITHDLLRS ELPDAIERAV NGEERCILDT MRLVDLLLVL LFEGRKALPR SGGGGGLANA ASRERTHRQL IDCIRSKDTD ALIDAIDSGG IEVNFMDDVG QTLLNWASAF GTQEMVEFLC ERCADVNKGQ RSSSLHYAAC FGRPAIAKVL LRYGANPDLR DEDGKTPLDK ARERNDEGHR EVAAILQSPG EWMITPTSST AVAEDSSILL DGTTTSTEPK GDPDMAPVYL RRLLPVFCHT FQSSMIVSVR KASLTIIKKM VHYTQPALLH SLCSHESNVF ISTLVEVVAA VLDNEEDEDG HLVCMHIIQD LMNKSADTFL DHFARLGIFS KVQMLANGET TATATTATTA TPTASTTSAQ VDAVKELMAG RPYHWRDWCI ARGRDCLYIW SDAAALELSN GSNGWFRFIL DGKLATMYSS GSPEGGSDSS ENRGEFLDKL QRVRTQIKPG TMSQAVFPPG VKIPDALRLT VGNWSLSSAK DGELIIQNSD GQQQATILRE DLPGFLFESN RGTKHTFTAE TALGPELAAG WAPVHCPGSG GRRSATKQKV RAQAQEIYQR YFQVAQAQPR GVVARLAAIV VHIERGCAAQ ENNREQQSRD HSSSQGGGAA WRDLLRSSLN DFRSLLEGED TTLSAFELQS SGLVQALHRL LSPAGLDDYL QGTRRGNRLL RQRVAIFREC FQSADYLYQN GPKGPATCLV RKLVSVLESI EKLPVYLYDA AGGSTNGLQT LTRRLRFRLE RSPGETGLTD RTGRALKTEP LTTVGQLEKY LLKMVAKQWY DYERSTLAFV RKIQAATAPA PASAPASAPA PASAVTGCPS ASPLVFRHTR DFDEGGIIYW LGTNGKTVPD WVNPAQVGLV VVTCSEGRNL PYGHLEDILS RDSAALNCHT NDDKRSWFAL DLGLHLLPTA YTLRHARGYG KSALRHWLLQ VSKDGLSWTT LFTHVDDCSL NEPGSTATWP LEPPGGETQG WRHIRIQQMG KNASGQTHYL SVSGLELYGT VTGVCQDLGR AAREAEANLR RQRRMVRHQM LRHLVPGARV VRGLDWKWRD QDGSTTTTAA VAAAAASAGG LAQPAEGLVT GELHNGWIDV QWDHGGSNSY RMGAEGKYDL RLAPSYDPDA AQQQQQPPTA AASSSTAGKT SSALKTINLD KSRKSSSTPS LPEATGVVKP SVASTEQATS VDNLLTSSSS STAAASAVAS RTGTQSTDDD EATSTAVAAL VGALSLMDPS GSSAEGAAGE SQVSDSHKNA KNTYLAGQST ATATGAAATQ LMDHSSLLAS SGAATGSSAA SISLSAASQA AAAAAAATLD GIVVDDSLMA DVTEEMLEIN AFDLLRSLER QASQLQAEAE TMAAASSALS HVELAGQHQQ QQQQQQADDE QDDMSSPPSP PSQSVKVASG AASSVRTSGS MSVSVPNLTT TSSEPEREVG GGPTPAAFLE SFANVARRRH QAGNAANSGS GGASGGNASG SPVDLSNANR SSGSNSGAGA MAGVGGHHGA SGTNAVNVSS SSSSNASSSL LFPRGPNSVS SLVRLALSSN FPGGLLSTAQ SYPSLSNTLS SLSGGGGAGA GHQGHQQTSS SSSHGATLSQ ALSMSLTGSS DSEQVSLEDF LESCRAGTLL AELEDDDELP EPDEDDNEDD DENEDDEDFE ERRSWDDEFV LKRQFSALIP AFDPRPGRTN VHQTSDLEIP PPGAAEGAQG AEAQSVIQSM DVEADADLVP QPKLHLVLRG PCLPNIPDVE IELTDSDWTV FKAVQKLIQA SALGTRQEKL RRIWEPTYTL VYKELKDEAG AVVVVVGPST TTGGAMGDAA GQRRLMDDEL FLPDFDVNTQ TSNTQLQHQS HHSPTIFIPL EEFSSKKVTN KLMQQLSDPL VVSSGALPAW CEQLLTVCPI LIPFETRQMY FHANAFGTSR SIVWLQSQRD SAVERQRNTG AVPRRDDPHE FRVGRLKHER VRVPRGERLL DWAQQVMKVH ADRKAILEVE FQDEEGTGLG PSLEFYALVA AESQRRDLAL WICDDEDENE LTASETAASV ADSGVKPPGY YVVRPSGLFP APLPQDSPIC DHAEQLYWFL GVFLAKALQD GRLVDLPLST PFLKLLCQGD QQQQQQQQQQ QQQEAEDPMT SSVFSQDGDV LGGNGKPLTG SSSPRDPSLP WFSGILSHED LAVVDPTRGR FLLQLRALAT RRRQILTDPT LSHEDRCRQA DNLALAQPNL LNSNTTTTGG GGGVRLEDLA LTFQFAPSSK VFGFTDVELR SGGADFEVTL ENVEDYVEQM AEFCLERGIR RQMEALRQGF DRVFPMSRLA GFSPSELRLL LCGDQSPSWT REDILNYTEP KLGYTRDSPG FQRFVNVLTG MNAEERKAFL QFTTGCSSLP PGGLANLYPR LTVVRKVDGV SDRSGCVNGS YPSVNTCVHY LKLPEYDSEE ILRERLLAAT REKGFHLN // ID E9HRU4_DAPPU Unreviewed; 324 AA. AC E9HRU4; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFX65508.1}; GN ORFNames=DAPPUDRAFT_264556 {ECO:0000313|EMBL:EFX65508.1}; OS Daphnia pulex (Water flea). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. OX NCBI_TaxID=6669 {ECO:0000313|Proteomes:UP000000305}; RN [1] {ECO:0000313|EMBL:EFX65508.1, ECO:0000313|Proteomes:UP000000305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21292972; DOI=10.1126/science.1197761; RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A., Arnold G.J., Basu M.K., RA Bauer D.J., Caceres C.E., Carmel L., Casola C., Choi J.H., RA Detter J.C., Dong Q., Dusheyko S., Eads B.D., Frohlich T., RA Geiler-Samerotte K.A., Gerlach D., Hatcher P., Jogdeo S., RA Krijgsveld J., Kriventseva E.V., Kultz D., Laforsch C., Lindquist E., RA Lopez J., Manak J.R., Muller J., Pangilinan J., Patwardhan R.P., RA Pitluck S., Pritham E.J., Rechtsteiner A., Rho M., Rogozin I.B., RA Sakarya O., Salamov A., Schaack S., Shapiro H., Shiga Y., RA Skalitzky C., Smith Z., Souvorov A., Sung W., Tang Z., Tsuchiya D., RA Tu H., Vos H., Wang M., Wolf Y.I., Yamagata H., Yamada T., Ye Y., RA Shaw J.R., Andrews J., Crease T.J., Tang H., Lucas S.M., RA Robertson H.M., Bork P., Koonin E.V., Zdobnov E.M., Grigoriev I.V., RA Lynch M., Boore J.L.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331:555-561(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL732741; EFX65508.1; -; Genomic_DNA. DR STRING; 6669.DappuP264556; -. DR EnsemblMetazoa; EFX65508; EFX65508; DAPPUDRAFT_264556. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR PhylomeDB; E9HRU4; -. DR Proteomes; UP000000305; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000305}; KW Reference proteome {ECO:0000313|Proteomes:UP000000305}. SQ SEQUENCE 324 AA; 36462 MW; A27F3AF84FBF550D CRC64; MPRQIAEQKY WMPTAKLNTP IQAQRVGIAN FELFSSSPKD FRVYISDRYP TRDWALIGLF TAADERSIQS FTLERQLWQV ELVSHYGKEH FCPIGLSLFH VYGNSEYEVL DNEEDSRTGS ISHRQEEDST EEEMLFDLQK NISNTDVDVE GRNLFGSAKD AVINIVKKAA EVLTKSPAPQ SVMVTINETD NKEAIHPIVC YTAVEPVNVS RAINGSLSCE WKELKFLASL PWLHSSLLLQ CQKESFDQTF SSVELAFGES IWTRSTIQAL CHWILPEEPH IPASLESVSP SIVELDVPEN VNPCTEAQQV GEVESIAAEH AAQC // ID E9HY63_DAPPU Unreviewed; 656 AA. AC E9HY63; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFX63319.1}; GN ORFNames=DAPPUDRAFT_335634 {ECO:0000313|EMBL:EFX63319.1}; OS Daphnia pulex (Water flea). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. OX NCBI_TaxID=6669 {ECO:0000313|Proteomes:UP000000305}; RN [1] {ECO:0000313|EMBL:EFX63319.1, ECO:0000313|Proteomes:UP000000305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21292972; DOI=10.1126/science.1197761; RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A., Arnold G.J., Basu M.K., RA Bauer D.J., Caceres C.E., Carmel L., Casola C., Choi J.H., RA Detter J.C., Dong Q., Dusheyko S., Eads B.D., Frohlich T., RA Geiler-Samerotte K.A., Gerlach D., Hatcher P., Jogdeo S., RA Krijgsveld J., Kriventseva E.V., Kultz D., Laforsch C., Lindquist E., RA Lopez J., Manak J.R., Muller J., Pangilinan J., Patwardhan R.P., RA Pitluck S., Pritham E.J., Rechtsteiner A., Rho M., Rogozin I.B., RA Sakarya O., Salamov A., Schaack S., Shapiro H., Shiga Y., RA Skalitzky C., Smith Z., Souvorov A., Sung W., Tang Z., Tsuchiya D., RA Tu H., Vos H., Wang M., Wolf Y.I., Yamagata H., Yamada T., Ye Y., RA Shaw J.R., Andrews J., Crease T.J., Tang H., Lucas S.M., RA Robertson H.M., Bork P., Koonin E.V., Zdobnov E.M., Grigoriev I.V., RA Lynch M., Boore J.L.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331:555-561(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL733115; EFX63319.1; -; Genomic_DNA. DR STRING; 6669.DappuP335634; -. DR EnsemblMetazoa; EFX63319; EFX63319; DAPPUDRAFT_335634. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; E9HY63; -. DR OMA; GRNARAN; -. DR PhylomeDB; E9HY63; -. DR Proteomes; UP000000305; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000305}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000305}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 481 499 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 405 440 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 656 AA; 73692 MW; 616826E5A912BCBC CRC64; MCGRNARANY GHQQGGRDSD GHQCRALHKV AEPAVVGVAI ERVGIANFEL FSSSPKDFRV YISDRYPTRD WALIGLFTAA DERSIQSFTL ERQLWQSLWQ GTFLSHRLSL FHVYGNSEYE VLDNEEDSRT GSISHRQEED STEEEMLFDL QKNISNTDVV VEGRNLFGSA KDAVINIVKK AAEVLTKSPA PQSVMVTVNE TDNKETIHPT VCYTAVEPVN VSRAINGSLS CEWKELKFLA SLPWLHSSLL LQCQKESFDQ TFSSVELAFA ESIWTRSTIQ ALCHWILPED PHIPASLESV SPSIVELDVS ENVNPCTEAQ QVGEVESIAA EQPSVEQTEL ENFEQLFSEE MGTVATSSDD LPIGMETIIP NDGFLIEPSK PIGVQQQQQE IKVLERNMSL SGQYLEELSR RYKRQVDDMQ KSLNRTLQTL NDTLQRMSTQ EERYQVVLSQ LHGDMAELKT SAYKLAEENL VLRAQTIEQH LFLMVVEVIV IAVLVVFLLR NFTRPQFNQP PDPRHSPTHR QKEHQPAPLD FRLEVGQKVE EVEMVKTTLP VSLNMSEKAL QPYKTKTQVS PTPSLADSSS SGCESAFSFP NSNCTTTSRD PSPVDGNRAR GKSSKSGSKK QQKKRNKVLI MSSPTTKHPS TTRKMPGQLY TPQKFA // ID E9I182_DAPPU Unreviewed; 228 AA. AC E9I182; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFX62246.1}; GN ORFNames=DAPPUDRAFT_301689 {ECO:0000313|EMBL:EFX62246.1}; OS Daphnia pulex (Water flea). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda; OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia. OX NCBI_TaxID=6669 {ECO:0000313|Proteomes:UP000000305}; RN [1] {ECO:0000313|EMBL:EFX62246.1, ECO:0000313|Proteomes:UP000000305} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21292972; DOI=10.1126/science.1197761; RA Colbourne J.K., Pfrender M.E., Gilbert D., Thomas W.K., Tucker A., RA Oakley T.H., Tokishita S., Aerts A., Arnold G.J., Basu M.K., RA Bauer D.J., Caceres C.E., Carmel L., Casola C., Choi J.H., RA Detter J.C., Dong Q., Dusheyko S., Eads B.D., Frohlich T., RA Geiler-Samerotte K.A., Gerlach D., Hatcher P., Jogdeo S., RA Krijgsveld J., Kriventseva E.V., Kultz D., Laforsch C., Lindquist E., RA Lopez J., Manak J.R., Muller J., Pangilinan J., Patwardhan R.P., RA Pitluck S., Pritham E.J., Rechtsteiner A., Rho M., Rogozin I.B., RA Sakarya O., Salamov A., Schaack S., Shapiro H., Shiga Y., RA Skalitzky C., Smith Z., Souvorov A., Sung W., Tang Z., Tsuchiya D., RA Tu H., Vos H., Wang M., Wolf Y.I., Yamagata H., Yamada T., Ye Y., RA Shaw J.R., Andrews J., Crease T.J., Tang H., Lucas S.M., RA Robertson H.M., Bork P., Koonin E.V., Zdobnov E.M., Grigoriev I.V., RA Lynch M., Boore J.L.; RT "The ecoresponsive genome of Daphnia pulex."; RL Science 331:555-561(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL733724; EFX62246.1; -; Genomic_DNA. DR STRING; 6669.DappuP301689; -. DR EnsemblMetazoa; EFX62246; EFX62246; DAPPUDRAFT_301689. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR PhylomeDB; E9I182; -. DR Proteomes; UP000000305; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000305}; KW Reference proteome {ECO:0000313|Proteomes:UP000000305}. SQ SEQUENCE 228 AA; 24421 MW; C3EB4A16EA7BA5FA CRC64; MYTPALISAA PLVRATSRNL TSSEVTVPVD NLDADLSVPS IEVLSSTEIL EESAYPEISA SEVISADAEP NVKLENVLAT NNVLNVTNNV TRNNESGSSE DDIPSFSEWT LKVLAEEEKS GANGSTGVQT PAKLSATPKL RQKNYASPDC GAKILDANSE AEHTSAILDP SRDEYFLSIC SAKIWFVIEL CEAIQAQRVG IANFELFSSS PKDFRVTLAT VIQLGIGL // ID E9IHR6_SOLIN Unreviewed; 295 AA. AC E9IHR6; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFZ19830.1}; DE Flags: Fragment; GN ORFNames=SINV_00618 {ECO:0000313|EMBL:EFZ19830.1}; OS Solenopsis invicta (Red imported fire ant) (Solenopsis wagneri). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. OX NCBI_TaxID=13686 {ECO:0000313|Proteomes:UP000006539}; RN [1] {ECO:0000313|EMBL:EFZ19830.1, ECO:0000313|Proteomes:UP000006539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21282665; DOI=10.1073/pnas.1009690108; RA Wurm Y., Wang J., Riba-Grognuz O., Corona M., Nygaard S., Hunt B.G., RA Ingram K.K., Falquet L., Nipitwattanaphon M., Gotzek D., RA Dijkstra M.B., Oettler J., Comtesse F., Shih C.J., Wu W.J., Yang C.C., RA Thomas J., Beaudoing E., Pradervand S., Flegel V., Cook E.D., RA Fabbretti R., Stockinger H., Long L., Farmerie W.G., Oakey J., RA Boomsma J.J., Pamilo P., Yi S.V., Heinze J., Goodisman M.A., RA Farinelli L., Harshman K., Hulo N., Cerutti L., Xenarios I., RA Shoemaker D., Keller L.; RT "The genome of the fire ant Solenopsis invicta."; RL Proc. Natl. Acad. Sci. U.S.A. 108:5679-5684(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL763294; EFZ19830.1; -; Genomic_DNA. DR EnsemblMetazoa; SINV23224-RA; SINV23224-PA; SINV23224. DR InParanoid; E9IHR6; -. DR OMA; NYESAYS; -. DR Proteomes; UP000006539; Unassembled WGS sequence. DR GO; GO:0016747; F:transferase activity, transferring acyl groups other than amino-acyl groups; IEA:InterPro. DR Gene3D; 3.40.47.10; -; 1. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR016039; Thiolase-like. DR InterPro; IPR020616; Thiolase_N. DR Pfam; PF07738; Sad1_UNC; 1. DR Pfam; PF00108; Thiolase_N; 1. DR SUPFAM; SSF53901; SSF53901; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006539}; KW Reference proteome {ECO:0000313|Proteomes:UP000006539}. FT COILED 52 72 {ECO:0000256|SAM:Coils}. FT NON_TER 295 295 {ECO:0000313|EMBL:EFZ19830.1}. SQ SEQUENCE 295 AA; 32601 MW; 5F88B61C102E511A CRC64; MSFVKFNTNL IYVSIKRSDI LAMSVSHLCD KYSASASFKM IKADLGNLRS HLDTLSLEVK NVIEMRDELK SKLKEVGTVI PKMSEAILNL RNEVSEEMNL HTKNLLKALS LDTVRNMVRN ELQTSNQAIT TGMGLIACGV YDAIVAGGVE FMSDIPIRHS RSMRSLMLQA NKAKTLQNKL ALLASIRPGH FIPDTSVLPG ECWAFKGSSG SVVIRLLGRV HVSGVSLEHI SSLISPTGET ATAPKDFSIW GLTDLDDKKP FSFGSFIYDN TGPPLQYFEV QVKYPIQYAP VQLTF // ID E9IZL6_SOLIN Unreviewed; 934 AA. AC E9IZL6; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFZ13931.1}; DE Flags: Fragment; GN ORFNames=SINV_03207 {ECO:0000313|EMBL:EFZ13931.1}; OS Solenopsis invicta (Red imported fire ant) (Solenopsis wagneri). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. OX NCBI_TaxID=13686 {ECO:0000313|Proteomes:UP000006539}; RN [1] {ECO:0000313|EMBL:EFZ13931.1, ECO:0000313|Proteomes:UP000006539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21282665; DOI=10.1073/pnas.1009690108; RA Wurm Y., Wang J., Riba-Grognuz O., Corona M., Nygaard S., Hunt B.G., RA Ingram K.K., Falquet L., Nipitwattanaphon M., Gotzek D., RA Dijkstra M.B., Oettler J., Comtesse F., Shih C.J., Wu W.J., Yang C.C., RA Thomas J., Beaudoing E., Pradervand S., Flegel V., Cook E.D., RA Fabbretti R., Stockinger H., Long L., Farmerie W.G., Oakey J., RA Boomsma J.J., Pamilo P., Yi S.V., Heinze J., Goodisman M.A., RA Farinelli L., Harshman K., Hulo N., Cerutti L., Xenarios I., RA Shoemaker D., Keller L.; RT "The genome of the fire ant Solenopsis invicta."; RL Proc. Natl. Acad. Sci. U.S.A. 108:5679-5684(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL767232; EFZ13931.1; -; Genomic_DNA. DR EnsemblMetazoa; SINV11778-RA; SINV11778-PA; SINV11778. DR InParanoid; E9IZL6; -. DR OMA; TRCTQRY; -. DR Proteomes; UP000006539; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006539}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006539}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 350 369 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 440 457 Helical. {ECO:0000256|SAM:Phobius}. FT NON_TER 934 934 {ECO:0000313|EMBL:EFZ13931.1}. SQ SEQUENCE 934 AA; 109083 MW; 09144F89A97ACF98 CRC64; MESEQHHYEL RSRSRSRSHT PMIPNRPLPE QDERHYDLRN WSRERSHTPG EVTSSRRSAS RSTVSMKTHE KTMEPIEESK EGSVTESLTD NQSTKSESSV VVTKKAERRS ERQRAKRQIF ANGQSESKDD VSDQKAERKR KSVTPRRVLT SDYSSEEGER EDPLSRSASS AYEIYKQAGD WWNVFPKTDY TYSPASECRY EIAPGILAMP NMSRRSIHVN DNGSTVISRR NVSQASRGTT ESGISDSDTI DLKETASLIN SSENNYGMSS TRMSDSKGKA TLYKKTRVEQ YTSHKETIYS EPRHSSDFRS RYLSYMPSFR EYDSSRVDSD TELDETFAKS TVKTKQNWKI VQWFTYFVTF IVAYFRKTVQ FFRMDRKRQY YGAQAYRGSN VSKLNTLWQT LDRYTHNMYF FFVRMLVFDA WLLSQFTGVR KWLQEKGPRI LWITLLPLLL LLGAYIAQRH YFTFADSLRN VQYGVTETTY HIFKDIINFM SLTYELFIKY FIDLKSYLVH LLPSVNVSLP ELPPIGWNIK WKFVSGGWCI AQYLSLVSDV ETAPQRVIER SLSDVQWEDR KSVIEELLAN NEIIKDLVNK ADLVNRIEML ENKQAHQMEY LMNISRTLED RKKSDADFRK EYDDKIIDVE NKLDVSSELK NMAYSELKVI KHEFEELRKL YSELKSCCNA NAEFVANQNI EKRVEKILFG YFPSGISKED LVKNMQTLFT FHNKEDENIL SGDDANDHVS DEHIRKIVKE VLRIYDADKT GQVDYALETA GGQIISTRCT QRYDIKTRAY SLFGFPLYYE SNDPRTVIQG NPIQPGVCWA FQDFPGYLLI QLRGIIYVTG FTLEHVPRLI LPNENMSSAP RKFNVWGLLS ENDQEPVMFG EYEFTISDEN LQYFPVQNTE IKKPYEYIEL RIHSNHGQLE YTCLYRFRVH GRPA // ID E9JB26_SOLIN Unreviewed; 2000 AA. AC E9JB26; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 14-OCT-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EFZ09977.1}; DE Flags: Fragment; GN ORFNames=SINV_12881 {ECO:0000313|EMBL:EFZ09977.1}; OS Solenopsis invicta (Red imported fire ant) (Solenopsis wagneri). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Solenopsis. OX NCBI_TaxID=13686 {ECO:0000313|Proteomes:UP000006539}; RN [1] {ECO:0000313|EMBL:EFZ09977.1, ECO:0000313|Proteomes:UP000006539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21282665; DOI=10.1073/pnas.1009690108; RA Wurm Y., Wang J., Riba-Grognuz O., Corona M., Nygaard S., Hunt B.G., RA Ingram K.K., Falquet L., Nipitwattanaphon M., Gotzek D., RA Dijkstra M.B., Oettler J., Comtesse F., Shih C.J., Wu W.J., Yang C.C., RA Thomas J., Beaudoing E., Pradervand S., Flegel V., Cook E.D., RA Fabbretti R., Stockinger H., Long L., Farmerie W.G., Oakey J., RA Boomsma J.J., Pamilo P., Yi S.V., Heinze J., Goodisman M.A., RA Farinelli L., Harshman K., Hulo N., Cerutti L., Xenarios I., RA Shoemaker D., Keller L.; RT "The genome of the fire ant Solenopsis invicta."; RL Proc. Natl. Acad. Sci. U.S.A. 108:5679-5684(2011). CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL770834; EFZ09977.1; -; Genomic_DNA. DR EnsemblMetazoa; SINV25456-RA; SINV25456-PA; SINV25456. DR InParanoid; E9JB26; -. DR OMA; NRQCIEG; -. DR Proteomes; UP000006539; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006539}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000006539}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1208 1228 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EFZ09977.1}. FT NON_TER 2000 2000 {ECO:0000313|EMBL:EFZ09977.1}. SQ SEQUENCE 2000 AA; 220131 MW; 9E09EA222A69B344 CRC64; TMLPSVRKAS LSLIRKMVHY IQPELLVETC GSDRTGGCGA MLVEVIANVL DNEEDEDGHL VVLQMIQDLM IKGKDEFLEH FARLGVFSKV AALAGPQEST PEPEAESNQS GEEQRMEDAK ELLIGRAYHW RDWCICRGRD CLYIWSDAAA LELSNGSNGW FRFILDGKLA TMYSSGSPEG GTDTSENRGE FLEKLQRARS QLKVNFVSQP VLSRPGTTRL VVGNWALSSR KESELCIHNS DGQQQATILR EDLPGFIFES NRGTKHSFTA ETSLGPEFAA GWTGKRGKRL RSKIEAIKQK VKVQAQEIYE RYFKAAQAQP RGIVAKLGAI VNQIEKASQK QQSGSREWRD LLQTALEQLK VLLNEEGRVS AYELHSSGLI QALLALLAAP PGPSPTTLRA TKLRMQRITT FKSCFQTKDT NKEPNSAKIL VHKLVSVLES IEKLPVYLYD TPGSGYGLQI LTRRLRFRLE KASSESALID RSGRSLKMEP LSTIQQLENH LLKMVAKQWH DHDRSTFAFV KRLKEENKIL FKYQHDFDEN GLLYWIGTNA KTCSEWVNPG QYGLVVVTSS DGRNLPYGHL EDILSRDPSA LNCHTNDDKR AWFSIDLGVW IIPSAYTLRH ARGYGRSALR NWMFQASKDG VTWITLYAHV DDCSLNEPGS TSTWTLEPPS EETQGWRHLR LQQIGKNASG QTHYLSVSGF EVYGEVTGVC EDLGRAAKEA EAGVRKQRRF IKTQVLKHLV AGVRVSRGLD WKWRDQDGVP PGEGTVTGEL HNGWIDVTWD HGGSNSYRMG AEGKYDLRLV GAGLDTDSTA KCKSGGGVLT GRKSNSTPSL PDCTDTAMRG SVASTDQAAS ADNLAAKQAA ESIAESVLSV ARAEAVVAVT GESGANSTSE LSVVLHPRPD TTVTSDLATI VESLTLNTDC PVNSTSNRAS SSKPLFATVR GNKASGGLLS LETAEVLDRM REGADRLRNN TNSFLSGELL GLVPVRISVS GESDENSLRI KSVPRHHPTG ITDVAKDCTR EKEASSSTQN TTGGCPVVVT NPMSVSVPNL ACSDANNTLE STAATGLLET FAAMARRRTL GPAGGQHLAS NSNTSCNPIR GPNSVSSLVR LALSPNFPGG LLSTAQSYPS LTSSGQVAGS GVTTTTGPGL GQALTMSLTS TSSDSEQVSL EDFLESCGGV ATSSTGGGRT TGGPTLLTEL EDDEDGVLEE EEDNEENDQE VSREYEEVMV SRNLLAAFME EEAPQSSKRR AWDDEFVLKR QFSALIPAFD PRPGRTNINQ TTDLEVPPPG SEAQVNSRIG SLPMPRLSLS LKGPGFPGIP DVEISLSDSH ASIFQAVQEL MQLTELGSRQ EKLKRIWEPT YTIIYKEARD EESSGRATPI VTLYSRNPTQ NANACTVEDV LQLLRHVFVL CTIRDESALV EQDESNDTTY WLHPDDFTSK KITNKIVQQI QDPLALAAGA LPNWCEELAR SCPFLLPFET RRLYFSCTAF GASRSIVWLQ TQRDAILERQ RAPGLSPRRD DSHEFRVGRL KHERVSVPRG EKLLDWAEQV LKVHASRKSI LEVAFVGEEG TGLGPTLEFF ALVAAELQRK DLGLWLCDDT ADDNTATRIL NEEQTCISGE KIRPAGYYVT RASGLFPAPL PQDSACCDRA VRYFWFLGVF LAKVLQDNRL VDLPLSRPFL KLMCRGDISN NVNEKIGLTT GVTQESMSSS MSSSFISEEE ENDAAYSSLE SCPWYAGLLD IEDLVEVDPV RGEFLREIQN AIAKRDRTFS DGPSSPDHEE TSLHITHPSG TSVAIEDLAL TMTYSPSSKV FQHDQVELIE GGSDIVVTIE NAREYANLTI NYCLNQGIYR QLEAFKSGFS KVFPMEKLHV FSPDEMRAML CGEQNPQWTR EDLLNYTEPK LGYTKESPGF QRFVNVLLSL TGPERKAFLQ FATGCSALPP GGLCNLHPRL TVVRKVDAGS GGYPSVNTCV HYLKLPEYPT EEILRERLLA ATRERGFHLN // ID E9LV81_MOUSE Unreviewed; 320 AA. AC E9LV81; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Sad1 and UNC84 domain-containing protein 3 {ECO:0000313|EMBL:ADW08709.1}; GN Name=Sun3 {ECO:0000313|EMBL:ADW08709.1, ECO:0000313|MGI:MGI:3041199}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] {ECO:0000313|EMBL:ADW08709.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=C57BL/6J {ECO:0000313|EMBL:ADW08709.1}; RC TISSUE=Testis {ECO:0000313|EMBL:ADW08709.1}; RX PubMed=20711465; DOI=10.1371/journal.pone.0012072; RA Gob E., Schmitt J., Benavente R., Alsheimer M.; RT "Mammalian sperm head formation involves different polarization of two RT novel LINC complexes."; RL PLoS ONE 5:E12072-E12072(2010). RN [2] {ECO:0000313|EMBL:ADW08709.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=C57BL/6J {ECO:0000313|EMBL:ADW08709.1}; RC TISSUE=Testis {ECO:0000313|EMBL:ADW08709.1}; RA Goeb E., Schmitt J., Benavente R., Alsheimer M.; RL Submitted (AUG-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HQ141857; ADW08709.1; -; mRNA. DR RefSeq; NP_001277448.1; NM_001290519.1. DR UniGene; Mm.79210; -. DR ProteinModelPortal; E9LV81; -. DR SMR; E9LV81; 124-317. DR STRING; 10090.ENSMUSP00000099973; -. DR PaxDb; E9LV81; -. DR PRIDE; E9LV81; -. DR GeneID; 194974; -. DR KEGG; mmu:194974; -. DR CTD; 256979; -. DR MGI; MGI:3041199; Sun3. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OMA; CVKLNIF; -. DR PhylomeDB; E9LV81; -. DR NextBio; 35553709; -. DR Bgee; E9LV81; -. DR ExpressionAtlas; E9LV81; baseline and differential. DR GO; GO:0034993; C:LINC complex; IDA:MGI. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 24 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 320 AA; 36758 MW; 8DEC1E6B0D0BC1FB CRC64; MLTRSWKIIL STVFISTFLL VGLLNHQWLK ETEFPQKPRQ LYTVIAEYGS RLYNYQARLR MPKEQQELLK KESQTLENNF REILFLIEQI DVLKALLKDM KDGVHNHSLP VHRDAVQDQA TTDVLDEEMS NLVHYVLKKF RGDQIQLADY ALKSAGASVI EAGTSESYKN NKAKLYWHGI GFLNYEMPPD MILQPDVHPG KCWAFPGSQG HILIKLARKI IPTAVTMEHI SEKVSPSGNI SSAPKEFSVY GVMKKCEGEE IFLGQFIYNK MEATIQTFEL QNEASESLLC VKLQILSNWG HPKYTCLYRF RVHGIPSDYT // ID E9PHI4_HUMAN Unreviewed; 822 AA. AC E9PHI4; DT 05-APR-2011, integrated into UniProtKB/TrEMBL. DT 05-APR-2011, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=SUN domain-containing protein 1 {ECO:0000313|Ensembl:ENSP00000384116}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSP00000384116}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000384116, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000384116, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [2] {ECO:0000213|PubMed:17081983} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=17081983; DOI=10.1016/j.cell.2006.09.026; RA Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., RA Mann M.; RT "Global, in vivo, and site-specific phosphorylation dynamics in RT signaling networks."; RL Cell 127:635-648(2006). RN [3] {ECO:0000213|PubMed:18669648} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=18669648; DOI=10.1073/pnas.0805139105; RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., RA Elledge S.J., Gygi S.P.; RT "A quantitative atlas of mitotic phosphorylation."; RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008). RN [4] {ECO:0000213|PubMed:20068231} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=20068231; DOI=10.1126/scisignal.2000475; RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., RA Mann M.; RT "Quantitative phosphoproteomics reveals widespread full RT phosphorylation site occupancy during mitosis."; RL Sci. Signal. 3:RA3-RA3(2010). RN [5] {ECO:0000313|Ensembl:ENSP00000384116} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [6] {ECO:0000213|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000384116}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC073957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC099731; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; KF458356; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; E9PHI4; -. DR SMR; E9PHI4; 626-820. DR STRING; 9606.ENSP00000384015; -. DR PaxDb; E9PHI4; -. DR PRIDE; E9PHI4; -. DR Ensembl; ENST00000405266; ENSP00000384116; ENSG00000164828. DR HGNC; HGNC:18587; SUN1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; CEEITTH; -. DR ChiTaRS; SUN1; human. DR NextBio; 35502057; -. DR Proteomes; UP000005640; Chromosome 7. DR Bgee; E9PHI4; -. DR ExpressionAtlas; E9PHI4; baseline and differential. DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IDA:HPA. DR GO; GO:0031965; C:nuclear membrane; IDA:HPA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|MaxQB:E9PHI4, KW ECO:0000213|PeptideAtlas:E9PHI4}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 296 319 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 326 345 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 420 440 {ECO:0000256|SAM:Coils}. FT COILED 465 499 {ECO:0000256|SAM:Coils}. FT COILED 512 532 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 822 AA; 91113 MW; 7F85C7F05CAE1448 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADSGTSSA VSLKNRAART TKQRRSTNKS AFSINHVSRQ VTSSGVSHGG TVSLQDAVTR RPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGAAAATAH NGFSCSNCSM LSERKDVLTA HPAAPGPVSR VYSRDRNQKC DDCKGKRHLD AHTAAHSQSP RLPGRAGTLW HIWACAGYFL LQILRRIGAV GQAVSRTAWS ALWLAVVAPG KAASGVFWWL GIGWYQFVTL ISWLNVFLLT RCLRNICKFL VLLIPLFLLL AGLSLRGQGN FFSFLPVLNW ASMHRTQRVD DPQDVFKPTT SRLKQPLQGD SEAFPWHWMS GVEQQVASLS GQCHHHGENL RELTTLLQKL QARVDQMEGG AAGPSASVRD AVGQPPRETD FMAFHQEHEV RMSHLEDILG KLREKSEAIQ KELEQTKQKT ISAVGEQLLP TVEHLQLELD QLKSELSSWR HVKTGCETVD AVQERVDVQV REMVKLLFSE DQQGGSLEQL LQRFSSQFVS KGDLQTMLRD LQLQILRNVT HHVSVTKQLP TSEAVVSAVS EAGASGITEA QARAIVNSAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALMSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MMIHPAAFTL EHIPKTLSPT GNISSAPKDF AVYGLENEYQ EEGQLLGQFT YDQDGESLQM FQALKRPDDT AFQIVELRIF SNWGHPEYTC LYRFRVHGEP VK // ID F0UNQ5_AJEC8 Unreviewed; 729 AA. AC F0UNQ5; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGC46817.1}; GN ORFNames=HCEG_06032 {ECO:0000313|EMBL:EGC46817.1}; OS Ajellomyces capsulatus (strain H88) (Darling's disease fungus) OS (Histoplasma capsulatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Histoplasma. OX NCBI_TaxID=544711 {ECO:0000313|Proteomes:UP000008142}; RN [1] {ECO:0000313|Proteomes:UP000008142} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H88 {ECO:0000313|Proteomes:UP000008142}; RA Champion M., Cuomo C., Ma L.-J., Henn M.R., Sil A., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., Berlin A., RA Borenstein D., Chen Z., Engels R., Freedman E., Gellesch M., RA Goldberg J., Griggs A., Gujja S., Heiman D., Hepburn T., Howarth C., RA Jen D., Larson L., Lewis B., Mehta T., Park D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA Walk T., White J., Yandava C., Klein B., McEwen J.G., Puccia R., RA Goldman G.H., Felipe M.S., Nino-Vega G., San-Blas G., Taylor J., RA Mendoza L., Galagan J., Nusbaum C., Birren B.; RT "Annotation of Ajellomyces capsulatus strain H88."; RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS990640; EGC46817.1; -; Genomic_DNA. DR EnsemblFungi; EGC46817; EGC46817; HCEG_06032. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008142; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008142}; KW Reference proteome {ECO:0000313|Proteomes:UP000008142}. FT COILED 161 181 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 729 AA; 78871 MW; BD4E7B4A4A8DFCD5 CRC64; MTGRKTASVR SGSRAQNTRG TRAAPTENPT VATEGNQSNP DLGNPSLPDV RTQQSFAYGS TKTPALPRQL EVDPSMGLSE MIDTLDDGLR QAQDRELARV DVEDPTHPVP ERRQTRSMSA SVRSSISPAP GPVSRRASSR NATTRSRAGP RRAASRQATP EEQLLETLRE VSEETEGVKR EEDPSISVLH DTPSFNGSAS VSWTTERAIH GILPRETNAG TRPNYYLHDP YGSRPSSSQE PSGLRLPPTR RPIFEEAFRA NPPLPGPIDV PNVSTSAAAR RTLPPVPAFN QLRNKSASKS SASSASSASI HTPGSSKHSS PVLVAAAPAG VHITSKQRLS GIAKTPSALL VTIGLILMTF LTYFCRDHAC MFPQSLQTTM SHYLCSPVST FATDNSTSMY ADAFHKLSSR LDQRLSDMAK EVTILKNEWN RRLPHLKEAL SGSPAAAMDP LMPPKVNYAS IGMGAVVDPY LTSPTMATSA GLVSRIGQYL AKVPRGSPPV AALQPWDGVG ECWCAATRSN VSQLTILLGR AIVPEEVVIE HIPKGATLDP GSAPREMELW VQYMARPPTA AAAYPPGSGS SNPSPPPSSA SSPHAPSPFP PFSAPSQPLP PPPATPHLRT PPFSHLRPSY YPHHLLPSWL RDAILTTLRQ VYPNEPTTAY SDDALLGPSF FRVGRWQYNI HGGHHVQRFD LDAVIDMPAA RVEKVVFRVK SNWGAAHTCL YRVRLHGHL // ID F0USK0_AJEC8 Unreviewed; 865 AA. AC F0USK0; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 16-SEP-2015, entry version 10. DE SubName: Full=Sad1 domain-containing protein {ECO:0000313|EMBL:EGC48877.1}; GN ORFNames=HCEG_08092 {ECO:0000313|EMBL:EGC48877.1}; OS Ajellomyces capsulatus (strain H88) (Darling's disease fungus) OS (Histoplasma capsulatum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Histoplasma. OX NCBI_TaxID=544711 {ECO:0000313|Proteomes:UP000008142}; RN [1] {ECO:0000313|Proteomes:UP000008142} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=H88 {ECO:0000313|Proteomes:UP000008142}; RA Champion M., Cuomo C., Ma L.-J., Henn M.R., Sil A., Goldman B., RA Young S.K., Kodira C.D., Zeng Q., Koehrsen M., Alvarado L., Berlin A., RA Borenstein D., Chen Z., Engels R., Freedman E., Gellesch M., RA Goldberg J., Griggs A., Gujja S., Heiman D., Hepburn T., Howarth C., RA Jen D., Larson L., Lewis B., Mehta T., Park D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA Walk T., White J., Yandava C., Klein B., McEwen J.G., Puccia R., RA Goldman G.H., Felipe M.S., Nino-Vega G., San-Blas G., Taylor J., RA Mendoza L., Galagan J., Nusbaum C., Birren B.; RT "Annotation of Ajellomyces capsulatus strain H88."; RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS990642; EGC48877.1; -; Genomic_DNA. DR EnsemblFungi; EGC48877; EGC48877; HCEG_08092. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008142; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008142}; KW Reference proteome {ECO:0000313|Proteomes:UP000008142}. SQ SEQUENCE 865 AA; 95513 MW; FEC0745C86414FA6 CRC64; MAWHFPLFRQ HVHTECRATD NLLYFWTIAL LAAVRAGGDM DSKNHNISPL SLDATCPPRA FSGIQHPVCL EPRWVGIGKI ENYTSNSSGE TDFYASITSA ASPSLSPTIT VTGSGSGSSG VDQELDTESP LDNANFLSFE EWKKQNLAKV GQSVENVRGD RQSAGSSGDG KRQRPTGIDN SLDSLGEDGE IALEFGGFGP EDSGPASWER KVGKDQPPDV DGAGSVTKGA EGETQIEATT RGGASRRKDA GTTCKERFNY ASFDCAATVL KTNPQCTGAS SVLIENKDSY MLNECKAKEK FLILELCDDI LIDTIVLANY EFFSSIFRTF RVSVSDRYPP KQPDMWKELG TYEAVNSREV QAFAVENPLI WARYVKIEFL THYGNEFYCP VSLIRVHGTT MLEEYKNDGE ANRLEDHNSH QIQGSRTLES GPDNSTTDPS KIAEDSEGPA EAGRFDMQPT RVLEDICLLK DAEVGGILLR SVVRAEDRMC AVHETPRAYN RTDDAVQPDL VQSHGPAQAV DNATPTTPSV EPSSNAVTPP TPVSTPTLTD TRAQKPTENE TSSNTHKTEY NGSSESPKPS TTVQYHQPNP TTQESFFKSV NKRLHMLETN SSLSLQYIEE QSRILRDAFN KVEKRQLAKT TTFLENLNTS VLQELREFRH QYDQVWHSVA VEFEQQRLQY RQEVFAMSSQ LGVLADELVF QKRISIIQSV FVLICFGLVL FSSSPIGSYL ELPRVHNMVS RSQSFRSSTH SFETPSASPL SRPNSPYQDN KRVSSSHTRT HSMESREDDL AVNPTICYSP PTPTSDSGGH ELRRRLSEQT NSTSSSVVVA PQARFLRSES SPPDLCESYE GSDGGSSSEA PQLST // ID F0VLC5_NEOCL Unreviewed; 3522 AA. AC F0VLC5; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBZ54877.1}; GN ORFNames=NCLIV_053020 {ECO:0000313|EMBL:CBZ54877.1}; OS Neospora caninum (strain Liverpool). OC Eukaryota; Alveolata; Apicomplexa; Conoidasida; Coccidia; OC Eucoccidiorida; Eimeriorina; Sarcocystidae; Neospora. OX NCBI_TaxID=572307 {ECO:0000313|EMBL:CBZ54877.1, ECO:0000313|Proteomes:UP000007494}; RN [1] {ECO:0000313|Proteomes:UP000007494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Liverpool {ECO:0000313|Proteomes:UP000007494}; RX PubMed=22457617; DOI=10.1371/journal.ppat.1002567; RA Reid A.J., Vermont S.J., Cotton J.A., Harris D., Hill-Cawthorne G.A., RA Konen-Waisman S., Latham S.M., Mourier T., Norton R., Quail M.A., RA Sanders M., Shanmugam D., Sohal A., Wasmuth J.D., Brunk B., RA Grigg M.E., Howard J.C., Parkinson J., Roos D.S., Trees A.J., RA Berriman M., Pain A., Wastling J.M.; RT "Comparative genomics of the apicomplexan parasites Toxoplasma gondii RT and Neospora caninum: Coccidia differing in host range and RT transmission strategy."; RL PLoS Pathog. 8:e1002567-e1002567(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR823391; CBZ54877.1; -; Genomic_DNA. DR RefSeq; XP_003884905.1; XM_003884856.1. DR STRING; 572307.XP_003884905.1; -. DR GeneID; 13446581; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F0VLC5; -. DR Proteomes; UP000007494; Chromosome X. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007494}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007494}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 64 82 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 103 121 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 3522 AA; 369834 MW; 9AE90C274D203D91 CRC64; MPISTVSSHA SGSSKKNTTA ENGHTEVSLC SWLATPKSTA CVVRVFVFLP LPIFSYAFSS QWLFVVDFGS AVSSLAVFIV SFPPVNPRSQ SRFARSKRVP RSSFVWFPSF LVCTFLLFLS APSTRVPLLS PGRLCPVQAA SPLEPKEAQA PSSSPTAVEA TASPAEGDTP QALHVSPAVA TERTRDGTFP SFSGGQEIGS EEGEEAETES LDTGEKGNAK PIQGQTSAIH LGMKAKPSQE TRDASPSLGA GDGAFSHSFP AQTGDRPGER NVKENEGETG GRKEDGRSVN ADDRMQKGEI HEKTAGSEDR QQKKKEANEE GDSEGFDGGK AGQGGGDGNV SSSSASREAN GEGGHEQDGS IGAAEFNGVG DRVKADAPTE PAGHARGEGR HGWQAETQGE ETEKETEPQY GNDGERVEEE EVVSISSPSL EHQHEARDGE REETDGVQIS EVSLKESETQ QSEKAEGQLL GHTAGRGSVR ERFSSPVSAI QEPGEARRET GPQDTEQANK EETACEVVNA LLSAVSPSPS SPYSSSSSPS SLSFSLSSSP SPSPSSSPSP SPSSSPSPSP SSPASSSPAF SSRGTIREGN ATRSFSHSPW PSAPSSSVSA LGCQPADVCF VSFHGVFSLS SPCSLRDVAA AHEQPEGEPA ERRQAYERRQ KGEEGREEGR GARKIFSRAL GLISSLTNLL SAGRSSLAAS PSPPSPAGSS ASPSSTRAPE ASLSSVASPF SPRLPCSLPD VGAGASPAAP RCSCDPAPGR IPQRSPVDSQ SHQGACSLSS PFLCADTRTS TAGVGDSLQA TAASRAFFLP GRPLTTELRC CRDAADSRPV SACASPFVCL PVPRSPVRER EIDACPVFAS GSQALFGSAS SCAAPSEQRE EEERRIGEGG LVSECSSFRS DSSFPCSRLA DSFARDGEKN SAPRTVEADA KGTEDPSRRS PSDHATAGVS PAGECRRRRG ADFSFFSACM CDTSEEGGAQ NGDSCRLAGI ASRDAEPQRE ATECAEGDGE MEGRGGRKRT PEERGGNEGD RFPVQARRGR GGLDWALTSS TPSTQEQMRD SPQDGKEAVG ENEEDGSRAS QGREAVATEG CGESVRGEAA GAENATTQGG DSVPGEEGGT QLHEVVAGEK WKSVRGLWSP SALLLRRCLP LSVRRLVLAP LKALRRRMQR PLRFPLSRGL AGAGAPREPQ RGDGSQPLRE TGEELEKAEE DQETPAASPG DSDTGPEGQL EGDGESAGES ADASGGEGTF PKPAAPRLDS AGASLSDTRS EAENAPESEA VPRSRPTPCA SAACAPEVRS RPAKAELSLN GRPFLRGRAY EAEAQKLKFD FASVDAGARI VASSRGVTNI KSVQRNDLDS YMLVPCQLQP KFFVISFTEP IHVEQVAVAS MEIYASAFRH IQLLGSDVYP TKQWRLLANL ETNPREAQEI FDVKRECAAL HGGQACWAKY LKVRLLSYHV VEAQYYYCSL TSFHVFGSTG FQMLESHIHN EGSPDAEASH GAAEAEGGDS GDSGDSGDAG SSGEARGGEG REGGGVGASS SGERQTEAGG RSRDAEKEPD DGHGEGADEE RGRGESGEGF MRTRDGVAGN EGDREEKRRG SGDEGGRIGT SSEDLETGIA AASPELLRDR GEAAPSPQER KPTPEEDDEV EEERSRGWAE QTPHGREKVD SSSAAAEVLG SAETGLSPSA ASRPSVLSPT SVEEQEADAP KTPKATQEEA PSVGRAEGIY SQGQGKNGEE EAGAVRRLPD SQGGDMGGEG GNEGTRSEAS GKAGNRPSSG DMRAEREEEV EGERGHRREQ DGEGTEGRER EPSRKQDQEE HVNKERRRPS FSPASPEEGT SGASRTGTEA SAPPSSSTAE FSPAAASEEQ REAERTAHSG PYLGHLGTVD KKDGKQDEKE PGPVGIDGHR QSETRGSGAG AHEETADARR DKGEQRGHER GEEQRPADQM HADPLGPSHA PKTPEPRAVP CSNCEEEQSS RPRAPRASAG ADRGGDQGSR LRSETPRADE GTPASRKKPS PAPPPSPQGL AARESEAETE REGLLRQLHR WRGIARLPLL FSPSPSFPLG GTGGGPAGGA SLLRSVLNYF FPSWNPVPPS RASSVVYESH SLSPSFSLFS SVPSFSSPVS PLKRPERPLD GDRQEAQEWR TRAQRAHEGD RGGTGSKDGE SPGRPPGAAN WESPDRVVAH PEGGNGEELA EGGSSPQRRS EATAKHGSEG EDGVPRCEED SKRGQRACGT RSPSSHTPRM PPPRGAAVSA DPKEPRPRPS GPFPGLERLF SRRRPSPFRP DPRLPRSPAP SPRGAAPEER RSTVEPLSPS ASSFPSPSAR STFPASSFPF PAFLPQNTIL EDLLAWADRG GGAQGLAAGE HVPGASGTGD NFEGTASFLA SSSLSPILPL LAQRRNGEAP SDARDGAKAL APVVAATFAP ERSAAAKLAS EADTASLATR GSALSQQQQQ DLLQLLLQNL PVAAATASAS LFPSGFPASR GVVESLAADL SAPSPAPASQ SKSGASGSVS TSSSSAPKGA AKTAAASASQ GGDKSSSSSA PKESAPAASV SSPKGSGHVL LTLVDRMKAA ESQGAHLKEK LNEVVLSLHS NQQQTLQQAV LLQLLLELVS FLYMRLSKFD VIVPRLPFLL DFSDSGAAAA ASFAAKRTTA LKSVSSASGA VVGDGDTPAG QTDTGPQQRA CSGNARDFSQ AKKTPGLDSD LLSFKGEGPS DGGGRDFFAS VSSLFGGNEE ADAARPSPCG GRRRWGQRKG TRPWSPKKRV RTTRKEGARC GGTRNGEGRE ESGGENACVE MDAESKTKDC VGFEEGEGDS TFGSDESEES FSTTALRFLT SYLVTAFSYL AILTEEACFV VVAPMLDAFA SSASTPQSLS TSQGRSFCLH MTETVTRFLG AAATLHQNIS SSLIECWRQA IKWDHAGGLS PGDGDPTEKG PNPLSLFLLL LFLHLAYGVF ILRLFSKRTN EAAARAEAAV RECEALRLSL ASLLRAPTGP EGHLGRGVTA GSLEMESSGR NTGQSSLRQE TGGEPGRFAG QENGAVERAL GLGDRKDAEN AEKRGIVEGE AEERTPLKTE TLPALAASDR PQEDHVSRSH PVSARRESRG YERFRLASLA LKRDTMKWER HRNDALEQRR QSFDLSYAAG SARSSLIHPL WQRKEKLTLV PLKNFPASPP VDGEKDPEPG VADLRQQRRT PARPSPGVDL SPAVPFEDQR NLSRRSGSSC SSRSGGAASF GDGRAWAPGG TEGRVAGPSS VSHGSQAPGN GTRLPLSDAT HRDATWREDA AVASPSEVKS GRVASLSSRE SWTGSVHSSV RVSPSPALPF FKHKRRTNTS GNLDGSRQSG WGLRAAWSPA EGSRGGDRGG TEEKKGEERK ASHRKTYGGQ SHGNRKAFKL QRHTVKGLSG SCGCVAAPGA TGLSLVSSDF AAEVSPAGGS LQAVAEATPA GEATRPVPVG SLCAAPPGDD AARRGDTGAD RTAGRVQPAW GVRGVAKSAN SESSDPRSEV KTERAEHANG LL // ID F0VR38_NEOCL Unreviewed; 824 AA. AC F0VR38; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBZ56185.1}; GN ORFNames=NCLIV_066110 {ECO:0000313|EMBL:CBZ56185.1}; OS Neospora caninum (strain Liverpool). OC Eukaryota; Alveolata; Apicomplexa; Conoidasida; Coccidia; OC Eucoccidiorida; Eimeriorina; Sarcocystidae; Neospora. OX NCBI_TaxID=572307 {ECO:0000313|EMBL:CBZ56185.1, ECO:0000313|Proteomes:UP000007494}; RN [1] {ECO:0000313|Proteomes:UP000007494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Liverpool {ECO:0000313|Proteomes:UP000007494}; RX PubMed=22457617; DOI=10.1371/journal.ppat.1002567; RA Reid A.J., Vermont S.J., Cotton J.A., Harris D., Hill-Cawthorne G.A., RA Konen-Waisman S., Latham S.M., Mourier T., Norton R., Quail M.A., RA Sanders M., Shanmugam D., Sohal A., Wasmuth J.D., Brunk B., RA Grigg M.E., Howard J.C., Parkinson J., Roos D.S., Trees A.J., RA Berriman M., Pain A., Wastling J.M.; RT "Comparative genomics of the apicomplexan parasites Toxoplasma gondii RT and Neospora caninum: Coccidia differing in host range and RT transmission strategy."; RL PLoS Pathog. 8:e1002567-e1002567(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR823393; CBZ56185.1; -; Genomic_DNA. DR RefSeq; XP_003886211.1; XM_003886162.1. DR GeneID; 13445408; -. DR eggNOG; ENOG410J8NG; Eukaryota. DR eggNOG; ENOG410Y04X; LUCA. DR InParanoid; F0VR38; -. DR Proteomes; UP000007494; Chromosome XII. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007494}; KW Reference proteome {ECO:0000313|Proteomes:UP000007494}. FT COILED 149 169 {ECO:0000256|SAM:Coils}. FT COILED 443 477 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 824 AA; 91171 MW; 6075BAF427860A33 CRC64; MRGRGELLPV PSVSSAFLFD VSAMTEDLER EDPRLRAGGE SAQAGLSFSV HRVYEHRRLT RPVLLERESP LYRERGTTPL SSQDLTGSNK DDRKRGTEAQ TGTGSRRVSS QYGRSTDTST DAGRPSAHGV RAQTLSSNSA AHSTFDDAET REEDEEADSQ EEYEEDEDEE EDEDEYEEDD PSPLRSLPQA RRSPRPSSSL RFERQRLPSE GDPANPAPPA PRRVSLWGSL LMHARRSLPF LGCDGLDGPL PPSPQPPAYD SEGLHQIGAS LLSSLAAPHA LTKKLGKSVL EGQLHAFWQE DGGSGGTGTR RLFRDAEREV SESGNGLAVS AGKSSSSFSR FPASPADSRC RLRSPGEAPK SPFGPLKIRA RRVGAVDAAD VVSLLWTPLR RRAFQVLLFL SFLLAFLGIY RQFAWTAHLT ESDVYLSKEA SRGPDGRASS TSAWLLEKEI QKLRREVEEE RDALSDYRER MQMQLKKQIA HFAQQIQQEL QSQQRHDIEQ VLLKLREEGS LRQGAESQSE PQLLADVDGL QVRKRTVDKS CSHRTLLTQA FARKVEDLHT HLVIEEEQML DWALESLGGR IVVSETAPPL MKPPSWASSV SAAMWSLVHG EEAEEAAGAA GFWAHKKPAV MLQPDRHAGS CYAFRGERGN VAIQLPTAVH VTSVAIDNVA SDLFTASAPR RFRLWGYEDP RAASLDLSRT SFSRNSSEGA SVPAGRKCGL LGAGRLCDWL SVSGFAEGET RQELPGGEHA SSGVKESPLR VFLGEYEVTE RKKLVLFELK GKPARPLRKI VFEFLDNFGH PYTCIYRLRV HGEKAVLQPR SKKD // ID F0WKK0_9STRA Unreviewed; 596 AA. AC F0WKK0; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein AlNc14C134G7052 {ECO:0000313|EMBL:CCA21806.1}; GN Name=AlNc14C134G7052 {ECO:0000313|EMBL:CCA21806.1}; GN ORFNames=ALNC14_079490 {ECO:0000313|EMBL:CCA21806.1}; OS Albugo laibachii Nc14. OC Eukaryota; Stramenopiles; Oomycetes; Albuginales; Albuginaceae; OC Albugo. OX NCBI_TaxID=890382 {ECO:0000313|EMBL:CCA21806.1}; RN [1] {ECO:0000313|EMBL:CCA21806.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21750662; DOI=10.1371/journal.pbio.1001094; RA Kemen E., Gardiner A., Schultz-Larsen T., Kemen A.C., Balmuth A.L., RA Robert-Seilaniantz A., Bailey K., Holub E., Studholme D.J., RA Maclean D., Jones J.D.; RT "Gene gain and loss during evolution of obligate parasitism in the RT white rust pathogen of Arabidopsis thaliana."; RL PLoS Biol. 9:e1001094-e1001094(2011). RN [2] {ECO:0000313|EMBL:CCA21806.1} RP NUCLEOTIDE SEQUENCE. RA MacLean D.; RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR824179; CCA21806.1; -; Genomic_DNA. DR EnsemblProtists; CCA21806; CCA21806; ALNC14_079490. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 133 160 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 596 AA; 67290 MW; 61531DB242BF23AD CRC64; MSKVNESVLH TKQKWIAADT SCAPQERALV QGFGGLLGNL NQVQREYKES DAEEEVDEMG DDKEDDVTFF SSQQSSASKY QDFASLARDP NQSSVYERDD INGRNRTFGS ARVSFHSHYM KSLSKWIKWS LSVVVRLILV VLNVTWLFLP LICCAIAIFA PTYLTTAIRY ASRIPYLDQT TSAMSLQERG AMRSIMEEVL DSKLALIKSE IDAIRMSIDQ QDQTIASIRS VQEYMQSAQL EQQKQTNIMD KSSTLSTYVE RRISEGIQQV AAQLTHLEKT QDHFQAGMTS IVEETTLLVS EQKTMSEKLQ DWKSETEQDL LATIQSQFHK AEQQQFEQKL QDNTISSSST MYGSKVDSSS QNELISVIEQ TVQRIMEYKE DVDYASIANG ALVIYQERDV FPRKQSSVIS IFEALIHPFT KQPLYDQSFT TPSIFSPSLF SARDLLTEVV TPPWLSRHNG RPETALSETM EIGSCWAIGG HQANLSIRLS ERIIPTSVSI HHIMRQVARD VTAAPNRFRL WGIVGNLMTY AIDHIPLGSY AYNRSASSSQ TFKIVPAEEL TIALDGITLS IESNHGNQDY TCLYRFQVHG KPNPSQ // ID F0WKK1_9STRA Unreviewed; 597 AA. AC F0WKK1; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein AlNc14C134G7052 {ECO:0000313|EMBL:CCA21807.1}; GN Name=AlNc14C134G7052 {ECO:0000313|EMBL:CCA21807.1}; GN ORFNames=ALNC14_079500 {ECO:0000313|EMBL:CCA21807.1}; OS Albugo laibachii Nc14. OC Eukaryota; Stramenopiles; Oomycetes; Albuginales; Albuginaceae; OC Albugo. OX NCBI_TaxID=890382 {ECO:0000313|EMBL:CCA21807.1}; RN [1] {ECO:0000313|EMBL:CCA21807.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21750662; DOI=10.1371/journal.pbio.1001094; RA Kemen E., Gardiner A., Schultz-Larsen T., Kemen A.C., Balmuth A.L., RA Robert-Seilaniantz A., Bailey K., Holub E., Studholme D.J., RA Maclean D., Jones J.D.; RT "Gene gain and loss during evolution of obligate parasitism in the RT white rust pathogen of Arabidopsis thaliana."; RL PLoS Biol. 9:e1001094-e1001094(2011). RN [2] {ECO:0000313|EMBL:CCA21807.1} RP NUCLEOTIDE SEQUENCE. RA MacLean D.; RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR824179; CCA21807.1; -; Genomic_DNA. DR EnsemblProtists; CCA21807; CCA21807; ALNC14_079500. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 129 156 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 29 49 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 597 AA; 67868 MW; 5E00B63F810138A4 CRC64; MSKVNERTRS SPRFRRSSTL QRTNTTPVQR QLLGNLNQVQ REYKESDAEE EVDEMGDDKE DDVTFFSSQQ SSASKYQDFA SLARDPNQSS VYERDDINGR NRTFGSARVS FHSHYMKSLS KWIKWSLSVV VRLILVVLNV TWLFLPLICC AIAIFAPTYL TTAIRYASRI PYLDQTTSAM SLQERGAMRS IMEEVLDSKL ALIKSEIDAI RMSIDQQDQT IASIRSVQEY MQSAQLEQQK QTNIMDKSST LSTYVERRIS EGIQQVAAQL THLEKTQDHF QAGMTSIVRY LEVEETTLLV SEQKTMSEKL QDWKSETEQD LLATIQSQFH KAEQQQFEQK LQDNTISSSS TMYGSKVDSS SQNELISVIE QTVQRIMEYK EDVDYASIAN GALVIYQERD VFPRKQSSVI SIFEALIHPF TKQPLYDQSF TTPSIFSPSL FSARDLLTEV VTPPWLSRHN GRPETALSET MEIGSCWAIG GHQANLSIRL SERIIPTSVS IHHIMRQVAR DVTAAPNRFR LWGIVGNLMT YAIDHIPLGS YAYNRSASSS QTFKIVPAEE LTIALDGITL SIESNHGNQD YTCLYRFQVH GKPNPSQ // ID F0WKK2_9STRA Unreviewed; 592 AA. AC F0WKK2; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein AlNc14C134G7052 {ECO:0000313|EMBL:CCA21808.1}; GN Name=AlNc14C134G7052 {ECO:0000313|EMBL:CCA21808.1}; GN ORFNames=ALNC14_079510 {ECO:0000313|EMBL:CCA21808.1}; OS Albugo laibachii Nc14. OC Eukaryota; Stramenopiles; Oomycetes; Albuginales; Albuginaceae; OC Albugo. OX NCBI_TaxID=890382 {ECO:0000313|EMBL:CCA21808.1}; RN [1] {ECO:0000313|EMBL:CCA21808.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21750662; DOI=10.1371/journal.pbio.1001094; RA Kemen E., Gardiner A., Schultz-Larsen T., Kemen A.C., Balmuth A.L., RA Robert-Seilaniantz A., Bailey K., Holub E., Studholme D.J., RA Maclean D., Jones J.D.; RT "Gene gain and loss during evolution of obligate parasitism in the RT white rust pathogen of Arabidopsis thaliana."; RL PLoS Biol. 9:e1001094-e1001094(2011). RN [2] {ECO:0000313|EMBL:CCA21808.1} RP NUCLEOTIDE SEQUENCE. RA MacLean D.; RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR824179; CCA21808.1; -; Genomic_DNA. DR EnsemblProtists; CCA21808; CCA21808; ALNC14_079510. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 129 156 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 29 49 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 592 AA; 67208 MW; E7E5E2520FB574ED CRC64; MSKVNERTRS SPRFRRSSTL QRTNTTPVQR QLLGNLNQVQ REYKESDAEE EVDEMGDDKE DDVTFFSSQQ SSASKYQDFA SLARDPNQSS VYERDDINGR NRTFGSARVS FHSHYMKSLS KWIKWSLSVV VRLILVVLNV TWLFLPLICC AIAIFAPTYL TTAIRYASRI PYLDQTTSAM SLQERGAMRS IMEEVLDSKL ALIKSEIDAI RMSIDQQDQT IASIRSVQEY MQSAQLEQQK QTNIMDKSST LSTYVERRIS EGIQQVAAQL THLEKTQDHF QAGMTSIVEE TTLLVSEQKT MSEKLQDWKS ETEQDLLATI QSQFHKAEQQ QFEQKLQDNT ISSSSTMYGS KVDSSSQNEL ISVIEQTVQR IMEYKEDVDY ASIANGALVI YQERDVFPRK QSSVISIFEA LIHPFTKQPL YDQSFTTPSI FSPSLFSARD LLTEVVTPPW LSRHNGRPET ALSETMEIGS CWAIGGHQAN LSIRLSERII PTSVSIHHIM RQVARDVTAA PNRFRLWGIV GNLMTYAIDH IPLGSYAYNR SASSSQTFKI VPAEELTIAL DGITLSIESN HGNQDYTCLY RFQVHGKPNP SQ // ID F0WKK3_9STRA Unreviewed; 601 AA. AC F0WKK3; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein AlNc14C134G7052 {ECO:0000313|EMBL:CCA21809.1}; GN Name=AlNc14C134G7052 {ECO:0000313|EMBL:CCA21809.1}; GN ORFNames=ALNC14_079520 {ECO:0000313|EMBL:CCA21809.1}; OS Albugo laibachii Nc14. OC Eukaryota; Stramenopiles; Oomycetes; Albuginales; Albuginaceae; OC Albugo. OX NCBI_TaxID=890382 {ECO:0000313|EMBL:CCA21809.1}; RN [1] {ECO:0000313|EMBL:CCA21809.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21750662; DOI=10.1371/journal.pbio.1001094; RA Kemen E., Gardiner A., Schultz-Larsen T., Kemen A.C., Balmuth A.L., RA Robert-Seilaniantz A., Bailey K., Holub E., Studholme D.J., RA Maclean D., Jones J.D.; RT "Gene gain and loss during evolution of obligate parasitism in the RT white rust pathogen of Arabidopsis thaliana."; RL PLoS Biol. 9:e1001094-e1001094(2011). RN [2] {ECO:0000313|EMBL:CCA21809.1} RP NUCLEOTIDE SEQUENCE. RA MacLean D.; RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR824179; CCA21809.1; -; Genomic_DNA. DR EnsemblProtists; CCA21809; CCA21809; ALNC14_079520. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 133 160 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 601 AA; 67951 MW; DC6648ACFD878E5B CRC64; MSKVNESVLH TKQKWIAADT SCAPQERALV QGFGGLLGNL NQVQREYKES DAEEEVDEMG DDKEDDVTFF SSQQSSASKY QDFASLARDP NQSSVYERDD INGRNRTFGS ARVSFHSHYM KSLSKWIKWS LSVVVRLILV VLNVTWLFLP LICCAIAIFA PTYLTTAIRY ASRIPYLDQT TSAMSLQERG AMRSIMEEVL DSKLALIKSE IDAIRMSIDQ QDQTIASIRS VQEYMQSAQL EQQKQTNIMD KSSTLSTYVE RRISEGIQQV AAQLTHLEKT QDHFQAGMTS IVRYLEVEET TLLVSEQKTM SEKLQDWKSE TEQDLLATIQ SQFHKAEQQQ FEQKLQDNTI SSSSTMYGSK VDSSSQNELI SVIEQTVQRI MEYKEDVDYA SIANGALVIY QERDVFPRKQ SSVISIFEAL IHPFTKQPLY DQSFTTPSIF SPSLFSARDL LTEVVTPPWL SRHNGRPETA LSETMEIGSC WAIGGHQANL SIRLSERIIP TSVSIHHIMR QVARDVTAAP NRFRLWGIVG NLMTYAIDHI PLGSYAYNRS ASSSQTFKIV PAEELTIALD GITLSIESNH GNQDYTCLYR FQVHGKPNPS Q // ID F0WX93_9STRA Unreviewed; 639 AA. AC F0WX93; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein AlNc14C345G10848 {ECO:0000313|EMBL:CCA26085.1}; GN Name=AlNc14C345G10848 {ECO:0000313|EMBL:CCA26085.1}; GN ORFNames=ALNC14_122290 {ECO:0000313|EMBL:CCA26085.1}; OS Albugo laibachii Nc14. OC Eukaryota; Stramenopiles; Oomycetes; Albuginales; Albuginaceae; OC Albugo. OX NCBI_TaxID=890382 {ECO:0000313|EMBL:CCA26085.1}; RN [1] {ECO:0000313|EMBL:CCA26085.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21750662; DOI=10.1371/journal.pbio.1001094; RA Kemen E., Gardiner A., Schultz-Larsen T., Kemen A.C., Balmuth A.L., RA Robert-Seilaniantz A., Bailey K., Holub E., Studholme D.J., RA Maclean D., Jones J.D.; RT "Gene gain and loss during evolution of obligate parasitism in the RT white rust pathogen of Arabidopsis thaliana."; RL PLoS Biol. 9:e1001094-e1001094(2011). RN [2] {ECO:0000313|EMBL:CCA26085.1} RP NUCLEOTIDE SEQUENCE. RA MacLean D.; RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR824390; CCA26085.1; -; Genomic_DNA. DR EnsemblProtists; CCA26085; CCA26085; ALNC14_122290. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 639 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003261802. FT TRANSMEM 513 535 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 292 312 {ECO:0000256|SAM:Coils}. FT COILED 444 475 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 639 AA; 72935 MW; DCFCF7BDE47AC2FF CRC64; MNKSWRLWFS IILCLCMRNV RTILTPDTEP ERLALSDSDK EGSDAKRVED DAQIDRSNDD SEPSSAYEIS ERGFKAVEMD EEEVVEISDT MNAEHNIGSE IYDQDTGSKR QNYASLDAGA TILDAANDVK SPTNLLVPDK DRYMLFPCEK SARWVLISLS EDVHAEAITI ANFEQFSSPV KDFLILGSLN YPTETWYVLG NFTAQQIQGE QDFPLNSKQH VRYIKIRLLN HYGAEYYCTL SQLKIYGKSF SQVISQLEKR IEEDVDLVPR THASEKDATE SFVNKGEKED EAELTQCTLS DKKREIEAAQ KESKLSLGNA TCAANEWIVP DIWNEETDFE SGKSVKLETP SGLAMDQDSA QENVQGSGAM DPSSLSYRKP NTTTLESSAT LTAVPTHQGQ GFGRLESIFI RITKKLHMLE MNQSVLVRSM EHFQQESVAF SELLQKHEEM LTSSLNELKA LIRAMNLKME QDENRSQFER IESQLAIDDL RNDVASLWSD IMLLREIIVT MKAGILCAIV LSVFVIGFFL LRILFRCLKK CKRRADLRDL FRRLNRGDYD AEELEYDVAS FDALRDVLFD GRGPRYVNRK QRFGTSWDDL AIERKTLRQK ILAEPTLYRF LYTRQANKSS STITDARIK // ID F0X8B2_GROCL Unreviewed; 1036 AA. AC F0X8B2; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Spindle pole body-associated protein sad1 {ECO:0000313|EMBL:EFX05689.1}; GN ORFNames=CMQ_3758 {ECO:0000313|EMBL:EFX05689.1}; OS Grosmannia clavigera (strain kw1407 / UAMH 11150) (Blue stain fungus) OS (Graphiocladiella clavigera). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Ophiostomatales; Ophiostomataceae; OC Grosmannia. OX NCBI_TaxID=655863 {ECO:0000313|Proteomes:UP000007796}; RN [1] {ECO:0000313|EMBL:EFX05689.1, ECO:0000313|Proteomes:UP000007796} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=kw1407 / UAMH 11150 {ECO:0000313|Proteomes:UP000007796}; RX PubMed=21262841; DOI=10.1073/pnas.1011289108; RA DiGuistini S., Wang Y., Liao N.Y., Taylor G., Tanguay P., Feau N., RA Henrissat B., Chan S.K., Hesse-Orce U., Alamouti S.M., Tsui C.K.M., RA Docking R.T., Levasseur A., Haridas S., Robertson G., Birol I., RA Holt R.A., Marra M.A., Hamelin R.C., Hirst M., Jones S.J.M., RA Bohlmann J., Breuil C.; RT "Genome and transcriptome analyses of the mountain pine beetle-fungal RT symbiont Grosmannia clavigera, a lodgepole pine pathogen."; RL Proc. Natl. Acad. Sci. U.S.A. 108:2504-2509(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL629735; EFX05689.1; -; Genomic_DNA. DR EnsemblFungi; EFX05689; EFX05689; CMQ_3758. DR InParanoid; F0X8B2; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000007796; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007796}; KW Reference proteome {ECO:0000313|Proteomes:UP000007796}. FT COILED 85 105 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1036 AA; 113876 MW; 2767F60BB776EF65 CRC64; MPPRRSNRMA SSRLFDGSSE AGGPVDGMTS PMRSAVTGRG MSPGPAKYSS AYGSPPTVMP TRQNAARQRY SLTSALSRVI DTVDDDNEND SRERAQREAE EQRRRQPSMP VPEPEPERER ERERERERER EPDTGPEPVA VPEWHTDPDE DDGYVAIDNS ERREETDNSV DFVSMPGDRG RPKDGHKSMT PERQGDTATH NFAEEDSIFA QARIFTPVLP SRTGVVARTS HALQSLLNSA RRLSIRRDES DDEAVRQAYP HLNFIHPRSD QEPEPEPTLS EKPVDAAVAS DPISTDSPGD PEENQQQPQP ISRTDIVRDL RPRTGTPGPI VRTRKTVPRN MRGSTSDELG DYEVEATAAA RSNMARRRQG RKQTDPQAAN QDDDDGDKSR QAWWQRMVAA MWHFTMHVGS ISKAMSLAIT LLVLGAVAVR VLVISTASPE LYEGYSMDKR LHWYGSDWRS NLGQLVPYAL VHPLGALSDE EFARMNGLML GHASELAQMR HAQSLQGDAL DRIGRVLPAV VHMELDKRGR PVIAQEFWHA LKDSIQADQD IFTLIQREDG APVISDLQWT ALKHQLEASS LLPDPKAGSM ALSVHDVEEI AETTMAKSWE AWLRRNQNKV RDILTGMGGS AWSPDAQQQQ QQQQQQQQTD LSLERLTDKQ VAKLAQKMMG SSAAREVIVS RQEFLQILQN NFVEHRVEIK GELAELEARV LEVAKAAAAA VPSSSGADAG TAPSSRESLP GGSMSRKEVT QLVDQLVRKG ISDAQLEAMA RGKIRADWNA NLINQVNFFS RSTGATFNHH FSSPSYQPAL PSGWKRLWGG GRSDGGGSGG SGGGGGSSGG SDSSDDSSMA GRPMPGSQRG SVVFEPWEQD GDCWCAAQGK SRQGEGSGSR PARTQAASVA VKLGLRVVPQ HVVVEHISAG ATLDPGATPK AIEVWAQITD YAQQRPLQDW SLARFPDTDE RSPLFGWGFV KIGAFSYEQA ATAGDGSVQV FRLPPELETL DAVTDHVIVR ALSNYGADHT CFYRLRLYGQ VRPDMV // ID F0Y686_AURAN Unreviewed; 114 AA. AC F0Y686; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGB09619.1}; DE Flags: Fragment; GN ORFNames=AURANDRAFT_24706 {ECO:0000313|EMBL:EGB09619.1}; OS Aureococcus anophagefferens (Harmful bloom alga). OC Eukaryota; Stramenopiles; Pelagophyceae; Pelagomonadales; Aureococcus. OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729}; RN [1] {ECO:0000313|EMBL:EGB09619.1, ECO:0000313|Proteomes:UP000002729} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729}; RX PubMed=21368207; DOI=10.1073/pnas.1016106108; RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A., RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., RA Dill B.D., Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., RA Lindquist E.A., Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., RA Talmage S.C., Walker E.A., Koch F., Burson A.M., Marcoval M.A., RA Tang Y.Z., Lecleir G.R., Coyne K.J., Berg G.M., Bertrand E.M., RA Saito M.A., Gladyshev V.N., Grigoriev I.V.; RT "Niche of harmful alga Aureococcus anophagefferens revealed through RT ecogenomics."; RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL833125; EGB09619.1; -; Genomic_DNA. DR RefSeq; XP_009035670.1; XM_009037422.1. DR EnsemblProtists; EGB09619; EGB09619; AURANDRAFT_24706. DR GeneID; 20219949; -. DR InParanoid; F0Y686; -. DR Proteomes; UP000002729; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002729}; KW Reference proteome {ECO:0000313|Proteomes:UP000002729}. FT NON_TER 1 1 {ECO:0000313|EMBL:EGB09619.1}. SQ SEQUENCE 114 AA; 12008 MW; 0753499FCCA77425 CRC64; PGRCWAFAGS AGEVTIALAA PAAPATFVVE HVPRLLTSRA SAAPKAFSVT GYEALDGKHA IDLGSFVFDL DGPPLQSFPA TRPPHAPSVN YVKLAIESNH GFGPYTCLYR FAVH // ID F0Y6N0_AURAN Unreviewed; 466 AA. AC F0Y6N0; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGB09046.1}; GN ORFNames=AURANDRAFT_63651 {ECO:0000313|EMBL:EGB09046.1}; OS Aureococcus anophagefferens (Harmful bloom alga). OC Eukaryota; Stramenopiles; Pelagophyceae; Pelagomonadales; Aureococcus. OX NCBI_TaxID=44056 {ECO:0000313|Proteomes:UP000002729}; RN [1] {ECO:0000313|EMBL:EGB09046.1, ECO:0000313|Proteomes:UP000002729} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CCMP 1984 {ECO:0000313|Proteomes:UP000002729}; RX PubMed=21368207; DOI=10.1073/pnas.1016106108; RA Gobler C.J., Berry D.L., Dyhrman S.T., Wilhelm S.W., Salamov A., RA Lobanov A.V., Zhang Y., Collier J.L., Wurch L.L., Kustka A.B., RA Dill B.D., Shah M., VerBerkmoes N.C., Kuo A., Terry A., Pangilinan J., RA Lindquist E.A., Lucas S., Paulsen I.T., Hattenrath-Lehmann T.K., RA Talmage S.C., Walker E.A., Koch F., Burson A.M., Marcoval M.A., RA Tang Y.Z., Lecleir G.R., Coyne K.J., Berg G.M., Bertrand E.M., RA Saito M.A., Gladyshev V.N., Grigoriev I.V.; RT "Niche of harmful alga Aureococcus anophagefferens revealed through RT ecogenomics."; RL Proc. Natl. Acad. Sci. U.S.A. 108:4352-4357(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL833126; EGB09046.1; -; Genomic_DNA. DR RefSeq; XP_009036172.1; XM_009037924.1. DR EnsemblProtists; EGB09046; EGB09046; AURANDRAFT_63651. DR GeneID; 20224429; -. DR InParanoid; F0Y6N0; -. DR Proteomes; UP000002729; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002729}; KW Reference proteome {ECO:0000313|Proteomes:UP000002729}. FT COILED 71 98 {ECO:0000256|SAM:Coils}. FT COILED 183 217 {ECO:0000256|SAM:Coils}. FT COILED 228 248 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 466 AA; 48054 MW; 353C63916E80293A CRC64; MLSNPAGREA QLKAQVRQSW AVCGILLVVL CGRELSGSGR TCSLSEADLE RGEALVKSEV AWLQQHTARA SEEADETLAA LAEELAQVEA LSNSLVARGE ADAKTLAEAL ERAPPGPPPD ARTCAAAAKD AAAASKASEK ARLDADKAVE TAAAATATAT AKTAQAEAVA QRDETLAALE LSLKALGESA EASKARLAEA QAALAASTAQ AREARDLLKA APGAQLASQQ LERAVAAVEK DVAAALAAAD AARSSSGAGC TAAAVERAVD TAVAVFFEAD RVAKFDYALR ATGASVVQGL TSEPYTPPGS VVPTKVWHAL GRDAGVGRAE DAISQRVGFG SCFAFEGSRG SLTVQLSSRV VPTAFTLEHI HGALCNPLHN ANCSSAPRTF TVLGRRAGAP DATPVDLGSF EYDAADAAKT VQTFAALNDD RQAFDLATLQ VTSNYGHPDY TCVYRFRVHG DAPETV // ID F0ZA59_DICPU Unreviewed; 1068 AA. AC F0ZA59; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGC39142.1}; GN ORFNames=DICPUDRAFT_148071 {ECO:0000313|EMBL:EGC39142.1}; OS Dictyostelium purpureum (Slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. OX NCBI_TaxID=5786 {ECO:0000313|Proteomes:UP000001064}; RN [1] {ECO:0000313|Proteomes:UP000001064} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=QSDP1 {ECO:0000313|Proteomes:UP000001064}; RX PubMed=21356102; DOI=10.1186/gb-2011-12-2-r20; RG US DOE Joint Genome Institute (JGI-PGF); RA Sucgang R., Kuo A., Tian X., Salerno W., Parikh A., Feasley C.L., RA Dalin E., Tu H., Huang E., Barry K., Lindquist E., Shapiro H., RA Bruce D., Schmutz J., Salamov A., Fey P., Gaudet P., Anjard C., RA Babu M.M., Basu S., Bushmanova Y., van der Wel H., Katoh-Kurasawa M., RA Dinh C., Coutinho P.M., Saito T., Elias M., Schaap P., Kay R.R., RA Henrissat B., Eichinger L., Rivero F., Putnam N.H., West C.M., RA Loomis W.F., Chisholm R.L., Shaulsky G., Strassmann J.E., RA Queller D.C., Kuspa A., Grigoriev I.V.; RT "Comparative genomics of the social amoebae Dictyostelium discoideum RT and Dictyostelium purpureum."; RL Genome Biol. 12:R20.1-R20.23(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL870962; EGC39142.1; -; Genomic_DNA. DR RefSeq; XP_003284288.1; XM_003284240.1. DR STRING; 5786.XP_003284288.1; -. DR EnsemblProtists; EGC39142; EGC39142; DICPUDRAFT_148071. DR GeneID; 10510232; -. DR KEGG; dpp:DICPUDRAFT_148071; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F0ZA59; -. DR OMA; SHYGDQL; -. DR Proteomes; UP000001064; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001064}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001064}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 1068 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003264976. FT TRANSMEM 798 819 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 148 357 {ECO:0000256|SAM:Coils}. FT COILED 362 409 {ECO:0000256|SAM:Coils}. FT COILED 771 791 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1068 AA; 125429 MW; 1C8D1C31D2A636EA CRC64; MCTIESILTL HLILGSLLSN NLSVLANNNN INDNRNNNDN INNNNNNNDI NSNNGNINEN NIQFINKNNN NILYENDNII NEQPLQQQQQ QQQQQQQQQQ ISLQSQDLIQ QNEPVLTPKE EKEQVEEIVV EKDKNNDDKE IHNQSPIYQT KENNVEIDEK EIEEKKRLEV EENEKLEELK KEKERIEIEE RLEIERQKQI EIEKEKERIE KEEKEEKEKE RQENEKREGE EKERLEKEKQ EKERLEKERL EIEKQEKERL EIEKQEKERL EKEKQEKERL EIEKAEKEDN EKQEKEKLEK ERVEKERIEK ENIEKEKIEK ENIEKENTEK ENIEIKKLEE EEQIKNDKLK LEEKERLERE ELFKLAVERE RLEKEEKKNE QESFEKERLE HQQENNNNNN NNNNNNNNNN NNNNINYNNN DGNGQFIKQP IYPQKEINKP IVLTPNDLPD KFNYASSDCG ANVVASNKEA REISSILQSS KDRYLLSVCD TKKWFVVELC EEIGIQIIEM ANFEFFSSMF KDFSVYGTNR FPSDEWNFLG NFTGENIRKA QYFVLKEKSW YKYVKISITS HYGDQLYCPV STLKVYGSTM VDDLKNEIYV GVNEGNVNNN NNNNNNNNNY NNNNKEKMYR SPSLPSEKKT IKNHNSIYEE IMGKKISINN FVVPEFHYIP VSGDNGEEIN IDSNFIFQQL QQQQQQQQQQ SQHRSQQNIF KILVDRIKNL ESNSFTLKQL FESLNQHYSK ILLDLEDDTK KVFQSLLSKS EKAQKDLDIF KKQSMNDIIF LKNKIKEMEE TRKTENTLFI SMLIGVIALL FFIFLFKIFS SFSRNGNFQT TSPSLVNSPL FNGANNNDGN SGYLQQQQQL LQYHQQLQQQ RDMEQQLKHY QQSPVGKQNP SPFNHQRRNS SPGAQLLNFS PIVFPTLVPQ DDFNYNYSSN NSNIKIENDI DEASSNSGNS GNSSPNQGIN GTNHSPNSPI HNIKNPPPLQ PQQHSPHHGL SRSFTFSTNH LKKLLVKQQS ESVLNLVNNN NSNGAVNTTN QNHHNSSIPA SNSSDNLHGM GTRMKNSKKQ RRKSNIYN // ID F1A1L1_DICPU Unreviewed; 855 AA. AC F1A1L1; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGC29917.1}; GN ORFNames=DICPUDRAFT_158427 {ECO:0000313|EMBL:EGC29917.1}; OS Dictyostelium purpureum (Slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. OX NCBI_TaxID=5786 {ECO:0000313|Proteomes:UP000001064}; RN [1] {ECO:0000313|Proteomes:UP000001064} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=QSDP1 {ECO:0000313|Proteomes:UP000001064}; RX PubMed=21356102; DOI=10.1186/gb-2011-12-2-r20; RG US DOE Joint Genome Institute (JGI-PGF); RA Sucgang R., Kuo A., Tian X., Salerno W., Parikh A., Feasley C.L., RA Dalin E., Tu H., Huang E., Barry K., Lindquist E., Shapiro H., RA Bruce D., Schmutz J., Salamov A., Fey P., Gaudet P., Anjard C., RA Babu M.M., Basu S., Bushmanova Y., van der Wel H., Katoh-Kurasawa M., RA Dinh C., Coutinho P.M., Saito T., Elias M., Schaap P., Kay R.R., RA Henrissat B., Eichinger L., Rivero F., Putnam N.H., West C.M., RA Loomis W.F., Chisholm R.L., Shaulsky G., Strassmann J.E., RA Queller D.C., Kuspa A., Grigoriev I.V.; RT "Comparative genomics of the social amoebae Dictyostelium discoideum RT and Dictyostelium purpureum."; RL Genome Biol. 12:R20.1-R20.23(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL871377; EGC29917.1; -; Genomic_DNA. DR RefSeq; XP_003293558.1; XM_003293510.1. DR EnsemblProtists; EGC29917; EGC29917; DICPUDRAFT_158427. DR GeneID; 10504879; -. DR KEGG; dpp:DICPUDRAFT_158427; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; F1A1L1; -. DR KO; K19347; -. DR OMA; VYSADKI; -. DR Proteomes; UP000001064; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001064}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001064}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 255 274 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 131 162 {ECO:0000256|SAM:Coils}. FT COILED 182 210 {ECO:0000256|SAM:Coils}. FT COILED 317 344 {ECO:0000256|SAM:Coils}. FT COILED 370 404 {ECO:0000256|SAM:Coils}. FT COILED 458 478 {ECO:0000256|SAM:Coils}. FT COILED 480 500 {ECO:0000256|SAM:Coils}. FT COILED 502 522 {ECO:0000256|SAM:Coils}. FT COILED 835 855 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 855 AA; 98981 MW; 9A0A0586F9BC43F8 CRC64; MSGDYKPNYK SSPGRKRAVI ANKDQPSIYR YQTPSQVNYS NTSGIANNHN SNILNDVTKY YNNQSNIRNR TTTTTTTTQQ TNVYNNRNTN NNIHMNDSDI EDDEDITYSD NSRNLDYSNE DYNSGNNPRK LKESLLLLQQ QQQQQQQLLQ QQQQQQQQLN HQQQGQNLLQ NFQAPQSQTT TQQQQQQQLQ QQQQNKANTQ QRNLTQQQQN RNQSKNIFQK LHDLYYNSIN TISTVWPFDT TESNRNNPRT KIKTFVWIVI IASMAVLGLI GFFATRYHGV NIYLPNSTSP SKTIENTITK EQLIPMLEDY FKQSKIFSKL ELKINELSNT NNKENQEIIR ELKDDINLIK LSSMDEDRVN KLVVKMIDHY NDNENNKKEL RQVLEKAINE FNQLEKEKLE RYENKASESL GIYESKASES LHIYESKAGE SLNKLSTQSN DQIKLFTTQL EGTVGKKLEE LEKKSKQQLS QLGQESKALI DQQSEQLKQY QLELDTNGKE KVKQLLNEYQ SLEKSLKEFT GKLENELSSS IQTLISDQKK SITSEFQKQT SDQSNMINSQ SNHLTAQYTQ ITNQFSKIQN FIESNPSIES IHKTISTLEG IRELIDDILE VYSADKIAKV DYALLESGSS IEYFATHYKV SKSYPTNRQE QHQQQQQEKT KNIINELSNT ATDLFNIASN WILPKTKVNE AHTILEPTVN TGSCWAFYGQ QGTVVIRLSK RIKVNEVSME HINPLISHHI ESAPKQFQVI GLVNSTDIGT DLGTFTYNTT INRHIQTFKV VETEEEFSHI VLNVLSNYGY KYTCIYRFRV HGIQVEHPEL EQLVDIEHQH SDQLIKDIEK SIEIENQQKQ NSQDL // ID F1KPL0_ASCSU Unreviewed; 2973 AA. AC F1KPL0; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:ADY39814.1}; OS Ascaris suum (Pig roundworm) (Ascaris lumbricoides). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. OX NCBI_TaxID=6253 {ECO:0000313|EMBL:ADY39814.1}; RN [1] {ECO:0000313|EMBL:ADY39814.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21685128; DOI=10.1101/gr.121426.111; RA Wang J., Czech B., Crunk A., Wallace A., Mitreva M., Hannon G.J., RA Davis R.E.; RT "Deep small RNA sequencing from the nematode Ascaris reveals RT conservation, functional diversification, and novel developmental RT profiles."; RL Genome Res. 21:1462-1477(2011). CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JI163839; ADY39814.1; -; mRNA. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:ADY39814.1}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2973 AA; 325234 MW; F6AADA71A809D6EB CRC64; MECSRRWWWW RRGLRWSQFF IFVTSLRSTR CVQLLQTSVM DGVDPDTLLE WLQTGVGDER DIQMMALEQL CMLLLMSDNI DRCFESCPPR TFLPALCKIF LDESATENVL EVTARAITYY LDVSNECTRR ITQVDGAVKA ICNRLAAAEM TNRTSKDLAE QCVKLLEHIC QRETSAVYDA GGLQCMLSLV TQYGQSVHKD TMHSAMSVVT RLCSKMEPAD AIMPECSASL GALLGHDDPK VSECALRCFA ALTDRFIRKS LDPVEMARHG NLVEHLLSAL VPLPPVSSSS SISANAQLPA VASSASILSA DSAASSTPSA GHRSASFTSI VISLLSNLCR GSSAVTEQVI SSSLLIPALR AVLASKDERC VMDTLRLCDL LVVLLCEGRH ALPRSTACST VVSRSENSGN VFERSHRHLI DAIRQRDTDA LIDAVESGQV DANFTDDVGQ TLLNWSSAFG TVDMVTYLCD KGADVNKGQR SSSLHYAACF GRPDVVKILL RNGANPDLRD EEGKTALDKA RERSEEGHQQ VAAILESPSS YMHGDEQAKC DRGRPEESRS TANEAMDPML VRSLLEQLLP LLCDVFQRSL GASVRRSTVS LLRKTTQYMS PESLSSLVTS SADGDDISME CVPATGPKLA ENIVNVIVAV LEHEDDIEGH EHVLHLMKAL FAKEVDFWLE QLIRVGVFEK VEAIATQPSQ QPKGGMVTTM VVDESGNVSS TSSMNVMQSS LVKAEHRPQD EASANRASEI GNGSDETVQS QSSATWGTNR SNSSTPSLAA GSLSVSDDSA PAERDNSVML NVPIHVEEEE ESCTAPSLAK PETAAVQAVV DTQRTPVSSK TANATGVARK NVRGAQMPTT DEMPTALEDR WEIVEGRSYR WKDWRLVKSR DSLFIWCDAV AMEFSDGSNG WFRFMLDGQL STMYSSGSPE SGADSAETRG EFVDKLTKAR SAVPPGSPLN PIFTLPNSSK SIDVGNWVLA SPKVGELTVT NRDGNQQRML IMEDLPGFIF ESNRQTKHCF QAESTLGLDF VTGWAARGGG RRLRFRAEAQ KAKLQELAKE LWENYMKEAR SRPRDALVEL QKASSSLQEY CASSKGGNVC SNAIRDQLQV CLRCMHEAVT NERVLSTFEL SISGLVSALL SLLEIVRDND AHCDVAVAFR KVFADGRSLS ALVRKMVLVL ETVEKFPQYL YDTPGGSSFG LQLLSRRIKL KLEHLNAKSS DQKQLLDRTG RTMKTEPLTT VGQLKNYILR MVAKQWYDRD RETFAFVRQI KDARKNSSKI SFVYSSDFDD QGLIYWLGTN GKTFSHWTNP ASVNVVYVTS SDGARQPYGH PEDILSRDVS ALNCHTSDDK NAHFTIDLGV YIYPTSYTLR HARGYGRSAL RNWLFQGSRN GRTWDVLLVH ENDAALSDPG STATWPVVCA EGKGPYRYLR IAQNGKNASN QTYYLSLSGF EVYGNVVDVV VDDLKPCEEK QSAAVLKGRI KRNSAGGKEK ESGASEGSGC TPSESAYSHL PGGAANNKVT SGAVLSDMTS ASSALKTRIL RYRRGSRGSR LLVGGRGLPT NPPSDAATMC SIGCRVTRGP DWKWENQGSG TLGTVISPVE DGWVDVQWDD KTSNSYRFGA DGKFDVEVKP GDAAAVSAVQ VYALRRVPRG LVASRREHTG NVAQISSQNP HQFAVFGGFT PFAGRQLPPA PRNSAEEGCP QRGKWTPSGL PFSRFHQNAT SRTNVPFADA AGTSREAKAA SASPAASSTI AQKSMSTTNL LDGSEGEKRP SVASTNQAAS AESLQHQTPS LENLLARSRI FDERIPEVVA DDASLQETPL GSSRAMDTDQ ESVSTNNNPG LDTSVESFDA AITLGDQSAG TNVSVGARNC STRAGGSQPS LSGRDGTRAG GSQPSLSGRD GTPFEGIEQR DTRSSPRESL GASPSVLIGP DGEPLVSIEQ VYANNLPNLS VSAPDLVLLR RRQQGAAAKD SDPRARAAIE QPDGEEQVAN INDDSIAAHP RTAKRSVASR NTAINEITNA VDDAIRQYLM GEDHSESLAA AVAQAVHVAE QGDDSTERLL ALAPTPGAYN EFLDAYADIL GAGEEESGAA GNEKKSTAQS TGAPASSDAS VPGGANSGAA SAVHRNVQSA GGGIRSRLGS YADVLRTMMQ QVIDSGASLN ALELEELEDD IYEEDMTEEG NEEDCDDEYV NGLSVEALAQ AAAALRRQSS TGSTSSNGEL KLNWKQIVMG EAGRLLGERT LRSSNSTEQK SPNGGLNRNW DDEFVLKRQF AALIPAFDPR PGRTNVNQTQ DVELPPPTSD AQRPESSSSR ASSEQPRRDS SDSYVEHNLR LHIRGPNLAN INNVTVELDD DDASVFYFLQ QLGQNVDWGQ KTERTRRVWE PTYTLIYEEA SGEKSSLETV NDCIDDANHS PRIVLDTLAV LANLHKMGEA IGELEMSAEM FVSEKLTQKL MQELADPLVV AARALPRWCD HLIYEYPCLF SVDTRTHYLH ATAFGTSRAI VWLQTRRDQI LEQSRGATSA AAMSNLAGAR RDDHYPEFRV GRIKHERIKV PRNGEQLFEY ANRVMNFHAS RKSVLELEYV GEEGTGLGPT LEFYALVAAE FQRKSLGMWV CDDADEDQLK LEEGELDLGE GVKPPGYYVR RAGGLFPAPL PRFTDESRKA AELFRFLGIF LAKVLQDGRL VDLPLSRPFL KLLVAPSVVE NKRAELKDVL TLDDLEEVSP MKGRVLKELS SLVHRKRAVE SDEHTDRDAK RRRIENLTLN INGNECRVED LSLSFSVNPP SSVFTYGEME LIEGGSNMDV TNDNVDLYID KCTEFYLNTG ICDQVNAFRE GFDLVFPLRS LRMFAPEEVQ TLLSGEQCPE WTREDVINYT EPKLGYTRES AGFLRFVDVL VGMSSSERKS FLQFTTGCSS LPPGGLANLH PRLTIVRKVD SGDGSYPSVN TCVHYLKLPD YSSTEIMRER LLMATNEKGF YLN // ID F1KPL4_ASCSU Unreviewed; 2569 AA. AC F1KPL4; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:ADY39818.1}; OS Ascaris suum (Pig roundworm) (Ascaris lumbricoides). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. OX NCBI_TaxID=6253 {ECO:0000313|EMBL:ADY39818.1}; RN [1] {ECO:0000313|EMBL:ADY39818.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21685128; DOI=10.1101/gr.121426.111; RA Wang J., Czech B., Crunk A., Wallace A., Mitreva M., Hannon G.J., RA Davis R.E.; RT "Deep small RNA sequencing from the nematode Ascaris reveals RT conservation, functional diversification, and novel developmental RT profiles."; RL Genome Res. 21:1462-1477(2011). CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JI163844; ADY39818.1; -; mRNA. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:ADY39818.1}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2569 AA; 281311 MW; B2854D8D285A5C79 CRC64; MHCRGVLLAR LVVSRNENSG NVFERSHRHL IDAIRQRDTD ALIDAVESGQ VDANFTDDVG QTLLNWSSAF GTVDMVTYLC DKGADVNKGQ RSSSLHYAAC FGRPDVVKIL LRNGANPDLR DEEGKTALDK ARERSEEGHQ QVAAILESPS SYMHGDEQAK CDRGRPEESR STANEAMDPM LVRSLLEQLL PLLCDVFQRS LGASVRRSTV SLLRKTTQYM SPESLSSLVT SSADGDDISM ECVPATGPKL AENIVNVIVA VLEHEDDIEG HEHVLHLMKA LFAKEVDFWL EQLIRVGVFE KVEAIATQPL QQPKGGMVTT MVVDESGNVS STSSMNVMQS SLVKAEHRPQ DEASANRASE IGNGSDETVQ SQSSATWGTN RSNSSTPSLA AGSLSVSDDS APAERDNSVM LNVPIHVEEE EESCTAPSLA KPETAAVQAV ADTQRTPVSS KTANATGVAR KNVRGAQMPT TDEMPTALED RWEIVEGRSY RWKDWRLVKS RDSLFIWCDA VAMEFSDGSN GWFRFMLDGQ LSTMYSSGSP ESGADSAETR GEFVDKLTKA RSAVPPGSPL NPIFTLPNSS KSIDVGNWVL ASPKVGELTV TNRDGNQQRM LIMEDLPGFI FESNRQTKHC FQAESTLGLD FVTGWAARGG GRRLRFRAEA QKAKLQELAK ELWENYMKEA RSRPRDALVE LQKASSSLQE YCASSKGGNV CSNAIRDQLQ VCLRCMHEAV TNERVLSTFE LSISGLVSAL LSLLEIVRDN DAHCDVAVAF RKVFADGRSL SALVRKMVLV LETVEKFPQY LYDTPGGSSF GLQLLSRRIK LKLEHLNAKS SDQKQLLDRT GRTMKTEPLT TVGQLKNYIL RMVAKQWYDR DRETFAFVRQ IKDARKNSSK ISFVYSSDFD DQGLIYWLGT NGKTFSHWTN PASVNVVYVT SSDGARQPYG HPEDILSRDV SALNCHTSDD KNAHFTIDLG VYIYPTSYTL RHARGYGRSA LRNWLFQGSR NGRTWDVLLV HENDAALSDP GSTATWPVVC AEGKGPYRYL RIAQNGKNAS NQTYYLSLSG FEVYGNVVDV VVDDLKPCEE KQSAAVLKGR IKRNSAGGKE KESGASEGSG CTPSESAYSH LPGGAANNKV TSGAVLSDMT SASSALKTRI LRYRRGSRGS RLLVGGRGLP TNPPSDAATM CSIGCRVTRG PDWKWENQGS GTLGTVISPV EDGWVDVQWD DKTSNSYRFG ADGKFDVEVK PGDAAAVSAV QVYALRRVPR GLVASRREHT GNVAQISSQN PHQFAVFGGF TPFAGRQLPP APRNSAEEGC PQRGKWTPSG LPFSRFHQNA TSRTNVPFAD AAGTSREAKA ASASPAASST IAQKSMSTTN LLDGSEGEKR PSVASTNQAA SAESLQHQTP SLENLLARSR IFDERIPEVV ADDASLQETP LGSSRAMDTD QESVSTNNNP GLDTSVESFD AAITLGDQSA GTNVSVGARN CSTRAGGSQP SLSGRDGTPF EGIEQRDTRS SPRESLGASP SVLIGPDGEP LVSIEQVYAN NLPNLSVSAP DLVLLRRRQQ GAAAKDSDPR ARAAIEQPDG EEQVANINDD SIAAHPRTAK RSVASRNTAI NEITNAVDDA IRQYLMGEDH SESLAAAVAQ AVHVAEQGDD STERLLALAP TPGAYNEFLD AYADILGAGE EESGAAGNEK KSTAQSTGAP ASSDASVPGG ANSGAASAVH RNVQSAGGGI RSRLGSYADV LRTMMQQVID SGASLNALEL EELEDDIYEE DMTEEGNEED CDDEYVNGLS VEALAQAAAA LRRQSSTGST SSNGELKLNW KQIVMGEAGR LLGERTLRSS NSTEQKSPNG GLNRNWDDEF VLKRQFAALI PAFDPRPGRT NVNQTQDVEL PPPTSDAQRP ESSSSRASSE QPRRDSSDSY VEHNLRLHIR GPNLANINNV TVELDDDDAS VFYFLQQLGQ NVDWGQKTER TRRVWEPTYT LIYEEASGEK SSLETVNDCI DDANHSPRIV LDTLAVLANL HKMGEAIGEL EMSAEMFVSE KLTQKLMQEL ADPLVVAARA LPRWCDHLIY EYPCLFSVDT RTHYLHATAF GTSRAIVWLQ TRRDQILEQS RGATSAAAMS NLAGARRDDH YPEFRVGRIK HERIKVPRNG EQLFEYANRV MNFHASRKSV LELEYVGEEG TGLGPTLEFY ALVAAEFQRK SLGMWVCDDA DEDQLKLEEG ELDLGEGVKP PGYYVRRAGG LFPAPLPRFT DESRKAAELF RFLGIFLAKV LQDGRLVDLP LSRPFLKLLV APSVVENKRA ELKDVLTLDD LEEVSPMKGR VLKELSSLVH RKRAVESDEH TDRDAKRRRI ENLTLNINGN ECRVEDLSLS FSVNPPSSVF TYGEMELIEG GSNMDVTNDN VDLYIDKCTE FYLNTGICDQ VNAFREGFDL VFPLRSLRMF APEEVQTLLS GEQCPEWTRE DVINYTEPKL GYTRESAGFL RFVDVLVGMS SSERKSFLQF TTGCSSLPPG GLANLHPRLT IVRKVDSGDG SYPSVNTCVH YLKLPDYSST EIMRERLLMA TNEKGFYLN // ID F1KS51_ASCSU Unreviewed; 1269 AA. AC F1KS51; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Nuclear migration and anchoring protein unc-84 {ECO:0000313|EMBL:ADY40705.1}; OS Ascaris suum (Pig roundworm) (Ascaris lumbricoides). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. OX NCBI_TaxID=6253 {ECO:0000313|EMBL:ADY40705.1}; RN [1] {ECO:0000313|EMBL:ADY40705.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21685128; DOI=10.1101/gr.121426.111; RA Wang J., Czech B., Crunk A., Wallace A., Mitreva M., Hannon G.J., RA Davis R.E.; RT "Deep small RNA sequencing from the nematode Ascaris reveals RT conservation, functional diversification, and novel developmental RT profiles."; RL Genome Res. 21:1462-1477(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JI164978; ADY40705.1; -; mRNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 355 377 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 419 445 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 472 497 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 562 579 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 779 813 {ECO:0000256|SAM:Coils}. FT COILED 906 926 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1269 AA; 144105 MW; DA5D0DC4EEF2B255 CRC64; MGLFRREPTI GRSNLTKSDL ERLLDLNSEY RTDYTYVSSK SYRLDGSVRY ENPRMFRLPM DSFRRKRAFE KAWKSGYIHY FSLCLSDFME RLFANPAKVA GNIAGSVVYF IMQIFWTIAS LLSLNIHKEH HDQQYSGYGS KVSHRHSFEH ERRFSWSSDE DARLGEIVRR RSPALHTPTF ANENYIESPT VYPQKILSKP SIGWRALRRI ARLITYIVTI GFYPTDFGEE SGRQKRSTYV QMLSEGGFLN APRTGSVGLS REDEKTPWID DDEEEALSDN FAIRESTQIS SAASPPTYKE PWDEESREGE VASPITTPFG PSEVRHGMGR MVARSSQPVV ESTKGILETV AVETFAFIFA VCTLPVTLVR AAIGFFSPRR QVVYNLRSRS VRSHDGESAP IPFVERTIHR FSKACTATMM TLVSIASSVL LAPVMATTYL VSIALGKTST STSPIARATT PSSKSDYMVG RAYGIVAECL YAFITALLSF FVFLFSLPIR MAHYIRENAP LFFATLMSRS PLSQQRSVHA MTTRAMSRGT VMPSESQRPS SRIHKRQYKR RLIWLIPLIA FLLLAGIIGK RCYEGGDFGG RIELLQYPSV LLGRTYEAAS SLSHSIWTKI CRLVTATGAT AYAVKERIIM AFASQPWTSQ TKVSSAIEST SRSVYDSIHS GFSYVHLCLL DLWNHLWNIA EKASILPGYA VDAGTSLLNG VYEFGKAIVD GFFIIIRSIW EILLPLFIFM RNKVENVTTR WSEAIQQSET VTPMSMSFDS DRSAWELGLE ELRREKSIMS EQLEQLRRER ELDEERYREL RKAIEIAHVP IEKPNLDEEI MKKSSAMNEE ILTKKIETHI YNYITKLNLL DDASISKRLS EMEARLLARL EQLSLRIEAD FDAKLGSVKR ESGNAQNSLS NEISDLASAI ANQRNEFDTF RRERDAKLAQ LAHAVMLVET EQAEDIKRIK AEIDQFIKDE VAKGISERLM SLTKKSEEET AALRDEIHKQ IESRVMTLFA KEFAKAAGTH EGRIHGREAS AKGKGVMNRF AESDLLSIKK LIAEALRLYD ADKTGKVDYA LESSGGSVIS TRCTETYKEK SRLESIFGIP LWYSSYSPRT VIQHHANALS SGECWAFHGV GYLTIKLALP IHVTEVSYEH LRRELHPDGV IRSAPRRFQI WAFKELNDLE SKVLLGEYEF DREGDSLQYF EAQHRPSAPT PILELVVLSN WGAEYTCLYR LRVHGQKPTN TTMEVRDDAP VVDHRVDDEF ARGTRPFEQ // ID F1KSQ3_ASCSU Unreviewed; 1033 AA. AC F1KSQ3; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 7. DE SubName: Full=Protein osteopotentia {ECO:0000313|EMBL:ADY40907.1}; OS Ascaris suum (Pig roundworm) (Ascaris lumbricoides). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. OX NCBI_TaxID=6253 {ECO:0000313|EMBL:ADY40907.1}; RN [1] {ECO:0000313|EMBL:ADY40907.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21685128; DOI=10.1101/gr.121426.111; RA Wang J., Czech B., Crunk A., Wallace A., Mitreva M., Hannon G.J., RA Davis R.E.; RT "Deep small RNA sequencing from the nematode Ascaris reveals RT conservation, functional diversification, and novel developmental RT profiles."; RL Genome Res. 21:1462-1477(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JI165243; ADY40907.1; -; mRNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 791 813 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 684 723 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1033 AA; 116268 MW; 9EC101D1FD7A8C95 CRC64; MNQHSTIGMI YDSTHLVMIL RCIYLMLLLC LQDGLTNAHY ESEWISSRNL SPFYRLLSNN SELEEMCPLV LTDRCVAPPR SRHSAGVLSD GTSNANNVSI AVENALPDME DAISFSASKR VFREENLVVS EEAEKQPMTE SSRSEEQPPI ATFDEWTKEK LKQEGHRKNH LQSQQQSGGG RLLSAVNGAD TGANVKTASE GNVPSVPSVP SSVIIPQQAA QRNYASRECG AKVLFSNDEA ENKNAVLNEK ERDDYMRNPC ERAQHKWLII ELCETVLPKA IELANFELFS SGPQKVRVSA SERYPSNEWI TLEEFVAEDS RNVQRFPITT DVNLYVKFIR VELLTHYGNE HYCTLSMIRV LGISMVDEYE AEAEAASASA VSPTTVDSVE KSSSDVEKQS EVEVKREVNI SEIKPALQQM NDSKAKEVPL VNAVVNAVGN IAIKNLKDAF ESAFLGKWKS GESPLSRKAT IMPMACSSCP VVDSFTIPVL FCRAFFPQLV SELPKQGVTV DSNHVKERVD QSGLHDDKAE NKILKEMDTR KGLKQKRAFK RSNEKRFLKR VMRRIICLPS EHPNEEMLGE SHSSTKISDN DRVTDMSSKS CSSIEETCVH LQHQQPTANV LPTRGTNLGH QTLPGASLSH KESVFLKLNK RINALELNMS LSSEYLSELS RRYVEQTNDS RRHAEKVVKL AEEAAQNAAR ATQQKLAKQI EGMGRELKEL SRMVRSLWSR SAAVESSLKM ASVEQQQQQP EVENDRRTMK THSQSEEESV LLHPREHLLY SNDYVWTTEQ LVYMVVAAQA CTVVLMLLVQ CFYDRNRRMT EQLTIQDIVE ERIRTLVQPI SQTQNDESKK GVGGEKSRRS RHRKNNVQLK NNSCEQEDNW DSAVESSARS VTPTRSDLSS ELEVEGRLNV ATNLADDCAA VLSQLEWAKV TAMERGNSGK HKQHEHEKTP KFPDKQASRR LNQDSSLTLT GMKVAAVEEE ETLLDEVNSV LTAEPVKTPT RPTVQSSPPT SGRSWHLVKP SRRRNRKQQM LYE // ID F1KWL9_ASCSU Unreviewed; 442 AA. AC F1KWL9; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 07-JAN-2015, entry version 6. DE SubName: Full=Nuclear migration and anchoring protein unc-84 {ECO:0000313|EMBL:ADY42273.1}; OS Ascaris suum (Pig roundworm) (Ascaris lumbricoides). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Ascaridida; OC Ascaridoidea; Ascarididae; Ascaris. OX NCBI_TaxID=6253 {ECO:0000313|EMBL:ADY42273.1}; RN [1] {ECO:0000313|EMBL:ADY42273.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=21685128; DOI=10.1101/gr.121426.111; RA Wang J., Czech B., Crunk A., Wallace A., Mitreva M., Hannon G.J., RA Davis R.E.; RT "Deep small RNA sequencing from the nematode Ascaris reveals RT conservation, functional diversification, and novel developmental RT profiles."; RL Genome Res. 21:1462-1477(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JI167106; ADY42273.1; -; mRNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; SQ SEQUENCE 442 AA; 49443 MW; 6B804DA140CE88FC CRC64; MDSRDVVRPF GHQITEQITR RLRHVDGTVK VTEVRKRTVD AGTAPTTDKL TFGEAEEACF KGFQKPLALA GLEGESNLSM PRVYNNWTVR VGLILSLLML LLMAWQAHSP EEVFPPNVEG EVEDTLSGMR KSLDRVSLDR VVHRLEDRIK KLQHLCFENT NAINRVSADL DRRISDLDFE LSKTFAGISG GNMSANYESL VKRVGVVEGF LAKTRNEISN LKSTISDRTD GNGKLNTTNG ISAEDVKEWI RLAIDTYDAD KTNEFDFALE SAGAVVLVDR CSRTYSGISS WRTVIPFMRS SHRGPEVVIQ RRLGPSTGDC WPFEGGSGIL TVKLAEHANI TAVSYEHLPA SLSIDRSLKS APKDFQIWGY DDDDEQSTRR MLGEYHYSNE GPALQFFKTQ VTLSSIPLRV VELRITSNYG SHYTCLYRFR VHGHQATGGP SH // ID F1LY77_RAT Unreviewed; 319 AA. AC F1LY77; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 37. DE SubName: Full=Protein Sun3 {ECO:0000313|Ensembl:ENSRNOP00000006818}; GN Name=Sun3 {ECO:0000313|Ensembl:ENSRNOP00000006818, GN ECO:0000313|RGD:1591975}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000006818, ECO:0000313|Proteomes:UP000002494}; RN [1] {ECO:0000313|Ensembl:ENSRNOP00000006818, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000006818, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [2] {ECO:0000313|Ensembl:ENSRNOP00000006818} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000006818}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSRNOP00000006818}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABR07015984; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10116.ENSRNOP00000006818; -. DR PaxDb; F1LY77; -. DR Ensembl; ENSRNOT00000006818; ENSRNOP00000006818; ENSRNOG00000005105. DR RGD; 1591975; Sun3. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F1LY77; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; F1LY77; -. DR TreeFam; TF323915; -. DR NextBio; 699721; -. DR Proteomes; UP000002494; Chromosome 14. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 7 28 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 319 AA; 36512 MW; 678763A272A0C644 CRC64; LTGSWKIILS TVFASTFLLV GLLNHQWLKE TEFPQKPRQL YTVIAEYGSR LHNYQARLRM PKEQQELLKK ESQTLENNFR EILFLIEQID LLKALLRDMK DGVHNHSWPA QREAVQDQAT AEVLDEEMSN LVHYVLKKFR GDQIQLADYA LKSAGASVIE AGTSESYKNN KAKLYWHGIG FLNYEMPPDM ILQPDVHPGK CWAFPGSQGH ILIKLARKII PTAVTMEHIS EKVSPSGNIS SAPKEFSVYG VTKKCEGEEM FLGQFIYKKM EATIQTFELQ NEASESLLCV KLQILSNWGH PKYTCLYRFR VHGIPSDHT // ID F1M9X3_RAT Unreviewed; 1251 AA. AC F1M9X3; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=SUN domain-containing ossification factor {ECO:0000313|Ensembl:ENSRNOP00000033216}; GN Name=Suco {ECO:0000313|Ensembl:ENSRNOP00000033216, GN ECO:0000313|RGD:735185}; Synonyms=Dd25 {ECO:0000313|RGD:735185}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000033216, ECO:0000313|Proteomes:UP000002494}; RN [1] {ECO:0000313|Ensembl:ENSRNOP00000033216, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000033216, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [2] {ECO:0000313|Ensembl:ENSRNOP00000033216} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000033216}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSRNOP00000033216}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC144674; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10116.ENSRNOP00000033216; -. DR PaxDb; F1M9X3; -. DR Ensembl; ENSRNOT00000036483; ENSRNOP00000033216; ENSRNOG00000026542. DR RGD; 735185; Suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR NextBio; 35583062; -. DR Proteomes; UP000002494; Chromosome 13. DR ExpressionAtlas; F1M9X3; baseline and differential. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Proteomics identification {ECO:0000213|PeptideAtlas:F1M9X3}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1251 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003265871. FT COILED 935 955 {ECO:0000256|SAM:Coils}. FT COILED 985 1005 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1251 AA; 139144 MW; C77922CBF3D5711D CRC64; MKKYRRALAL VSCLSLCSLV WLPSWHVCCK ESSSASTSYY SQDDNCAVGS EDIQFQKKNE REEPSNAKVS EKSNSYLTIS PEENKLKDDY TVDENRIWKQ RSLPVVEALP TVDSHEESSS VVVGSENIEN SSSSSTSETT PISKLDEIEN SGTLSVAKPG DTEQPEADCD AGEAADADAS VEQPAFVSAP ESLVGQHIEN VSSSHGKEKV TKSEFESKVS VSEQDGGDPK SALNASDTLK NESSDYTKPR ETDPTSVTSP KDPEDIPTFD EWKKKVMEVE KEKSQSLHPS SNGGPHATKK VQKNRNNYAS VECGAKILAA NPEAKSTSAI LIENMDLYML NPCSTKIWFV IELCEPIQVK QFDIANYELF SSTPKDFLVS ISDRYPTNKW IKLGTFHGRD ERTVQSFPLD EQMYAKYVKM FIKYIKVELL SHFGSEHFCP LSLIRVFGTS MVEEYEEIAD SQYQSERQEL FDEDYDYPLD YNTVEDKSSK NLLGSATNAI LNMVNIAANI LGAKTEDLTE GNKSISENAT ATTEPKMPES TGVSTPVPSP EYIIKEVHTH DTEPPTSDPP KESPIVQLVQ EEEEEASPST VTLLGSGEQE DESSSWFESE TQILCSELTS ICCISSFSEY LYKWCSVRIA LYRQHSRTVS KGKDVSPQPS LLPPVDSVEV SVLQPPSGDV DKEDMERELE TVALDDLSSV HQAHVRNHTV DTVELEPSYP QTLSQSLPLD VTPEMDSLST VEGSESVKSE GGHKPSQVMP QESSVEFDDE TEKKPESFSS VAKLSVIYET SKVNEVMDGT VKEDIVSTHV VTKFPETKFP ETVAPPPINT AAVPESEGME TKPSLADTLK HVVTPVTDPS LPEVKEDEQS PDDALLRGLQ RTATDFYAEL QNSTDLGYGN GNLVHGSNQK ESVFMRLNNR IKALEVNMSL SGRYLEELSQ RYRKQMEEMQ KAFNKTIVKL QNTSRIAEEQ DQRQTEAIHL LQAQLTNMTQ IVSNLSATVA ELKREVSDRQ SYLVMSLVLC VVLGLMLCMQ RCRNTSQFDG DYTSKLPKSN QYPSPKRCFS SYDDMNLKRR TSFPLIRSKS LQFTGKEDPN DLYIVEPLKF SPEKKKKRCK YKTEKIETIK PADPLHPIAN GDIKGRKPFT NQRDFSSMGE VYHSSYKGPP SEGSSETSSQ SEESYFCGIS ACTSLCNGQT QKTKLRRGLK RRRSKVQDQG KLIKALIQTK SGSLPSLHDI IKGNKEITVG AFGVTAVSGH I // ID F1MH03_BOVIN Unreviewed; 359 AA. AC F1MH03; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 2. DT 11-NOV-2015, entry version 22. DE SubName: Full=SUN domain-containing protein 3 {ECO:0000313|Ensembl:ENSBTAP00000010721}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSBTAP00000010721}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913 {ECO:0000313|Ensembl:ENSBTAP00000010721, ECO:0000313|Proteomes:UP000009136}; RN [1] {ECO:0000313|Ensembl:ENSBTAP00000010721, ECO:0000313|Proteomes:UP000009136} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000010721, RC ECO:0000313|Proteomes:UP000009136}; RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42; RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., RA Puiu D., Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., RA Marcais G., Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.; RT "A whole-genome assembly of the domestic cow, Bos taurus."; RL Genome Biol. 10:R42.01-R42.10(2009). RN [2] {ECO:0000313|Ensembl:ENSBTAP00000010721} RP IDENTIFICATION. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000010721}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSBTAP00000010721}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DAAA02009729; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; DAAA02009730; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9913.ENSBTAP00000010721; -. DR PaxDb; F1MH03; -. DR Ensembl; ENSBTAT00000010721; ENSBTAP00000010721; ENSBTAG00000008155. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000009136; Chromosome 4. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009136}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009136}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 45 64 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 359 AA; 40490 MW; 7B78179CBCF00542 CRC64; MSGRPNSRGS SRLFRAPSED ASSGSSGSAV LPQQENPNAS GLTRSWKAVM GMVFILTLLL LGFINHMKLK EKAFPQKSRQ IYAVIAEYGS RLYNYQARLR MPKEQLELLK KESQTLENNF REILFLIEQI DVLKALLRDM QDGLHNYSWN ADIDPAEGWN HTEVIDEEMS NLVNYILKLR EDQVQMADYA LKSAGASVVE AGTSESYKNN KAKLYWHGIG FLNYEMPPDI ILQPDVHPGK CWAFPGSQGH ALIKLARKII PTAVTMEHIS EKVSPSGNIS SAPKEFSVYG VLKQCEGEEI FLGQFVYNKT GTTVQTFALQ HEVPEFLLCV KLKILSNWGH PNYTCLYRFR VHGTPKDDS // ID F1MS76_BOVIN Unreviewed; 1252 AA. AC F1MS76; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSBTAP00000043864}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSBTAP00000043864}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913 {ECO:0000313|Ensembl:ENSBTAP00000043864, ECO:0000313|Proteomes:UP000009136}; RN [1] {ECO:0000313|Ensembl:ENSBTAP00000043864, ECO:0000313|Proteomes:UP000009136} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000043864, RC ECO:0000313|Proteomes:UP000009136}; RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42; RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., RA Puiu D., Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., RA Marcais G., Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.; RT "A whole-genome assembly of the domestic cow, Bos taurus."; RL Genome Biol. 10:R42.01-R42.10(2009). RN [2] {ECO:0000313|Ensembl:ENSBTAP00000043864} RP IDENTIFICATION. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000043864}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSBTAP00000043864}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DAAA02042901; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; DAAA02042902; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; DAAA02042903; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9913.ENSBTAP00000043864; -. DR PaxDb; F1MS76; -. DR Ensembl; ENSBTAT00000046578; ENSBTAP00000043864; ENSBTAG00000010169. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F1MS76; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000009136; Chromosome 16. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009136}; KW Reference proteome {ECO:0000313|Proteomes:UP000009136}. FT COILED 934 954 {ECO:0000256|SAM:Coils}. FT COILED 1190 1210 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSBTAP00000043864}. SQ SEQUENCE 1252 AA; 139151 MW; 554651396D3A20C9 CRC64; SKFRRLRVLK SCYVLCSFIR LPSWHVCCKE SSSASSYYSQ DDNCALENED VQSQKKDERG GPVNAELSGK VGSSLPVPPE ENKLKDDYIV NVQKQDTESK KLSPSVTETL PTVDVHEDSS SVVVDSENTE NISSSSTSEI TPISKLDEIE DSGTIPIAKP GETEHSETDC DVGEALDANA PVDQPSFVSP PESLVGQHIE NVSSSHGKGK ITKSEFESKV SSSDQGGSDP KSALNASSNN LKNESSDYTK PGEIDPTPVT NSKDPEDIPT FDEWKKKVME VEKEKSQSMH PSSNGGLHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQFDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYQSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKTISEN ASATAAPKMP ESAPVSAPVP SHEFETTEGH VHDIESLSPD TTKESPIVQL VQEEEEEASP STVTLLGSGE QEDETSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VARYRQRSRT ALSKQKEYLM SAEPPLVLPE EPVDVSLLQP PGGEPDSNKE KDAETSVLDD LSGVLQEDLI NHTLDAIELE PSHPQTLSQS VLLDVTPEIN SLSKIEVSEP VKYEAGHSPS QVIPQDSSVE VDNETEKRSE SFSSIEKPTV ISETKILDKV MDNVVKEDIN SMRITTKLSE TIVPPVNTAT MPDIEAGEAK MNIADTPKQI STPVVDSSSL PEVKDDEQSP EDVLLRGLQR TATDFYAELQ NSSDLGYANG NLVHGSNQKE SVFMRLNNRI KALEVNMSLS GRYLEELSQR YRKQMEEMQK AFNKTIVKLQ NTSRIAEEQD QRQTEAIQLL QAQLTNMTQL VSNLSTTVAE LKHEVSDRQS YLVISLVLCV VLGLMLCMQR CRNTSQFDGD YISKLPKSNQ YPSPKRCFSS YDDMSLKRRT SFPLIRSKSL QLTGKEVDPS DLYIVEPLKF SPEKKKKRCK YKTEKIETIK PADPLHPVAN GDIKGRKPFT NQRDFSNMGE VYHSSYKGPP SEGSSETSSQ SEESYFCGIS ACTSLCNGQS QKTKTEKRAL KRRRSKVQDQ GKLIKTLIQT KSGSLPSLHD IIKGNKELTV GTFGVTAVSG HI // ID F1MZZ8_BOVIN Unreviewed; 433 AA. AC F1MZZ8; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 2. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSBTAP00000009127}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSBTAP00000009127}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913 {ECO:0000313|Ensembl:ENSBTAP00000009127, ECO:0000313|Proteomes:UP000009136}; RN [1] {ECO:0000313|Ensembl:ENSBTAP00000009127, ECO:0000313|Proteomes:UP000009136} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000009127, RC ECO:0000313|Proteomes:UP000009136}; RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42; RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., RA Puiu D., Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., RA Marcais G., Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.; RT "A whole-genome assembly of the domestic cow, Bos taurus."; RL Genome Biol. 10:R42.01-R42.10(2009). RN [2] {ECO:0000313|Ensembl:ENSBTAP00000009127} RP IDENTIFICATION. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000009127}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSBTAP00000009127}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DAAA02036525; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9913.ENSBTAP00000009127; -. DR PaxDb; F1MZZ8; -. DR Ensembl; ENSBTAT00000009127; ENSBTAP00000009127; ENSBTAG00000006949. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F1MZZ8; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000009136; Chromosome 13. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009136}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009136}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 130 153 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 159 184 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 197 231 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 433 AA; 47303 MW; 9082BFB2BC6A5AF3 CRC64; MRRSPRPGSA ASQHKHTPNF YSDNSNSSVS VTSGDSCGHR SAGPGPGEPE GRRARGSSCG EPALSAGVPG GTTWAGSSRQ KPAPRSHNGQ TACGAATVRG GASVSEEQLD LLPTLDLRQE MPSPQVSKSF LSLLFQVLSV LLSLVGDVLV SVYREVCSIR FLLTAVSLLS LFLAALWWGL LYLAPPLENE PKEMLTLSEY HERVRSQGQQ LQQLQAELVK LHKEMSSVRA ANSERVAQLV FQRLSEDFVQ KPDYALSSVG ASIDLEKTSQ DYEDANTAYF WNRFSFWNYA RPPTVILEPD VFPGNCWAFE GDQGQVVIRL PGRVQLSDIT LQHPPPTVAH TRGANSAPRD FAVYGLQVDG ETEVFLGKFT FDVEKSEIQT FHLQNDPPAA FPKVKIQILS NWGHPRFTCL YRVRAHGIRT SEGAGDSATG GAH // ID F1NP61_CHICK Unreviewed; 1251 AA. AC F1NP61; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000000625}; DE Flags: Fragment; GN Name=C8H1ORF9 {ECO:0000313|Ensembl:ENSGALP00000000625}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000000625, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000000625, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000000625, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000000625} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000000625}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000000625}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03005669; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9031.ENSGALP00000000625; -. DR PaxDb; F1NP61; -. DR Ensembl; ENSGALT00000000626; ENSGALP00000000625; ENSGALG00000003081. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F1NP61; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000000539; Chromosome 8. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. FT COILED 932 952 {ECO:0000256|SAM:Coils}. FT COILED 982 1002 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGALP00000000625}. SQ SEQUENCE 1251 AA; 138545 MW; 562CF1187C902F7B CRC64; LPAARSYSAL YSCVYLSNLI RLPVWHVFCK DSLLSSVQYA SSDACDLAND GENIQEKRRE REDLSTLLEP EHTDSSTQTY STEELLDDFI KSEQAAKVSE TSQPEAQTPP SVDVNEAPSN IVASTENNSS SPTSEISTVS QPDAIENTRA DIPVVSSIEA EQSEPDCDIG GTLEADPQSE PSSFVSPQES LAGQHIENVS SSHGKGKKTK SEFESKVEAA EKGADQKSAL NASENLKREK DYKKTGEIDP TSVITPKDPG DIPTFDEWKK KVMEVEKEKS QSMHPSAAGG QHSTKKVQKN RNNYASVECG AKILAANPEA KSTSAILMEN MDLYMLNPCS TKIWFVIELC EPVQVKQLDI ANHELFSSTP KDFLVSISDR YPTNKWIKLG TFHARDERNV QSFPLDEQMY AKYVKMFIKY IKVELISHFG SEHFCPLSLI RVFGTSMVEE YEEIADSQYQ SERQELFDED YDYPLDYNTG EEKSSKNLLG SATNAILSMV NIAANMLGAK TEETSEGNKS ISENVTVTTP ASSTAAPRLP EPTPVPSPEL TTTDIPQIDK EQLRVDLTKE SPIVQLVQEY EEDASQSTVT LLSSDDQEEE KSSWFELETD MYCYDLATVC CISTFTEYLF KWCSVTVAIH RQHSKTEGKQ EQDESTRAQP PQVVLPQSVP VSVDEPLPEQ LDSKVDKVPG STVAVDFSSV VHEIISNETT ATIELEPSHP QTVSQSLLLE VTSEVKPLPT TEMVLEPSQE DAGQEVPGIT PQADSAEISA VTERAESSVA EEAVVVSETS VITEVKETST KETTATSMIS KPTETVLQPE YTVGILASDT GEGKESTPEV QKPVLSPVES SVSVETKEDD QATEEAFMSI PVSGGPQRTA TDFYAELQNS TDLGYANGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSSR YLEELSQRYR KQMEEMQKAF NKTIIKLQNT SRIAEEQDQR QTEAIQLLQA QLTNMTQLVS NLSSTVAELK REVSDRQTYL VISLVLCVIL GLVLCVQRCR STSQFCEGYL SKIPKSNHYP SPKRCFSSYD DMNLKRRTSL PLVRSQSFQL SGKEVDPEDL YIVEPLKFSP EKKKKRCKYK SEKIETIKPT AEPPHPIANG EIKGRKPFTN QRDFSNIGEV YHSSYKGPPS EGSSETSSQS EESYFCGISA CTSLCNGQTQ KTKTEKRAIK RRRSKVSDQG KLIKTLIQTK SGSMPSLHDI IKGNKDITVG TLGVTAVSGH I // ID F1NSE9_CHICK Unreviewed; 201 AA. AC F1NSE9; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000021370}; DE Flags: Fragment; GN Name=Gga.38038 {ECO:0000313|Ensembl:ENSGALP00000021370}; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000021370, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Ensembl:ENSGALP00000021370, ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000021370, RC ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000021370} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000021370}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000021370}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03002038; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03002758; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSGALT00000021401; ENSGALP00000021370; ENSGALG00000017691. DR GeneTree; ENSGT00390000011587; -. DR OMA; SSASIHM; -. DR Proteomes; UP000000539; Chromosome 2. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGALP00000021370}. SQ SEQUENCE 201 AA; 22124 MW; 0AC8C211177813D8 CRC64; AVQKIIDQVL EKLEESPFQM TNYASKTSGA AIVRSKTSPS WIGSGRVFWQ SLPLVAYMRP PEVILEPDNH PGNCWPFPGS QGHVFIKLPV AVFPTAVTIN HGVPAAAYHA DSISSAPKDF AVYGLQAEDD EKGTLLGEFI FTPGQAPGQT FQLKNEHSGF IKYVRLQVLS NWGHRDYTCV YQFRLHGDPA HDGDTRGKLS A // ID F1P7X2_CANFA Unreviewed; 1392 AA. AC F1P7X2; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-OCT-2012, sequence version 2. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCAFP00000021658}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSCAFP00000021658}; OS Canis familiaris (Dog) (Canis lupus familiaris). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Canidae; OC Canis. OX NCBI_TaxID=9615 {ECO:0000313|Ensembl:ENSCAFP00000021658, ECO:0000313|Proteomes:UP000002254}; RN [1] {ECO:0000313|Ensembl:ENSCAFP00000021658, ECO:0000313|Proteomes:UP000002254} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000021658, RC ECO:0000313|Proteomes:UP000002254}; RX PubMed=16341006; DOI=10.1038/nature04338; RG Broad Sequencing Platform; RA Lindblad-Toh K., Wade C.M., Mikkelsen T.S., Karlsson E.K., Jaffe D.B., RA Kamal M., Clamp M., Chang J.L., Kulbokas E.J. III, Zody M.C., RA Mauceli E., Xie X., Breen M., Wayne R.K., Ostrander E.A., RA Ponting C.P., Galibert F., Smith D.R., deJong P.J., Kirkness E.F., RA Alvarez P., Biagi T., Brockman W., Butler J., Chin C.-W., Cook A., RA Cuff J., Daly M.J., DeCaprio D., Gnerre S., Grabherr M., Kellis M., RA Kleber M., Bardeleben C., Goodstadt L., Heger A., Hitte C., Kim L., RA Koepfli K.-P., Parker H.G., Pollinger J.P., Searle S.M.J., RA Sutter N.B., Thomas R., Webber C., Baldwin J., Abebe A., RA Abouelleil A., Aftuck L., Ait-Zahra M., Aldredge T., Allen N., An P., RA Anderson S., Antoine C., Arachchi H., Aslam A., Ayotte L., RA Bachantsang P., Barry A., Bayul T., Benamara M., Berlin A., RA Bessette D., Blitshteyn B., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Brown A., Cahill P., Calixte N., RA Camarata J., Cheshatsang Y., Chu J., Citroen M., Collymore A., RA Cooke P., Dawoe T., Daza R., Decktor K., DeGray S., Dhargay N., RA Dooley K., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., RA Dupes A., Egbiremolen O., Elong R., Falk J., Farina A., Faro S., RA Ferguson D., Ferreira P., Fisher S., FitzGerald M., Foley K., RA Foley C., Franke A., Friedrich D., Gage D., Garber M., Gearin G., RA Giannoukos G., Goode T., Goyette A., Graham J., Grandbois E., RA Gyaltsen K., Hafez N., Hagopian D., Hagos B., Hall J., Healy C., RA Hegarty R., Honan T., Horn A., Houde N., Hughes L., Hunnicutt L., RA Husby M., Jester B., Jones C., Kamat A., Kanga B., Kells C., RA Khazanovich D., Kieu A.C., Kisner P., Kumar M., Lance K., Landers T., RA Lara M., Lee W., Leger J.-P., Lennon N., Leuper L., LeVine S., Liu J., RA Liu X., Lokyitsang Y., Lokyitsang T., Lui A., Macdonald J., Major J., RA Marabella R., Maru K., Matthews C., McDonough S., Mehta T., RA Meldrim J., Melnikov A., Meneus L., Mihalev A., Mihova T., Miller K., RA Mittelman R., Mlenga V., Mulrain L., Munson G., Navidi A., Naylor J., RA Nguyen T., Nguyen N., Nguyen C., Nguyen T., Nicol R., Norbu N., RA Norbu C., Novod N., Nyima T., Olandt P., O'Neill B., O'Neill K., RA Osman S., Oyono L., Patti C., Perrin D., Phunkhang P., Pierre F., RA Priest M., Rachupka A., Raghuraman S., Rameau R., Ray V., Raymond C., RA Rege F., Rise C., Rogers J., Rogov P., Sahalie J., Settipalli S., RA Sharpe T., Shea T., Sheehan M., Sherpa N., Shi J., Shih D., Sloan J., RA Smith C., Sparrow T., Stalker J., Stange-Thomann N., Stavropoulos S., RA Stone C., Stone S., Sykes S., Tchuinga P., Tenzing P., Tesfaye S., RA Thoulutsang D., Thoulutsang Y., Topham K., Topping I., Tsamla T., RA Vassiliev H., Venkataraman V., Vo A., Wangchuk T., Wangdi T., RA Weiand M., Wilkinson J., Wilson A., Yadav S., Yang S., Yang X., RA Young G., Yu Q., Zainoun J., Zembek L., Zimmer A., Lander E.S.; RT "Genome sequence, comparative analysis and haplotype structure of the RT domestic dog."; RL Nature 438:803-819(2005). RN [2] {ECO:0000313|Ensembl:ENSCAFP00000021658} RP IDENTIFICATION. RC STRAIN=Boxer {ECO:0000313|Ensembl:ENSCAFP00000021658}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCAFP00000021658}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEX03005210; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9615.ENSCAFP00000021658; -. DR PaxDb; F1P7X2; -. DR Ensembl; ENSCAFT00000023323; ENSCAFP00000021658; ENSCAFG00000014685. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F1P7X2; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000002254; Chromosome 7. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002254}; KW Reference proteome {ECO:0000313|Proteomes:UP000002254}. FT COILED 1074 1094 {ECO:0000256|SAM:Coils}. FT COILED 1124 1144 {ECO:0000256|SAM:Coils}. FT COILED 1330 1350 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSCAFP00000021658}. SQ SEQUENCE 1392 AA; 154546 MW; E0B2BF56E98F42E1 CRC64; EEILAHPSPP TNQHLPPRAG PIGTGKTPSF FPHRHSRPSI PRTVLKRGKL SKTIPKLFIL ARRRGTLHLS IKTVIPGGIK EKDKLQLPRG LARTRLAPGE GRRARTSRRL RQSQETSCEA RLWAPPWAPR GGRPVREPLR SRSPSAAALR TLGPILSLLL RLLHLGLGSG GCREDVPPSG RGKKEEKMKK YRRALALVSC LSLCSLVWLP SWHVCCKESS SASSYYSQDD NCALENEDVQ FQKKNTESKL SPSVIETLPT VDLQEDSSSI VVGSENIENI SSSSTSEITP ISKLDEIEKS GTIPIAKPSE TEQSETDCDV GDTLETNAPV DQPSFVSPPE SLVGQHIENV SSSHGKGKIT KSEFESKVSA SDQKSGDSKS ALNTSDNLKN ESSDSSKPGE IDHTSVTSPK DPEDIPTFDE WKKKVMEVEK EKSQSMHPSS NGGLHATKKV QKNRNNYASV ECGAKILAAN PEAKSTSAIL IENMDLYMLN PCSTKIWFVI ELCEPIQVKQ LDIANYELFS STPKDFLVSI SDRYPTNKWI KLGTFHGRDE RNVQSFPLDE QMYAKYVKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYQSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP DLAPVSTPVP SPEFITTEGH IHDTELSSPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYVYKWCSVR VALYRQRSRT AVGKGKDRLV LAQPPLLLPA ESVDVLVLQP PSGDLDSKRK EKDAETIDLG DLSSMNQGDL INHSADAIEL EPSHPQTLSQ SLILDVTPEI HSLSKIEVSE PIKYETGPTP SQVIPQESSV EVDNETEKKS ESFTSIEKPP VIYDTNLNEV MDNTVKEDMN SLHIITKLSE TIVPPVNTAT MPDSEDGEAK LNIADTPKQI LTPIMDSSSL PEVREEEQSP EDALLRGLQR TATDFYAELQ NSTDLGYSNG NLVHGSNQKE SVFMRLNNRI KALEVNMSLS GRYLEELSQR YRKQMEEMQK AFNKTIVKLQ NTSRIAEEQD QRQTEAIQLL QAQLTNMTQL VSNLSTTVAE LKREVSDRQS YLVISLVLCV VLGLMLCMQR CRNNSQFDGD YISKLPKSNQ YPSPKRCFSS YDDMNLKRRT SFPLIRSKSL QLTGKEVDPN DLYIVEPLKF SPEKKKKRCK YKTEKIETIK PADPLHPVAN GDLKGRKPFM NQRDFSNMGE VYHSSYKGPP SEGSSETSSQ SEESYFCGIS ACTSLCNGQS QKTKTEKRAL KRRRSKVQDQ GKLIKTLIQT KSGSLPSLHD IIKGNKEITV GTFGVTAVSG HI // ID F1Q595_DANRE Unreviewed; 1314 AA. AC F1Q595; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000015683}; GN Name=suco {ECO:0000313|Ensembl:ENSDARP00000015683, GN ECO:0000313|ZFIN:ZDB-GENE-030131-2941}; GN Synonyms=si:ch211-184m19.1 {ECO:0000313|ZFIN:ZDB-GENE-030131-2941}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000015683, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000015683} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000015683}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSDARP00000015683, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000015683, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000015683}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BX470128; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BX901920; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_005160502.1; XM_005160445.2. DR UniGene; Dr.9374; -. DR STRING; 7955.ENSDARP00000015683; -. DR PaxDb; F1Q595; -. DR Ensembl; ENSDART00000026969; ENSDARP00000015683; ENSDARG00000016532. DR GeneID; 558218; -. DR CTD; 51430; -. DR ZFIN; ZDB-GENE-030131-2941; suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F1Q595; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR PRO; PR:F1Q595; -. DR Proteomes; UP000000437; Chromosome 20. DR Bgee; F1Q595; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 1314 FT /FTId=PRO_5003267820. FT COILED 994 1014 {ECO:0000256|SAM:Coils}. FT COILED 1037 1064 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1314 AA; 145104 MW; C36801F368C296E4 CRC64; MKKLRVLFLC SVLALLCWHS CHDVYCSEEV SSGPGLPAQD DGSYTMQEPG AGEKVEEERA THTQSSYDVG LETERAALKE TSLHSQETPH EEKHKQTVNL EEVLEEPQIQ HDSALPELEQ QQQQEQHPEG QSSQHPDQQD SAQPPTFAQP ETAPPLSQPD PEVPQATATE DQAAQPSTDE TPPHPDPHAE EDASAPADEL QNSPESTSPT SVTSSESLSS VVEEQVETQA ENQSTVLEQS TETSTDPADS LDPSELDLTE TGQTDRIPPP TESQQVADHE GATDPPVPSK EDIPTFDEWK KKVMEVEEKK SQSLHTSSNG SPHPVKKVQK NFKNNYASVE CGAKILSANN EAKSTSAILM ENMDLYMLNP CSTKIWFVIE LCEPIQVKQL DIANFELFSS TPRDFLVSIS DRYPTNKWIK LGTFHARDER TVQSFPLDEQ LYAKYVKMFI RYIKVELLSH FGSEHFCPLS LIRVFGTSMV EEYDEIADSQ YTSERAEYLD EDYDYPPGYL PSEDKASKNL LGSATNAILN MVNNIAANVL GGKPELEDGA ELEGNVSSGT ENVTQASTET TLTPDPTPTE QPHTLDVLEL DPTFVKEDME APIPEAPSQA PTVAPAEESR IVILIEEDEE STQPTVTLLE EEGEEERRLE ELWESESTTY CGHLSTLSCL ASLHEHFYCY CSAALALQRQ RKDRRHKAQH TQAHTKTQTT SQQTLTDPTP SQEPPAPSEK LSEAERPSQT QSERISPTET AAAPSESSNW ESSSTELELP PETILLEPSR TSTLPPHSFS ETPTQNIPES LPTGEPVDQR PQYLVDIPET HAPGSVCSST SSVPSPSAVS STPLSSTTEP HSLETDPPKL ELPVPTKEQT AQPLPTHTRP AEIPPPTEIP ILIPEASENG KVSLVEDQHH NAVVSQPELS EAPHEDTVDD ILLPHRTAGE FYAEQQQTAE MGHSNGNGNG NQVHGSNQKE SVFMRLNNRI KALEMNMSLS SRYLEELSQR YRKQMEEMQR AFNKTIIKLQ NTSRIAEEQD QKQTESIQML QSQLENATRI MLNLSATVAQ LQREVSDRQI YLVLSLILCL MLGLLLCTQC CRSSTNHNTS PTIPMSNNYP SPKRCFSSYD DMSLKRRVSC PLVRSKSFQL PTSEVGPDDL YIVEPLRFSP EKKKKRCKTK AGERVETLRA SATAPNGLLK LNGESSPPLQ FLPSTCNGYT STCSSRDCAS EGSSESSVST LSEESFGSAP SVNGHELVLY NGEPPPACPP AKSRTEKKRR RSRQGEQDSM LSASLPLPAL EELIKGKTEL GVASFGRTAV TGRV // ID F1R1L9_DANRE Unreviewed; 987 AA. AC F1R1L9; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000104532}; GN Name=sun1 {ECO:0000313|Ensembl:ENSDARP00000104532, GN ECO:0000313|ZFIN:ZDB-GENE-050522-551}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000104532, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000104532} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000104532}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSDARP00000104532, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000104532, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000104532}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CABZ01055403; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CABZ01055404; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CABZ01055405; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU571260; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7955.ENSDARP00000104532; -. DR PaxDb; F1R1L9; -. DR Ensembl; ENSDART00000124824; ENSDARP00000104532; ENSDARG00000055350. DR ZFIN; ZDB-GENE-050522-551; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000000437; Chromosome 3. DR Bgee; F1R1L9; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:F1R1L9}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 358 378 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 422 440 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 447 471 Helical. FT COILED 621 641 {ECO:0000256|SAM:Coils}. FT COILED 659 686 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 987 AA; 109725 MW; FA421BD3611FF346 CRC64; MDFSRLHTYT PPHCTPDNTG YTYSLSSSYS TAALEFEKEH KINPVYDSPK MSRRSLRLQT SSGLYDNSFT EVAGNHSVGS YKRTNTSTTT TTSSSSSVSR SVRGRRQQQD SSIYESQSVT GTPQSTSDLS FTSTDASLIS NLLDQSTLRQ SSTTETYSAT RRRRAVNRSL LENGNVSKTE AHANLANGYF CKDCSFHAEG NEKETSYSVP YSTSESAAYQ TTEAADATMT TMTTSLNSVD GAAHDSYCGS VNVRDVVTAD HLNLNGSLCD DCKGKQHMEM NTERKHYSYI HRVLTVLWAV VTYTGNVLHR VCQGFGSAGA FVSRKMKSVV GLAVCSPGDI CKEKQHMEMN TERKHYSYIH RMLTVLWAVV SYTGYGLLRV CRGFGSAGAF VSRKLKSILW FAVCSPGKAA TGAFWWLGTG WYQLVALMSL INVFLLTRCL PKLLKLLLFL LPFLLLFGLW YLGLPIALSF LPAVNLTEWK TSVTSFASLP ALPSFPSFPS LPALPSFTKE PLLKEQDVPP LVVAQAASDS INSERLALLE QRVSALWESV RQGELKAKQQ HEEALGLTQS LQEQIKTQTD RENLGLWVTE LLQPKFTALE GDMKTETLSR AETEEQHIQH QNILEARLAE LEVLLQNLNS RTEDIHLSQQ TPVQAPVSVG VSQEKHEALL SEVQRLEAEL GRIRGDLQGV MGCQGKCDRL DTIHETVSAQ VKEQLYALLY GRDRGEAVIP EPLLPWLASQ YTSNSDLTAT LVTLERSILG NLSLQLQESK QQQASAETVT QTVAHTAEAA GMSEEQVQLI VQRALKLYSE DRTGQVDYAL ESGGGSVLST RCSETYETKT ALMSLFGIPL WYFSQSPRVV IQPDMYPGNC WAFKGSQGYL VIRLSLRVIP NGFCLEHIPK SLSPSGNISS APRRFSVYGL DDEYQDEGKL LGDYTYQEDG DSLQNFPVME ENDKAFQIIE MRVLSNWGHP EYTCLYRFRV HGKPHAQ // ID F1RIY7_PIG Unreviewed; 908 AA. AC F1RIY7; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 11-JUL-2012, sequence version 2. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSSCP00000008057}; GN Name=LOC100737805 {ECO:0000313|Ensembl:ENSSSCP00000008057}; OS Sus scrofa (Pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; OC Sus. OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000008057, ECO:0000313|Proteomes:UP000008227}; RN [1] {ECO:0000313|Ensembl:ENSSSCP00000008057, ECO:0000313|Proteomes:UP000008227} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Porcine genome sequencing project; RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSSSCP00000008057} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSSCP00000008057}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FP102523; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9823.ENSSSCP00000008057; -. DR PaxDb; F1RIY7; -. DR Ensembl; ENSSSCT00000008276; ENSSSCP00000008057; ENSSSCG00000007544. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F1RIY7; -. DR OMA; ASHREHE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Reactome; R-SSC-1221632; Meiotic synapsis. DR Proteomes; UP000008227; Chromosome 3. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008227}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:F1RIY7}; KW Reference proteome {ECO:0000313|Proteomes:UP000008227}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 378 399 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 411 430 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 557 598 {ECO:0000256|SAM:Coils}. FT COILED 605 625 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 908 AA; 99191 MW; 3493B5D8DBDD94C0 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLVT TAHAAEDGPA AAAAGDRAAR VTKQRGSAGR LALTLNHASR KGLSLAAGQS STGAVQGEAC LRPPVLDESL IREQTKVDHF WGLDDDGDLR GGTRAALQGN GDAAGGAAGA AGSNGYTCSD CSLLSSRGAA LTAHSTSFGP TSRIYSRDRN QRLGGASRFA DRIWRLAERT SSFSSFLVHL FQGAFAKCSC ESENAKLKSC ESKARESQIH ESQAHCGHGG RMDVREALGD DGRLRVNGES LCDGRKGKEP RETHSAPRSP SSRPGGLAGT AGRAFAHAGR AAVQTLRRAG AAGWFVSRTV LAVLRLALLA PGKTASEVLW WLGVGWYQFV TLISWLNVFL LTRCLRNICK FLILLIPLLL LLGAGVSLWG QGDVLAFLPV LNWTHLLGAQ RADDPSSTRT SGHPPPRQPL EAGTEAFHWH RMSEVERAVT PRVRQCHHMT RASRTVALLQ KLQARVEQMD GGGEGLLSXV RHAVGQHFEE MGAPGPSSSQ ADSEASHREH ELRISSLEAV LGKLTERSEA VQRELERTAR ATARMEEEQR LLGLVQHLEQ ELGRLKADLA GWQRGRSSCE AAVDARVRET LSLVLSGDQP EAALERLLQK WSSQFVSKEH LQVLLRELEL GILKNVTHHL AVTKQTPAPE TVVSAARGAG ITGITEAQAQ VLVNKALKLY SQDRTGLVDF ALESGGGSVL STRCSETFET KTALISLFGI PLWYFSQSPR VAIQPDMYPG NCWAFRGSQG YLVVRLSMKI QPSAFTLEHI PKTLSPTGNI TSAPKDFAVY GLEDEYQEEG QLLGRFTYDQ EGESLQMFHV LKRPESAFQI VELRIFSNWG HPEYTCLYRF RVHGEPIK // ID F1S4W0_PIG Unreviewed; 433 AA. AC F1S4W0; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 11-JUL-2012, sequence version 2. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSSCP00000007778}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSSSCP00000007778}; OS Sus scrofa (Pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; OC Sus. OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000007778, ECO:0000313|Proteomes:UP000008227}; RN [1] {ECO:0000313|Ensembl:ENSSSCP00000007778, ECO:0000313|Proteomes:UP000008227} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Porcine genome sequencing project; RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSSSCP00000007778} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSSCP00000007778}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU914282; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_005672949.1; XM_005672892.2. DR STRING; 9823.ENSSSCP00000007778; -. DR PaxDb; F1S4W0; -. DR Ensembl; ENSSSCT00000007992; ENSSSCP00000007778; ENSSSCG00000007305. DR GeneID; 100523033; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F1S4W0; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008227; Chromosome 17. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008227}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008227}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 130 153 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 159 184 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 197 231 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 433 AA; 47489 MW; B39663AE3E0C22A2 CRC64; MRRSPRPGSA ASPHKHTPNF YSDNSNSSVS VTSGDSCGHR SAGPGPGEPE GRRARGSSCG EPALSAGVPG GTTWAGSSRL KPAPRSHNAQ TACGAATVRG GASVSEEQLD LLSTLDLRQE MPPPRVSKSF LSLLFQVLSV LLTLVGDVLV SVYREVCSIR FLLTAVSLLS LFLAGLWLGL LYLAPPLENE PKEMLTLREY HAQVHSQGQQ LQQLQAELVK LHKEVSNVRA ANSERVAKLV FQRLNEDFVR KPDYALSSVG ASIDLEKTSH DYEDMDTAYF WNRFSFWNYA RPPTVILEPD VFPGNCWAFE GDKGQVVIRL PGRVQLSDIT LQHPPPSVAH TRGANSAPRD FAVYGLQVDD ETEVFLGKFT FDVEKSEIQT FHLQNDPPAA FPKVKIQILS NWGHPRFTCL YRVRAHGIRT SEGAGDNARG GPY // ID D5K8A6_PIG Unreviewed; 383 AA. AC D5K8A6; F1S510; DT 15-JUN-2010, integrated into UniProtKB/TrEMBL. DT 15-JUN-2010, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Sperm associated antigen 4-like protein {ECO:0000313|EMBL:ADE28544.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSSCP00000007726}; GN Name=SPAG4L {ECO:0000313|EMBL:ADE28544.1}; GN Synonyms=SUN5 {ECO:0000313|Ensembl:ENSSSCP00000007726}; OS Sus scrofa (Pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; OC Sus. OX NCBI_TaxID=9823 {ECO:0000313|EMBL:ADE28544.1}; RN [1] {ECO:0000313|Ensembl:ENSSSCP00000007726, ECO:0000313|Proteomes:UP000008227} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Porcine genome sequencing project; RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:ADE28544.1} RP NUCLEOTIDE SEQUENCE. RA Liu Y.G.; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Ensembl:ENSSSCP00000007726} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU179704; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU856051; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GU475008; ADE28544.1; -; mRNA. DR RefSeq; NP_001171381.1; NM_001177910.1. DR UniGene; Ssc.47021; -. DR STRING; 9823.ENSSSCP00000007726; -. DR Ensembl; ENSSSCT00000007939; ENSSSCP00000007726; ENSSSCG00000007256. DR GeneID; 100156284; -. DR KEGG; ssc:100156284; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008227; Chromosome 17. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008227}; KW Reference proteome {ECO:0000313|Proteomes:UP000008227}. FT COILED 158 185 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 383 AA; 43833 MW; 712F2FBCD1E09683 CRC64; MPRSPRSPGD SCDPPRDVAH VPREIRPRRI IQRGRNICRM AEEPLSNTSD AVLLPIRISV PAPGLTQCML GCMSWIACLV CFLRTQAHQV LFNNCRCRLL FHKLMEKTGV LVLCAFGFWV FSMHLPSKLE VWQDDSIHSP LQSLRMYQEK VRHHTGEIQD LRGNINQLIA KLQEMEAMSD EQKMAQKIMK MIQGDYIEKP DFALKSIGAT IDFEQTSATY NHDKARSYWN WIRLWNYAQP PDVILEPNVT PGNCWAFAGD RGQVTIRLAQ KVYLSNLTLQ HIPKTISLSG SLDTAPKDFV IYGMEGSPRE EVFLGAFQFQ PENIIQMFQL QNQPPRTFGA VKVKISSNWG NPRFTCLYRV RVHGSVTPPR EQPHQTPAHP KRD // ID F1S7S5_PIG Unreviewed; 1251 AA. AC F1S7S5; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 11-JUL-2012, sequence version 2. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSSCP00000016418}; GN Name=SUCO {ECO:0000313|Ensembl:ENSSSCP00000016418}; OS Sus scrofa (Pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; OC Sus. OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000016418, ECO:0000313|Proteomes:UP000008227}; RN [1] {ECO:0000313|Ensembl:ENSSSCP00000016418, ECO:0000313|Proteomes:UP000008227} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Porcine genome sequencing project; RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSSSCP00000016418} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSSCP00000016418}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEMK01052628; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU856594; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; FP325183; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9823.ENSSSCP00000016418; -. DR PaxDb; F1S7S5; -. DR Ensembl; ENSSSCT00000016869; ENSSSCP00000016418; ENSSSCG00000015485. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F1S7S5; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000008227; Chromosome 9. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008227}; KW Reference proteome {ECO:0000313|Proteomes:UP000008227}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 1251 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003270052. FT COILED 933 953 {ECO:0000256|SAM:Coils}. FT COILED 1189 1209 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1251 AA; 139235 MW; 800960B8A624D3B3 CRC64; VLKGTRIRTM LLLQFVVQIM GLPSWHVCCK ESSSASSYYS QDDNCALEKE DVQFQNKDER EGPKNTESSG KVDSNLPIPP EEHKLKDDYI VTLQNTESKK LSPSVIETLP TVDLHEDSSS VVVGSENTEN ISSSSTSEIT PISKLDEIEK SGTIPIAKPS ETEQSETDCD VGEALDTSAP VDQPSFVSPP ESLVGQHIEN VSSSHGKGKI TKSEFESKVA SNDQGGGDPK SALNTSDNVK NESSDYTKSG EIDPTSVTSP KDPEDIPTFD EWKKKVMEVE KEKSQSMHPS SNGGLHATKK VQKNRNNYAS VECGAKILAA NPEAKSTSAI LIENMDLYML NPCSTKIWFV IELCEPIQVK QLDIANYELF SSTPKDFLVS ISDRYPTSKW IKLGTFHGRD ERNVQSFPLD EQMYAKYVKM FIKYIKVELV SHFGSEHFCP LSLIRVFGTS MVEEYEEIAD SQYQSERQEL FDEDYDYPLD YNTGEDKSSK NLLGSATNAI LNMVNIAANI LGAKTEDLTE GNKSISENGT ATTAPKLPES APVSTPLPSH EFVTTEERVH DTEPSLPDTA KESPIVQLVQ EEEEEASPST VTLLGSGEQE DETSPWFESE TQMFCSELTT ICCISSFSEY IYKWCSVRVA HYRQRSRTSV SKEKDYLVSA QPPLVLPVES VDVSVLQLPG GELDSKSKEK EVETVDLGDL SSTHRGDLIN HTVDAIELEP SHPQTLSQSF LLDITPEIGS SSKIEVSEPI KYEAGHTPSQ VIPQESSVEV DNETEKRSES VSSIEKPTVI YETNKLNEVM DNIVKEEVNT MQIITKLSET IVPPINTATV PDNEDGEPKM NIADTPKQIL TPAVDSSSLP EIKEEEQSTE DALLKGLQRT ATDFYAELQN SSDLGYANGN LIHGSNQKES VFMRLNNRIK ALEVNMSLSG RYLEELSQRY RKQMEEMQKA FNKTIVKLQN TSRIAEEQDQ RQTEAIQLLQ AQLTNMTQLV LNLSTTVAEL KHEVSDRQSY LVISLVLCVI LGLMLCMQRC RNSSQFDGDF ISKLPKSNQY PSPKRCFSSY DDMNLKRRTS FPLIRSKSLQ LTGEEVDPND LYIVEPLKFS PEKKKKRCKY KTEKIETIKP ADPLHPIANG DIKGRKPFTN QRDFSNIGEV YHSSYKGPPS EGSSETSSQS EESYFCGISA CTSLCNGQSQ KTKTEKRALK RRRSKVQDQG KLIKTLIQTK SGSLPSLHDI IKGNKEITVG TFGVTAVSGH I // ID F1SHF6_PIG Unreviewed; 2607 AA. AC F1SHF6; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 11-JUL-2012, sequence version 2. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSSCP00000002152}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSSSCP00000002152}; OS Sus scrofa (Pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; OC Sus. OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000002152, ECO:0000313|Proteomes:UP000008227}; RN [1] {ECO:0000313|Ensembl:ENSSSCP00000002152, ECO:0000313|Proteomes:UP000008227} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Porcine genome sequencing project; RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSSSCP00000002152} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSSCP00000002152}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CT967321; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9823.ENSSSCP00000002152; -. DR PaxDb; F1SHF6; -. DR Ensembl; ENSSSCT00000002204; ENSSSCP00000002152; ENSSSCG00000001971. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; F1SHF6; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000008227; Chromosome 7. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008227}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Proteomics identification {ECO:0000213|PeptideAtlas:F1SHF6}; KW Reference proteome {ECO:0000313|Proteomes:UP000008227}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2607 AA; 289042 MW; 659999C84A9EE235 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVGLIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIVYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSVCSV ASSSDISLGS TTERRSEIVM EHSIVGADVH EPIVVLSSAE NVPQTEVGSS SSASTSTLTA ETGSENAERK LGTDSSVRTP GESSAISMGI VSVSSPDVSS VSELTNKEAT SQRPLSSSAS NRLSVSSLLA AGAPMSSSAS VPNLSSRETS SLESFVRRVA NIARTNATNN MNLSRSSSDN NTNTLGRNVM STATSPLMGA QSFPNLTTPG TTSTVTMSTS SVTSSSNVAT ATTVLSVGQS LSNTLTTSLT STSSESDTGQ EAEYSLYDFL DSCRASTLLA ELDDDEDLPE PDEEDDENED DNQEDQEYEE VMILRRPSLQ RRAGSRSDVT HHAVTSQLPQ VPAGAGSRPI GEQEEEEYET KGGRRRTWDD DYVLKRQFSA LVPAFDPRPG RTNVQQTTDL EIPPPGTPHS ELLEEVECTP SPRLALTLKV TGLGTTREVE LPLTNFRSTI FYYVQKLLQL SCNGNVKSDK LRRIWEPTYT IMYREMKDSD KEKENGKMGC WSIEHVEQYL GTDELPKNDL ITYLQKNADA AFLRHWKLTG TNKSIRKNRN CSQLIAAYKD FCEHGTKSGL NQGAISTLQS SDILNLTKEQ PQAKAGNGQN SCGVEDVLQL LRILYIVASD PYSRISQEEG DEQPQFTFPP DEFTSKKITT KILQQIEEPL ALASGALPDW CEQLTSKCPF LIPFETRQLY FTCTAFGASR AIVWLQNRRE ATVERTRTTS SVRRDDPGEF RVGRLKHERV KVPRGESLME WAENVMQIHA DRKSVLEVEF LGEEGTGLGP TLEFYALVAA EFQRTDLGAW LCDDNFPDDE SRHVDLGGGL KPPGYYVQRS CGLFTAPFPQ DSDELERITK LFHFLGIFLA KCIQDNRLVD LPISKPFFKL MCMGDIKSNM SKLIYESRGD RDLHCTESQS EASTEEGHDS LSVGSFEEDS KSEFILDPPK PKPPAWFNGI LTWEDFELVN PHRARFLKEI KELAIKRRQI LSNKGLSEDE KNTKLQELVL KNPSGSGPPL SIEDLGLNFQ FCPSSRIYGF TAVDLKPSGE DEMVTMDNAE EYVDLMFDFC MHTGIQKQME AFRDGFNKVF PMEKLSSFSH EEVQMILCGN QSPSWAAEDI INYTEPKLGY TRDSPGFLRF VRVLCGMSSD ERKAFLQFTT GCSTLPPGGL ANLHPRLTVV RKVDATDASY PSVNTCVHYL KLPEYSSEEI MRERLLAATM EKGFHLN // ID F1SNX8_PIG Unreviewed; 734 AA. AC F1SNX8; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 11-JUL-2012, sequence version 2. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSSCP00000000098}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSSSCP00000000098}; OS Sus scrofa (Pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; OC Sus. OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000000098, ECO:0000313|Proteomes:UP000008227}; RN [1] {ECO:0000313|Ensembl:ENSSSCP00000000098, ECO:0000313|Proteomes:UP000008227} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Porcine genome sequencing project; RL Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSSSCP00000000098} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAY-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSSCP00000000098}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU655907; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9823.ENSSSCP00000000098; -. DR PaxDb; F1SNX8; -. DR Ensembl; ENSSSCT00000000100; ENSSSCP00000000098; ENSSSCG00000000094. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F1SNX8; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Reactome; R-SSC-1221632; Meiotic synapsis. DR Proteomes; UP000008227; Chromosome 5. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008227}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:F1SNX8}; KW Reference proteome {ECO:0000313|Proteomes:UP000008227}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 177 195 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 201 217 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 226 247 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 286 306 {ECO:0000256|SAM:Coils}. FT COILED 419 453 {ECO:0000256|SAM:Coils}. FT COILED 493 513 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 734 AA; 82024 MW; 5DBD22BAF855C25F CRC64; MSRRSQRLTR FSQGDDDGGS SSSGGSSVMG SQSTLFKDSP LRTLKRKSSN VKRLSPVPQL GPSSDAHTYY SESVVRESYI GSPRAASLAA SLTRGSILDD QLRGDPYWSE DLRVRRRRGT GGSESSKING LAEGKASEDF LGSSSGYSSE DDYVGYSQTD QRGSGSRLRN AVSRVGSLFW MVVTSPGRLF GLLYWWIGTT WYRLTTAASL LDVFVLTRRI SSPKTFLWFL LLLLLLTGLT YGAWYFYPYG LQMFHPAVVS WWASKGSGGQ HEVWASRDSS PHFQAEQRIL SRVHSLERRL EALAAEFSSN WQKEALRLER LELRQGAGGQ GGSGGLSQED TLALLEGLVS RREAALKEDF RRDTAAQIQE ELGTLRTEHQ QDSEDLFRKI VQASQESEAR LQQLRSEWQR SRMTQESFQE NSMKELGRLE GQLAALRQEL AALTLKQSSV EDQVGLLPQQ LQAVRDDVES QFPAWVSQFL LRGGGTRTGL LQREEMQAQL QELESKILAH VAEMQGKSAR EAVASLGLTL QKEGVIGVTE EQVHRIVKQA LKRYSEDRIG MVDYALESGV SGASVISTRC SETYETKTAL LSLFGIPLWY HSQSPRVILQ PDVHPGNCWA FQGPQGFAVV RLSARIRPTA VTLEHVPKSL SPNSTISSAP KDFAIFGFQE DLQQEGTLLG QFTYDQDGEP IQTFYFQNPK MATYQVVELR ILTNWGHPEY TCIYRFRVHG EPAH // ID F2D919_HORVD Unreviewed; 445 AA. AC F2D919; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:BAJ91590.1}; OS Hordeum vulgare var. distichum (Domesticated barley). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Triticodae; Triticeae; Hordeinae; Hordeum. OX NCBI_TaxID=112509 {ECO:0000313|EMBL:BAJ91590.1}; RN [1] {ECO:0000313|EMBL:BAJ91590.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Leaf {ECO:0000313|EMBL:BAJ91590.1}, and RC Shoot and root {ECO:0000313|EMBL:BAJ94537.1}; RX PubMed=21415278; DOI=10.1104/pp.110.171579; RA Matsumoto T., Tanaka T., Sakai H., Amano N., Kanamori H., Kurita K., RA Kikuta A., Kamiya K., Yamamoto M., Ikawa H., Fujii N., Hori K., RA Itoh T., Sato K.; RT "Comprehensive sequence analysis of 24,783 barley full-length cDNAs RT derived from 12 clone libraries."; RL Plant Physiol. 156:20-28(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK360381; BAJ91590.1; -; mRNA. DR EMBL; AK363333; BAJ94537.1; -; mRNA. DR EMBL; AK363606; BAJ94809.1; -; mRNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 170 190 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 445 AA; 47493 MW; EDEE5CE381B290D4 CRC64; MASPSISAAA VASPVTPVAL DPSPIASRPP AATAPVRKRP VLLLDQRPHP PTPTSRTAAA AAAAAAPLSQ ARRKRGLSSS GRPRWQTALS VAAKNAALLA VLLYLGDQAW RWAHPAPLAP PDNAALAGYT ARVDDVEASL ARAFGALKVQ LEAVDRKIDG EVGAARGELA ALLEEKRLAL EGQLNLLDAR TDDLNDALGG LRRMEFLRKD QFEAFLDEFK ESLGSNSGTE VDLDQVRALA REIVMREIEK HAADGVGRVD YAVGSAGGRV VRYSEAYDAG KRGGLLSALP FGGGDNGDQS QKILQPSFGE PGQCFPLKGS SGFVEIQLRK GIIPEAVTLE HVSKDVAYDM STAPKDCRLS GWYQGTHTET PPNHAAEMYT LTEFTYDLAK NNIQTFDITA PDVGVVNMVR LDFTSNHGSS ALTCIYRIRV HGHELVTPII ASPLP // ID F2E1M6_HORVD Unreviewed; 454 AA. AC F2E1M6; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:BAK01248.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:MLOC_74926.1}; OS Hordeum vulgare var. distichum (Domesticated barley). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Triticodae; Triticeae; Hordeinae; Hordeum. OX NCBI_TaxID=112509; RN [1] {ECO:0000313|EMBL:BAK01248.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Shoot and root {ECO:0000313|EMBL:BAK01248.1}; RX PubMed=21415278; DOI=10.1104/pp.110.171579; RA Matsumoto T., Tanaka T., Sakai H., Amano N., Kanamori H., Kurita K., RA Kikuta A., Kamiya K., Yamamoto M., Ikawa H., Fujii N., Hori K., RA Itoh T., Sato K.; RT "Comprehensive sequence analysis of 24,783 barley full-length cDNAs RT derived from 12 clone libraries."; RL Plant Physiol. 156:20-28(2011). RN [2] {ECO:0000313|EnsemblPlants:MLOC_74926.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Morex {ECO:0000313|EnsemblPlants:MLOC_74926.1}; RA Scholz U.; RT "The genome sequence of Hordeum vulgare subsp. vulgare."; RL Submitted (JUN-2012) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EnsemblPlants:MLOC_74926.1} RP IDENTIFICATION. RC STRAIN=subsp. vulgare {ECO:0000313|EnsemblPlants:MLOC_74926.1}; RG EnsemblPlants; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK370047; BAK01248.1; -; mRNA. DR EnsemblPlants; MLOC_74926.1; MLOC_74926.1; MLOC_74926. DR OMA; MEIARHS; -. DR Proteomes; UP000011116; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Complete proteome {ECO:0000313|Proteomes:UP000011116}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000011116}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 116 137 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 454 AA; 49947 MW; 38C13DAAF209DCEF CRC64; MSASTAAIPA ANTNGNHALS VDSHSSQDVR RRTVVVAKQK PTPELLTEGG VNGVSNDKVS SKKDIGHTSR GESVIDKPKY SSEAKKDAFP PSPSAEHRKK ITTKHRKTKW ETALSVLMKL CLLFTAFAWM GQVLWGWQNG DLSFTALDME SRLSKVEAFK KTTRMLQVQL DILDKKLGDE IGKAKRDITK QFDDKGKKLE TKMKTLEGKA DILDKSLTEL RDMGFVSKKE FDEILTQLKR KKGLDGTGSD ITLDDVRIFA KEIVEMEIAR HAADGLGMVD YALGSGGGKV VKHSGAFKKA KSILPRRSES HKMLEPSFGQ PGECFALEGS VGFVEIRLRT GIVPEAVTLE HVDKSVAFDR SSAPKDFQVS GWYQGADDDS DKQPRMPTSL GEFTYDLEKS NAQTFQLDRT TADASVVNMV RLDFSSNHGQ PELTCIYRFR VHGSEPGSLG TAAS // ID F2E9E6_HORVD Unreviewed; 571 AA. AC F2E9E6; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:BAK03968.1}; OS Hordeum vulgare var. distichum (Domesticated barley). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Triticodae; Triticeae; Hordeinae; Hordeum. OX NCBI_TaxID=112509 {ECO:0000313|EMBL:BAK03968.1}; RN [1] {ECO:0000313|EMBL:BAK03968.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Flower {ECO:0000313|EMBL:BAK03968.1}; RX PubMed=21415278; DOI=10.1104/pp.110.171579; RA Matsumoto T., Tanaka T., Sakai H., Amano N., Kanamori H., Kurita K., RA Kikuta A., Kamiya K., Yamamoto M., Ikawa H., Fujii N., Hori K., RA Itoh T., Sato K.; RT "Comprehensive sequence analysis of 24,783 barley full-length cDNAs RT derived from 12 clone libraries."; RL Plant Physiol. 156:20-28(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK372771; BAK03968.1; -; mRNA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 514 532 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 552 570 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 571 AA; 62287 MW; 7CC81FF562A813D8 CRC64; MSKKKREGGG GGKGGGGGGG GGPTAVTDVL SMDGGLREVS VSVVFSVWCP LFLLRSQFLN SQTDDPSDLY GERKRDDNYC KVMPLEAYIF PADNASSPTC QSSSSPHRHR PEAEAEAEAE AVAPSNASSG NPPTEAALVE LDEFRSRILQ GKADNDSSRH HQRVADGATP THRLEPSGAE YNYAAASKGA KALAHNKEAK GAANILDGDK DRYLRNPCSA DDKFVVVQLS EETLVHTVAL ANLEHYSSNF KDVELYGSLS YPGEAWELLG RFTAENGKHA QRFVLAEPRW TRYLRLRLVS HYGSGFYCIL SYLEVYGIDA VERMLQDFIA SHSPDADAAK AAADARKDSG HNDTQLHAKH KQVEGSGRND SAGDVAKNNG SKVPAQGKEA VKQATGRVHG DVVLKILMQK LRSLELGLST LEEYTKVLNQ RYGGKLPDLH NGLTQTGKAL EKMKADVEDL VEWKDKVARD LGELRGWKSS VSGKLEHLVR ENAAMRWDVE EMRSIQQTLQ NKELAVLSIS LFLACLALFK LACDRLLLLF ASKEEEDRAG SGWLLVLTAS SITTLIVLLY S // ID F2EAF0_HORVD Unreviewed; 473 AA. AC F2EAF0; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:BAK04322.1}; DE Flags: Fragment; OS Hordeum vulgare var. distichum (Domesticated barley). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Triticodae; Triticeae; Hordeinae; Hordeum. OX NCBI_TaxID=112509 {ECO:0000313|EMBL:BAK04322.1}; RN [1] {ECO:0000313|EMBL:BAK04322.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Seed {ECO:0000313|EMBL:BAK04322.1}; RX PubMed=21415278; DOI=10.1104/pp.110.171579; RA Matsumoto T., Tanaka T., Sakai H., Amano N., Kanamori H., Kurita K., RA Kikuta A., Kamiya K., Yamamoto M., Ikawa H., Fujii N., Hori K., RA Itoh T., Sato K.; RT "Comprehensive sequence analysis of 24,783 barley full-length cDNAs RT derived from 12 clone libraries."; RL Plant Physiol. 156:20-28(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK373125; BAK04322.1; -; mRNA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 416 434 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 454 472 Helical. {ECO:0000256|SAM:Phobius}. FT NON_TER 1 1 {ECO:0000313|EMBL:BAK04322.1}. SQ SEQUENCE 473 AA; 52013 MW; A187EC302AC1FA1E CRC64; TCQSSSSPHR HRPEAEAEAE AEAVAPSNAS SGNPPTEAAL VELDEFRSRI LQGKADNDSS RHHQRVADGA TPTHRLEPSG AEYNYAAASK GAKALAHNKE AKGAANILDG DKDRYLRNPC SADDKFVVVQ LSEETLVHTV ALANLEHYSS NFKDVELYGS LSYPGEAWEL LGRFTAENGK HAQRFVLAEP RWTRYLRLRL VSHYGSGFYC ILSYLEVYGI DAVERMLQDF IASHSPDADA AKAAADARKD SGHNDTQLHA KHKQVEGSGR NDSAGDVAKN NGSKVPAQGK EAVKQATGRV HGDVVLKILM QKLRSLELGL STLEEYTKVL NQRYGGKLPD LHNGLTQTGK ALEKMKADVE DLVEWKDKVA RDLGELRGWK SSVSGKLEHL VRENAAMRWN VEEMRSIQQT LQNKELAVLS ISLFLACLAL FKLACDRLLL LFASKEEEDR AGSGWLLVLT ASSITTLIVL LYS // ID F2PQ06_TRIEC Unreviewed; 911 AA. AC F2PQ06; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EGE03974.1}; GN ORFNames=TEQG_03007 {ECO:0000313|EMBL:EGE03974.1}; OS Trichophyton equinum (strain ATCC MYA-4606 / CBS 127.97) (Horse OS ringworm fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=559882 {ECO:0000313|Proteomes:UP000009169}; RN [1] {ECO:0000313|Proteomes:UP000009169} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4606 / CBS 127.97 {ECO:0000313|Proteomes:UP000009169}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS995730; EGE03974.1; -; Genomic_DNA. DR EnsemblFungi; EGE03974; EGE03974; TEQG_03007. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009169; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009169}; KW Reference proteome {ECO:0000313|Proteomes:UP000009169}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 38 {ECO:0000256|SAM:SignalP}. FT CHAIN 39 911 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003283715. FT COILED 435 471 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 911 AA; 101061 MW; B113D1C43AFC8CA1 CRC64; MDRHLSPHGR KRRLRTDRIT SVFLAFLAVC SAPGPAAAES ADSSRLMAVD KSNTICQAHV SNDLGAEYIR YPICLETRWN AASATLGGER PATTRSASGI YADSKDSSGS VTVPVPDTAS SDSSKSKESG SSGSGDDADV ESPLDNSNFL SFEEWKNQNL AKAGQSAETM RRHRQDKGQK ARRRHTRSPQ MNDPLDGLGE ESEIDLEFGG FSTDESGVAS WERKDDGMAS PDNIDSIAAP GGAVVGGKED KHPSQPIFEL DGQDAENMPR KGIGRRKHAG TTCKERFNYA SFDCAATVLK TNPQCTGSSA VLNENKDSYM LNECRAKDKF LIMELCDDIL VDTVVLANYE FFSSIFRSFR VSVSDRYPIK ADKWRVLGTY EAANARQVQA FAVENPLIWA RYLKIDFLSH YGNEFYCPVS LVRVHGTTMM EEYKNDGEAA RADEEEDANA QEEAEQQRQQ EEQQQQEQKQ VDVVVHPDVS IPEVVINDQM VPLSNLSDHE LDELRCFVER NETESILLGL VSSKMCAIQE RAAHIASQPV TATRVKDEAA APASGSITST NTPEQIRSVS STRTPTASDR EETRRTSTGS SIAANGSHTE PTRMNSATYS PSPASPPPNP STQESFFKSV NKRLQMLESN STLSLLYIEE QSRILRDAFN KVEKRQLAKT STFLENLNST VLQELKEFRQ QYDHLWHSVF IEFEQQRQQY HREVYSVATQ LGVLADELVF QKRVAVIQSI FVLVCFGLVL FSRSSGTPYF EFPRNIVTRT RSFRSSSVTY DSPAPSASPS PPPMSRMGSS ILSRSEADDD HLHHNHSRHH RSPSEQTDYE VGNPTFTYSP PTPTSRTTTP ERTRKLRFSP DPQSGLAVSA TGSPATMSDP ELSLRKRPIK SVEVKHESES DAEQAEGDSF T // ID F2Q0I6_TRIEC Unreviewed; 577 AA. AC F2Q0I6; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGE07654.1}; GN ORFNames=TEQG_06638 {ECO:0000313|EMBL:EGE07654.1}; OS Trichophyton equinum (strain ATCC MYA-4606 / CBS 127.97) (Horse OS ringworm fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=559882 {ECO:0000313|Proteomes:UP000009169}; RN [1] {ECO:0000313|Proteomes:UP000009169} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4606 / CBS 127.97 {ECO:0000313|Proteomes:UP000009169}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS995763; EGE07654.1; -; Genomic_DNA. DR EnsemblFungi; EGE07654; EGE07654; TEQG_06638. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000009169; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009169}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009169}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 335 356 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 577 AA; 63000 MW; D801379CFAC9C430 CRC64; MAPPRRTRRL TPSASAGASN EADNPYLPSI ETQQTFSYGG SATPALPRPL GSLPAANTAA DVAASIEAAI TRPARPAHPA RPAARPLTES AGFHQIEDEA RKSPEKQRVT RGQQRRAESM TPPREPVRRM TPDIQLMGSL REASGEPEEQ HDQQQQQQQQ QQQQPDPVDL LADAIDGSSI SWDTERHLLA NERPAFGLSG WPRATSLRPQ MSPSQASSTS IQQNTQHQQH PQHQQHPQQH PQQHPQQHPQ KYTYQQLRGQ PQRSRATAER IERGIAIGPP VGLTTITTSN KNTPETPGID TPQSEHTPAS SRPPSALDHA AAPTLLPTLP SGATFGFMHV VCILLSIMMA LNGYLLRDEI ASAVRTIYSP GHHGTSSSSS NNCTESISQM MATVDQRLTS MTKDISLLQQ RVSNASPPPP PPAPRVNPLE PRRPNFFSLG FGATVDPYLS SPTLSSTTSF LHRLRRFAGG IRPGPSHVSA LQPWDDIGDC CSRVKSALAA AWPGEAESDP SLGPSFVRLG RWQYDIHGAH HIQRFAIAAA SAVPDLPSTS KVAVVARSNW GRREYTCLYR LRLHGRY // ID F2QPN0_PICP7 Unreviewed; 660 AA. AC F2QPN0; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 21. DE SubName: Full=Putative secreted protein {ECO:0000313|EMBL:CCA37358.1}; GN OrderedLocusNames=PP7435_Chr1-1234 {ECO:0000313|EMBL:CCA37358.1}; OS Komagataella pastoris (strain ATCC 76273 / CBS 7435 / CECT 11047 / OS NRRL Y-11430 / Wegner 21-1) (Yeast) (Pichia pastoris). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Phaffomycetaceae; Komagataella. OX NCBI_TaxID=981350 {ECO:0000313|EMBL:CCA37358.1, ECO:0000313|Proteomes:UP000006853}; RN [1] {ECO:0000313|EMBL:CCA37358.1, ECO:0000313|Proteomes:UP000006853} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 76273 / CBS 7435 / CECT 11047 / NRRL Y-11430 / Wegner 21-1 RC {ECO:0000313|Proteomes:UP000006853}; RX PubMed=21575661; DOI=10.1016/j.jbiotec.2011.04.014; RA Kueberl A., Schneider J., Thallinger G.G., Anderl I., Wibberg D., RA Hajek T., Jaenicke S., Brinkrolf K., Goesmann A., Szczepanowski R., RA Puehler A., Schwab H., Glieder A., Pichler H.; RT "High-quality genome sequence of Pichia pastoris CBS7435."; RL J. Biotechnol. 154:312-320(2011). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=CBS 7435; RA Kueberl A., Schneider J., Thallinger G.G., Anderl I., Wibberg D., RA Hajek T., Jaenicke S., Brinkrolf K., Goesmann A., Szczepanowski R., RA Puehler A., Schwab H., Glieder A., Pichler H.; RT "High-quality genome sequence of Pichia pastoris CBS 7435."; RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR839628; CCA37358.1; -; Genomic_DNA. DR EnsemblFungi; CCA37358; CCA37358; PP7435_Chr1-1234. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006853; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006853}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 16 {ECO:0000256|SAM:SignalP}. FT CHAIN 17 660 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003284610. FT TRANSMEM 494 511 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 660 AA; 75286 MW; EE620F8E2F98E729 CRC64; MLVAWFLLLL VSSCICNDDK LNDSNPLKEA EQEEDSRFMS FEEWKKKKID NTRNDKAIQD DQKSHNVYRE PERSYRNNEL NGVLGEDMEI DLEMFTGVDR DDEIGKVYQQ RFNYASFDCA ATIVKTNTEA KGASSILNEN KDSYLLNKCD VANQYAVVEL CQDILVDTVV LANYEFFSSG FKTVRFSVSD RFPVPANGWK VLGDFDASNT RSIQTFNIES PLIWARYLKI EILSHYGNEY YCPLSLVRVH GKTMMEKFKL EEVEEEEKAN TENGQNTNIA TQSKAIAANQ TNVQNKFISP NGKNITVLCK NSSKTNCDPE SAQVKPLKDE DENEEDCPVS FKHFSLDEFL TEHSKEICLE KDEQSSDHIS IEPPLSSSEP KTQESIYKNI IKRISLLESN ATLSLLYIEE QSRLLSNAFS KLEERQSLRF EAMLDSVNSS IQNQIQLIDD LKLYFKVEFE TLLAGSKTKH ERALVENIEL ISSISDDLVF QKKLIFFTIF AGLCLFAFVI FNRETYIEST YEDDHLEYIK DRNRYRDRSG DESSTLGSPE PFQPISRSST PDSSSPLTPL PGPQLAPIPI HLTNRKKATS FERKNKNHTD LIYKRTATNS YEVGSKTERP TESEPEANLE PNTGEYTDNE ADDVFSNPSK HEPSEATSSI // ID F2RML6_TRIT1 Unreviewed; 911 AA. AC F2RML6; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EGD92565.1}; GN ORFNames=TESG_00138 {ECO:0000313|EMBL:EGD92565.1}; OS Trichophyton tonsurans (strain CBS 112818) (Scalp ringworm fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=647933 {ECO:0000313|Proteomes:UP000009172}; RN [1] {ECO:0000313|Proteomes:UP000009172} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 112818 {ECO:0000313|Proteomes:UP000009172}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG698477; EGD92565.1; -; Genomic_DNA. DR EnsemblFungi; EGD92565; EGD92565; TESG_00138. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009172; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009172}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 38 {ECO:0000256|SAM:SignalP}. FT CHAIN 39 911 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003286520. FT COILED 435 471 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 911 AA; 101061 MW; B113D1C43AFC8CA1 CRC64; MDRHLSPHGR KRRLRTDRIT SVFLAFLAVC SAPGPAAAES ADSSRLMAVD KSNTICQAHV SNDLGAEYIR YPICLETRWN AASATLGGER PATTRSASGI YADSKDSSGS VTVPVPDTAS SDSSKSKESG SSGSGDDADV ESPLDNSNFL SFEEWKNQNL AKAGQSAETM RRHRQDKGQK ARRRHTRSPQ MNDPLDGLGE ESEIDLEFGG FSTDESGVAS WERKDDGMAS PDNIDSIAAP GGAVVGGKED KHPSQPIFEL DGQDAENMPR KGIGRRKHAG TTCKERFNYA SFDCAATVLK TNPQCTGSSA VLNENKDSYM LNECRAKDKF LIMELCDDIL VDTVVLANYE FFSSIFRSFR VSVSDRYPIK ADKWRVLGTY EAANARQVQA FAVENPLIWA RYLKIDFLSH YGNEFYCPVS LVRVHGTTMM EEYKNDGEAA RADEEEDANA QEEAEQQRQQ EEQQQQEQKQ VDVVVHPDVS IPEVVINDQM VPLSNLSDHE LDELRCFVER NETESILLGL VSSKMCAIQE RAAHIASQPV TATRVKDEAA APASGSITST NTPEQIRSVS STRTPTASDR EETRRTSTGS SIAANGSHTE PTRMNSATYS PSPASPPPNP STQESFFKSV NKRLQMLESN STLSLLYIEE QSRILRDAFN KVEKRQLAKT STFLENLNST VLQELKEFRQ QYDHLWHSVF IEFEQQRQQY HREVYSVATQ LGVLADELVF QKRVAVIQSI FVLVCFGLVL FSRSSGTPYF EFPRNIVTRT RSFRSSSVTY DSPAPSASPS PPPMSRMGSS ILSRSEADDD HLHHNHSRHH RSPSEQTDYE VGNPTFTYSP PTPTSRTTTP ERTRKLRFSP DPQSGLAVSA TGSPATMSDP ELSLRKRPIK SVEVKHESES DAEQAEGDSF T // ID F2RW79_TRIT1 Unreviewed; 638 AA. AC F2RW79; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGD95701.1}; GN ORFNames=TESG_03171 {ECO:0000313|EMBL:EGD95701.1}; OS Trichophyton tonsurans (strain CBS 112818) (Scalp ringworm fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=647933 {ECO:0000313|Proteomes:UP000009172}; RN [1] {ECO:0000313|Proteomes:UP000009172} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 112818 {ECO:0000313|Proteomes:UP000009172}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG698489; EGD95701.1; -; Genomic_DNA. DR EnsemblFungi; EGD95701; EGD95701; TESG_03171. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000009172; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009172}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 325 346 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 638 AA; 69553 MW; 79B8955BADA85BE7 CRC64; MAPPRRTRRL TPSASAGASN EADNPYLPSI ETQQTFSYGG SATPALPRPL GSLPAANTAA DVAASIEAAI TRPARPAARP LTESAGFHQI EDEARKSPEK QRVTRGQQRR AESMTPPREP VRRMTPDIQL MGSLREASGE PEEQHDQQQQ QQQQQQQQPD PVDLLADAID GSSISWDTER HLLANERPAF GLSGWPRATS LRPQMSPSQA SSTSIQQNTQ HQQHPQHQQH PQQHPQQHPQ KYTYQQLRGQ PQRSRATAER IERGIAIGPP VGLTTITTSN KNTPETPGID TPQSEHTPAS SRPPSALDHA AAPTLLPTLP SGATFGFMHV VCILLSIMMA LNGYLLRGEI ASAVRTIYSP GHHGTSSSSS NNCTESISQM MATVDQRLTS MTKDISLLQQ EVSNASPPPP PAPRVNPLEP RRPNFFSLGF GATVDPYLSS PTLSSTTSFL HRLRRFAGGI RPGPSHVSAL QPWDDIGDCW CAGTTTTPTT TTTTPTSTKN ENKIQLSVEL GRPIVPEEVI VEHMPREATL DNGAAAPRLM ELWGEYTDSN DSSRVKSALA AAWPGEAESD PSLGPSFVRL GRWQYDIHGA HHIQRFAIAA ASAVPDLPST SKVAVVARSN WGRREYTCLY RLRLHGRY // ID F2SCL9_TRIRC Unreviewed; 304 AA. AC F2SCL9; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 29-OCT-2014, sequence version 2. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGD85174.2}; GN ORFNames=TERG_01451 {ECO:0000313|EMBL:EGD85174.2}; OS Trichophyton rubrum (strain ATCC MYA-4607 / CBS 118892) (Athlete's OS foot fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=559305 {ECO:0000313|EMBL:EGD85174.2, ECO:0000313|Proteomes:UP000008864}; RN [1] {ECO:0000313|Proteomes:UP000008864} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4607 / CBS 118892 {ECO:0000313|Proteomes:UP000008864}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG700648; EGD85174.2; -; Genomic_DNA. DR EnsemblFungi; EGD85174; EGD85174; TERG_01451. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; F2SCL9; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008864; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008864}; KW Reference proteome {ECO:0000313|Proteomes:UP000008864}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 16 {ECO:0000256|SAM:SignalP}. FT CHAIN 17 304 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003285922. FT COILED 51 71 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 304 AA; 33407 MW; 022EA46063A44F2B CRC64; MHVVCILLSI MMALNGYLLR DEIASAARSI LYSPSGHYGM PNCTESIGQM LATVDQRLTS MTKEISFLKQ EVNKATPPPA LPKANPLEPR RPNFFSLGFG ATVDPYLSSP TLSSTRSFLD RLRRFAGGIR PGPSPVSALQ PWDEIGDCWC ASTSTSPASA SKKEDKIQLS VELGRPIVPE EVIVEHMPRE ATLDNGAAAP HLMELWGEYT DANTVDEVRS SLAAVWPGEA ESGYAHEPSL GPSFVRLGRW QYDIHAAHHI QRFSVAAASV HDLPATSKVV VVARSNWGQR EYTCLYRLRL HGRL // ID F2SPP3_TRIRC Unreviewed; 918 AA. AC F2SPP3; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGD87795.1}; GN ORFNames=TERG_04041 {ECO:0000313|EMBL:EGD87795.1}; OS Trichophyton rubrum (strain ATCC MYA-4607 / CBS 118892) (Athlete's OS foot fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton. OX NCBI_TaxID=559305 {ECO:0000313|EMBL:EGD87795.1, ECO:0000313|Proteomes:UP000008864}; RN [1] {ECO:0000313|Proteomes:UP000008864} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4607 / CBS 118892 {ECO:0000313|Proteomes:UP000008864}; RX PubMed=22951933; DOI=10.1128/mBio.00259-12; RA Martinez D.A., Oliver B.G., Graeser Y., Goldberg J.M., Li W., RA Martinez-Rossi N.M., Monod M., Shelest E., Barton R.C., Birch E., RA Brakhage A.A., Chen Z., Gurr S.J., Heiman D., Heitman J., Kosti I., RA Rossi A., Saif S., Samalova M., Saunders C.W., Shea T., RA Summerbell R.C., Xu J., Young S., Zeng Q., Birren B.W., Cuomo C.A., RA White T.C.; RT "Comparative genome analysis of Trichophyton rubrum and related RT dermatophytes reveals candidate genes involved in infection."; RL MBio 3:E259-E259(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG700651; EGD87795.1; -; Genomic_DNA. DR RefSeq; XP_003234990.1; XM_003234942.1. DR STRING; 559305.XP_003234990.1; -. DR EnsemblFungi; EGD87795; EGD87795; TERG_04041. DR GeneID; 10376294; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F2SPP3; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008864; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008864}; KW Reference proteome {ECO:0000313|Proteomes:UP000008864}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 37 {ECO:0000256|SAM:SignalP}. FT CHAIN 38 918 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003287393. FT COILED 441 479 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 918 AA; 101332 MW; EF8003FB23CF8D7F CRC64; MDGHLSPHGL KHLRTDRITS AFLTFLAVCS APGPAAAESV DSSRLMAVDK SNTICQAHVS NDLGAEYIRY PICLETRWNA AASAPAAASA TTGGEGPSTT RSASGIYAES KDTSGSVTVP VPDTASSDSS ISKESGSSGN GDDADVESPL DNSNFLSFEE WKNQNLAKAG QSAETMRRHR QDKGQQARRR HTRSPQMNDP LDGLGEESEI DLEFGGFSTD ESGVASWERK DGGKASPDNI DSVTAPGGAV VGGKEDKHPS QPIFELDGQD AENVPRKGIG RRKHAGTTCK ERFNYASFDC AATVLKTNPQ CTGSSAVLNE NKDSYMLNEC RAKDKFLIME LCDDILVDTV VLANYEFFSS IFRSFRVSVS DRYPIKADKW RVLGTYEAAN ARQVQAFAVE NPLIWARYLK IDFLSHYGNE FYCPVSLVRV HGTTMMEEYK NDGEAARADE EEDANAQEEA EQQRQQEEQQ QQLEQKEADV VVHPDVSIPE VVINDQMVPL SNLSDRELDE LRCFVERNET ESILLGLVSS KMCAIQERAA HIASQPVTTT RVKDETAAPA SGSITSTNTP EQIRSVSSTR TPTASDREET RRSSTGSSIA ANGSHTEPTR MNSATYSPSP ASPPPNPSTQ ESFFKSVNKR LQMLESNSTL SLLYIEEQSR ILRDAFNKVE KRQLAKTSTF LENLNSTVLQ ELKEFRQQYD HLWHSVFIEF EQQRQQYHRE VYSVASQLGV LADELVFQKR VAVIQSIFVL VCFGLVLFSR SSGTPYLEFP RNIVTRTRSF RSSSVTYGSP APSASPSPPP MSRMGSSILS RSEADDDHLH HNHSRHHRSL SEQTDYEVGN PTFTYSPPTP TSRTTTPERT RKLRFSLEPR SGLAASATGS PATMSDPELS LRKRPIKSVE VKHEFESDAE QAEGDSFT // ID F2T834_AJEDA Unreviewed; 717 AA. AC F2T834; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGE79397.1}; GN ORFNames=BDDG_02336 {ECO:0000313|EMBL:EGE79397.1}; OS Ajellomyces dermatitidis (strain ATCC 18188 / CBS 674.68) (Blastomyces OS dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Onygenales; Ajellomycetaceae; Blastomyces. OX NCBI_TaxID=653446 {ECO:0000313|EMBL:EGE79397.1, ECO:0000313|Proteomes:UP000007802}; RN [1] {ECO:0000313|Proteomes:UP000007802} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 18188 / CBS 674.68 {ECO:0000313|Proteomes:UP000007802}; RA Cuomo C., Klein B., Sullivan T., Heitman J., Young S., Zeng Q., RA Gargeya S., Alvarado L., Berlin A.M., Chapman S.B., Chen Z., RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., RA Heilman E., Heiman D., Howarth C., Mehta T., Neiman D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA White J., Yandava C., Haas B., Nusbaum C., Birren B.; RT "Annotation of Blastomyces dermatitidis strain ATCC 18188."; RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:EGE79397.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=ATCC 18188 {ECO:0000313|EMBL:EGE79397.1}; RG The Broad Institute Genome Sequencing Platform; RG Broad Institute Genome Sequencing Center for Infectious Disease.; RA Cuomo C., Klein B., Sullivan T., Heitman J., Young S., Zeng Q., RA Gargeya S., Alvarado L., Berlin A.M., Chapman S.B., Chen Z., RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., RA Heilman E., Heiman D., Howarth C., Mehta T., Neiman D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA White J., Yandava C., Haas B., Nusbaum C., Birren B.; RT "Annotation of Blastomyces dermatitidis strain ATCC 18188."; RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GG749414; EGE79397.1; -; Genomic_DNA. DR EMBL; GG749414; KMW66994.1; -; Genomic_DNA. DR EnsemblFungi; EGE79397; EGE79397; BDDG_02336. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000007802; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007802}. FT COILED 163 184 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 717 AA; 77866 MW; F77FF43B457E9E9B CRC64; MTGRRATSLR SGSRAQSTRP TRAAATANAA TATQADQSNP DLGNPSLPDV RTQQSFAYGS SKTPALPRQL EVDPSMGLSE MVDTLDDGLR QAQDRELARV EDPRNPSPER RQTRSMSLSM RSSMSPAPEP ASRRTPSRRT TATRGRAGSR RAASRQPTPE GQLLESLREV SEETENVKQE EEEAYTSTLP DTPSFNDSAS ISWTTERAIH GTLPREVNTG TRPNYYLRDP YGSRPSSSQG PSGLSFPPTR RPIFEESFPA NPHLSGPVDA SRAAAPTAVR RTLPPVPAFN QLRDEPRSKS TTSSTSSASN HTPSSSTHSS PVFVAATPAA ANVTSSQKRL AGIAKTPSAI LVIIGLFLTT FLAYFCRNHA CAFPHSLQTT MSHYLCRPTS SFTMDNSTSM YADAFHKLSS HVDQRLLDMA KDVATLKNEW NRRLPHLKEA LSRSPAAATD PLAPPKVNYA SVGMGAVVDP YLTSPTMSTS AGLVSRIGQY LAKVPRGSPP VAALQPWDGV GECWCAATRS NVSQLTILLG RPIVPEEVVV EHIPKGATLD PGSAPREMEL WAQYTARQPA AAAAAYPPGS SSSNPPSSPS SAPGRPPPPP YTPPYLRSPP IAHLHPSHSL HYLLPSRLRD AILTTLRQVY PDEPTTAYSE DALLGPSFFR VGRWQYNIHG DHHIQRFELD AVIDMPAARV EKVVFRVKSN WGAAHTCLYR VRLHGHL // ID F2UHC0_SALR5 Unreviewed; 1498 AA. AC F2UHC0; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGD76519.1}; GN ORFNames=PTSG_07636 {ECO:0000313|EMBL:EGD76519.1}; OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021). OC Eukaryota; Choanoflagellida; Salpingoecidae; Salpingoeca. OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799}; RN [1] {ECO:0000313|Proteomes:UP000007799} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799}; RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N., RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., RA Gargeya S., Alvarado L., Berlin A., Chapman S.B., Chen Z., RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., RA Heilman E., Heiman D., Howarth C., Mehta T., Neiman D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA White J., Yandava C., Haas B., Nusbaum C., Birren B.; RT "Annotation of Salpingoeca rosetta."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL832974; EGD76519.1; -; Genomic_DNA. DR RefSeq; XP_004991433.1; XM_004991376.1. DR EnsemblProtists; EGD76519; EGD76519; PTSG_07636. DR GeneID; 16071993; -. DR InParanoid; F2UHC0; -. DR Proteomes; UP000007799; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007799}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007799}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1367 1394 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 1315 1335 FT COILED 1496 1498 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1498 AA; 155425 MW; A8BE6E314754B258 CRC64; MEPSSLMRET VDGDLASTVD DTLTFDEWRD FVRGHDYDVT HVPPSGDHLR ANYASDDCGA QMVAASPDIK KASAVLSSNR DSYLMVPCKA DTWFIVELCD HVLLQQITIA NFELFSSIAK TVEVSASEHY PTEEWEHIGT LHLKDVRSIQ KFQVESQQFA KFVRFHVTEH YGSEFYCPLS VVTCRGVTMI QEFNFLRAQG EPNGLDENNA DGASSQQTNA AHTSSLARTV MQGARLVGQL LWASDDDRMR GSSPPAMVVE VDSAYDDDEE EEEEKGEVQQ PQQQPQQQQH HQQHHQQHHQ QHQQEEEAHN AGGASSTASD SRSMEDVGAS TPMNTDMPAD VKPTSDTHID THSAVDGRDG NASERQQEQQ RQQQGQQHDQ PDHMRAAREQ EETAGRGDGA DGRGMHVDGT DHTQGSSTQE EGCLPTSALN DTKPSSLEAA SHEATQAENE STATPPSSSS SSSSSTSAGA TDDARDPQQQ QQQQQRRGVG FIVSVDGENE DDEGDMHKHG NHAAADDDDD DGATGAGDSA EKHNQAVPTT TPDHEDGTTS APNQSPSPSP APPPNTAGDT RDARDSSLIR GGAETRAGSD GGGGDGEGGN AEQVAPSSGR VADAADGGHS SAGEGDHVPP PSTHAADTVP AAGQGEDDED TVDANPDDEF AIRARDVVAA SARIADDSTD GGRGSAAVAA GTKAQAGGDS NGGPDSAENA GTPARSPAPR SSSSSSSTVS SKDEAPAQRS REPAELDGRD EDAVDDALAA APHSQDTTTD PASGDAAAIE GDGDGGGGGD HSGAGEGRAS GGLSKVAARL VSLSQGKPHA SAQAEGGDEN TPQHEQHQQH KEGAGAPSSP KAPSPEQEHV GSNSVEATAD ASPSTASSRP SPPVQSSSLS DASGSVVRGD TPTRAVPDTS TPSEEDASSP DADTNPDGIH PHREQQQQPP QVSAPVSADG DDAVGALSSS RQQPSDAVAQ DTAGRASSGD AAAPHASRPP QHTRSSPQRT ATATAALTSS SSPLRPTRTL PCNACAAVRW GCLPKAGRAG RPQHTTTTTA TPTTTTTDAA AHDGASDGEQ RNGSGDTRAS ASTNDDESSS DGTHESAGRS AASPSPQGRT GARGSEQLWW WWWQCPQLNM TNTVNMISVG MARSSAQCRT DRCTMAGGLP TVPVQLHRRH KAPDAATTTT STTTNNGTSN SSRDDQRSGR SSRSSDEQPV AVGGSVHASA ATSADNATAH ANATHTTDQT HRNATSPAPN VHKTANTADT TTTAAPTPAS GGGAGAPTAG VGSFTSRSKT ASFMTHLRAR IAELERDTSL TDTYLNQLSQ RTRRLEQDLT SAVRTLRDDA SSTRHDVMSL HADVLLLRAD VHRLESLVVV ATALALCALL LVCTVAVGGA VVYLRVARPS HASSAAPTAV RGGGGGATTV STAAYGDGHC NNGGQSDANL NGGGDRHRAL GRGGAVGVAD VGVAATTPPS LRKRKVHSNG STGLRIHTPP RPRHAKRT // ID F2UMM9_SALR5 Unreviewed; 2036 AA. AC F2UMM9; DT 31-MAY-2011, integrated into UniProtKB/TrEMBL. DT 31-MAY-2011, sequence version 1. DT 14-OCT-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGD78378.1}; GN ORFNames=PTSG_12883 {ECO:0000313|EMBL:EGD78378.1}; OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021). OC Eukaryota; Choanoflagellida; Salpingoecidae; Salpingoeca. OX NCBI_TaxID=946362 {ECO:0000313|EMBL:EGD78378.1, ECO:0000313|Proteomes:UP000007799}; RN [1] {ECO:0000313|Proteomes:UP000007799} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799}; RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N., RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., RA Gargeya S., Alvarado L., Berlin A., Chapman S.B., Chen Z., RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., RA Heilman E., Heiman D., Howarth C., Mehta T., Neiman D., Pearson M., RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., RA White J., Yandava C., Haas B., Nusbaum C., Birren B.; RT "Annotation of Salpingoeca rosetta."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL832982; EGD78378.1; -; Genomic_DNA. DR RefSeq; XP_004989701.1; XM_004989644.1. DR EnsemblProtists; EGD78378; EGD78378; PTSG_12883. DR GeneID; 16070252; -. DR InParanoid; F2UMM9; -. DR Proteomes; UP000007799; Unassembled WGS sequence. DR GO; GO:0005634; C:nucleus; IEA:InterPro. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR GO; GO:0006355; P:regulation of transcription, DNA-templated; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR004092; Mbt. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF02820; MBT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SMART; SM00561; MBT; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51079; MBT; 2. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007799}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007799}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1169 1189 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2036 AA; 227098 MW; 243DFCA14B1467FE CRC64; MDVDPDALLD FLSVPGDAQM TALEQLCMLL LLSDNVDRVF RQFPPSSFVP ALCKILVSDT ATPTVLEATL RALSFFLDIS MDCVGRITSC DGAISAICRH LEAADLTDAI TKDMAMQSTK VLEMISNRDT AALYRAGALR SCLHFVVEGQ SLVFLDALKS AMNVVVRCCS RIDEKDDYLP QVIEHLNTLL DSEQPDLRNT ALQCYGTLTD KLSRQKGDLG ALATEGTVRS LLRLICSTAS TASDAELEIA RIALNTLNAL WYGADTSRHD EDGRTALELA QAGEHQACAD ILSSPEQHME EAEDEEDEGE EEDDGGDGDA DAAESDASSP TRAAVMGTHS IMGDGTPDIK DEAARTSDAP NPLLDLSMMC AEQVVPPLAT ALTNLSSIQT QGMLLRVIAR FTAHLSQEDL SNFIQYGHNF SLEMILDPLP EILESDDGQA MLAAVTIVKH LVEKNVPSAR NDILRLGLVA RLATLAERAE ALAKETKPQE SEETIEPGMK LEAVDRKNPG LVCVATILEI DNSRPDCLHI RFDGWTDRYD YWADPTSTDL RPVGWCEHNS HTLQIPRGYS KRFDWQQYLR ETNAKAAPAH LLATVHVHGG QAKKLREALT KAKALYDKHF GANAVERDVV RRLREISTML VTLVRSTPNA CVTPTTHASL DFAVDERQGA STAKNNSNAT AGTPTGSGSS SNNPHPETED VPIVGLQRQL SSGSQLALQR KQDMVHCLVK LSELLVSDKS ISSYELCRSK FDESLLMFLT HVTRRASVSR ADGHTDLEMR RVLFRQVFGV QPSKTDNPAA LLVRKLLSVV EHTEALPVFQ FSKTTKDTLE VMNRALTLEV QRHPNSRELK CLDGNTLRVE PLTTVQQLLD FLKPRTKKKW YDYQRQKLSF FKRIRDGEKV TCKYESDFDQ NGVVYWIGTN GRRESSWVNP SETHLIYITT HIGRELPFGR VRDIVSRDSS PRNCHTNNNK SGWVSIDLGL LIKPSHYTLR HATGYSSSAL RTWSLEGSDN GKEWFTLREH EKDEHLRSPG DTHTWEITTA PPRPVCMFRI KQKGPDSDGK NHYISISGFE VYGEIVGVSK RSFDKALRQE VLQRQRQLEK LRRSADVIQP GVRVKRGRDW KWGDQDGNPP GPGTVTSTVK QGWVDVEWDD GGVNSYRMGN DGKYDLELIS DEERQMDEEE DTAERTAASK KKKKKREEAP AKAGEAMDVN SDDVDDDDHD GDDDDDDGDD DKDGGNATST GTVATAGSSE AERVRASLRA VAGDILTSDE DESDDDDDDD DDDDDDDDDD EVEEEDNMFT SSQSRAMFDL EHYLGRDELY HRRGPVAAFL RRRRHREASV SSSTWDSSTT LKRQYSALVP AYDPRPGHSN PPATVDLHVP GPDTHVSLPT PQQNKQQDPF VLYLMKDPEK GLQADNLAKL DPSCTILRQI QRHCFDANNW KPTDVWGKRF KIVYSNNKPT PKRKVSSQSD ARGPKQVTFD AQLTEALSKE VSKRHTDSSL SLLEVLFDMT QAVGGDESED AAAYLDATCR TMLRATEEIV FAIEKAEFVS RKLTSKLQRQ MADPLALVSG ALPSWCETLA RSCPVLFPFE ARKQFFSARA FGISRSIVWV QEKREAATRR GQNVSGNEPP PQEYQVGRQK HERVTVPRSE DKLFSWARNV LLTQASHKSV LEIEFKDEPA TGLGPSLEFY SLVCAEFQRK DYAMWLCDDH LGTATTTRDL GHGAKPPGFY VRRPAGLFPA PYPEEKLPAE VESKFELLGV FLAKAMQDHR LVDMPLSTAM LKIMRGQELD MQDLAVVAPD LFSTIKALHE VAARKRAIDA EFDVQEDRER AYEKLTVVFG ETECKVDDLG LYMVYLPPSR VFGLEEYPLC VSELCVCVCV CVCVCVFMMS LGFTFAQTTL TPFTHPPLIL LVLCGEQAIE WTFDELLAST EAKHGYSKDH PNYLAFLSLL VEFTAEERKR FLLFATGSPS LPPGGLQNLH PHFTIVRKGM DDAPADDTYP SVNTCVHYVK LPIYSSKEVM KKRLLTAMQS TGFHLN // ID F4I316_ARATH Unreviewed; 660 AA. AC F4I316; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Galactose-binding protein {ECO:0000313|EMBL:AEE30301.1}; GN OrderedLocusNames=At1g22882 {ECO:0000313|EMBL:AEE30301.1, GN ECO:0000313|TAIR:AT1G22882}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000313|EMBL:AEE30301.1, ECO:0000313|Proteomes:UP000006548}; RN [1] {ECO:0000313|EMBL:AEE30301.1, ECO:0000313|Proteomes:UP000006548} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RX PubMed=11130712; DOI=10.1038/35048500; RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., RA White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., RA Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., RA Chung M.K., Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., RA Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., RA Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., RA Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., RA Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., RA Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., RA Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., RA Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., RA Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., RA Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., RA Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., RA Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.; RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis RT thaliana."; RL Nature 408:816-820(2000). RN [2] {ECO:0000313|Proteomes:UP000006548} RP GENOME REANNOTATION. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RG The Arabidopsis Information Resource (TAIR); RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP002684; AEE30301.1; -; Genomic_DNA. DR RefSeq; NP_683323.2; NM_148482.4. DR UniGene; At.45268; -. DR ProteinModelPortal; F4I316; -. DR STRING; 3702.AT1G22882.1; -. DR PaxDb; F4I316; -. DR PRIDE; F4I316; -. DR EnsemblPlants; AT1G22882.1; AT1G22882.1; AT1G22882. DR GeneID; 838894; -. DR KEGG; ath:AT1G22882; -. DR TAIR; AT1G22882; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F4I316; -. DR OMA; KTEASMA; -. DR Proteomes; UP000006548; Chromosome 1. DR GO; GO:0005783; C:endoplasmic reticulum; IDA:TAIR. DR GO; GO:0005635; C:nuclear envelope; IDA:TAIR. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006548}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006548}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 610 630 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 642 659 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 507 527 {ECO:0000256|SAM:Coils}. FT COILED 563 608 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 660 AA; 74184 MW; 662735FC07BF50A5 CRC64; MQRSCRTRRR VSVNKFNGRN SFYKVSLSLV FLLWVLLFFS TLLISHGDGA KDEPLNDSMG MADPDDGQSD EKVVPFDGPL SLASASVDVT SDLSRNDDVN LSEESEDKEQ EAEISSTVSG NDIESKDTYL LKQSEINKKD TGIDAGSKYD DFPKKSEINN TGTWNDTEGK DDNNFLKQSQ LNKTGTGNDT ESSDNEFLEQ NQMNKTVLGN GTEINVSKVD QPSRAVPLGL DEFKSRASNS RNKSLSDQVS GVIHRMEPGG KEYNYASASK GAKVLSSNKE AKGAASILSR DNDKYLRNPC STEGKFVVVE LSEETLVNTI KIANFEHYSS NLKEFELQGT LVYPTDTWVH MGNFTASNVK HEQNFTLLEP KWVRYLKLNF ISHYGSEFYC TLSLIEVYGV DAVERMLEDL ISVQDNKNAY KPREGDSEHK EKPMQQIESL EGDDGADKST HREKEKEAPP ENMLAKTEAS MAKSSNKLSE PVEEMRHHQP GSRMPGDTVL KILMQKLRSL DLNLSILERY LEELNLRYGN IFKEMDREAG VREKAIVALR LDLEGMKERQ EGMVSEAEEM KEWRKRVEAE MEKAEKEKEN IRQSLEQVSK RLEWMEKKCL TVFTVCLGFG IIAVIAVVIG MGTGLAEKTG SGAWLLLLIS STFIMFVLSL // ID F4I8I0_ARATH Unreviewed; 596 AA. AC F4I8I0; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Galactose-binding protein {ECO:0000313|EMBL:AEE35193.1}; GN OrderedLocusNames=At1g71360 {ECO:0000313|EMBL:AEE35193.1, GN ECO:0000313|TAIR:AT1G71360}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000313|EMBL:AEE35193.1, ECO:0000313|Proteomes:UP000006548}; RN [1] {ECO:0000313|EMBL:AEE35193.1, ECO:0000313|Proteomes:UP000006548} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RX PubMed=11130712; DOI=10.1038/35048500; RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., RA White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., RA Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., RA Chung M.K., Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., RA Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., RA Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., RA Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., RA Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., RA Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., RA Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., RA Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., RA Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., RA Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., RA Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., RA Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.; RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis RT thaliana."; RL Nature 408:816-820(2000). RN [2] {ECO:0000313|Proteomes:UP000006548} RP GENOME REANNOTATION. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RG The Arabidopsis Information Resource (TAIR); RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP002684; AEE35193.1; -; Genomic_DNA. DR RefSeq; NP_177292.4; NM_105805.5. DR UniGene; At.70452; -. DR ProteinModelPortal; F4I8I0; -. DR STRING; 3702.AT1G71360.1; -. DR PaxDb; F4I8I0; -. DR PRIDE; F4I8I0; -. DR EnsemblPlants; AT1G71360.1; AT1G71360.1; AT1G71360. DR GeneID; 843477; -. DR KEGG; ath:AT1G71360; -. DR TAIR; AT1G71360; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F4I8I0; -. DR OMA; LERLEWM; -. DR Proteomes; UP000006548; Chromosome 1. DR GO; GO:0005783; C:endoplasmic reticulum; IDA:TAIR. DR GO; GO:0005635; C:nuclear envelope; IDA:TAIR. DR GO; GO:0006997; P:nucleus organization; IGI:TAIR. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006548}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006548}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 25 47 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 546 570 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 576 594 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 475 544 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 596 AA; 67272 MW; 9F8F796BFEE32E58 CRC64; MQRSRRALLV RRRVSETTSN GRNRFYKVSL SLVFLIWGLV FLSTLWISHV DGDKGRSLVD SVEKGEPDDE RADETAESVD ATSLESTSVH SNPGLSSDVD IAAAGESKGS ETILKQLEVD NTIVIVGNVT ESKDNVPMKQ SEINNNTVPG NDTETTGSKL DQLSRAVPLG LDEFKSRASN SRDKSLSGQV TGVIHRMEPG GKEYNYAAAS KGAKVLSSNK EAKGASSIIC RDKDKYLRNP CSTEGKFVVI ELSEETLVNT IKIANFEHYS SNLKDFEILG TLVYPTDTWV HLGNFTALNM KHEQNFTFAD PKWVRYLKLN LLSHYGSEFY CTLSLLEVYG VDAVERMLED LISIQDKNIL KLQEGDTEQK EKKTMQAKES FESDEDKSKQ KEKEQEASPE NAVVKDEVSL EKRKLPDPVE EIKHQPGSRM PGDTVLKILM QKIRSLDVSL SVLESYLEER SLKYGMIFKE MDLEASKREK EVETMRLEVE GMKEREENTK KEAMEMRKWR MRVETELEKA ENEKEKVKER LEQVLERLEW MEKKGVVVFT ICVGFGTIAV VAVVFGMGIV RAEKQGGLAW LLLLISSTFV MFILSL // ID F4JPE8_ARATH Unreviewed; 562 AA. AC F4JPE8; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Galactose-binding protein {ECO:0000313|EMBL:AEE84832.1}; GN OrderedLocusNames=At4g23950 {ECO:0000313|EMBL:AEE84832.1, GN ECO:0000313|TAIR:AT4G23950}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000313|EMBL:AEE84832.1, ECO:0000313|Proteomes:UP000006548}; RN [1] {ECO:0000313|EMBL:AEE84832.1, ECO:0000313|Proteomes:UP000006548} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RX PubMed=10617198; DOI=10.1038/47134; RG EU; RG CSHL and WU Arabidopsis Sequencing Project; RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., RA Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., RA Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., RA Weichselgartner M., de Simone V., Obermaier B., Mache R., Mueller M., RA Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., RA Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., RA Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., RA Langham S.-A., McCullagh B., Bilham L., Robben J., RA van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., RA Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., RA Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., RA Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., RA Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., RA De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., RA van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., RA Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., RA Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., RA Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., RA Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., RA Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., RA Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., RA Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., RA Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., RA Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., RA Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., RA Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., RA Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., RA Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., RA Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., RA Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., RA Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., RA Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., RA Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., RA Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., RA Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., RA Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., RA Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., RA Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., RA Chen E., Marra M.A., Martienssen R., McCombie W.R.; RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis RT thaliana."; RL Nature 402:769-777(1999). RN [2] {ECO:0000313|Proteomes:UP000006548} RP GENOME REANNOTATION. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RG The Arabidopsis Information Resource (TAIR); RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP002687; AEE84832.1; -; Genomic_DNA. DR RefSeq; NP_001190819.1; NM_001203890.1. DR UniGene; At.54494; -. DR ProteinModelPortal; F4JPE8; -. DR STRING; 3702.AT4G23950.2; -. DR PaxDb; F4JPE8; -. DR PRIDE; F4JPE8; -. DR EnsemblPlants; AT4G23950.2; AT4G23950.2; AT4G23950. DR GeneID; 828495; -. DR KEGG; ath:AT4G23950; -. DR TAIR; AT4G23950; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F4JPE8; -. DR OMA; IVKEQAN; -. DR Proteomes; UP000006548; Chromosome 4. DR ExpressionAtlas; F4JPE8; baseline and differential. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006548}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006548}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 502 524 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 544 561 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 402 422 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 562 AA; 63722 MW; 597C721DE7229259 CRC64; MARRGSCSTI CLNEKLQRFR IVRISDKADN FNSRSGSFFE RSISLVLLLW CFLFLVYSKL GQSHDDYGNA DRIGNYTDGS VSKTLNSTSS VFPQATEKEN NFCLLRKGQL QDVYEHVLVN NALLICKVVL PERRISKKTL EARDPRYVNL EDKSLKVNGS SQLVNNGTRY RLEPDGNGYN YASAMKGAKV VDHNKEAKGA SNVLGKDHDK YLRNPCSVSD KYVVIELAEE TLVDTVRIAN FEHYSSNPKE FSLSGSLSFP SDMWTPAGSF AAANVKQIQS FRLPEPKWLR YLKLNLVSHY GSEFYCTLSV VEVFGIDALE QMLEDLFVPS ETPPSKPAMV ELKTADEKQD GEIKSNRTDQ IGKETEAQKK KDDVVKTINI IGDKKYEVKE KHNVLKVMMQ KVKLIEMNLS LLEDSVKKMN DKQPEVSLEM KKTLVLVEKS KADIREITEW KGKMQEKELR DLELWKTLVA SRVESLARGN SALRLDVEKI VKEQANLESK ELGVLLISLF FVVLATIRLV STRLWAFLGM SITDKARSLW PDSGWVMILL SSSIMIFIHL LS // ID F4JPE9_ARATH Unreviewed; 561 AA. AC F4JPE9; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Galactose-binding protein {ECO:0000313|EMBL:AEE84831.1}; GN OrderedLocusNames=At4g23950 {ECO:0000313|EMBL:AEE84831.1, GN ECO:0000313|TAIR:AT4G23950}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000313|EMBL:AEE84831.1, ECO:0000313|Proteomes:UP000006548}; RN [1] {ECO:0000313|EMBL:AEE84831.1, ECO:0000313|Proteomes:UP000006548} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RX PubMed=10617198; DOI=10.1038/47134; RG EU; RG CSHL and WU Arabidopsis Sequencing Project; RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., RA Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., RA Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., RA Weichselgartner M., de Simone V., Obermaier B., Mache R., Mueller M., RA Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., RA Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., RA Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., RA Langham S.-A., McCullagh B., Bilham L., Robben J., RA van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., RA Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., RA Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., RA Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., RA Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., RA De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., RA van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., RA Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., RA Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., RA Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., RA Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., RA Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., RA Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., RA Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., RA Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., RA Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., RA Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., RA Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., RA Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., RA Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., RA Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., RA Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., RA Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., RA Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., RA Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., RA Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., RA Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., RA Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., RA Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., RA Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., RA Chen E., Marra M.A., Martienssen R., McCombie W.R.; RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis RT thaliana."; RL Nature 402:769-777(1999). RN [2] {ECO:0000313|Proteomes:UP000006548} RP GENOME REANNOTATION. RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548}; RG The Arabidopsis Information Resource (TAIR); RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP002687; AEE84831.1; -; Genomic_DNA. DR RefSeq; NP_194126.5; NM_118527.5. DR UniGene; At.54494; -. DR ProteinModelPortal; F4JPE9; -. DR STRING; 3702.AT4G23950.2; -. DR PaxDb; F4JPE9; -. DR PRIDE; F4JPE9; -. DR EnsemblPlants; AT4G23950.1; AT4G23950.1; AT4G23950. DR GeneID; 828495; -. DR TAIR; AT4G23950; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000006548; Chromosome 4. DR ExpressionAtlas; F4JPE9; baseline and differential. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006548}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006548}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 501 523 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 543 560 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 402 422 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 561 AA; 63594 MW; 9DB9C0D16BF35FC7 CRC64; MARRGSCSTI CLNEKLQRFR IVRISDKADN FNSRSGSFFE RSISLVLLLW CFLFLVYSKL GQSHDDYGNA DRIGNYTDGS VSKTLNSTSS VFPQATEKEN NFCLLRKGQL QDVYEHVLVN NALLICKVVL PERRISKKTL EARDPRYVNL EDKSLKVNGS SQLVNNGTRY RLEPDGNGYN YASAMKGAKV VDHNKEAKGA SNVLGKDHDK YLRNPCSVSD KYVVIELAEE TLVDTVRIAN FEHYSSNPKE FSLSGSLSFP SDMWTPAGSF AAANVKQIQS FRLPEPKWLR YLKLNLVSHY GSEFYCTLSV VEVFGIDALE QMLEDLFVPS ETPPSKPAMV ELKTADEKQD GEIKSNRTDQ IGKETEAQKK KDDVVKTINI IGDKKYEVKE KHNVLKVMMQ KVKLIEMNLS LLEDSVKKMN DKQPEVSLEM KKTLVLVEKS KADIREITEW KGKMEKELRD LELWKTLVAS RVESLARGNS ALRLDVEKIV KEQANLESKE LGVLLISLFF VVLATIRLVS TRLWAFLGMS ITDKARSLWP DSGWVMILLS SSIMIFIHLL S // ID F4P736_BATDJ Unreviewed; 1154 AA. AC F4P736; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGF78795.1}; GN ORFNames=BATDEDRAFT_89980 {ECO:0000313|EMBL:EGF78795.1}; OS Batrachochytrium dendrobatidis (strain JAM81 / FGSC 10211) (Frog OS chytrid fungus). OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Rhizophydiales; OC Rhizophydiales incertae sedis; Batrachochytrium. OX NCBI_TaxID=684364 {ECO:0000313|Proteomes:UP000007241}; RN [1] {ECO:0000313|EMBL:EGF78795.1, ECO:0000313|Proteomes:UP000007241} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JAM81 / FGSC 10211 {ECO:0000313|Proteomes:UP000007241}; RG US DOE Joint Genome Institute (JGI-PGF); RA Kuo A., Salamov A., Schmutz J., Lucas S., Pitluck S., Rosenblum E., RA Stajich J., Eisen M., Grigoriev I.V.; RT "The draft genome of Batrachochytrium dendrobatidis."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL882887; EGF78795.1; -; Genomic_DNA. DR RefSeq; XP_006680455.1; XM_006680392.1. DR EnsemblFungi; EGF78795; EGF78795; BATDEDRAFT_89980. DR GeneID; 18243711; -. DR InParanoid; F4P736; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000007241; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007241}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007241}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 337 363 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 370 390 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 756 780 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1154 AA; 128531 MW; ADEC07C4875C503C CRC64; MGTPAADSGE VSGTDLSDTL DSTSGVRRSK RAHRNINYSP IRTASPSKRI GHPAANHESS ISVNTIRRTS ISSTASDCSR SSRTSRATHV TSISSNNTLS PITTPTARSY SHRNVLFSTP QDPILSPSDS VLIKPDTTEQ PTSAWNHPFR STNNGIDSGH EINSTKSFSS TESTKRTPTK PKIARHKMAK SNPAASDSEL DNDLIPKTPN SSQLNPSKTP ESFVEITPSK FSLTSFFSPF RSPMNLKHIL ENVGMTPRTH KRKVAKSWLG RMATSSDQIE VEYESDLCDK SDDELGENRI LFPDIFAEEA VIEEPGPLLR FIDWLSDFYI VQLVIPIFSW IASIIALLIW GVCTLMWNIV YYLCGKPLGF IGWTFLRVAG FIFFTTFRVL GYHRNMCVAG LLCGLGVYLL TTKPGHSIDL LASLDPVNQS PQLQSMFTPN ISMIFEHLRS TFASTETEHI EINSQDTQSN TDSRPESNWK SQKQLVMLEK RVATVEKKLL LVTQGLVEVA GDMKSQHSRL EKSIEHTIAG QDSLKDTLSA QIADVTKQIF DLRNTLDALH DATEKTTIEQ SAYIKHLHDS LEIHKSSIVE LEKQIHEDVQ NLSVSNEVSK GLAEKLSTLS ETVTRLGQQL EAYGSVDDVV KRVMDTIRTS TDFPGFIAAT KSSETSEIEL PPELWKAIES KLNMFGDKDQ TIDVSNTAEA RLKTFILAQI DDIRVTYATN ADVDVRIRMA LKDAIKDYTQ SGQNDASLAA RVDLKMDEVK ALLQTKVEEL NKLEQETREK LQGAFSGTKK EYTQLDLHLK TLTEQIASHT TSLKEVMSEQ DRLGVFYDNQ KREFELIKTT VDTSTHWHNF LEQNRQALST IIHEQVDTHP NSHIILTQEQ TLSAIEVKLE SIVSKSSQDM AIIMNRLESL DSVSSSYAHL KGDAPLSRDA TESLIHRIVA SALEEYRADV LAIPDYALES AGARVVYNLT SSTFTTHFKP PVLGLFARVM GIRKASGRSP STALTQDISP GNCWAMSDSS GTLAISLAEP IIPTDMTIEH SHIQTSISDR HTSAPRQIEL WAVFDAVEFA KLDLNNNQVR LQTLGTLSSS NKKTQPAGIL LGDFEFNPMT AALKTYPLHR ILNIKVNMVV VRIKNNWGNP KWTCIYRVRI HGRE // ID F4PAQ5_BATDJ Unreviewed; 571 AA. AC F4PAQ5; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 11. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGF77583.1}; GN ORFNames=BATDEDRAFT_27399 {ECO:0000313|EMBL:EGF77583.1}; OS Batrachochytrium dendrobatidis (strain JAM81 / FGSC 10211) (Frog OS chytrid fungus). OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Rhizophydiales; OC Rhizophydiales incertae sedis; Batrachochytrium. OX NCBI_TaxID=684364 {ECO:0000313|Proteomes:UP000007241}; RN [1] {ECO:0000313|EMBL:EGF77583.1, ECO:0000313|Proteomes:UP000007241} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JAM81 / FGSC 10211 {ECO:0000313|Proteomes:UP000007241}; RG US DOE Joint Genome Institute (JGI-PGF); RA Kuo A., Salamov A., Schmutz J., Lucas S., Pitluck S., Rosenblum E., RA Stajich J., Eisen M., Grigoriev I.V.; RT "The draft genome of Batrachochytrium dendrobatidis."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL882891; EGF77583.1; -; Genomic_DNA. DR RefSeq; XP_006681670.1; XM_006681607.1. DR EnsemblFungi; EGF77583; EGF77583; BATDEDRAFT_27399. DR GeneID; 18239321; -. DR InParanoid; F4PAQ5; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007241; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007241}; KW Reference proteome {ECO:0000313|Proteomes:UP000007241}. SQ SEQUENCE 571 AA; 63986 MW; 205FB2EA8A87A350 CRC64; MHLHFKFHSS LPQSAVLVAS LNMIKKLQKI QLFLEHTLKS LASQITNIDS NILTETRSDS STAGSVSDTT DSLPKNEIEH SLTDSLDFTQ STNSDVTSIA SFEVESNAIA DLNSNQITKT TSSDDMRPTQ PSLPSAHLPP QVKNEKERFN YASFDCGALV RAVNPEASSA TAILSNSKDQ YMLNKCSTNK FVEVELCEDI LVDTIMLANL EFFSSIFKDF KVYVADRYPP ITGWKIIGTF TGENSRERQI FKIDHPAFWA RYLRIEFLTH FGQEFYCPLT MLKVYGTRMI EDVKADEYDD DLSGTETILP IVPKQDSGDA SLSKDSVPLS QDLDIKDTTS CTLKTSTKLY SDTSMPETVL NDSPTVTSLQ SEMTRSNTKT ELDQSAKTRH DKTLSEKDEF ASIVLTTSTS GVLDENDTTS TLSSNSELTD LSHTSIPGIT PSVQPSFTGK KESIFKNIIK RLSAVEKSIA SQLISTEKHY QDLNDRLENF EVLQASIVGK RFDSYKAGYD LDFQTMLELF AKRMSEHDNV LKDKVRQLDE KLYIVNSLIE SLEFQSSKLT HEFSECIPRS I // ID F4PQB4_DICFS Unreviewed; 668 AA. AC F4PQB4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 11. DE SubName: Full=SUN domain-containing protein 1 {ECO:0000313|EMBL:EGG22577.1}; GN Name=sun1 {ECO:0000313|EMBL:EGG22577.1}; GN ORFNames=DFA_04707 {ECO:0000313|EMBL:EGG22577.1}; OS Dictyostelium fasciculatum (strain SH3) (Slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. OX NCBI_TaxID=1054147 {ECO:0000313|Proteomes:UP000007797}; RN [1] {ECO:0000313|EMBL:EGG22577.1, ECO:0000313|Proteomes:UP000007797} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SH3 {ECO:0000313|EMBL:EGG22577.1, RC ECO:0000313|Proteomes:UP000007797}; RA Gloeckner G., Schaap P., Noegel A.A., Felder M., Eichinger L., RA Heidel A.J., Platzer M.; RT "Living fossils from the dawn of multicellularity."; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883009; EGG22577.1; -; Genomic_DNA. DR RefSeq; XP_004360428.1; XM_004360371.1. DR EnsemblProtists; EGG22577; EGG22577; DFA_04707. DR GeneID; 14874741; -. DR KEGG; dfa:DFA_04707; -. DR KO; K19347; -. DR Proteomes; UP000007797; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007797}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007797}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 219 238 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 668 AA; 75227 MW; 70D2987EC08F2FD2 CRC64; MSTYKPTYGQ QKSPHRKRAT FTPATSRVAK YGFSSTDDED ESSPAYNNNN TSYDFDDGGT SSTTTTALFP PTHSSSSSSV LNNNTTINKQ QQRTTTTTRS NVVSQHQHNN GIDFSMNGGG GGNSFKASAA ADAKPTQAKV KATGGVGGNT GAVHQRHHHQ KQRSLVMRMF DTISYPFVYV FNLISRCLMW INVTLSTKVH SPVDMNHQNK GHVEKKSKVI GWVTLVSVLV LVLVYVLFYK SSFPINMTNH HQTGVDEQAV IQILHKYIKE HESKMENKFD NKLGIIKTEL ISTTMTESEK LKLEQKKENQ LINTLVDTMR EELTELIQTR SSNDPKELLE IYKAQLDSHV NRIMGDLTKH GKTEKEEIHT LITSAVNKLQ EAMKQSQGDI GKSGQDQLQK LLLAFEENAN SRLNIIVQSL DHQQTTEREK LLLISKKATE QIQSFREIME KNPDFISISK GLTALEQTQL LIDDALEKYS SDKTGRADFA LWVSGGSIAY DLEHYPITQT YQNDDVSWLD IATAWLRPVP RLNPPETILE PIRNIGDCWA FPGNNGTIGI HLSAPIIVRS VSIEHPNSKI TYHSESSPQE FEILGLKNST DTGVSLATFK YDIHKNRHLQ TFNFDNEQVY SDVVIKILSN YGYKYTCLYR IRVHGLLSGD YNDPRLTF // ID F4PTG8_DICFS Unreviewed; 868 AA. AC F4PTG8; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=SUN domain-containing protein 2 {ECO:0000313|EMBL:EGG20850.1}; GN Name=sun2 {ECO:0000313|EMBL:EGG20850.1}; GN ORFNames=DFA_00715 {ECO:0000313|EMBL:EGG20850.1}; OS Dictyostelium fasciculatum (strain SH3) (Slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. OX NCBI_TaxID=1054147 {ECO:0000313|Proteomes:UP000007797}; RN [1] {ECO:0000313|EMBL:EGG20850.1, ECO:0000313|Proteomes:UP000007797} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SH3 {ECO:0000313|EMBL:EGG20850.1, RC ECO:0000313|Proteomes:UP000007797}; RA Gloeckner G., Schaap P., Noegel A.A., Felder M., Eichinger L., RA Heidel A.J., Platzer M.; RT "Living fossils from the dawn of multicellularity."; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883010; EGG20850.1; -; Genomic_DNA. DR RefSeq; XP_004358700.1; XM_004358643.1. DR EnsemblProtists; EGG20850; EGG20850; DFA_00715. DR GeneID; 14873246; -. DR KEGG; dfa:DFA_00715; -. DR Proteomes; UP000007797; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007797}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007797}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 720 739 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 210 233 {ECO:0000256|SAM:Coils}. FT COILED 300 329 {ECO:0000256|SAM:Coils}. FT COILED 595 615 {ECO:0000256|SAM:Coils}. FT COILED 693 713 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 868 AA; 99634 MW; 487D7C658C120BFA CRC64; MTHRQKQHRQ LGSSRLSLLL VLVVGLLFVA YTSQWHPFVD ASTTPLSDSE LSDTSIIQDN NNNNNPIQQQ QQQIDDTPET IAPLNPDDES HRKQQQQQDP PPPPPPTKKE EEQKDKIIDD ITYEEEEEEE EEGEIELNTD TTGIIQDEVT TQPEQQEKES TNNNNDNDNN NDINQLEEEK KPTKTIEPTT NNKELDKKKE EEQTTTKIND DQIEKNKEEI TSTKELLEKE EKDKPIIDDT TPKGNEKDGT TTTSQPITDN NTVEKKEPII GENEEEPTTN NKNEQDQNTG RPMTNITTVI EEVHKQIERE RLENELKERK AQKAEQLAQQ QILTISDKDD EEEDQHNPII NDTYPTTPLP KKPEDLPNQF NYASAECGAT VLATNREARE VSSILHSSKD RYMLNECGTD QWFVIELCEE IGIQIIELAN YEFFSSMFKT FTVYGSQQYP SMQWDSLGNF TANNVRKPQY FALKEKYWYK YIKIKFLTHY GNQVYCPISD IKVYGSTMVE DLKAGMENDI NMQKIIEEQL HGKLHYHGTP GTVGGSASIS NKGDSTNQNF ETTAGMLKNL IDIFKKTQKQ QPVFYHPPPQ TTTSATEEED QISQITEMYE QQSQKNPESI FKTLANRVKS LEINQSISNR YLEKLEAFYS ETIQSIRDDF NRLSEFFEKM AYLGADLEKR IAREKEEVEV KIARDFSKEI NYLRERIIRM EQKNEDDKNY YLMLLLATVI GTVFLSYMIS KSNNIVATTI GTSSAVLSQA YEHVLSPKFS LKHNRRNSLN VNQTYQQHQQ NGVQSSSSSL SNSITSPPIM SNGSGDHFHH KESNGHLHSL PLLDVSLSSS STPPIYQNND NNNNGGYNRK KKKKNKQY // ID F4R6L2_MELLP Unreviewed; 173 AA. AC F4R6L2; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG11900.1}; DE Flags: Fragment; GN ORFNames=MELLADRAFT_32759 {ECO:0000313|EMBL:EGG11900.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883091; EGG11900.1; -; Genomic_DNA. DR RefSeq; XP_007404275.1; XM_007404213.1. DR EnsemblFungi; EGG11900; EGG11900; MELLADRAFT_32759. DR GeneID; 18927239; -. DR KEGG; mlr:MELLADRAFT_32759; -. DR InParanoid; F4R6L2; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. FT NON_TER 1 1 {ECO:0000313|EMBL:EGG11900.1}. SQ SEQUENCE 173 AA; 19373 MW; 314D44C3A9A9C67D CRC64; QRPDYALSSG GAQIYYWMTS PTYKEKPATK LARLFNWLVG GPVKIEFHPP SVALDPDTNV GRCWAMEGQR GSLGIFLAQK IIIDELVIEH TDPSMAFELE SILQKFELFG LSEMLTSPVK LGEGVFNISS AHIQHFSIQP QIPVVIVIFN VLSNHGDPDF TCIYKLRIHG QMV // ID F4R7L0_MELLP Unreviewed; 404 AA. AC F4R7L0; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG11333.1}; GN ORFNames=MELLADRAFT_90810 {ECO:0000313|EMBL:EGG11333.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883092; EGG11333.1; -; Genomic_DNA. DR RefSeq; XP_007404968.1; XM_007404906.1. DR EnsemblFungi; EGG11333; EGG11333; MELLADRAFT_90810. DR GeneID; 18935694; -. DR KEGG; mlr:MELLADRAFT_90810; -. DR InParanoid; F4R7L0; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 63 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 59 86 {ECO:0000256|SAM:Coils}. FT COILED 126 146 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 404 AA; 45581 MW; 569D1736A531E383 CRC64; MPGQSTEHRA SPSHINVTER ESCQHSVGKR WGRMVKLLIS RSVRRWYYIL LSAVVLWFFI SYSERLQILN KKVNELEAEI RYIQEGNISG NNSTTLIMMD EKIIELVNQV RRLQETENLG PILNKLHKTK KKVADLELQV QHLQEVHDSW ISALSTVPRN IGLWNEITTL RVPSQINQLP SQPPKPTLQD STQDIIINTI TSIGRVLQMI EHLKNVAQDA LEICTKDKDG RRDFAFIQTG GSVIPSLTSE SISLNSSSRS WKTLFASLSG RPSVTLPDIV LIGDGSIGAC WAFRGSKGQL GIALSDPIKI TGVTVEHIGK ELAQDSIDVA PKDFELWGVV NDKDKHSEFF LFSSFYDTSK ASIAQTFEFT PTKEVYKKVI FKINSNNGNK QYTCVYRVRI HGVM // ID F4R8L4_MELLP Unreviewed; 241 AA. AC F4R8L4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG11091.1}; GN ORFNames=MELLADRAFT_86358 {ECO:0000313|EMBL:EGG06545.1}, GN MELLADRAFT_92477 {ECO:0000313|EMBL:EGG11091.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). RN [2] {ECO:0000313|EMBL:EGG11091.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=98AG31 {ECO:0000313|EMBL:EGG11091.1}; RG US DOE Joint Genome Institute (JGI-PGF); RA Duplessis S., Cuomo C., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D., Hacquard S., Amselem J., Cantarel B., RA Readman C., Coutinho P., Feau N., Field M., Frey P., Gelhaye E., RA Goldberg J., Grabherr M., Kodira C., Kohler A., Kues U., Lindquist E., RA Lucas S., Mago R., Mauceli E., Morin E., Murat C., Pangilinan J., RA Park R., Pearson M., Quesneville H., Rouhier N., Sakthikumar S., RA Salamov A., Schmutz J., Selles B., Shapiro H., Tangay P., Tuskan G., RA Henrissat B., Van de Peer Y., Rouze P., Schein J., Ellis J., Dodds P., RA Zhong S., Hamelin R., Grigoriev I., Szabo L., Martin F.; RT "Obligate Biotrophy Features Unraveled by the Genomic Analysis of the RT Rust Fungi, Melampsora larici-populina and Puccinia graminis f. sp. RT tritici."; RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883107; EGG06545.1; -; Genomic_DNA. DR EMBL; GL883093; EGG11091.1; -; Genomic_DNA. DR RefSeq; XP_007405693.1; XM_007405631.1. DR RefSeq; XP_007409985.1; XM_007409923.1. DR EnsemblFungi; EGG06545; EGG06545; MELLADRAFT_86358. DR EnsemblFungi; EGG11091; EGG11091; MELLADRAFT_92477. DR GeneID; 18934164; -. DR GeneID; 18936262; -. DR KEGG; mlr:MELLADRAFT_86358; -. DR KEGG; mlr:MELLADRAFT_92477; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 241 AA; 27040 MW; 360CAC47CC94BA46 CRC64; MQLQRIHKRF HDLESRLRWV EQNYDMLSKT FQEDDSVWED ASSTGNIARF IKTSQEVKFM AEECTKQCMK DRNWRRDFAF FNTGGDIISD ITSKSMQPAT DSLGIRSPVG PEMAISGDAS SGACWAFPGA VGQIGIRLRR RIVVQAVTIE HIGAKLAQNE IASAPKDFEL YGIEGDGEDA ANLLLLQGFY NITGPSTMQT FEVKEANFQS AHQKVVLKIT SNHGNPEFTC LYRVRVHGKE A // ID F4RAF8_MELLP Unreviewed; 195 AA. AC F4RAF8; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG10789.1}; GN ORFNames=MELLADRAFT_93454 {ECO:0000313|EMBL:EGG10789.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883094; EGG10789.1; -; Genomic_DNA. DR RefSeq; XP_007406258.1; XM_007406196.1. DR EnsemblFungi; EGG10789; EGG10789; MELLADRAFT_93454. DR GeneID; 18936574; -. DR KEGG; mlr:MELLADRAFT_93454; -. DR InParanoid; F4RAF8; -. DR KO; K19347; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 195 AA; 21198 MW; FE0BDCEF97A8AE0B CRC64; MAREAFKQCI KDRDGPRDFA FMHTGGYVIQ SLTSKSYIAS KSWRRHLPWS KTSRSSNTNT KTPGTALSGD GSAGNCWAFV GSVGQLGIGL SSTIDITALS IEHIGAQLAQ NDITSAPREF ELWGVSDDGD AVASTTPLFQ GVYNISSPSS TLQTFELNKS HKFAYSKVLF RVTSNHGNPG FTCIYRVRIH GQSIL // ID F4RBI5_MELLP Unreviewed; 299 AA. AC F4RBI5; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG10355.1}; GN ORFNames=MELLADRAFT_94437 {ECO:0000313|EMBL:EGG10355.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883095; EGG10355.1; -; Genomic_DNA. DR RefSeq; XP_007406656.1; XM_007406594.1. DR EnsemblFungi; EGG10355; EGG10355; MELLADRAFT_94437. DR GeneID; 18936884; -. DR KEGG; mlr:MELLADRAFT_94437; -. DR InParanoid; F4RBI5; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 299 AA; 33406 MW; 6104F543D3B89B8F CRC64; MNNQSSFGRR VGLAVRRAIY HLTKHTATIV AVAALAVALK SFVQLQQIDQ RSDVIESRVQ WVEQSYKMLI LNTFAEDDLV WQGEMDTNHY HARWGQGDIA SFIKTAKDVK FMAEECMKQW TKDRNWRRDF ALWKTGGYII SNLTSSSVKP DTIRVWAGFS KAASNSVGPE MALRGDASIG ACWAFPGVVG QIGIGLSHQI VVEAITIEHI GAKMAQKDIT SAPKEFALWG LPEEEQDNAG LLLIQGVYEI TSSSTIQTFE AKEGISWPAY RKVLLKILSN HGNAEYTCLY RVRVHGKEV // ID F4RDA9_MELLP Unreviewed; 230 AA. AC F4RDA9; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG09368.1}; GN ORFNames=MELLADRAFT_95812 {ECO:0000313|EMBL:EGG09368.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883097; EGG09368.1; -; Genomic_DNA. DR RefSeq; XP_007407095.1; XM_007407033.1. DR EnsemblFungi; EGG09368; EGG09368; MELLADRAFT_95812. DR GeneID; 18937360; -. DR KEGG; mlr:MELLADRAFT_95812; -. DR InParanoid; F4RDA9; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 230 AA; 25662 MW; BD8869DDEB2EE32C CRC64; MMLKMFGEEG IEKNHDYTTS LVQDINQLIQ ASEEVKFMAE EGIKQCTKDR IGHQDYAALK TGRSVISKLT SSSVKPNTRS TCHDFSIPLL KDNSADLCGP EIEIFGDSSA GECWAFSGIV GQIGIDLSHQ IIVESVKIEH IGSEMAQGNI NLAPKNFEVW GSRAVGNDYE EVLLLRGIYN ITNSSVIQNF VVTEGNLQQF YKKVILKILS NHGNLQYTCL YRVRVHGKQF // ID F4RER2_MELLP Unreviewed; 290 AA. AC F4RER2; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG09144.1}; GN ORFNames=MELLADRAFT_96414 {ECO:0000313|EMBL:EGG09144.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883098; EGG09144.1; -; Genomic_DNA. DR RefSeq; XP_007407504.1; XM_007407442.1. DR EnsemblFungi; EGG09144; EGG09144; MELLADRAFT_96414. DR GeneID; 18937592; -. DR KEGG; mlr:MELLADRAFT_96414; -. DR InParanoid; F4RER2; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 290 AA; 32095 MW; A98C2C3F2847F9EF CRC64; MNNQNSFGRA VGLALRGAIQ PLMRQRAAII ALAALFVALK NLADLQQIHV KFIELEGRIQ WVEHNDDMLA KTFEDDEFAG EGGMDAHQYY KSISTGDITS YIKTSKEVKF MAEECMKQSL KDRNWRRDFA FLKTGGDIIS GLTSNTMKTT HNQQFSSVGP EMAISGDASS GACWAFPGRA GQIAISLSRR IKVQATTIEH IGAKLAQKEI TLAPKDFELW GIEGGGEVLA KRPLIKGFYD INNASTIQTF EVIEGDPGST YQKVLLKILS NHGNSQHTCL YRVRIHGKEV // ID F4RET1_MELLP Unreviewed; 411 AA. AC F4RET1; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG09154.1}; GN ORFNames=MELLADRAFT_96428 {ECO:0000313|EMBL:EGG09154.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883098; EGG09154.1; -; Genomic_DNA. DR RefSeq; XP_007407514.1; XM_007407452.1. DR EnsemblFungi; EGG09154; EGG09154; MELLADRAFT_96428. DR GeneID; 18937598; -. DR KEGG; mlr:MELLADRAFT_96428; -. DR InParanoid; F4RET1; -. DR KO; K19347; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 63 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 96 116 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 411 AA; 46458 MW; 3A4F8FB9C7A79683 CRC64; MPNQSTSHRP SRSHIRGEER DFYGHGIGKK WGLMVKLVIA RSIRRWYFIV FCALVLWFFR NYSAKCHITN KKLTELEAEV QYIREGNIPG DTSKTLNIMN EKLIELGNQV RRIQETENLH PVLNELRMTE KKVVDLETQV QYLQAVHNSW ISAISTVPDN IGSWNDKTTL QVPGQIAQLS QTPEPEFHDS TQDIIKDTMM KIGRALQMIE HFKTVAQDAL KTCTKDKDGR RDFAFLQTGG SVIASLTSKS ISLNSSLPDH LKSPRSWRTL FDSLMSGRPS VTLPDIVLIG NGSIGACWAF SGSKGQIGIA LSDPINISGV TVEHIGKELA EDSINVAPKD FELWGLVEGK DRHSEFFLFS GFYDTSKPSV SQTFEFTPTK EVYKKVIFKI NSNHGNQHYT CIYRVRIHGV V // ID F4RFH1_MELLP Unreviewed; 374 AA. AC F4RFH1; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG08930.1}; GN ORFNames=MELLADRAFT_84359 {ECO:0000313|EMBL:EGG08930.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883099; EGG08930.1; -; Genomic_DNA. DR RefSeq; XP_007407904.1; XM_007407842.1. DR EnsemblFungi; EGG08930; EGG08930; MELLADRAFT_84359. DR GeneID; 18933454; -. DR KEGG; mlr:MELLADRAFT_84359; -. DR InParanoid; F4RFH1; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 374 AA; 41565 MW; D165D5AF7FC8A8C3 CRC64; MPQKLRPNNG NPHSLGRRAG FRVREAVHEL LRHAPAVHVV LLIFLSITIC GRIHLLDQRL KKAEGELFYI RVHTTSQPCT SVWASSEGFQ SKQRDASNDP ETVKIADSHH PSSIIPTFVE NNSHHNDVAS DDRLQDVHSE KDSFQAWPED IIGDVEKGLQ TTTQEDSYQV QTGLPTSYMA REAFKQCIKD RDGLRDFAFM HTGGYVIRSL TSESHIRSSK TWRSHLPWSK TPGASIMNSE TPGTALSGDG TAGNCWAFRG SVGQLGIGLS SKIDITALSI EHIGAQLAQD DITSAPREFK LWGVSDDEDT VAGKTPLFKG VYSISSPSST LQMFELNKSH KLLFSKVLFK VISNYGHPVF TCLYRVRIHG QPIF // ID F4RGE9_MELLP Unreviewed; 365 AA. AC F4RGE9; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG08664.1}; GN ORFNames=MELLADRAFT_84672 {ECO:0000313|EMBL:EGG08664.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883100; EGG08664.1; -; Genomic_DNA. DR RefSeq; XP_007408250.1; XM_007408188.1. DR EnsemblFungi; EGG08664; EGG08664; MELLADRAFT_84672. DR GeneID; 18933568; -. DR KEGG; mlr:MELLADRAFT_84672; -. DR InParanoid; F4RGE9; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 365 AA; 40491 MW; 0C7B4E6DDFFB09F5 CRC64; MMPNLEVCQS SEGVPLSFGR SAGLRARKAI TELSRHAPAM HIWLLIFLAM TTCGRIHLLD RRLKKAEGDL QYMRVHLNCQ TCTGDSASVA ELRIPNLDVR GAVAVATGHV RVYTEIDDIE TSMNIVGKEI QTITQENELR TEDNNQLGRI TEDCSSSGNL DMETSYMARE AFKQCIKDRD GQRDFAFLKT GGYVIQSLTS PSFVPSTTWR RRLSWWIALR GSNLQASLPE IALKGDGSAG NCWPLEGPVG QLGIGLSRTI DITSLSIEHI GAQLAQNDIT AAPRDFELWG LDDNHNQSAD GTLLFRGVYR ISSPSSPLQT FEIPRTHPLV YSKVLFKITS NYGHSGFTCI YRVRIHGRPI VGKDI // ID F4RLI4_MELLP Unreviewed; 379 AA. AC F4RLI4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG06747.1}; GN ORFNames=MELLADRAFT_86354 {ECO:0000313|EMBL:EGG06747.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883107; EGG06747.1; -; Genomic_DNA. DR RefSeq; XP_007410187.1; XM_007410125.1. DR EnsemblFungi; EGG06747; EGG06747; MELLADRAFT_86354. DR GeneID; 18934162; -. DR KEGG; mlr:MELLADRAFT_86354; -. DR InParanoid; F4RLI4; -. DR KO; K19347; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 379 AA; 41719 MW; CC04188C848D8A6D CRC64; MSTSEMRHKM PPPEKGNPHS FGRRAGFRVR AAVDQLSRHA PAMHVLLLIL LSITTCGRTH FLEQQLKKVE GDLFDMRVQL VQPCTSLSAS WEGLKYKQRG TFNDPDAVNI AESLHPSIIS PAFVKKNLHH VDEGAADTRF EDVHAGTDGV QVWPEDLIGD VEKGLQTTTQ EDPNQVQTGL RTSYMAREAF KQCIKDRDGP RDFAFMHTGG YVIQSLTSKS YIASKSWRRH LPWSKTSRSS NTNTQTPGTA LSGDGSAGNC WAFVGSVGQL GIGLSSTIDI TALSIEHIGA QLAQNDITSA PREFELWGVS DDGDTVASTT PLFQGVYNIS SPSSTLQTFE LNKSHKFAYS KVLFRVTSNH GNPGFTCIYR VRIHGQSIL // ID F4RMG3_MELLP Unreviewed; 232 AA. AC F4RMG3; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG06424.1}; GN ORFNames=MELLADRAFT_36137 {ECO:0000313|EMBL:EGG06424.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883108; EGG06424.1; -; Genomic_DNA. DR RefSeq; XP_007410258.1; XM_007410196.1. DR EnsemblFungi; EGG06424; EGG06424; MELLADRAFT_36137. DR GeneID; 18927531; -. DR KEGG; mlr:MELLADRAFT_36137; -. DR InParanoid; F4RMG3; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 232 AA; 26043 MW; C31B6FD0A3A30EEA CRC64; MEETDELNQI ISQSILRFSL TDGGINQPDY ALFSGGARII PDLTSATFDI KPKGFIKSTL SSLIGRGNLV IARGPSTVLE PDRSIGKCWP MNGNLGQIGI ALARKIVVKE IVIEHVQFSL AYELDSALRE FEVLGYDEVR KVWRTLGNGT FDIFDSKVNS IQRFQMNYPS EEEDDQFETG LVVLKVLSNH GNSEFTCLYR IRVLGERRNL GNGLVDHQSS SQEQEQERIE MV // ID F4RPG7_MELLP Unreviewed; 364 AA. AC F4RPG7; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG05526.1}; GN ORFNames=MELLADRAFT_87786 {ECO:0000313|EMBL:EGG05526.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883112; EGG05526.1; -; Genomic_DNA. DR RefSeq; XP_007411015.1; XM_007410953.1. DR EnsemblFungi; EGG05526; EGG05526; MELLADRAFT_87786. DR GeneID; 18934641; -. DR KEGG; mlr:MELLADRAFT_87786; -. DR InParanoid; F4RPG7; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 364 AA; 40215 MW; 2DEF0B8D0567D920 CRC64; MPSLEVCQSS EGVPLSFGRS AGLRARKAIT ELSRHAPAMH IWLLIFLAMT TCGRIHLLDR RLKKAEGDLQ YMRVHLNSQT SSGDTASMVE LRTPNPDVRG AVPVATGHVR VYTEIDDIEI SMNVDSEGIH TITQENKLRT GDNNQLGRIT EDCSSSGDLE METSYMAREA FKQCIKDRDG QRDFAFLKTG GYVIQSLTSP SFVPSTTWRR RLSWWRALRG SNLQASLPEI ALKGDGSAGN CWPLEGPVGQ LGIGLSRTID ITSLSIEHIG AQLAQNDITA APREFELWGL DDNHNQSANG TLLFKGVYSI SSPSSPLQNF EIPRTHPLVY SKVLFKITSN YGHSGYTCIY RVRIHGKPIA GKDI // ID F4RTF4_MELLP Unreviewed; 404 AA. AC F4RTF4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG04347.1}; GN ORFNames=MELLADRAFT_89456 {ECO:0000313|EMBL:EGG04347.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883119; EGG04347.1; -; Genomic_DNA. DR RefSeq; XP_007412476.1; XM_007412414.1. DR EnsemblFungi; EGG04347; EGG04347; MELLADRAFT_89456. DR GeneID; 18935198; -. DR KEGG; mlr:MELLADRAFT_89456; -. DR InParanoid; F4RTF4; -. DR KO; K19347; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 63 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 59 79 {ECO:0000256|SAM:Coils}. FT COILED 96 116 {ECO:0000256|SAM:Coils}. FT COILED 126 146 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 404 AA; 45689 MW; 9DDDEF44FB651B20 CRC64; MPGQSTEHRA SRSHINVEER ESCQHSVGKR WGRMVKLLIS RSIRRWYYIL LSAVVLWVFI SYSERLQIVN KKVNELEAEI RYIQEGNISG NNSTTLNIMD EKLIELVNQV RRLQETENLG PILNELHKME KKVADLEIRV QYLQEVHDSW ILALSTVPHN IGLWNEITTL RVPSQINQLP SQPPKPTLQD STQEIIINAI TSISRVLQMI EHFKNVAQDA LEICTKDKDG RRDFAFIQTG GSVIASLTSE SISLNSSSRS WKTLFASLSG RPSVTLPDIV LIGDGSIGAC WAFRGSKGQL GIALSDPIKI TGVTVEHIGK ELAQDSIDVA PKDFELWGVV DDKDKHSEFF LFSGFYDTSK ASIAQTFEFT PTKEVYKKVI FKINSNNGNK QYTCVYRVRI HGVM // ID F4RV02_MELLP Unreviewed; 163 AA. AC F4RV02; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG03787.1}; GN ORFNames=MELLADRAFT_72518 {ECO:0000313|EMBL:EGG03787.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883122; EGG03787.1; -; Genomic_DNA. DR RefSeq; XP_007412901.1; XM_007412839.1. DR EnsemblFungi; EGG03787; EGG03787; MELLADRAFT_72518. DR GeneID; 18932119; -. DR KEGG; mlr:MELLADRAFT_72518; -. DR InParanoid; F4RV02; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. FT COILED 142 162 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 163 AA; 18956 MW; 3C1D01B2CC840EE6 CRC64; MLTPCRAPNQ SKKNKSNRSS THPTHSDSNF VIFELGDEIE IDHVVLANYE FFSSMYKLIR ITVSNSGLGG AGGIKWVEVG RFKTRNVRGI QVFPIKHLKG FYRYVRLDFL SHYGSKYFCP LSLVRIYGLT QIDAYGRDEE LERRRQLELK EFEDDLQDHE KTC // ID F4RZP4_MELLP Unreviewed; 380 AA. AC F4RZP4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG02150.1}; GN ORFNames=MELLADRAFT_91615 {ECO:0000313|EMBL:EGG02150.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883133; EGG02150.1; -; Genomic_DNA. DR RefSeq; XP_007414687.1; XM_007414625.1. DR EnsemblFungi; EGG02150; EGG02150; MELLADRAFT_91615. DR GeneID; 18935978; -. DR KEGG; mlr:MELLADRAFT_91615; -. DR InParanoid; F4RZP4; -. DR KO; K19347; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. SQ SEQUENCE 380 AA; 41894 MW; A9BAD8DD5148EC5D CRC64; MSTSEMRHKM PPPEKGNPHS FGRRAGFRVR AAVDQLSRHA PAMHVLLLIL LSITTCGRTH FLEQRLKKVE GDLFDMRVQL VQPCTSLSAS WEGLKYKQRG TFNDPDAVNI AESLHPSIIS PAFVKKNLHH VDEGAADTRF EDVHAGTDGV QVWPEDLIGD VEKGLQTTTQ EDPNQVQTGL RTSYMAREAF KQCIKDRDGP RDFAFMHTGG YVIQSLTSKS YIASKSWRRH LPWSKTSRSS KYTNTKTPGT ALSGDGSAGN CWAFVGSVGQ LGIGLSSTID ITALSIEHIG AQLAQNDITS APREFELWGV SDDGDAVAST TPLFQGVYNI SSPSSTLQTF ELNKSHKFAY SKVLFRVTSN HGNPGFTCIY RVRIHGQSIL // ID F4S2N6_MELLP Unreviewed; 292 AA. AC F4S2N6; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG01096.1}; GN ORFNames=MELLADRAFT_92765 {ECO:0000313|EMBL:EGG01096.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883141; EGG01096.1; -; Genomic_DNA. DR RefSeq; XP_007415696.1; XM_007415634.1. DR EnsemblFungi; EGG01096; EGG01096; MELLADRAFT_92765. DR GeneID; 18936365; -. DR KEGG; mlr:MELLADRAFT_92765; -. DR InParanoid; F4S2N6; -. DR KO; K19347; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 39 57 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 292 AA; 31899 MW; FD49DF8F06980C72 CRC64; MPSLEVCQSS EGVPLSFGRS AGLRARKAIT ELSRHAPAMH IWLLIFLAMT TCGRIHLLDR RLKKAEGDLQ YMRVHLNSQT SSGDTASMVE LQTPNPDVQG AVPVATDGQR DFAFLKTGGY VIQSLTSPSF VPSTTWRQCL SWWRALRGSN LQASLPEIAL KGDGSAGNCW PLEGPVGQLG IGLSRTIDIT SLSIEHIGAQ LAQNDITAAP REFELWGLDD NHNQSANGTL LFKGVYSISS PSSPLQNFEI PRTHPLVYSK VLFKITSNYG HSGYTCIYRV RIHGKPIAGK DI // ID F4S3U4_MELLP Unreviewed; 334 AA. AC F4S3U4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG00735.1}; GN ORFNames=MELLADRAFT_93073 {ECO:0000313|EMBL:EGG00735.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883144; EGG00735.1; -; Genomic_DNA. DR RefSeq; XP_007416006.1; XM_007415944.1. DR EnsemblFungi; EGG00735; EGG00735; MELLADRAFT_93073. DR GeneID; 18936457; -. DR KEGG; mlr:MELLADRAFT_93073; -. DR InParanoid; F4S3U4; -. DR KO; K19347; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 36 55 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 334 AA; 36893 MW; 2A8BB1F0A20E3CE6 CRC64; MPQKLRPNNG DPHSFGRRAG FRVRAAVHEL SRHTPAIHVV LLIFLSITIC GRIHLLDHRL KKAKGELFYI RVHTTSQPCT SVWASSEGFQ SKQRDAFNDP ETVKIAESHH LSSIIPAFVE SNSHHNDVAS DDRLQDVHSE KDSFQAWPED IIGDVEKGLQ TTTLKDSYQV QTGLPTSYMA REAFKQCIKD RDGASITNSE TPGTALSGDG TAGNCWAFRG SVGQLGIGLS SKIDITALSI EHIGAQLAQD DITSAPREFE LWGVSDDEDT VAGTTPLFKG VYSISSPSST LQMFELNKSH KLLFSKVLFK VISNYGNPVF TCLYRVRIHG QPIF // ID F4S5A2_MELLP Unreviewed; 163 AA. AC F4S5A2; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGG00195.1}; DE Flags: Fragment; GN ORFNames=MELLADRAFT_57672 {ECO:0000313|EMBL:EGG00195.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883150; EGG00195.1; -; Genomic_DNA. DR RefSeq; XP_007416598.1; XM_007416536.1. DR EnsemblFungi; EGG00195; EGG00195; MELLADRAFT_57672. DR GeneID; 18929122; -. DR KEGG; mlr:MELLADRAFT_57672; -. DR InParanoid; F4S5A2; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}. FT COILED 142 163 {ECO:0000256|SAM:Coils}. FT NON_TER 163 163 {ECO:0000313|EMBL:EGG00195.1}. SQ SEQUENCE 163 AA; 19166 MW; 1B3751EC6956E693 CRC64; MLTPCRAPNQ SKKNKSNRSS TDPTHSDSNF VIFELCDEIE IDHVVLANYE FFSSMYKLIR ITVSNSGLEG AGGIKWVEVG LFKTRNVRGI QVFPIKHLKG FYRYVRLDFL SHYGSEYFCP LSLVRIYGLT QIDAYRRDEE LERRRQLELK EFEDDLQDHE EEM // ID F4S6S0_MELLP Unreviewed; 411 AA. AC F4S6S0; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGF99670.1}; GN ORFNames=MELLADRAFT_94181 {ECO:0000313|EMBL:EGF99670.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883156; EGF99670.1; -; Genomic_DNA. DR RefSeq; XP_007417087.1; XM_007417025.1. DR EnsemblFungi; EGF99670; EGF99670; MELLADRAFT_94181. DR GeneID; 18936807; -. DR KEGG; mlr:MELLADRAFT_94181; -. DR InParanoid; F4S6S0; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 64 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 126 146 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 411 AA; 46281 MW; 6DCE742D20BF9FD0 CRC64; MPDQRTNHRH STSHIHIEER EIYQHSIGKK CGLVVKLLIS RSIRQSYLIV LSAFVLWFFT SYSATLHITN KNLIELQAEF RSIREGDISG DNSMALKIMD EKLKELGNHV RRIQDPENLQ PILNRLHMTE KKVDELETQV QHLQQVHDYW ISAFSIVSHD LGLWDEITTL QVPGQFTPLS ETPESTLKDS THDKINNAML NFRQFLQMIE HFKTIAQDAL KTCTKDKDGR RDFAFIQTGG SVIAGLTSKS ISLDRGVPAP LEGPRRWGTL FAPLSGLPSV TLPKIVLTGD GSIGACWAFN GSKGQLGIAL SHPINIAGVT VEHIGKELAQ DSIDAAPKDF ELWGLVEDKD VNSEFFLFGA FYNTSKASIA QTFEFTPTKD VYNKVIFKIN SNHGNQHHTC VYRVRIHGII L // ID F4SBI7_MELLP Unreviewed; 411 AA. AC F4SBI7; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGF97995.1}; GN ORFNames=MELLADRAFT_84107 {ECO:0000313|EMBL:EGF97995.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883189; EGF97995.1; -; Genomic_DNA. DR RefSeq; XP_007418735.1; XM_007418673.1. DR EnsemblFungi; EGF97995; EGF97995; MELLADRAFT_84107. DR GeneID; 18933345; -. DR KEGG; mlr:MELLADRAFT_84107; -. DR InParanoid; F4SBI7; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 64 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 126 146 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 411 AA; 46370 MW; EF9E3995823AD8A6 CRC64; MPDQRTNHRH STSHIHIEER EIYQHSIGKK CGLVVKLLIS RSIRQSYLIV LSAFVLWFFT SYSATLHITN KNLIELQAEF RSIREGDISG DNSMALKIMD EKLKELGNHV RRIQDPENLQ PILNRLHMTE KKVDELETQV QHLQQVHDYW ISAFSIVSHD LGLWDEITTL QVPGQFTPLS ETPESTLKDS THDKINNAML NFRQFLQMIE HFKTIAQDAL KTCTKDKDGR RDFAFIQTGG SVIAGLTSKS ISLDRGVPAP LEGPRRWGTL FAPLSGRPSV TLPEIVLTGD GSIGAFWAFN GSKGQLGIAL SHPINIAGVT VEHIGKELAQ DSIDAAPKDF ELWGLVEDKD VNSEFFLFGA FYDTSKASIA QTFEFTPTKD VYNKVIFKIN SNHGNQHHTC VYRVRIHGII L // ID F4SE70_MELLP Unreviewed; 404 AA. AC F4SE70; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGF97056.1}; GN ORFNames=MELLADRAFT_88584 {ECO:0000313|EMBL:EGF97056.1}; OS Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) (Poplar OS leaf rust fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. OX NCBI_TaxID=747676 {ECO:0000313|Proteomes:UP000001072}; RN [1] {ECO:0000313|Proteomes:UP000001072} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=98AG31 / pathotype 3-4-7 {ECO:0000313|Proteomes:UP000001072}; RX PubMed=21536894; DOI=10.1073/pnas.1019315108; RA Duplessis S., Cuomo C.A., Lin Y.-C., Aerts A., Tisserant E., RA Veneault-Fourrey C., Joly D.L., Hacquard S., Amselem J., RA Cantarel B.L., Chiu R., Coutinho P.M., Feau N., Field M., Frey P., RA Gelhaye E., Goldberg J., Grabherr M.G., Kodira C.D., Kohler A., RA Kuees U., Lindquist E.A., Lucas S.M., Mago R., Mauceli E., Morin E., RA Murat C., Pangilinan J.L., Park R., Pearson M., Quesneville H., RA Rouhier N., Sakthikumar S., Salamov A.A., Schmutz J., Selles B., RA Shapiro H., Tanguay P., Tuskan G.A., Henrissat B., Van de Peer Y., RA Rouze P., Ellis J.G., Dodds P.N., Schein J.E., Zhong S., Hamelin R.C., RA Grigoriev I.V., Szabo L.J., Martin F.; RT "Obligate biotrophy features unraveled by the genomic analysis of rust RT fungi."; RL Proc. Natl. Acad. Sci. U.S.A. 108:9166-9171(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL883361; EGF97056.1; -; Genomic_DNA. DR RefSeq; XP_007419674.1; XM_007419612.1. DR EnsemblFungi; EGF97056; EGF97056; MELLADRAFT_88584. DR GeneID; 18934927; -. DR KEGG; mlr:MELLADRAFT_88584; -. DR InParanoid; F4SE70; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001072; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001072}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001072}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 46 63 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 59 86 {ECO:0000256|SAM:Coils}. FT COILED 126 146 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 404 AA; 45503 MW; 507E32DFC59BC999 CRC64; MPGQSTEHRA CPSHINVTER ESCQHSVGKR WGRMVKLLIS RSVRRWYYIL LSAVVLWFFI SYSERLQILN KKVNELEAEI RYIQEGNISG NNSTTLIMMD EKIIELVNQV RRLQETENLG PILNKLHKTK KKVADLELQV QHLQEVHDSW ISALSTVPHN IGLWNEITTL RVPSQINQLP SQPPKPTLQD STQDIIINAI TSIGRVLQMI EHLKNVAQDA LEICTKDKDG RQDFAFIQTG GSVIPSLTSE SISLDSSSRS WKTLFASLSG RPSVTLPDIV LIGDGSIGAC WAFRGSKGQL GIALSDPIKI TGVTVEHIGK ELAQDSIDVA PKDFELWGVV DDKDKHSEFF LFSSFYDTSK ASIAQTFEFT PTKEVYKKVI FKINSNNGNK QYTCVYHVRI HGVM // ID F4W731_ACREC Unreviewed; 1291 AA. AC F4W731; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 11. DE SubName: Full=Protein C1orf9-like protein {ECO:0000313|EMBL:EGI70021.1}; GN ORFNames=G5I_01239 {ECO:0000313|EMBL:EGI70021.1}; OS Acromyrmex echinatior (Panamanian leafcutter ant) (Acromyrmex OS octospinosus echinatior). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Acromyrmex. OX NCBI_TaxID=103372 {ECO:0000313|Proteomes:UP000007755}; RN [1] {ECO:0000313|Proteomes:UP000007755} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21719571; DOI=10.1101/gr.121392.111; RA Nygaard S., Zhang G., Schiott M., Li C., Wurm Y., Hu H., Zhou J., RA Ji L., Qiu F., Rasmussen M., Pan H., Hauser F., Krogh A., RA Grimmelikhuijzen C.J., Wang J., Boomsma J.J.; RT "The genome of the leaf-cutting ant Acromyrmex echinatior suggests key RT adaptations to advanced social life and fungus farming."; RL Genome Res. 0:0-0(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL887802; EGI70021.1; -; Genomic_DNA. DR InParanoid; F4W731; -. DR Proteomes; UP000007755; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007755}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007755}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 938 959 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 113 133 {ECO:0000256|SAM:Coils}. FT COILED 378 398 {ECO:0000256|SAM:Coils}. FT COILED 864 892 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1291 AA; 143984 MW; 92F88B1FA78318B7 CRC64; MYVCLICRIS ETGQAVVLTL VDTAAELQNL AEPKIDHEFS SRSSIKNSSL LGTNQVRQSE HLKETVPTTQ IPPPPIVDET TERPNDTDVL EDEEEVLLLK KITAEEPPEV VVIVRAEQKI NTDELELRVE EEDAGQAQVS EELSSKIDDT FTTAPELNDT AARARLVGDS RDETAAVILN GLVTSGPTEP HEDIPSFSEW AQKRLEEAEK KKTHPNASVQ TPGGPGRSVS GMKIRNKNYA SPDCGAKIVA ANPEANSAKN VLVSTRDEYM LNACTSRVWF VVELCEAIQA KKIELANFEL FSSSPKDFSV YVSDRFPTKD WSPVGQFTAK DVKDVQSFTL HPHFFGKFIK VELQSHYGSE HFCPVSLFRA YGTSEFEVLE TETENETLEE KNTDEDDDED SDEEELLDGE SASNLFGSAR DAVLSVVKKA AEILVKSSGL TGNNITQIQQ SIDHGNILDN SYTSCTTPRY TILCGNCTDQ KFASVFQLVS CKNQQLDDLL RIDLVNRTLR RGKLCGLHGV EIESFWQERE EDKTKDDDLT HFNLAEDFQA TFLTSFFKPE YIVALCNVLA TKERKVVMNT SYEIPVNKSK DAASKNILST KDTDRVDITF HQTSSTPDPC TLDSSSSACK SAASSKEVRQ HPLTQDVKES ENISIATIET SSSFPESLAS QIKPTKTLSK EDLKKESSVP ILEPSKEFTE ETLQSKVLTT AAPLSNPTPT LKIAEELAVL GTPIETLSIS NVPLINIDSK ETETLMPDME SAETITQVKT EKSENNEQDG RQVKDLSEQE ARLSPQDHLS LDSLLSDLKD LEVDTANIQN GASSSSPTTQ PTANVVPQKE SVFLRLSNRI KILERNMSLS GQYLEELSRR YKKQVEEMQR SLERAVAAMG EESRKGEERD AKRAEEITAL REEIVILSKS VETLLYDRDS WRSRISAIVQ HALLICLEVI VIILILSYCR RREGFEEEKL ESDTRKDTMR RKSAENFSSH AAAKKIKKRR PSEIASYISG TYHELMIDDR PFETKKERKK KRKKEALTAG TKAVNIDAKQ EVVHYKSVLN AIPGGTTLPS RRASSIDPPH SKESQDSVDK RPESAPETAI GWFDDQIERI ERITQPTSED KTYKIESRMN SDLSRQDKLG KLGKPNTSLV IVNRSVEKSG PRLNTVERKN ETSKNGSFRA GSILKGTRLS SPSFMKTALG KRKLSTNSTN STNSEKWEWS QDSEHSNDRS SQSSPTDFKT LSQIISDRAN GSTANGLIEE SDESRSSSAT PTSIKKEKRS TGLKKMVRKF F // ID F4WDJ4_ACREC Unreviewed; 858 AA. AC F4WDJ4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Protein unc-84-like protein A {ECO:0000313|EMBL:EGI67774.1}; GN ORFNames=G5I_03647 {ECO:0000313|EMBL:EGI67774.1}; OS Acromyrmex echinatior (Panamanian leafcutter ant) (Acromyrmex OS octospinosus echinatior). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Acromyrmex. OX NCBI_TaxID=103372 {ECO:0000313|Proteomes:UP000007755}; RN [1] {ECO:0000313|Proteomes:UP000007755} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21719571; DOI=10.1101/gr.121392.111; RA Nygaard S., Zhang G., Schiott M., Li C., Wurm Y., Hu H., Zhou J., RA Ji L., Qiu F., Rasmussen M., Pan H., Hauser F., Krogh A., RA Grimmelikhuijzen C.J., Wang J., Boomsma J.J.; RT "The genome of the leaf-cutting ant Acromyrmex echinatior suggests key RT adaptations to advanced social life and fungus farming."; RL Genome Res. 0:0-0(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL888087; EGI67774.1; -; Genomic_DNA. DR RefSeq; XP_011050839.1; XM_011052537.1. DR RefSeq; XP_011050840.1; XM_011052538.1. DR RefSeq; XP_011050841.1; XM_011052539.1. DR RefSeq; XP_011050842.1; XM_011052540.1. DR RefSeq; XP_011050843.1; XM_011052541.1. DR GeneID; 105143938; -. DR KEGG; aec:105143938; -. DR InParanoid; F4WDJ4; -. DR KO; K19347; -. DR Proteomes; UP000007755; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007755}; KW Reference proteome {ECO:0000313|Proteomes:UP000007755}. FT COILED 510 530 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 858 AA; 99709 MW; 02C589093D4076F6 CRC64; MENEQHHYEL RSRSRSRSHT PMVPNRSQQD PEVTERHYDL RNWSRERSHT PGEVTSSRRS GSRSLTGSVK THEKNMETIE ENKEGSVTES LTDNQSTKSE GSVVTIKKAE RRSERQRAKR QIFVNGQSES KEDASDKKAE RRHRSVTPRR VFTSDYSSEE GEREDPPSRP GSAYEIYKQA GEWWNVFPKT DYTYSPTSQC RYEIAPGILA MPNMSRRPIH VNNNGSTISQ TSHRNLSQAS RGTTESGISD MDTVDLKETT SLIHSFGDNC DMSRAGMSNS TAKTTLYKKT HVKQYTSHKE IIYSDPRCSS NFRSSYLSWA NTPIGKYDAS SHADSDTDLD DTYVKSSVKT NQSWRVVQWF TYFITFIVTC FRKTVEFFKF KTNGKRQYYV SQAYRSSNES KWSSLWQTLD RYTHNMYFFF VRMLVLDAWL LSRFTGIRKW LQEKSPRILW ITLLPLLLLF GGWCIVQCLS LLSDVKTVTE TAIEKSLNNV QWEDHQNKII EKILTDNEII KENLVNKADL VNRIEMLENR QTHQMDYLIN ITRTVEDRKQ SDADFRKEYD DKIINVENKL GVSELKNIAY SELKVIKDEF EELRKLYSEL KSCCNANAES IINQDIEKHV EKILLSYFPS GILKEDLGQN LQNLLASHNR EAQKVLDNAN VHTSDEHIRK IVKEVLRIYD ADKTGQVDYA LETAGGQIIS TRCTQRYDIK SRAFSLFGFT LYYESNNPRT VIQGNPIQPG VCWAFQDFPG YLLIQLRSFI YVTGFTLEHV SKLILPNENM SSAPRKFNVW GLLNENDLEP VMFGEYEFTY SDESLQYFPV QNTEVNKPYE YIELRIHSNH GQLDYTCLYR FRVHGRLA // ID F4WKJ1_ACREC Unreviewed; 1838 AA. AC F4WKJ1; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:EGI65282.1}; GN ORFNames=G5I_06252 {ECO:0000313|EMBL:EGI65282.1}; OS Acromyrmex echinatior (Panamanian leafcutter ant) (Acromyrmex OS octospinosus echinatior). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Acromyrmex. OX NCBI_TaxID=103372 {ECO:0000313|Proteomes:UP000007755}; RN [1] {ECO:0000313|Proteomes:UP000007755} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21719571; DOI=10.1101/gr.121392.111; RA Nygaard S., Zhang G., Schiott M., Li C., Wurm Y., Hu H., Zhou J., RA Ji L., Qiu F., Rasmussen M., Pan H., Hauser F., Krogh A., RA Grimmelikhuijzen C.J., Wang J., Boomsma J.J.; RT "The genome of the leaf-cutting ant Acromyrmex echinatior suggests key RT adaptations to advanced social life and fungus farming."; RL Genome Res. 0:0-0(2011). CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL888200; EGI65282.1; -; Genomic_DNA. DR InParanoid; F4WKJ1; -. DR Proteomes; UP000007755; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007755}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EGI65282.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000007755}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 174 194 {ECO:0000256|SAM:Coils}. FT COILED 1036 1058 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1838 AA; 202289 MW; 1C6E7066DDA8DF57 CRC64; MYSSGSPEGG TDTSENRGEF LEKLQRARSQ LKVNFISQPV LSRPGTTRLV VGNWALSSRK ESELCIHNSD GQQQATILRE DLPGFIFESN RGTKHSFTAE TSLGPEFAAG WTGKRGKRLR SKIEAIKQKV KVQAQEIYEC YFKAAQAQPR GVVAKLGAIV NQIEKASQKQ QSGSREWRNT LQTALEQLKI LLNEEGRVSA YELHSSGLVQ ALLVLLAAPS GPTPPTLRAT KLRMQRITAF KNCFQTKDTN KELNSAKILV HKLISVLESI EKLPVYLYDT PGSGYGLQIL TRRLRFRLEK ASSESALIDR SGRSLKMEPL STIQQLENHL LKMVAKQWHD HDRSTFAFVK RLKEENRVTF KYQHDFDENG LLYWIGTNAK TCSEWVNPGQ YGLVVVTSSD GRNLPYGHLE DILSRDPSAL NCHTNDDKRA WFSIDLGVWI IPSAYTLRHA RGYGRSALRN WMFQASKDGI TWITLYAHVD DCSLNEPGST STWALEPPSE ETQGWRHLRL QQIGKNASGQ THYLSVSGFE VYGEVTGVCE DLGRAAKEAE AGVRKQRRFI KMQVLKHLVA GVRVARGLDW KWRDQDGVPP GEGTVTGELH NGWIDVTWDH GGSNSYRMGA EGKYDLRLVG AGLDTDNATK CKSGGGVLTG RKSNSTPSLP DCTDTAMRSS VASTDQAASA DNLAAKQAAE SIAESVLSVA RAEAVVAVTG ESGANSTSEL SVVLHPRPDT TVTSDLATIV ESLTLNTDCP VNSTSNRASS SKPLFATVRG NKTSGGLLSL ETAEVLDRMR EGADRLRNNT NSFLSGELLG LVPVRISVSG ESDENLRIKS VPRHHPTGIA DVAKDCTREK EASSSTQNTT GGCPVVVTNP MSVSVPNLAC SDANNTLEST AATGLLETFA AMARRRTLGP AGGQHHLASN SNTNCNPIRG PNSVSSLVRL ALSPNFPGGL LSTAQSYPSL TSSGQVAGSG VTTTTGPGLG QALTMSLTST SSDSEQVSLE DFLESCGGVA TSSTGGGRTT GGPTLLTELE DDEDGVLEEE EDNEENDQEE EDEENEEEGD GCEGEYEEVM VSRNLLAAFM EEEAPQSSKR RAWDDEFVLK RQFSALIPAF DPRPGRTNIN QTTDLEVPSP GSETQVNSRI GSLPMPRLSL SLKGPGFPGI PDVEISLSDS HASIFQAVQE LMQLTELGSR QEKLKRIWEP TYTIIYKEAR DEESSGRATP IVTLYSRNPT QNTNACTVED ILQLLRHVFV LGTIRDEGIL AEQNESNDTT YWLHPDDFTS KKITNKIVQQ IQDPLALAAG ALPNWCEELA RSCPFLLPFE TRRLYFSCTA FGASRSIVWL QTQRDAILER QRAPGLSPRR DDSHEFRVGR LKHERVSVPR GEKLLDWAEQ VLKVHASRKS ILEVAFIGEE GTGLGPTLEF FALVAAELQR KDLSLWLCDD TADDNATRIL NEEQTCISGE KIRPAGYYVT RVSGLFPAPL PQDSACCDRA VRYFWFLGVF LAKVLQDNRL VDLPLSRPFL KLMCRGDISN NVNEKIGLTG VTQESMSSSM SSSFISEEGE NDAVYSSLEP CPWYAGLLDI EDLVEVDPVR GEFLREIQNA ITKRDRTCSD GPSSTDEETS LYITHPSGTS VAIEDLALTM TYSPSSKVFQ HDQVELMEGG LDITVTRENV REYANLTINY CLNQGIYRQL EAFKSGFSKV FPMEKLHVFS PDEMRAMLCG EQNPQWTRED LLNYTEPKLG YTKESPGFQR FVNVLLSLTG SERKAFLQFA TGCSALPPGG LCNLHPRLTV VRKVDAGSGG YPSVNTCVHY LKLPEYPTEE ILRERLLAAT RERGFHLN // ID F4WXT4_ACREC Unreviewed; 280 AA. AC F4WXT4; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Protein unc-84-like protein A {ECO:0000313|EMBL:EGI61012.1}; GN ORFNames=G5I_10774 {ECO:0000313|EMBL:EGI61012.1}; OS Acromyrmex echinatior (Panamanian leafcutter ant) (Acromyrmex OS octospinosus echinatior). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; OC Vespoidea; Formicidae; Myrmicinae; Acromyrmex. OX NCBI_TaxID=103372 {ECO:0000313|Proteomes:UP000007755}; RN [1] {ECO:0000313|Proteomes:UP000007755} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21719571; DOI=10.1101/gr.121392.111; RA Nygaard S., Zhang G., Schiott M., Li C., Wurm Y., Hu H., Zhou J., RA Ji L., Qiu F., Rasmussen M., Pan H., Hauser F., Krogh A., RA Grimmelikhuijzen C.J., Wang J., Boomsma J.J.; RT "The genome of the leaf-cutting ant Acromyrmex echinatior suggests key RT adaptations to advanced social life and fungus farming."; RL Genome Res. 0:0-0(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL888427; EGI61012.1; -; Genomic_DNA. DR InParanoid; F4WXT4; -. DR Proteomes; UP000007755; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007755}; KW Reference proteome {ECO:0000313|Proteomes:UP000007755}. SQ SEQUENCE 280 AA; 31199 MW; 96A970246DC785B1 CRC64; MTVSHLCDKY SASASLKIIK ADLGNLRSHL GTLSLEVKNV MEMRDELKSK LKEVGSVIPK MSEAILNLRN EVSEGGAILS IRDTEPYSTG APVLNLFGIP LCQQQNTPRA MIQTGVLPGE CWAFKGSSGS VVIRLLGHVH VSGVSLEHIS SLISPTGETA TAPKDFSVWG LSDLDDKKPF SFGSFMYDNT GSPLQYFEVQ NRGKKAYDII EVKVHSNSGN PEYTCIYRIR VHGTLSETYQ VRENHIFPKI LLFSYFKSDI NKKSSFRIRE KLLFNVTENY // ID F5HC96_CRYNB Unreviewed; 718 AA. AC F5HC96; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAL18987.1}; GN OrderedLocusNames=CNBI2480 {ECO:0000313|EMBL:EAL18987.1}; OS Cryptococcus neoformans var. neoformans serotype D (strain B-3501A) OS (Filobasidiella neoformans). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. OX NCBI_TaxID=283643 {ECO:0000313|EMBL:EAL18987.1, ECO:0000313|Proteomes:UP000001435}; RN [1] {ECO:0000313|Proteomes:UP000001435} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=B-3501A {ECO:0000313|Proteomes:UP000001435}; RX PubMed=15653466; DOI=10.1126/science.1103773; RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D., RA Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E., RA Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J., RA D'Souza C.A., Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J., RA Huang J.C., Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I., RA Kwon-Chung K.J., Lengeler K.B., Maiti R., Marra M.A., Marra R.E., RA Mathewson C.A., Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L., RA Schein J.E., Shvartsbeyn A., Shin H., Shumway M., Specht C.A., RA Suh B.B., Tenney A., Utterback T.R., Wickes B.L., Wortman J.R., RA Wye N.H., Kronstad J.W., Lodge J.K., Heitman J., Davis R.W., RA Fraser C.M., Hyman R.W.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307:1321-1324(2005). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAL18987.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEY01000044; EAL18987.1; -; Genomic_DNA. DR RefSeq; XP_773634.1; XM_768541.1. DR STRING; 283643.XP_773634.1; -. DR EnsemblFungi; EAL18987; EAL18987; CNBI2480. DR GeneID; 4938208; -. DR KEGG; cnb:CNBI2480; -. DR EuPathDB; FungiDB:CNBI2480; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001435; Chromosome 9. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001435}. FT COILED 146 193 {ECO:0000256|SAM:Coils}. FT COILED 348 368 {ECO:0000256|SAM:Coils}. FT COILED 664 684 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 718 AA; 80189 MW; DC750D46F0FAEF72 CRC64; MLTPCRADEH WVVVELCDEI RIEAVEIAVW EFFSGVVREV RVSVGGEDEE DDAEEPGQDD VAGRGHRWKQ VGSFIGKNVR GSQTFSLSQP TSFHRFIRLD FPSYFGSEYY CPVSSLKVYG MNQMEAFKWE QKQLSAVAKD RDRTGNREHE EEERRAKERR EREKKERDER DKQEQREREL DELEKLLHEQ AGRLVPELLT ESGLFSSIDE TAPTNVPTVV SKRDGDSDSP PTNESMATSL IESTSIESTS IESPTSIESP STSYTRAVPP RSDSSESIYA FIIRRLNALE GNSSLVARYI EEQAKVMRSM LKQVQVGWDE WKGEWEDEDR GRWQQERMRQ EDRLGRVLSQ LEQQRIAFDA ERKAIETQLR VLADQLGYER RRGIAQLIIM VVIILLGAAS RSSTMDAILT PLLKEARRRR SDYYHRKSLS GPLAGLHIDM GAGRPPAIIG QARPTSTTPS AHPHRHSSST PTPRLKTSLS RAGSGHRSNT SLKRRGIVPQ VPPSYRSVSS SEFTFSPLSH LPPTSSPSPA NIPNPNPNPR NVRVSFPPPR QTPPPPSVSS RKLAQSAHLH HLHTTAAAAA AREDTERGIT ASMRRRRMRS SLVNDDNEQQ TTVSGLGSGK ADAGGGGGGG GGEEAERVVG AEDNSQGEWG TDDFDTEADD FDTEAEAEAE VSKVEDQVRD KKDSETDRKE QDQLGETEQQ PVREKRGVQG EHVGLARA // ID F6GW16_VITVI Unreviewed; 586 AA. AC F6GW16; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CCB44151.1}; GN OrderedLocusNames=VIT_18s0089g00340 {ECO:0000313|EMBL:CCB44151.1}; OS Vitis vinifera (Grape). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; Vitales; Vitaceae; Vitis. OX NCBI_TaxID=29760 {ECO:0000313|Proteomes:UP000009183}; RN [1] {ECO:0000313|Proteomes:UP000009183} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Pinot noir / PN40024 {ECO:0000313|Proteomes:UP000009183}; RX PubMed=17721507; DOI=10.1038/nature06148; RG The French-Italian Public Consortium for Grapevine Genome Characterization.; RA Jaillon O., Aury J.-M., Noel B., Policriti A., Clepet C., RA Casagrande A., Choisne N., Aubourg S., Vitulo N., Jubin C., Vezzi A., RA Legeai F., Hugueney P., Dasilva C., Horner D., Mica E., Jublot D., RA Poulain J., Bruyere C., Billault A., Segurens B., Gouyvenoux M., RA Ugarte E., Cattonaro F., Anthouard V., Vico V., Del Fabbro C., RA Alaux M., Di Gaspero G., Dumas V., Felice N., Paillard S., Juman I., RA Moroldo M., Scalabrin S., Canaguier A., Le Clainche I., Malacrida G., RA Durand E., Pesole G., Laucou V., Chatelet P., Merdinoglu D., RA Delledonne M., Pezzotti M., Lecharny A., Scarpelli C., Artiguenave F., RA Pe M.E., Valle G., Morgante M., Caboche M., Adam-Blondon A.-F., RA Weissenbach J., Quetier F., Wincker P.; RT "The grapevine genome sequence suggests ancestral hexaploidization in RT major angiosperm phyla."; RL Nature 449:463-467(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN594956; CCB44151.1; -; Genomic_DNA. DR RefSeq; XP_002271455.1; XM_002271419.2. DR RefSeq; XP_010644023.1; XM_010645721.1. DR RefSeq; XP_010644024.1; XM_010645722.1. DR RefSeq; XP_010644025.1; XM_010645723.1. DR ProteinModelPortal; F6GW16; -. DR STRING; 29760.VIT_18s0089g00340.t01; -. DR EnsemblPlants; VIT_18s0089g00340.t01; VIT_18s0089g00340.t01; VIT_18s0089g00340. DR GeneID; 100249908; -. DR KEGG; vvi:100249908; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F6GW16; -. DR Proteomes; UP000009183; Chromosome 18. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009183}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009183}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 24 46 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 521 541 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 562 584 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 493 520 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 586 AA; 65758 MW; C941AD8D2AAB6829 CRC64; MQRSRRALLQ RRALEKAIIG RSRLYKVSLS LVFVLWGLVF LLSLWISHGD GYQDGSGMPL IGISTWDEAK QGLNLGSCSV DEHSLIETNS DNSYEGSRND AETKDFTNEL HSKGNVKSTL PVEEGSEVEK SSSDVKSEKD TPKNDRLSRA VPPGLDEFKS KAISYKSKSV TGQAGNVIHR VEPGGADYNY ASASKGAKVL ASNKEAKGAS NILGKDKDKY LRNPCSAEEK FVVIELSEET LVDTIEIANF EHYSSNPKDF ELLGSSVFPT DEWVKLGNFT AANVKHAQRF ALHEPKWVRY LKLNLLSHHG TEFYCTLSVV EVYGVDAVER MLEDLISVQD NPFVPEEITA EKKSIPSQPE PTEGNNLYQK PVSETESDPL LDKPEAIKSN MPDPVEEIRH QQVGRMPGDT VLKILMQKVQ SLDLSLSVLE RYLEDLNSRY GNIFKEFDKE IEEKDVLLEN IRSDIRNFLD SKEIITKDVS DLISWKSLVS LQLDNLLKDN ALLRAEVQKV QEDQTHMENK GIAVFLICLI FGFWAFARLL VDMMLSVYMA VSVNNRSDKS RNFCGTSSSW VFLLLSCSII IVILSL // ID F6HU31_VITVI Unreviewed; 402 AA. AC F6HU31; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CCB58254.1}; GN OrderedLocusNames=VIT_02s0025g01670 {ECO:0000313|EMBL:CCB58254.1}; OS Vitis vinifera (Grape). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; Vitales; Vitaceae; Vitis. OX NCBI_TaxID=29760 {ECO:0000313|Proteomes:UP000009183}; RN [1] {ECO:0000313|Proteomes:UP000009183} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Pinot noir / PN40024 {ECO:0000313|Proteomes:UP000009183}; RX PubMed=17721507; DOI=10.1038/nature06148; RG The French-Italian Public Consortium for Grapevine Genome Characterization.; RA Jaillon O., Aury J.-M., Noel B., Policriti A., Clepet C., RA Casagrande A., Choisne N., Aubourg S., Vitulo N., Jubin C., Vezzi A., RA Legeai F., Hugueney P., Dasilva C., Horner D., Mica E., Jublot D., RA Poulain J., Bruyere C., Billault A., Segurens B., Gouyvenoux M., RA Ugarte E., Cattonaro F., Anthouard V., Vico V., Del Fabbro C., RA Alaux M., Di Gaspero G., Dumas V., Felice N., Paillard S., Juman I., RA Moroldo M., Scalabrin S., Canaguier A., Le Clainche I., Malacrida G., RA Durand E., Pesole G., Laucou V., Chatelet P., Merdinoglu D., RA Delledonne M., Pezzotti M., Lecharny A., Scarpelli C., Artiguenave F., RA Pe M.E., Valle G., Morgante M., Caboche M., Adam-Blondon A.-F., RA Weissenbach J., Quetier F., Wincker P.; RT "The grapevine genome sequence suggests ancestral hexaploidization in RT major angiosperm phyla."; RL Nature 449:463-467(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN596251; CCB58254.1; -; Genomic_DNA. DR STRING; 29760.VIT_02s0025g01670.t01; -. DR EnsemblPlants; VIT_02s0025g01670.t01; VIT_02s0025g01670.t01; VIT_02s0025g01670. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F6HU31; -. DR Proteomes; UP000009183; Chromosome 2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009183}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009183}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 343 362 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 383 400 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 233 260 {ECO:0000256|SAM:Coils}. FT COILED 314 334 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 402 AA; 44466 MW; 8CBA28769305BAF3 CRC64; MPSQLVNITH HLEPDGTEYN YASVSKGAKV VAHNKEAKGA SNILGKDHDK YLRNACSVGE KFVVVELAEE TLVDAIKIAN FEHYSSNVKE FTLSGSLSYP TEKWFLLGNF VAANVKHAQS FKLPEPKWVS VFEVYGVDAI ERMLEDLIVA NEDPTPGKFV NPNSSSMPSS EPIDRKIKGE LQIGVGKGTE NTGDAPIARV GMTKDPAAMH KIPDPVVEVR QMPTGRIPGD TVLKILMQKV RSLELNLSVL EEYIKELNRR EGNVLPELDK ELSRISLLLE KSRAEIKDLL EWKEITEKGI TDLESWKTAV SSQVQELARE NDMLRLDVKK VVTEQSSLEN KELAVVAVSF SIACVAVLKL VSDRVLTLFG AAQSGEVGQK SRGWVLILVS SSMMIFITFL CS // ID F6I756_VITVI Unreviewed; 410 AA. AC F6I756; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CCB62774.1}; GN OrderedLocusNames=VIT_13s0175g00100 {ECO:0000313|EMBL:CCB62774.1}; OS Vitis vinifera (Grape). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; Vitales; Vitaceae; Vitis. OX NCBI_TaxID=29760 {ECO:0000313|Proteomes:UP000009183}; RN [1] {ECO:0000313|Proteomes:UP000009183} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Pinot noir / PN40024 {ECO:0000313|Proteomes:UP000009183}; RX PubMed=17721507; DOI=10.1038/nature06148; RG The French-Italian Public Consortium for Grapevine Genome Characterization.; RA Jaillon O., Aury J.-M., Noel B., Policriti A., Clepet C., RA Casagrande A., Choisne N., Aubourg S., Vitulo N., Jubin C., Vezzi A., RA Legeai F., Hugueney P., Dasilva C., Horner D., Mica E., Jublot D., RA Poulain J., Bruyere C., Billault A., Segurens B., Gouyvenoux M., RA Ugarte E., Cattonaro F., Anthouard V., Vico V., Del Fabbro C., RA Alaux M., Di Gaspero G., Dumas V., Felice N., Paillard S., Juman I., RA Moroldo M., Scalabrin S., Canaguier A., Le Clainche I., Malacrida G., RA Durand E., Pesole G., Laucou V., Chatelet P., Merdinoglu D., RA Delledonne M., Pezzotti M., Lecharny A., Scarpelli C., Artiguenave F., RA Pe M.E., Valle G., Morgante M., Caboche M., Adam-Blondon A.-F., RA Weissenbach J., Quetier F., Wincker P.; RT "The grapevine genome sequence suggests ancestral hexaploidization in RT major angiosperm phyla."; RL Nature 449:463-467(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FN596766; CCB62774.1; -; Genomic_DNA. DR STRING; 29760.VIT_13s0175g00100.t01; -. DR EnsemblPlants; VIT_13s0175g00100.t01; VIT_13s0175g00100.t01; VIT_13s0175g00100. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; F6I756; -. DR Proteomes; UP000009183; Chromosome 13. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009183}; KW Reference proteome {ECO:0000313|Proteomes:UP000009183}. FT COILED 150 170 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 410 AA; 45218 MW; B02026CEF3ABDD38 CRC64; MSASTVSITA NTAARRRPVV IGEKKPNIEL LSGDAGVSQF NGIAGEDKLT GGGGKDLSHS IRGETILERS KEMIRKLALK SADSSGGSLV AVPDFERRIA EVESFLKTTT KMMQVQVEVV DRKIESEVGG LRRELSKKIE EKAGDFNNHL EKLDSKSETL EKKLGELGAM EFLRKEDFDK IFDELKNAKS ADYGDREMSL DEIRGIAREI VEKEIERHAA DGLGRVDYAL SSSGAMVVRH SEPYILGKGS GWFPKTSLTG VHRDSEKMLK PSFGEPGQCF PLKGDSGFVQ IRLRTTIIPE AITLEHVDKM VAYDRSSAPK DCRVYGWHQG HDTDIAAETG SMFLLAEFSY DLEKSNAQTF NVLDLVGSGL VDMVRFDFAS NHGSPSHTCI YRLRVHGHEL DSVSMLAMQS // ID F6Q9Q6_ORNAN Unreviewed; 2610 AA. AC F6Q9Q6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000022126}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSOANP00000022126}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000022126, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Ensembl:ENSOANP00000022126, ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000022126, RC ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000022126} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000022126}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000022126}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPN01002960; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_001512305.1; XM_001512255.3. DR ProteinModelPortal; F6Q9Q6; -. DR STRING; 9258.ENSOANP00000022126; -. DR Ensembl; ENSOANT00000022130; ENSOANP00000022126; ENSOANG00000014033. DR GeneID; 100077752; -. DR KEGG; oaa:100077752; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; F6Q9Q6; -. DR KO; K12231; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000002279; Chromosome 4. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289256 MW; 4CA7F554AF0C0D10 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGAVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDA NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTILVEIT ATILDQEDDD DGHLLALQII RDLVDKGGDL FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPSTS SQPILSAPGP IKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCTQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERASGET SLIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QTFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DSSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTTL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV SGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGIDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTAASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKMERRSES VTEQSVVSGP DVHEPIVVLS SAENVPQAEV GSSSSASTST LTAEAGCENA ERKLGPESSV RTPGETSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TTGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLSNFR STIFYYVQKL LQFSCNGSVK SDKLRRIWEP TYTIMYREMK DSDKEKESGK MGCWSIEHVE QFLGTDELPK NDLITYLQKN ADSAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGSK SGLSQGAIST LQNCDILSLV KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYTRTSH EEGDEQLQFH FPPDEFTSKK LTTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSTVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTEL GTWLCDDDFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKSLS EDEKNTKLQD LMLKNPSGSG PSLSIEDLGL NFQFCPSSRV YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMQTGIQK QMDAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID F6QKD9_XENTR Unreviewed; 2528 AA. AC F6QKD9; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000030896}; GN Name=hectd1 {ECO:0000313|Ensembl:ENSXETP00000030896}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000030896, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000030896} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000030896} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000030896}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01016385; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01016386; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01016387; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01016388; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8364.ENSXETP00000030896; -. DR PaxDb; F6QKD9; -. DR Ensembl; ENSXETT00000030896; ENSXETP00000030896; ENSXETG00000014149. DR Xenbase; XB-GENE-1010869; hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; F6QKD9; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR ExpressionAtlas; F6QKD9; baseline. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1268 1288 {ECO:0000256|SAM:Coils}. FT COILED 1650 1674 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2528 AA; 280512 MW; 54D5511BB85C3DD3 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVEGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DASLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAQH GLTEELLSRM AAAGGTVSGP SSACKTGRGT SGGPSTSGDS KISNQVSTIV SLLSTLCRGS PVVTHDLLRA ELLDSMESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDEKKKKDS NREEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCHSDAGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGDDL FLDQLARLGV ISKVSTLAGP TSDDENEEDS KPEKVRIHLF PLYSAHKGTQ AAASQLKEDE PQEDAKELQQ GRPYHWRDWS VIRGRDCLYI WSDAAALELS NGSNGWFRFI LDGKLATMYS SGSPEGGSDS SESRSEFLEK LQRARSQVKP STSSQPILST PGPGKLTVGN WSLTCLKDGE IAIHNSDGQQ ATILKEDLPG FVFESNRGTK HSFTAETSLG SEFVTGWTGK RGRKLKSKLE KTKQKVRTMA RDLYDDHFKA VESMPRGVVV TLRNIATQLE SAWELHTNRQ CIEGENTWRD LMKTALENLI VLLKDENTIS PYEMCSSGLV QALLTVLNNN EDCDIKQDCG QLVERLNVFK TAFSENEDDE SRPAVALVRK LIAVLESIER LPLHLYDTPG SSYNLQILTR RLRFRLERAP GETSLIDRTG RMLKMEPLAT VESLEQYLLK MVAKQWYDFD RSSFVFVRKL REGQSCVFRH QHDFDDNGIM YWIGTNAKTA YEWVNPAAYG LVVVTSSEGR NLPYGRLEDI LSRDSSALNC HTNDDKSAWF AIDLGLWVVP SAYTLRHARG YGRSALRNWV FQVSKDGQNW TTLYTHMDDC SLNEPGSTAT WPLDPAREEK QGWRHVRIKQ TGKNASGQTH YLSLSGFELY GNVTGVCEDQ LGKAAKEAEA NLRRQRRLVR SQVLKYMVPG ARVIRGIDWK WRDQDGSAQG EGTVTGELHN GTPPSWSSLV KNNCPDKAPP SSSSSCVVVG SVAGSGSRKG SSSSVCSVAS SSDVSLSCAK TERRAEEQVS DIHHDPILLL SSNQAASGSS TCPPGGETVG EGGDRKAGEA PAISMGMVSI SSPDVSSVSE LSNKEVAVPR PLGSSASNRL SVSSLLAAGA PMSSSASVPN LSSRETSSLE SFVRRVANIA RTNATNNMNL SRSSSDNNTN TLGRNAVSSA TSPLMGAQSF PNLTTTGTTS TVTMSTSSVT SSNVATATTG LSVGQSLSNT LTTSLTSTSS ESDTGQEAEY SLYDFLDSCR ASTLLAELDD DEDLPEPDEE DDENEDDNQE EQEYEEVMEE EEYETKGGRR RTWDDDYVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPAP GTPHSELLEE VECAPAPRLA LTLKVTGLGS GREVELPLNN FRSTIFFYVQ RLLQLSCNGA IKTDKLRRIW EPTYTIMYRE MKDSDKQKEC GRLGCWSVEH VEQSLGTDAL PKNDLITYLQ RNADPGFLRR WKLTGTNKSI RKNRNCSQLI AAYKDFCENG CKSLSMPAAL ATLQSADILS HSREQAQAKA GSSQNSCGVE DVLQLLRILF IVASDPYSAR TPQEDGEDML LFSVPPEEFT SKKITTKIVQ QIEEPLALAS GALPDWCEQL TSKCPFLIPF ETRQLYFTCT AFGASRAIVW LQNRREATVE RSRTASAVRR DDPGEFRVGR LKHERVKVPR GESLMEWAEN VMQIHADRKS VLEVSKHKAT IMKKLSYINF IMPEVQPLDF FLWPTLDLYL SLCPTDVLRE KDLTVKYKLE HWKVTSVVLA YFISYINRDC VQLCNVSVLT QWIDPFHSQN TFWCSAPVSL PLIKMGRKVK GFFHKLFSCR EESEHCTESQ SEASTEDGHD ALSVGSFEED CKSEFILDPP KPKPPAWFQG ILTWEDFELI NPHRARFLRD IRELAVKRRQ ILGNRCLSED EKNTQLQELM LKNPSGSGPP VSIEDLGLNF QFCPSSRVYG FSAVDLRPNG EDEMVTIDNA EEYVDLMFDF CMQTGVQKQM EAFRSGFNKV FPMEKLGSFS PEEVQMILCG NQSPSWSAED IINYTEPKLG YTRESPGFLR FVRVLCGMSS DERKAFLQFT TGCSTLPPGG LANLHPRLTV VRKVDATDAS YPSVNTCVHY LKLPEYSSEE IMRDRLLAAT MEKGFHLN // ID F6RAR8_HORSE Unreviewed; 808 AA. AC F6RAR8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000015443}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSECAP00000015443}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000015443, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000015443, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015443, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000015443} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015443}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000015443}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000015400; -. DR PaxDb; F6RAR8; -. DR Ensembl; ENSECAT00000018903; ENSECAP00000015443; ENSECAG00000017201. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000002281; Chromosome 13. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 253 276 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 282 303 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 315 334 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 400 420 {ECO:0000256|SAM:Coils}. FT COILED 452 486 {ECO:0000256|SAM:Coils}. FT COILED 499 519 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 808 AA; 90127 MW; F50EB2550E8C50F2 CRC64; MDFSRLHMYT PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLVT TACATEEGQA GDAPSCPSSA ASLRDRAART AKQRRSASKL AFSVNHASGK AMSLAVGRRG SGGLQGAACL QPPVLDESLI REQTKVDHFW GLDDDGDLKG GSEAAVQGNG DLAARSNGFT CRDCSLLSGR EDALTTLPVA HGTSSRVYSR DRTQKRDDSK GQKRPDTHAA RANCCWFSGA RACRLLMQTL RRIGAAGWFV LKAVLSVVWL AVVAPGKAAS GVLWWLGIGW YQLVTLISWL NVFLLTRCLR NICKFLILVI PLLLLLGAGL SLWGQGDFRS FLPLLNWTHM YGAQRPSEPR NTLTPAAPPP ARPPEAGDEA FPRLQMSEVE RQMTFLSGQC HSHDQKLREL TVLIQELQAQ VHQMDAGSEG VLPLVKRVVE QRLKETDYMT FHQDHELRIS NLEEILGRMT ERSEAIQREL EQTKQRTMRA RGRRLLSVVE HLELELGHLQ AELSDWQRLK APCESADSVH EQVDARVRET LKLMFSGDEQ DASLEWLLQK FSSQFVSKDD VQVLLRDLEL QILKNITHHI SVTKQMPTSE TVVSAVNEAG ISGITEAQAR AIVDNALKLY SQDKTGMVDF ALESGGGSIL STRCAETYET KTALLSLFGI PLWYFSQSPR VVIQPDIYPG NCWAFRGSQG YLVVRLSMQI RPTTFTLEHI PKTLSPTGNI TSAPKDFAVY GLENEYQEEG ELLGRFTYDQ DGESLQMFPV PKRPEGAFQI VELRILSNWG HPEYTCLYRF RVHGEPVE // ID F6RJA0_HORSE Unreviewed; 818 AA. AC F6RJA0; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000015407}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSECAP00000015407}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000015407, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000015407, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015407, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000015407} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015407}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000015407}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000015400; -. DR PaxDb; F6RJA0; -. DR Ensembl; ENSECAT00000018861; ENSECAP00000015407; ENSECAG00000017201. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000002281; Chromosome 13. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 263 286 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 292 313 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 325 344 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 410 430 {ECO:0000256|SAM:Coils}. FT COILED 462 496 {ECO:0000256|SAM:Coils}. FT COILED 509 529 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 818 AA; 90749 MW; B7D3714E42208823 CRC64; MDFSRLHMYT PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLVT TACATEEGQA GDAPSCPSSA ASLRDRAART AKQRRSASKL AFSVNHASGK AMSLAVGRRG SGGLQGAACL QPPVLDESLI REQTKVDHFW GLDDDGDLKG GSEAAVQGNG DLAARSNGFT CRDCSLLSGR EDALTTLPVA HGTSSRVYSR DRTQKRDDSK GQKRPDTHAA VHSPSSRPGG AAGPAGRGLA HAGRLLMQTL RRIGAAGWFV LKAVLSVVWL AVVAPGKAAS GVLWWLGIGW YQLVTLISWL NVFLLTRCLR NICKFLILVI PLLLLLGAGL SLWGQGDFRS FLPLLNWTHM YGAQRPSEPR NTLTPAAPPP ARPPEAGDEA FPRLQMSEVE RQMTFLSGQC HSHDQKLREL TVLIQELQAQ VHQMDAGSEG VLPLVKRVVE QRLKETDYMT FHQDHELRIS NLEEILGRMT ERSEAIQREL EQTKQRTMRA RGRRLLSVVE HLELELGHLQ AELSDWQRLK APCESADSVH EQVDARVRET LKLMFSGDEQ DASLEWLLQK FSSQFVSKDD VQVLLRDLEL QILKNITHHI SVTKQMPTSE TVVSAVNEAG ISGITEAQAR AIVDNALKLY SQDKTGMVDF ALESGGGSIL STRCAETYET KTALLSLFGI PLWYFSQSPR VVIQPDIYPG NCWAFRGSQG YLVVRLSMQI RPTTFTLEHI PKTLSPTGNI TSAPKDFAVY GLENEYQEEG ELLGRFTYDQ DGESLQMFPV PKRPEGAFQI VELRILSNWG HPEYTCLYRF RVHGEPVE // ID F6RJG8_HORSE Unreviewed; 917 AA. AC F6RJG8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000015400}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSECAP00000015400}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000015400, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000015400, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015400, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000015400} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015400}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000015400}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000015400; -. DR PaxDb; F6RJG8; -. DR Ensembl; ENSECAT00000018854; ENSECAP00000015400; ENSECAG00000017201. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6RJG8; -. DR OMA; MKLNYES; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002281; Chromosome 13. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 362 385 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 391 412 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 424 443 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 509 529 {ECO:0000256|SAM:Coils}. FT COILED 561 595 {ECO:0000256|SAM:Coils}. FT COILED 608 628 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSECAP00000015400}. SQ SEQUENCE 917 AA; 102202 MW; 1E36C3AEC3E50A77 CRC64; FEIVNMDFSR LHMYTPPQCV PENTGYTYAL SSSYSSDALD FETEHKLDPV FDSPRMSRRS LRLVTTACAT EEGQAGDAPS CPSSAASLRD RAARTAKQRR SASKLAFSVN HASGKAMSLA VGRRGSGGLQ GAACLQPPVL DESLIREQTK VDHFWGLDDD GDLKGGSEAA VQGNGDLAAR SNGFTCRDCS LLSGREDALT TLPVAHGTSS RVYSRDRTQK RGASFCVDRI WWLAKYTSSS FSSFLVQLFQ VVLMKLNYES ENYKLKSYES KDRESESYKS KSRESTAHSS YCGRVNVTEL FREDGRLGVH GESLCDDSKG QKRPDTHAAV HSPSSRPGGA AGPAGRGLAH AGRLLMQTLR RIGAAGWFVL KAVLSVVWLA VVAPGKAASG VLWWLGIGWY QLVTLISWLN VFLLTRCLRN ICKFLILVIP LLLLLGAGLS LWGQGDFRSF LPLLNWTHMY GAQRPSEPRN TLTPAAPPPA RPPEAGDEAF PRLQMSEVER QMTFLSGQCH SHDQKLRELT VLIQELQAQV HQMDAGSEGV LPLVKRVVEQ RLKETDYMTF HQDHELRISN LEEILGRMTE RSEAIQRELE QTKQRTMRAR GRRLLSVVEH LELELGHLQA ELSDWQRLKA PCESADSVHE QVDARVRETL KLMFSGDEQD ASLEWLLQKF SSQFVSKDDV QVLLRDLELQ ILKNITHHIS VTKQMPTSET VVSAVNEAGI SGITEAQARA IVDNALKLYS QDKTGMVDFA LESGGGSILS TRCAETYETK TALLSLFGIP LWYFSQSPRV VIQPDIYPGN CWAFRGSQGY LVVRLSMQIR PTTFTLEHIP KTLSPTGNIT SAPKDFAVYG LENEYQEEGE LLGRFTYDQD GESLQMFPVP KRPEGAFQIV ELRILSNWGH PEYTCLYRFR VHGEPVE // ID F6RMZ5_MACMU Unreviewed; 1402 AA. AC F6RMZ5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000018073}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSMMUP00000018073}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000018073, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000018073, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000018073, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000018073} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000018073}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000018073}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9544.ENSMMUP00000018073; -. DR Ensembl; ENSMMUT00000019300; ENSMMUP00000018073; ENSMMUG00000007280. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F6RMZ5; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000006718; Chromosome 1. DR ExpressionAtlas; F6RMZ5; baseline. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}. FT COILED 1084 1104 {ECO:0000256|SAM:Coils}. FT COILED 1134 1154 {ECO:0000256|SAM:Coils}. FT COILED 1340 1360 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMMUP00000018073}. SQ SEQUENCE 1402 AA; 156330 MW; AF595F639F502495 CRC64; ERFLARPFLS TNQHLARWGS PLIQGKVQLP SQQPRHSRPS HELCSKEEKS ATLPKLISLV VSSETTDFRN KTMDSRRDRE RRKRVLEGKL QLPRALARTQ RARDEGRRAW TSRRPQQRRS PESCEAPLSA RLWGPRRGLP GREPLRSRSA SATAFRIIGP ILALLLRLLH LGFGSGGCRE DVPPSDRGKK EEKMKKHRRA LALVSCLFLC SLVWLPSWRV CCKESSSASA SSYYSQDDNC ALENEDVQFP KKNTESKKLS PPVVETLPTV DLHEESSNAV VDSETVENIS SSSTSEITPI SKLDEIEKSG TIPIAKPSET EQSETDCDVG EALDASAPIE QPSFVSPPDS LVGQHIENVS SSHGKGKITK SEFESKVSAS EQDGGDQKSA LNASDNVKNE SSDYTKPGDI DPTSVTSPKD PEDIPTFDEW KKKVMEVEKE KSQSMHPSSN GGSHATKKVQ KNRNNYASVE CGAKILAANP EAKSTSAILI ENMDLYMLNP CSTKIWFVIE LCEPIQVKQL DIANYELFSS TPKDFLVSIS DRYPTNKWIK LGTFHGRDER NVQSFPLDEQ MYAKYVKVEL VSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYHSERQE LFDEDYDYPL DYNTGEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLT EGNKSISENA TATAAPKMPE STPVSTPVPS PAYVTTEVDT NDMELSTPDT PKESPIVQLV QEEEEEASPS TVTLLGSGEQ EDESSPWFES ETQIFCSELT TICCISSFSE YIYKWCSVRV ALYWQRSRTA LSKGKDYLVS AQPPLLPAES VDISVLQPLS GELENKNIER EAETVVLGDL SSSMHQDDLV NHTVDAVELE PSHSQTLSQS LLLDITPEIN PLPKIEVSES VEYEAGHITS QVIPQESSVE IDNEAEQKSE SFSSIEKPSV TYETNKVNEV VDNIIKEDVN SMQIFTKLSE TIVPPINTAT VPDNEDGEAK MNVADTAKQT LISVVDSSSF PEVKEEEQSP EDALLRGLQR TATDFYAELQ NSTDLGYANG NLVHGSNQKE SVFMRLNNRI KALEVNMSLS GRYLEELSQR YRKQMEEMQK AFNKTIVKLQ NTSRIAEEQD QRQTEAIQLL QAQLTNMTQL VSNLSATVAE LKREVSDRQS YLVISLVLCV VLGLMLCMQR CRNTSQFDGD YISKLPKSNQ YPSPKRCFSS YDDMNLKRRT SFPLMRSKSL QLTGKEVDPN DLYIVEPLKF SPEKKKKRCK YKIEKIETIK PAEPLHPIAN GDIKGRKPFT NQRDFSNIGE VYHSSYKGPP SEGSSETSSQ SEESYFCGIS ACTSLCNGQS QKTKTEKRAL KRRRSKVQDQ GKLIKTLIQT KSGSLPSLHD IIKGNKEITV GTFGVTAVSG HI // ID F6RQM4_MONDO Unreviewed; 730 AA. AC F6RQM4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000036771}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSMODP00000036771}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000036771, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000036771, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000036771} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000036771}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 13616.ENSMODP00000036771; -. DR Ensembl; ENSMODT00000038367; ENSMODP00000036771; ENSMODG00000009597. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6RQM4; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002280; Chromosome 8. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 187 211 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 243 263 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 382 402 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 730 AA; 82050 MW; AC8D17418BD090AB CRC64; MSRRSQRLIR YSQGDDDGGS SSSSASSSLM IGQHVPFKDS PLRALRRKSG SMKRLSPAPH LGSASKPHTS YYSESVVSES YGRGFRAPSV TKSSILHEQL DSDSSWSGDL MVGRRRGIGG PESSKINMQT EDKISYDTYG SSSGYSSEDD YSETQEGRPL WGSSFPPCPT RNLGMDQPSS VSRLSNAFFQ AGAFLWMVVT FPGRFFGLFY WWLGTTWYRL TTAASLLDVF VLTRCFASLK KLLLSFLMMM LLASLAFGAW YFYPYGLKTF HPALFSWWAA KGGNKREVWE PVDSNSYFKA EQHILSRVHA LERRLETVAS EFSALWQKEA TRMEHLELRL QQGASGNGGV KGLSQEDSMI FLEGLWSRRE AVLKEECRRN AMTHIQEELT ILRAEYQQYL DNQKKIIQAF QDLESRFLQL KSDWQSLSQE EASQRQAAVE ALQHDVSLCP LVGSGILPRI GHATHAPHSK PGNCCRGICS GIFPSQGQEM QAMVQAQLRD LEHRILTQMA EEQGKFVRET AARVEQTLDK EGEVGITEEK VHRIVNQALK RYSEDRIGLV DYALESGGAS VISTRCSETY DTKTALLSLF GIPLWYRSQS PRAILQPDVY PGNCWAFQGT QGFAVIRLSA LIRPTAVTLE HVPKALTPIS NIPSAPKDFI ILGLNEDLQQ EGAALGYFTY DQNGEPIQTF HLQSNNTAIY QVVELRILSN WGHPEYTCIY RFRVHGEPVY // ID F6S3X7_XENTR Unreviewed; 675 AA. AC F6S3X7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000058750}; DE Flags: Fragment; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000058750, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000058750} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000058750} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000058750}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01026206; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8364.ENSXETP00000058750; -. DR PaxDb; F6S3X7; -. DR Ensembl; ENSXETT00000061455; ENSXETP00000058750; ENSXETG00000029959. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6S3X7; -. DR OMA; ATERNEW; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR Bgee; F6S3X7; -. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 173 193 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 243 263 {ECO:0000256|SAM:Coils}. FT COILED 301 350 {ECO:0000256|SAM:Coils}. FT COILED 429 449 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSXETP00000058750}. SQ SEQUENCE 675 AA; 76213 MW; 81BDFA1D96EDC86A CRC64; RSLRRKVTTL KHAPTPRQSA HHSHYSETSS YITRDKLGHT SELPETDYDA SYWGDNLVRS RVGLERSYQS KNGVSETQLS YDKSTSSSGY SSEEDFTGML GVMADVQQQN HISVSFSCHL SSAHCLSLLH GTGRVFGLLY WWFGTTWYRL TTSASLLDVF ILTRHYSMLK KPLIILLLLL LLALLGAGLW HFYPYGLGGL TIIPTSLFSV KHTPKGEGIP LKQEETRSQA QSLLSEAEFI SRMESLERKF HSLEKGLTLL QQQNMAKPKE TEVPRDVGVS REEIFQIFSE LSSDREAALM DSIQQQEATK AKNNLRNLRE EQQGNLQEMV QKMHNMFKDV EAEIVQLKTD MKSSATDDLN KNLVEVEGRL SGELLGIKEQ LKAVRKTQAD LSQQVETVPK QIQGVRDGVE LLFPKWLRTQ MEDGRTGPLA ELFLRRDELQ KHLVELERKI LAGIATERNE WAARAHTSVD RELQAGGLSG ITREEVHEIV NRAIQTYSED RIGMVDYALE SSGASVINTR CSETFETKTA LLSLFGVPLW YQSQSPRVIL QPDLNPGNCW AFRGSQGYAV IRLSSPIHPT AVTIDHIPRS LSPKATISSA PKDFSVYGLE EESQKEGLLL GNFTYNQYGK PIQTFSIQGG DIPTYQLVEL RIQSNWGHPE YTCIYRFRVH GETEV // ID F6S4J7_XENTR Unreviewed; 918 AA. AC F6S4J7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000046766}; GN Name=sun1 {ECO:0000313|Ensembl:ENSXETP00000046766}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000046766, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000046766} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000046766} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000046766}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01119010; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01119011; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8364.ENSXETP00000046766; -. DR PaxDb; F6S4J7; -. DR Ensembl; ENSXETT00000046766; ENSXETP00000046766; ENSXETG00000021642. DR Xenbase; XB-GENE-995941; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6S4J7; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 401 419 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 426 444 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 519 539 {ECO:0000256|SAM:Coils}. FT COILED 605 625 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 918 AA; 102754 MW; 4BBB6A7CBC4AF79D CRC64; MRMDYSHLHT YAPPQCLPDN TGYTYALSSS YSSEALDFET IHKLAPVFDS PRMSRRSLRL KTSMGNYADN SATSICAGTL SKCSLRTDRR QRKMQQHNSS RQSSSSHQSA SRKSVTNTSF QSQSSFNSQI ADTSVLSSVL DASVIREQTE VSSIWGLDDE ELIKDGNTTV IQSNGDFNSA ETQTTMVNGY TCSDCSIVSQ RNDALTALSA SYSSSARVYS RDRSQKREIP FYMHKAMLLF KNTASSLATV VVRLLHMVML KLGCDLKDHS DQTCFTLSVW GKGVYVAAHL NYCGSVNVKD FLKEDGLLRI NGKSLCDDYN GTKHHEMRTT IHTQSSWARG VTGTLWHTLY YTGYLLLQAV RSVGAAGWFV SRKMLSFLWL AIVSPGRAAS SLLWWLGTGW YQLATLVSLL NVFILTRCLS KLSKLLLLLL PLLILLGNLG LYLWGSDYIL LPAFGGLRIF SSDVLEETAH SLEPSPESTT ISSPGTKEEG LPYDTDRIRE LEKQFGLMGR KHNGHMEDYK KLNVLVLKIQ EQVQQMNDES HLSSIITNIT ISVSLFYTND ASSSTASNHE ARIVHLEALF AKLSQSQAIE EDRTLAESRT RGGGGNSKKR RIESLEEEFE TYKAAFTNRQ TAQTSCDLPD CLLQKVDARV KESVNMMFAS QENIPESLLQ WLSANYVNKG DFNSRLQELE LKILQNITHH VILTKQVPSA KVVETAITGA IDGISKQETQ AMINNALRLY SQDRTGMADF ALESGGGSIL GTRCSETYGT KTALMSLFGI PLWYFSQSPR VVIQPDMYPG NCWAFKGTQG YLVVRLSRMI YPTAFSIEHI PKSLSPLGNI TSAPKDFAVY GLDDEYQEDG QVLIRAVYDQ EGEPLQIFHI MEEYKKPFQI VELRIFSNWG HQDFTCLYRF RAHGTPVQ // ID F6SF78_ORNAN Unreviewed; 729 AA. AC F6SF78; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000016766}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSOANP00000016766}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000016766, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000016766} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000016766}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000016766}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9258.ENSOANP00000016766; -. DR Ensembl; ENSOANT00000016769; ENSOANP00000016766; ENSOANG00000010580. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6SF78; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002279; Unassembled WGS sequence. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 175 195 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 226 247 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 415 449 {ECO:0000256|SAM:Coils}. FT COILED 489 509 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 729 AA; 81890 MW; B30165AAB5B6AC7E CRC64; MSRRSQRLKR YSQGDDDGGS SSSGGSSLVG SQHSLFKDSP IRTLKRKSSN VKRLSPAPHL GTSSNTHTTY YSESMVSESY VGGPRSSPLA RSSILHDHQQ DSDLYWNEDL LVRRRRDTGG TESSKINGLT ENKVTYDTYG SSSGYSSEDD YTGHLGEDQY SSGSRLKKAA SRAGSFLWMV VTSPGQLFGL CYWWIGTTWY RLTTNASLLD VFVLTRSVRF PSLKKLLLLL LMLLLLASLA YGAWYFYPYG LQTLYPAVLS WWAGRASVDS SKRDVMWESG RPTARIQDEQ LVLSQVHGLE RRLEALAAEF SSHWQKEALR LERLELWQVS HEATLTLLEG LVSRREAVLR EDFRVGLANR IQRGHPQSIN LKGERAKSER LDHRRPLGVR ARGRNRVTVV ACGWESTSST QEALRASMLQ ELGRLEGQLA DLRQELATLT LKQATVAEQV EGFPQKIQAV QDEVESQFPG WISRFLLRDE AVGTKLLQRE ELHDQLQKLE HKILAHLAQE RGKSAQEAAA GLGVVLRKEG VTGVTEEQVH HIVSQALKRY SEDQIGLVDY ALESGGASVI NTRCSETYET KTALLSLFGI PLWYHSQSPR TILHPDVYPG NCWAFRGPQG FAVVRLSARI RPTAVTLEHV SKSLLPSSTL LSAPKDFVIL GLDEETQQEG TPMGRFTYES ARKTPIQTFQ LEDTQSTTYQ VVELRILSNW GHPEYTCIYR FRVHGEPQA // ID F6SIE6_MONDO Unreviewed; 140 AA. AC F6SIE6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000000018}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSMODP00000000018}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000000018, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000000018, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000000018} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000000018}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 13616.ENSMODP00000000018; -. DR Ensembl; ENSMODT00000000018; ENSMODP00000000018; ENSMODG00000000018. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6SIE6; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000002280; Chromosome 1. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}. SQ SEQUENCE 140 AA; 15897 MW; F70C80596B6216F4 CRC64; MTPGNCWAFS GDRGQVVIRL ARKIFLTNVT IQHIPKTISL SGSLDTAPKD FVVYVNFLAF DTDPHGINDK IKEETFLGAF LFQPENSIQM FPLQNSLCKS FNYIKLKILT NWGNPHFTCL YRVRAHGTIS RPAHDHYPQG // ID F6SQT7_MONDO Unreviewed; 458 AA. AC F6SQT7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000002040}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSMODP00000002040}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000002040, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000002040, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000002040} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000002040}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 13616.ENSMODP00000002040; -. DR Ensembl; ENSMODT00000002083; ENSMODP00000002040; ENSMODG00000001669. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6SQT7; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002280; Chromosome 1. DR ExpressionAtlas; F6SQT7; baseline. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 175 197 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 225 245 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 458 AA; 50682 MW; 6EF5753A61417464 CRC64; MRRSPRPGSA ATPHKHNPDF YSDRSNSSVS ATSRDSGSRS KEGTRIRQGA GRDEGSFRDS ESIPCAGAAG GHWQAGSCLP IRAPRSRAEP ASSSKQGPRR VSQPPTPHPS PGLAFTPAPQ RLLPCLGPSS QMVTPQTMSS PHPAVVLQTL RFFPAVFGEI LGAFCRCGEV RSLRFLITAS LFFCLFAAAI WGAFLYFTPV WDEETREFLT LSEYHEKVHS QGLQLQQLQA ELDKLHTDVS SIRAANSERV AQLVFQRLNE DFVQKPDYAL SSVGASIDLD KTSHDYEDRD TAYFWNRFSF WNYAKPPTVI LEPDVFPGNC WAFKGAKGQV VIRLPGRVQL SDITLQHPPP SVAHSGGASS APRDFAVFGL QGDDKTEVFL GRFIFDVEKS EIQTFHLKNE PPIAFPKVKI QILSNWGHPR FTCLYRVRAH GLRSHDGHGE DSRTKGERNG ASETTTPH // ID F6SWL9_HORSE Unreviewed; 913 AA. AC F6SWL9; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000015320}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSECAP00000015320}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000015320, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000015320, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015320, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000015320} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000015320}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000015320}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000015400; -. DR PaxDb; F6SWL9; -. DR Ensembl; ENSECAT00000018761; ENSECAP00000015320; ENSECAG00000017201. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000002281; Chromosome 13. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 350 373 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 379 400 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 412 431 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 497 517 {ECO:0000256|SAM:Coils}. FT COILED 559 593 {ECO:0000256|SAM:Coils}. FT COILED 608 628 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 913 AA; 101396 MW; D8E9C29A1BDA6D79 CRC64; MDFSRLHMYT PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLVT TACATEEGQA GDAPSCPSSA ASLRDRAART AKQRRSASKL AFSVNHASGK AMSLAVGRRG SGGLQGAACL QPPVLDESLI REQTKVDHFW GLDDDGDLKG GSEAAVQGNG DLAARSNGFT CRDCSLLSGR EDALTTLPVA HGTSSRVYSR DRTQKRGASF CVDRIWWLAK YTSSSFSSFL VQLFQVVLMK LNYESENYKL KSYESKDRES ESYKSKSRES TAHSSYCGRV NVTELFREDG RLGDDSKGQK RPDTHAAVHS PSSRPGGAAG PAGRGLAHAG RLLMQTLRRI GAAGWFVLKA VLSVVWLAVV APGKAASGVL WWLGIGWYQL VTLISWLNVF LLTRCLRNIC KFLILVIPLL LLLGAGLSLW GQGDFRSFLP LLNWTHMYGA QRPSEPRNTL TPAAPPPARP PEAGDEAFPR LQMSEVERQM TFLSGQCHSH DQKLRELTVL IQELQAQVHQ MDAGSEGVLP LVKRVVEQRL KEVRADVPSG SATDYMTFHQ DHELRISNLE EILGRMTERS EAIQRELEQT KQRTMSGADE GRRLLSVVEH LELELGHLQA ELSDWQRLKA PCESADSVDA RVRETLKLMF SGDEQDASLE WLLQKFSSQF VSKDDVQVLL RDLELQILKN ITHHISVTKQ MPTSETVVSA VNEAGISGIT EAQARAIVDN ALKLYSQDKT GMVDFALESG GGSILSTRCA ETYETKTALL SLFGIPLWYF SQSPRVVIQP DIYPGNCWAF RGSQGYLVVR LSMQIRPTTF TLEHIPKTLS PTGNITSAPK DFAVYGLENE YQEEGELLGR FTYDQDGESL QMFPVPKRPE GAFQIVELRI LSNWGHPEYT CLYRFRVHGE PVE // ID F6T6D4_CALJA Unreviewed; 727 AA. AC F6T6D4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000021388}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSCJAP00000021388}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000021388, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000021388, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000021388} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000021388}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01144889; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01144890; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01144891; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01144892; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000021388; -. DR Ensembl; ENSCJAT00000022609; ENSCJAP00000021388; ENSCJAG00000011606. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6T6D4; -. DR OMA; EHQQDSE; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008225; Chromosome 1. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 221 242 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 360 380 {ECO:0000256|SAM:Coils}. FT COILED 414 441 {ECO:0000256|SAM:Coils}. FT COILED 488 508 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 727 AA; 81317 MW; A7F64A8E505D6B60 CRC64; MSRRSQRLTR YSQGDDDGGS SSSGGSSVAG SQSTLFKDSP LRTLKRKSSN MKRPSPEPQL GLSSDPHTSY YSESLVRESY IGSPRAAFLA RSALEELHSD PDWGDHLRVR KRRGTGGSES SRASGLVVGK AAEDFLGSSS GYSSEDDYVG YSDADLQSSG SRLQSVVSRV GSLLWMVATS PGRLFRLLYW WAGTTWYRLT TAASLLDVFV LTRRFSSLKT FLWFLLLLLL LTCLTYGAWY FYPYGLQTFH PALVSWWTAK DSRREHEGWE SRDSSPHFQA EQHVLSRVHS LERRLEALAA EFSSNWQKEA MRLERLELQQ GTPAQGGSGG LSHEDTLALL EGLVSRREAA LREDFRRETA ARIQEELAAL RAEHQQDSED LFKKIVRASQ ESEAHIQQLK SEWQRTSMTQ EAFRESSVKE LRRLEDQLAG LQQELAALVQ KQSSVADEVH LLPQQIQATR DDVESQFPAW ISEFLARGGG GRVGLLQREE MQAQLRDLEN KILTHIAEMQ GKSAREAAAS LGLTLQKEGV IGVTEEQVHR IVKQALQRYS EDRIGLADYA LESGGASVIS TRCSETYETK TALLSLFGIP LWYHSQSPRV ILQPDVYPGN CWAFQGPQGF AVVRLSARIR PTAVTLEHVP KALSPNSTIS SAPKDFAIFG SSQNRRTEGC AREKGTSQEC ARPVASAQFQ APMMATYQVV ELRILTNWGH PEYTCIYRFR VHGEPTH // ID F6TCF6_MONDO Unreviewed; 928 AA. AC F6TCF6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 27. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000036552}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSMODP00000036552}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000036552, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000036552, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000036552} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000036552}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_007498355.1; XM_007498293.1. DR RefSeq; XP_007498356.1; XM_007498294.1. DR RefSeq; XP_007498357.1; XM_007498295.1. DR STRING; 13616.ENSMODP00000036552; -. DR Ensembl; ENSMODT00000038144; ENSMODP00000036552; ENSMODG00000008621. DR GeneID; 100021066; -. DR CTD; 23353; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6TCF6; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002280; Chromosome 6. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 391 412 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 424 442 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 570 604 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 928 AA; 105190 MW; FE5E32751FE92020 CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLVT TAYTSDDGQV ENTHSYTRNT AYKDKISKTS KQHRNTNKQS LAITRTPRKA TSSSSLLSQS TSHASDASLR SSVLDESLIR EQTKVDHFWG LDDDGDLKGG TKNVIQGNGD LATGGTDTTL VNGYTCTDCS MLSERKDVLT AYSTSHVPSS RIYSRERSQK RGASLYMNRI LRLAKHTAAS FSSLLVQLFQ VVLMKLGYES ENHKLKNYEF KDCESKSYKT KSHESKAHSN YCGSMNVKEF LREDGHLSVN GESLCDDCKG KKHLETYTTT HLQSSRSKRV ARTIWHTFSY TGYFLMQTLQ RIGATGWFVS KKVLSFLWLA IVSPGKAASG VFWWLGTGWY QFVTLISWLN VFLLTRCLPK ICKLLLLLIP LLLLLGIGLY LWNMESFLSL LPIFNWTTIH RTPKIDESRY FFKPDSSLVN QPVEGDVKFF DWHRIGEIER QMALLSDRCH NSDEDYGKVT LLLQKLQAKV DQMDDDSGTL SLIKNVVGQH LKEMKSDSIS SSKTDFLAFH QEHELRILKL EDLLGKLSEK SKIIQEELDQ TKSRTFSGID EQQHLLSKVK HLEMELGHLK SELLTWQGLK TSCAKVETMH EKVDTQIRET IKLMFSGDQQ DGSLEWLLQW LSSKFVSKGD LQILLRDLEL QILKNITHHM SMTKEIPSSE TVVNAVNSLG ISGITEAQAH AIVNNALKLY SQDKTGMVDF ALESGGGSIL STRCSETYET KTALISLFGI PLWYFSQSPR VVIQPDIYPG NCWAFKGSQG YLVVRLSMMI HPTAFTIEHI PKTLSPTGNI TSAPKDFSVY GLDNEYQEEG MLLGQFVYNQ EGESLQMFHA MKSPGKAFQI VELRILSNWG HPEYTCLYRF RVHGEPIK // ID F6TPB5_HORSE Unreviewed; 726 AA. AC F6TPB5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000001880}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSECAP00000001880}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000001880, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000001880, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001880, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000001880} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001880}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000001880}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_001499892.2; XM_001499842.3. DR STRING; 9796.ENSECAP00000001880; -. DR PaxDb; F6TPB5; -. DR Ensembl; ENSECAT00000002633; ENSECAP00000001880; ENSECAG00000000797. DR GeneID; 100070178; -. DR KEGG; ecb:100070178; -. DR CTD; 25777; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6TPB5; -. DR KO; K19347; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002281; Chromosome 28. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 173 191 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 222 243 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 282 302 {ECO:0000256|SAM:Coils}. FT COILED 372 410 {ECO:0000256|SAM:Coils}. FT COILED 413 440 {ECO:0000256|SAM:Coils}. FT COILED 487 507 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 726 AA; 81744 MW; 6454151E3E89C9CF CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVMGS QSTLFKDSPL RTLKKKSSNM KRLSPAPQLG PSSDAHTSYY SESVVRESYI GSPRAAALAR SSIFDDQLHG DSYWSEDLRV RRRRGTGGTE SSKINGLAEN KLSEDFFGSS SGYSSEDDYA GYSETDQRSS GSRLRSAVSR AGSFFWMVVT SPGRLFGLLY WWVGTTWYRL TTAASLLDVF VLTRRFSSLK TFLWFLLLLL LLTGLTYGAW YFYPYWLQTF HPAVVSWWAG KGNSQQHEVW ESRESSPQFQ AEQRLLSRVH SLERRLEALA AEFSSNWQKE AMRLERLELR QGATGQGGSG SLSHEDTLGL LEGLVSRREA ALKEDFRRDT AARIQEELVT LRAEHQQDLE DLFKKIVQAS QESEAQLQQL KSEWQRMTQE SFRENSMKEL GRLEGQLAGL RQELAALTLK QSSVVDQVDL LPQQIQAVRD DVESQFPAWV SQFLLRGGGT RAGLLQREEI QAQLQELESK ILAHMAEMQG KSAREAAASL GLTLQKEGVI GVTEEQVHRI VKQALKRYSE DRIGMVDYAL ESGGASVIST RCSETYETKT ALLSLFGIPL WYHSQSPRVI LQPDVHPGNC WAFQGPQGFA VVRLSARIRP TAVTLEHVPK SLSPNSTISS APKDFAIFGF DEDLQQEGTL LGQFTYDQDG EPIQTFYFQD PKMATYQVVE LRILTNWGHP EYTCIYRFRV HGEPAH // ID F6TVL7_CALJA Unreviewed; 352 AA. AC F6TVL7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000045027}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSCJAP00000045027}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000045027, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000045027, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000045027} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000045027}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01140041; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140042; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140043; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140044; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000044146; -. DR Ensembl; ENSCJAT00000060377; ENSCJAP00000045027; ENSCJAG00000014725. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000008225; Chromosome 8. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}. FT COILED 97 117 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 352 AA; 40037 MW; BD034B06B207E406 CRC64; MSRTIKVRMA AIFFSDCSEE ASGSGNALLA EHENPDVNGV TRSWKIILST MFTLTFLLVG LLSHQWLKET EDPQKSRRLY AIIAEYGSRI YKYQARLHMS KEQLELLKKE SQTLENNFRE ILFLTEQIDV LKALLRDMKD GMDNNHSWST HGDPAEDPDH TEEMSNLVDY VLKKLREDQV QMADYALKSA GASIIEAGTS ESYKNNKAKL YWHGIGFLNH EMPPDTIPRC LPWKVLGFGS QGHTLIKLAR KIIPTAVTME HISEKVSPSG NISSAPKEFS IYGITKKCEG EEIFLGQFIY NKTGTTIQTF ELQHAVSEYL LCVKLNIFSN WGHPNYTCLY RFRVHGIPGS HI // ID F6TYD4_XENTR Unreviewed; 354 AA. AC F6TYD4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000059030}; GN Name=sun3 {ECO:0000313|Ensembl:ENSXETP00000059030}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000059030, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000059030} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000059030} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000059030}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01017613; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01017614; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8364.ENSXETP00000059030; -. DR PaxDb; F6TYD4; -. DR Ensembl; ENSXETT00000066361; ENSXETP00000059030; ENSXETG00000012087. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6TYD4; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR ExpressionAtlas; F6TYD4; baseline. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 85 105 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 128 148 {ECO:0000256|SAM:Coils}. FT COILED 158 178 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 354 AA; 41179 MW; 3B40C67D79527D05 CRC64; MLRRSERNRI NLKSPSSESK ERSKATMSPK PTSGKEKLET AQPPITLQGT RRIKHSNPYV TKDIRKGESM EIEVASTSRN NNFDCLYEFV FLFAVFAFVL LLIYIRSQLI SCILLEQKMR QQNTDMTLKE ISRMKDRFQE ILNDVSEQKR TQMTKMMVQE IKNELKKWEE DNVQVKDYAL YSLGATIIKD KTSQSLKSDN LHWSFLGILS WPYTSCPEEI LKPDVYPGKC WTFPGSQGQV LIKLSAKIIP VAVTLQHISK TISPSKNYSS APRDFSVFVS KWHLLASVGE MLFFFIYNNW EKSLIKISLP CLQNDDTSRF QFIQLRILSN WGNEKYTSVY RFQVHQELPV QLRS // ID F6UPU6_HORSE Unreviewed; 356 AA. AC F6UPU6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000020248}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSECAP00000020248}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000020248, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000020248, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000020248, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000020248} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000020248}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000020248}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_005609188.1; XM_005609131.1. DR STRING; 9796.ENSECAP00000020248; -. DR PaxDb; F6UPU6; -. DR Ensembl; ENSECAT00000024379; ENSECAP00000020248; ENSECAG00000022760. DR GeneID; 100066102; -. DR KEGG; ecb:100066102; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6UPU6; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002281; Chromosome 4. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 44 63 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 356 AA; 40462 MW; 69C72EB93ED18475 CRC64; MSGRPKSRKG ARVIRVHSEE ASSTSSSDTV LSEHGNPDSN RLTGSWKIII SMAFILTFLL IGLRNHMWLK ETEFPQKSRP FYSVIAEYGS RLYNYQARRR MPKEQVELVK KESRTLENNF REILFLIEQI DVLKALLRDM QNGLHSYSWN PDGDPTEVQN HTEEISNLVN YVLKKLREDQ VQMADYALKS AGASIIEAGT SESYKNSKAK LYWHGIGFLN YEMPPDIILQ PDVHPGKCWA FPGSQGHALI KLAMKIIPTA VTMEHISEKV SPSGDISSAP KEFSVYGMSK QCEGEEIFLG QFVYNRTGAT IQTFELQHKV PESLLCVKLK ILSNWGHLKY TCLYRFRVHG TPGDDT // ID F6UTZ6_HORSE Unreviewed; 1230 AA. AC F6UTZ6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000001528}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSECAP00000001528}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000001528, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000001528, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001528, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000001528} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001528}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000001528}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000001528; -. DR PaxDb; F6UTZ6; -. DR Ensembl; ENSECAT00000002158; ENSECAP00000001528; ENSECAG00000000971. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F6UTZ6; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000002281; Chromosome 5. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}. FT COILED 912 932 {ECO:0000256|SAM:Coils}. FT COILED 962 982 {ECO:0000256|SAM:Coils}. FT COILED 1168 1188 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSECAP00000001528}. SQ SEQUENCE 1230 AA; 136856 MW; 0A668F40184C9E7A CRC64; LPSWRVCCKE SSSASSYYSQ DDNCALENED VQFQKKDERE GPINAELSGK VGSNLPIPPE EHKLKDDYIV DVQNTESETL SPSVIETLPA VDLHEDSSSV VVGSENTENT SSSSTSEITP VSKLDEIEKS GTIPIAKPRE TEQSETDCDV GETLEANAPV DQPAFVNPPE SLVGQHIENV SSSHGKGKIT KSEFESKVSA SDQSNGDPKS ALNASDNLKN ESSDYTKPGE IDPTSVTSPK DPEDIPTFDE WKKKVMEVEK EKSQSMHPSS NGGLHATKKV QKNRNNYASV ECGAKILAAN PEAKSTSAIL IENMDLYMLN PCSTKIWFVI ELCEPIQVKQ LDIANYELFS STPKDFLVSI SDRYPTNKWI KLGTFHGRDE RNVQSFPLDE QMYAKYVKMF IKYIKVELIS HFGSEHFCPL SLIRVFGTSM VEEYEEIADS QYQSERQELF DEDYDYPLDY NNGEDKSSKN LLGSATNAIL NMVNIAANIL GAKTEDLMEG NKSISENATA TTPPKMPESA PVPTPVPSPE FVTTEGHVHD TQPSSPDTPK ESPIVQLVQE EEEEASPSTV TLLGSGEQED ESSPWFESET QIFCSELTTI CCISSFAEYI YKWCSVRIAL YRQRSRTDVS KEKDYLVSAQ PPLLLPAESV DVSVLQPPSG ELDGKSKEKE TETTVLGDLS DMHQGDLINH TVDAIELEPS HPQTLSQSVL LDVTPEINSL SKTELSEPIK YEAGHTPSQV ITQESSVEVD NETEKKSESF SSVEKSTVIY ETNKLNEVMD NIVKEDLNSM QIITKLTETI VPPVNTATVP DSEDGEAKMS VADTPKQILT PVVDSSSVPE VKEEEQSPED ALLRGLQRTA TDFYAELQNS TDLGYANGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQRYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIQLLQA QLTNMTQLVS NLSTTVAELK REVSDRQSYL VISLVLCVVL GLMLCMQRCR NTSPFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQL TGKEVDPNDL YIVEPLKFSP EKKKKRCKYK TEKIETIKPA DRLHPIANGD IRGRKPFTNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQSQK TKTEKRALKR RRSKVQDQGK LIKTLIQTKS GSLPSLHDII KGNKEITVGT FGVTAVSGHI // ID F6V9Z4_CIOIN Unreviewed; 770 AA. AC F6V9Z4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 2. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCINP00000022579}; GN Name=LOC100179144 {ECO:0000313|Ensembl:ENSCINP00000022579}; OS Ciona intestinalis (Transparent sea squirt) (Ascidia intestinalis). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=7719 {ECO:0000313|Ensembl:ENSCINP00000022579, ECO:0000313|Proteomes:UP000008144}; RN [1] {ECO:0000313|Ensembl:ENSCINP00000022579} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15114417; DOI=10.1007/s00239-003-2559-6; RA Gissi C., Iannelli F., Pesole G.; RT "Complete mtDNA of Ciona intestinalis reveals extensive gene RT rearrangement and the presence of an atp8 and an extra trnM gene in RT ascidians."; RL J. Mol. Evol. 58:376-389(2004). RN [2] {ECO:0000313|Ensembl:ENSCINP00000022579} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCINP00000022579}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EAAA01002408; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002127085.1; XM_002127049.3. DR STRING; 7719.ENSCINP00000022579; -. DR Ensembl; ENSCINT00000022825; ENSCINP00000022579; ENSCING00000011949. DR GeneID; 100179144; -. DR KEGG; cin:100179144; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F6V9Z4; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000008144; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008144}; KW Reference proteome {ECO:0000313|Proteomes:UP000008144}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 770 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003343724. SQ SEQUENCE 770 AA; 87865 MW; 8AA2959DBD2B8216 CRC64; MPLRCFWIFL IGIILILSSS RHRAENIEKK EEETAKDVPI SNEEQTENNF KLDKENESET PLDLKIETDK QIDEKNVKHE NIPIIPDNTN VKFEKKSEET ENENKENPQN TTESVVETSD SENLTKSPEN GNIGFEADGS THVTLNDEKI SMEEMKTTTP GVEVEMVNDP KDQEIIDKIL NQSVDEPTKE DVNLTKIVDE INNTILNDSH TESKPDVLPT TNDDLTQTVI EPKTTHTVHL NKAVDSEDQE TTTTTTTTTT EVKTEEQNEQ ILEPKSKINK PEKEEEETEA VPTADKEDTT TSNDNEVENT DKQEVIVNSN KTTTANAEEK QNEENVKLEV DSIQSFEEWK KKQIQDREEA AVKVTSQSPP RAIRTKKTQV NYASQDCGAK ILTQNPEAKH VSAILDENKD MYMLNPCSAN IWFVVELCEP IQIRQLQVAN LELFSSAPHI FDISISERYP AREWRPLGTF EARNERTVQT FAPPREELMF AKYIKFEMKS HFGKEHFCPL TLIRVLGVSM VEEYEETEDK NNEKSEMGRE SGEQKIIKNI HDDDVVNEDG KESGSIEKMF KIMKNAVGTL LGNGGSEQNT TNETNISNET LDEKTNKTEE DDKIDPKWES PVTLVTNKSE VAPPVLRKTP IVTLVETGDG QEVLMEEDFN PHLHSLRFLD KISRDKFTAH LCWFLELIYR VSVMSCIMQP YNNQTSADPS LYAYVLHKKV NQEQQVPPKV TEFKEKSKDF VKIEEVQKAA PLDDAVEPKK IKIFTKNQKI // ID F6X9N3_MACMU Unreviewed; 376 AA. AC F6X9N3; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000012321}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSMMUP00000012321}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000012321, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000012321, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000012321, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000012321} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000012321}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000012321}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9544.ENSMMUP00000012321; -. DR Ensembl; ENSMMUT00000013145; ENSMMUP00000012321; ENSMMUG00000009414. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6X9N3; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000006718; Chromosome 10. DR ExpressionAtlas; F6X9N3; baseline. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 376 AA; 42327 MW; 48B347CC6165672B CRC64; MPRSSRSPGD PGAPLEDVAH NPRPRRIAQR GRNTSRMVED TSSNMNDNFL LPVRINAQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DDSINGPLQS LRLYQEKVRH HSGEIQDLRG SMNLLIAKLQ EMEAMSDEQK VAQKIMKMIH GDYIEKPDFA LKSTGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDL AEEPNVTPGN CWAFEGNRGQ VTIQLAQKVY LSNLTLQHIP KTISPSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPES IIQMFPLQNQ PARAFGAVKV KISSNWGNPA FTCLYRVRVH GSVAPPXEQA SPEPLP // ID F6X9P3_MACMU Unreviewed; 351 AA. AC F6X9P3; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000012320}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSMMUP00000012320}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000012320, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000012320, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000012320, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000012320} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000012320}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000012320}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSMMUT00000013144; ENSMMUP00000012320; ENSMMUG00000009414. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000006718; Chromosome 10. DR ExpressionAtlas; F6X9P3; baseline. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}. FT COILED 130 150 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 351 AA; 39655 MW; 925006F38BC20742 CRC64; MPRSSRSPGD PGAPLEDVAH NPRPRRIAQR GRNTSRMVED TSSNMTWFTC FACSLRTQAQ QVLFNTCRCK LLCQKLMEKT GILLLCAFGF WMFSIHLPSK MKVWQDDSIN GPLQSLRLYQ EKVRHHSGEI QDLRGSMNLL IAKLQEMEAM SDEQKVAQKI MKMIHGDYIE KPDFALKSTG ASIDFEHTSA TYNHEKAHSY WNWIQLWNYA QPPDLAEEPN VTPGNCWAFE GNRGQVTIQL AQKVYLSNLT LQHIPKTISP SGSLDTAPKD FVIYGMEGSP KEEVFLGAFQ FQPESIIQMF PLQNQPARAF GAVKVKISSN WGNPAFTCLY RVRVHGSVAP PXEQASPEPL P // ID F6YJT5_HORSE Unreviewed; 442 AA. AC F6YJT5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000019432}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSECAP00000019432}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000019432, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000019432, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000019432, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000019432} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000019432}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000019432}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000019432; -. DR PaxDb; F6YJT5; -. DR Ensembl; ENSECAT00000023460; ENSECAP00000019432; ENSECAG00000021900. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6YJT5; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002281; Chromosome 22. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 142 162 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 168 193 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 206 240 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 442 AA; 48297 MW; 6DD1B40792DF7B22 CRC64; MRRSPRPGSA ASPHKSTPNC YSDNSNSSVS VTSGESSGHR SAGLGPGEPE GRRARGSSCG EPALSAGVPG TARAGSSRQK PAPRSHTGRT ACGAANREGR GLGSPLRTPV LIFEPLAHVG VSSSFPELDP TPLRLSLFLS HLLFQVLSVL LSLVGDMLVS VYREVCSIRF LLTAVSLLSL FLAALWWGLL YLIPLAENEP KEMLTLSEYH ERVRSQGQQL QQLQAELDKL YKEVSSVRAA NSERVAKLVF QRLNEDFVRK PDYALSSVGA SIDLEKTSHD YGDANTAYFW NRFSFWNYAR PPTVILEPDV FPGNCWAFEG DQGQVVIRLP SRVQLSDITL QHPPPSVAHT SGANSAPRDF AVYGLQVDDE TEVFLGKFTF DVEKSEIQTF HLQNDPPAAF PKVKIQILSN WGHPRFTCLY RVRAHGMRTS EGAGESATGG PH // ID F6YMU8_HORSE Unreviewed; 2610 AA. AC F6YMU8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000021258}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSECAP00000021258}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000021258, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000021258, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000021258, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000021258} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000021258}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000021258}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_001489913.2; XM_001489863.4. DR ProteinModelPortal; F6YMU8; -. DR STRING; 9796.ENSECAP00000021245; -. DR PaxDb; F6YMU8; -. DR PRIDE; F6YMU8; -. DR Ensembl; ENSECAT00000025553; ENSECAP00000021258; ENSECAG00000023505. DR GeneID; 100050203; -. DR KEGG; ecb:100050203; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR KO; K12231; -. DR Proteomes; UP000002281; Chromosome 1. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289332 MW; D317821500702739 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRGT TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALQK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNTVDL DMKQDCSQLV ERINVFKTAF SENEDEESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTPL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILSLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EEGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID F6YN41_HORSE Unreviewed; 2613 AA. AC F6YN41; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000021245}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSECAP00000021245}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000021245, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000021245, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000021245, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000021245} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000021245}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000021245}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000021245; -. DR PaxDb; F6YN41; -. DR Ensembl; ENSECAT00000025538; ENSECAP00000021245; ENSECAG00000023505. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; F6YN41; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000002281; Chromosome 1. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1248 1268 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2613 AA; 289731 MW; 7B285D0C7042B053 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRGT TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALQK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNVSLC NSTEQSIYNE FRKFIINVFK TAFSENEDEE SRPAVALIRK LIAVLESIER LPLHLYDTPG STYNLQILTR RLRFRLERAP GETALIDRTG RMLKMEPLAT VESLEQYLLK MVAKQWYDFD RSSFVFVRKL REGQNFIFRH QHDFDENGII YWIGTNAKTA YEWVNPAAYG LVVVTSSEGR NLPYGRLEDI LSRDNSALNC HSNDDKNAWF AIDLGLWVIP SAYTLRHARG YGRSALRNWV FQVSKDGQNW TPLYTHVDDC SLNEPGSTAT WPLDPPKDEK QGWRHVRIKQ MGKNASGQTH YLSLSGFELY GTVNGVCEDQ LGKAAKEAEA NLRRQRRLVR SQVLKYMVPG ARVIRGLDWK WRDQDGSPQG EGTVTGELHN GWIDVTWDAG GSNSYRMGAE GKFDLKLAPG YDPDTVASPK PVSSTVSGTT QSWSSLVKNN CPDKTSAAAG SSSRKGSSSS VCSVASSSDI SLGSTKTERR SEIVMEHSIV SGADVHEPIV VLSSAENVPQ TEVGSSSSAS TSTLTAETGS ENAERKLGPD SSVRTPGESS AISMGIVSVS SPDVSSVSEL TNKEAASQRP LSSSASNRLS VSSLLAAGAP MSSSASVPNL SSRETSSLES FVRRVANIAR TNATNNMNLS RSSSDNNTNT LGRNVMSTAT SPLMGAQSFP NLTTPGTTST VTMSTSSVTS SSNVATATTV LSVGQSLSNT LTTSLTSTSS ESDTGQEAEY SLYDFLDSCR ASTLLAELDD DEDLPEPDEE DDENEDDNQE DQEYEEVMIL RRPSLQRRAG SRSDVTHHAV TSQLPQVPAG AGSRPIGEQE EEEYETKGGR RRTWDDDYVL KRQFSALVPA FDPRPGRTNV QQTTDLEIPP PGTPHSELLE EVECTPSPRL ALTLKVTGLG TTREVELPLT NFRSTIFYYV QKLLQLSCNG NVKSDKLRRI WEPTYTIMYR EMKDSDKEKE NGKMGCWSIE HVEQYLGTDE LPKNDLITYL QKNADAAFLR HWKLTGTNKS IRKNRNCSQL IAAYKDFCEH GTKSGLNQGA ISTLQSSDIL SLTKEQPQAK AGNGQNSCGV EDVLQLLRIL YIVASDPYSR ISQEEGDEQP QFTFPPDEFT SKKITTKILQ QIEEPLALAS GALPDWCEQL TSKCPFLIPF ETRQLYFTCT AFGASRAIVW LQNRREATVE RTRTTSSVRR DDPGEFRVGR LKHERVKVPR GESLMEWAEN VMQIHADRKS VLEVEFLGEE GTGLGPTLEF YALVAAEFQR TDLGAWLCDD NFPDDESRHV DLGGGLKPPG YYVQRSCGLF TAPFPQDSDE LERITKLFHF LGIFLAKCIQ DNRLVDLPIS KPFFKLMCMG DIKSNMSKLI YESRGDRDLH CTESQSEAST EEGHDSLSVG SFEEDSKSEF ILDPPKPKPP AWFNGILTWE DFELVNPHRA RFLKEIKDLA IKRRQILSNK GLSEDEKNTK LQELVLKNPS GSGPPLSIED LGLNFQFCPS SRIYGFTAVD LKPSGEDEMI TMDNAEEYVD LMFDFCMHTG IQKQMEAFRD GFNKVFPMEK LSSFSHEEVQ MILCGNQSPS WAAEDIINYT EPKLGYTRDS PGFLRFVRVL CGMSSDERKA FLQFTTGCST LPPGGLANLH PRLTVVRKVD ATDASYPSVN TCVHYLKLPE YSSEEIMRER LLAATMEKGF HLN // ID F6YNE8_ORNAN Unreviewed; 358 AA. AC F6YNE8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000004888}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSOANP00000004888}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000004888, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Ensembl:ENSOANP00000004888, ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000004888, RC ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000004888} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000004888}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000004888}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPN01122392; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPN01122393; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_007659855.1; XM_007661665.1. DR RefSeq; XP_007659863.1; XM_007661673.1. DR STRING; 9258.ENSOANP00000004888; -. DR Ensembl; ENSOANT00000004889; ENSOANP00000004888; ENSOANG00000003087. DR GeneID; 100075480; -. DR KEGG; oaa:100075480; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F6YNE8; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002279; Chromosome 4. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 43 62 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 114 141 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 358 AA; 40666 MW; 817958D1016F91A0 CRC64; MSARLNLRRN ARFSGIPSEE AKSGTSRADQ QPEVQNLNPP RKIWKIITTV VLLLILLLSG FYNKEWLKET RISQNTLELY DVLADYGFKL YQNQVRGRDS KGHQERLRMG TVDLKNNLRE ILALKEQINT LKAELNHAMK EINNFSLHAD GDVREGQGRE TVSDKEMSKM VNYVLKKLRE DQVEMADYAL KSAGASIVEA GTSENYKNEK AKLYWYGLGF LNYEMPPDVI LQPAVHPGNC WAFPGPQGHA IIKLARKVIP KAVTLEHISE RISPSGNITS APKDFSIYGL RDECEGEGIF LGQFMYDKTG TAVQTFHLKV EVSEFLSCVK LKILTNWGHP KFTCVYRFRV HGNPESNM // ID F7AW58_HORSE Unreviewed; 374 AA. AC F7AW58; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000016357}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSECAP00000016357}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000016357, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000016357, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000016357, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000016357} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000016357}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000016357}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000016347; -. DR PaxDb; F7AW58; -. DR Ensembl; ENSECAT00000019944; ENSECAP00000016357; ENSECAG00000018621. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000002281; Chromosome 22. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}. FT COILED 158 185 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 374 AA; 42793 MW; 96DCEC7738097E42 CRC64; MPRHSRSPRD LGDPPEDVAH TCRDARPRRV LQRGRNTCRT PEEPSPNRND SFLLPIRINA TAPGLTQCML GCMSWITCLA CFLRTQAHQV LFNTCRCKLL FQKLMEKTGV LVLCVFGFWV FSVHLPSQVQ VWQDDSISTP LQSLRMYQEK VRHHTGEIQD LRGSMNQLIA KLQEMEAMSD EQKMAQKIMK MIQGDYIEKP DFALKSIGAS IDFEQTSATY NHDKARSYWN WIRLWNYAQP PDKAEEPNVT PGNCWAFSGD RGQVTIQLAQ KVYLSNLTLQ HIPRTISLSG SLDTAPKDFV IYGMEGSPRE EVFLGAFQFQ PENIIQTFQL QNQPARTFDA VKVKISSNWG NPRFTCLYRV RVHGSVTPPR EQPS // ID F7B1D6_HORSE Unreviewed; 375 AA. AC F7B1D6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000016347}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSECAP00000016347}; OS Equus caballus (Horse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus. OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000016347, ECO:0000313|Proteomes:UP000002281}; RN [1] {ECO:0000313|Ensembl:ENSECAP00000016347, ECO:0000313|Proteomes:UP000002281} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000016347, RC ECO:0000313|Proteomes:UP000002281}; RX PubMed=19892987; DOI=10.1126/science.1178158; RG Broad Institute Genome Sequencing Platform; RG Broad Institute Whole Genome Assembly Team; RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., RA Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., RA Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L., RA Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J., RA Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J., RA Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R., RA Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F., RA Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L., RA Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M., RA White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.; RT "Genome sequence, comparative analysis, and population genetics of the RT domestic horse."; RL Science 326:865-867(2009). RN [2] {ECO:0000313|Ensembl:ENSECAP00000016347} RP IDENTIFICATION. RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000016347}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSECAP00000016347}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9796.ENSECAP00000016347; -. DR PaxDb; F7B1D6; -. DR Ensembl; ENSECAT00000019933; ENSECAP00000016347; ENSECAG00000018621. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7B1D6; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002281; Chromosome 22. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002281}; KW Reference proteome {ECO:0000313|Proteomes:UP000002281}. FT COILED 158 185 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 375 AA; 42938 MW; 9E72A91DFB9F7AC8 CRC64; MPRHSRSPRD LGDPPEDVAH TCRDARPRRV LQRGRNTCRT PEEPSPNRND SFLLPIRINA TAPGLTQCML GCMSWITCLA CFLRTQAHQV LFNTCRCKLL FQKLMEKTGV LVLCVFGFWV FSVHLPSQVQ VWQDDSISTP LQSLRMYQEK VRHHTGEIQD LRGSMNQLIA KLQEMEAMSD EQKMAQKIMK MIQGDYIEKP DFALKSIGAS IDFEQTSATY NHDKARSYWN WIRLWNYAQP PDGRFLEPNV TPGNCWAFSG DRGQVTIQLA QKVYLSNLTL QHIPRTISLS GSLDTAPKDF VIYGMEGSPR EEVFLGAFQF QPENIIQTFQ LQNQPARTFD AVKVKISSNW GNPRFTCLYR VRVHGSVTPP REQPS // ID F7B1Y2_CALJA Unreviewed; 691 AA. AC F7B1Y2; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000049139}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSCJAP00000049139}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000049139, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000049139, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000049139} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000049139}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01144889; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01144890; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01144891; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01144892; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000021388; -. DR Ensembl; ENSCJAT00000052954; ENSCJAP00000049139; ENSCJAG00000011606. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000008225; Chromosome 1. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 222 243 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 255 276 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 394 414 {ECO:0000256|SAM:Coils}. FT COILED 446 473 {ECO:0000256|SAM:Coils}. FT COILED 520 540 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSCJAP00000049139}. SQ SEQUENCE 691 AA; 76776 MW; 06310777E3EC0F7D CRC64; ASPLSAEQGL RGSPVWAAGA FRVSSGEEST SHRAMSRRSQ RLTRYSQGDD DGGSSSSGGS SVAGSQSTLF KDSPLRTLKR KSSNMKRPSP EPQLGLSSDP HTSYYSESLV RESYIGSPRA AFLARSALEE LHSDPDWGDH LRVRKRRGTG GSESSRASGL VVGKAAEDFL GSSSGYSSED DYVGYSDADL QSSGSRLQSV VSRVGSLLWM VATSPGRLFR LLYWWAGTTW YRLTTAASLL DVFVLTRRFS SLKTFLWFLL LLLLLTCLTY GAWYFYPYGL QTFHPALVSW WTAKDSRREH EGWESRDSSP HFQAEQHVLS RVHSLERRLE ALAAEFSSNW QKEAMRLERL ELQQGTPAQG GSGGLSHEDT LALLEGLVSR REAALREDFR RETAARIQEE LAALRAEHQQ DSEDLFKKIV RASQESEAHI QQLKSEWQSM TQEAFRESSV KELRRLEDQL AGLQQELAAL VQKQSSVADE VHLLPQQIQA TRDDVESQFP AWISEFLARG GGGRVGLLQR EEMQAQLRDL ENKILTHIAE MQGKSAREAA ASLGLTLQKE GVIGVTEEQV HRIVKQALQR YSEDRIGLAD YALESGGASV ISTRCSETYE TKTALLSLFG IPLWYHSQSP RVILQPDVYP GNCWAFQGPQ GFAVVRLSAR IRPTAVTLEH VPKALSPNST ISSAPKDFAI F // ID F7B4N3_MONDO Unreviewed; 2612 AA. AC F7B4N3; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000021945}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSMODP00000021945}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000021945, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000021945, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000021945} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000021945}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 13616.ENSMODP00000021945; -. DR Ensembl; ENSMODT00000022331; ENSMODP00000021945; ENSMODG00000017590. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; F7B4N3; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000002280; Chromosome 1. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1247 1267 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2612 AA; 289658 MW; 3FCF02E9B4D1EA9E CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPATAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDEKKKKDA NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDL FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPSTS SQPILSVPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNVSPF FFTQAPQNQL VIERINVFKT AFSENEDDES RPAVALIRKL IAVLESIERL PLHLYDTPGS TYNLQILTRR LRFRLERASG ETSLIDRTGR MLKMEPLATV ESLEQYLLKM VAKQWYDFDR SSFVFVRKLR EGQNFVFRHQ HDFDENGIIY WIGTNAKTAY EWVNPAAYGL VVVTSSEGRN LPYGRLEDIL SRDSSALNCH SNDDKNAWFA IDLGLWVIPS AYTLRHARGY GRSALRNWVF QVSKDGQNWT TLYTHVDDCS LNEPGSTATW PLDPPKDEKQ GWRHVRIKQM GKNASGQTHY LSLSGFELYG TVNGVCEDQL GKAAKEAEAN LRRQRRLVRS QVLKYMVPGA RVIRGIDWKW RDQDGSPQGE GTVTGELHNG WIDVTWDAGG SNSYRMGAEG KFDLKLAPGY DPDTAASPKP VSSTVSGTTQ SWSSLVKNNC PDKTSAAAGS SSRKGSSSSV CSVASSSDIS LGSTKMERRS ESVMEQSIVS GTDVHEPIVV LSSAESMPQA EVGSSSSAST STLTADTGSE NAERKLGPDS SVRTAGESSA ISMGIVSVSS PDVSSVSELT NKEAASQRPL SSSASNRLSV SSLLAAGAPM SSSASVPNLS SRETSSLESF VRRVANIART NATNNMNLSR SSSDNNTNTL GRNVMSTATS PLMGAQSFPN LTTTGTTSTV TMSTSSVTSS SNVATATTVL SVGQSLSNTL TTSLTSTSSE SDTGQEAEYS LYDFLDSCRA STLLAELDDD EDLPEPDEED DENEDDNQED QEYEEVMILR RPSLQRRAGS RSDVTHHAVT SQLPQVPSGA GSRPLGEQEE EEYETKGGRR RTWDDDYVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPPP GTPHSELLEE VECTPSPRLA LTLKVTGLGT TREVELPLTN FRSTIFYYVQ KLLQFSCNGN VKSDKLRRIW EPTYTIMYRE MKDSDKEKEN GKMGCWSIEH VEQYLGTDEL PKNDLITYLQ KNADSAFLRH WKLTGTNKSI RKNRNCSQLI AAYKDFCEHG SKSGLSQGTI STFQNCDILS LAKEQPQAKA GNGQNSCGVE DVLQLLRILY IVASDPYSRS SQEEGDEQLQ FNFPPDEFTS KKITTKILQQ IEEPLALASG ALPDWCEQLT SKCPFLIPFE TRQLYFTCTA FGASRAIVWL QNRREATVER TRTTSTVRRD DPGEFRVGRL KHERVKVPRG DSLMEWAENV MQIHADRKSV LEVEFLGEEG TGLGPTLEFY ALVAAEFQRT ELGTWLCDDD FPDDESRHVD LGGGLKPPGY YVQRSCGLFT APFPQDSDEL ERITKLFHFL GIFLAKCIQD NRLVDLPISK PFFKLMCMGD IKSNMSKLIY ESRGDRDLHC TESQSEASTE EGHDSLSVGS FEEDSKSEFI LDPPKPKPPA WFNGILTWED FELVNPHRAR FLKEIKDLAI KRRQILGNKS LSEDEKNTKL QDLMLKNPSG SGPPLSIEDL GLNFQFCPSS RVYGFTAVDL KPRGEDEIIT MDNAEEYVDL MFDFCMQTGI QKQMEAFRDG FNKVFPMEKL SSFSHEEVQM ILCGNQSPSW AAEDIINYTE PKLGYTRDSP GFLRFVRVLC GMSSDERKAF LQFTTGCSTL PPGGLANLHP RLTVVRKVDA TDASYPSVNT CVHYLKLPEY SSEEIMRERL LAATMEKGFH LN // ID F7B8S5_XENTR Unreviewed; 184 AA. AC F7B8S5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000026372}; GN Name=sun3 {ECO:0000313|Ensembl:ENSXETP00000026372}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000026372, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000026372} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000026372} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000026372}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01017613; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01017614; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSXETT00000026372; ENSXETP00000026372; ENSXETG00000012087. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR ExpressionAtlas; F7B8S5; baseline. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}. SQ SEQUENCE 184 AA; 21319 MW; DDEAF689821D9306 CRC64; LKKWEEDNVQ VKDYALYSLG ATIIKDKTSQ SLKSDNLHWS FLGILSWPYT SCPEEILKPD VYPGKCWTFP GSQGQVLIKL SAKIIPVAVT LQHISKTISP SKNYSSAPRD FSVFHLWAKC YFFLFMHNYI FASKSLIKIS LPCLQNDDTS RFQFIQLRIL SNWGNEKYTS VYRFQVHQEL PVQL // ID F7BKT5_MONDO Unreviewed; 333 AA. AC F7BKT5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000011782}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSMODP00000011782}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000011782, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000011782, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000011782} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000011782}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_007500484.1; XM_007500422.1. DR RefSeq; XP_007500485.1; XM_007500423.1. DR RefSeq; XP_007500486.1; XM_007500424.1. DR STRING; 13616.ENSMODP00000011782; -. DR Ensembl; ENSMODT00000012005; ENSMODP00000011782; ENSMODG00000009429. DR GeneID; 100030253; -. DR KEGG; mdo:100030253; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7BKT5; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002280; Chromosome 6. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 12 32 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 92 119 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 333 AA; 38076 MW; D7BFCF1F8D9FFF24 CRC64; MERVRECFNQ IYWSWKIILS MMFLLACLII GFHNSERLKR TGLSNIPRQL YELSTDYGSK LYDYQARIRM PKVKMELLRV GSHNLESNSQ EILSLTKQID ILKALLKDIK NKMDNYILNP NTEAFGKQDD SDITNEEMVI LVNYVLKKLR EDQVQMADYA LKSAGASIVE AGTSESYKND KAKLYWHGIG FLSYEMPPDV ILQPDVHPGK CWAFPGSKGH TIIKLARKIT PTAVTMEHIS EKISPSGNTS SAPKDFSVYG LKEECKGEEI FLGQFMYNKK GTSVQTFHLQ NEVSEYLLCV KLKILNNWGH PKYTCLYRFR VHGKPETDGL GAP // ID F7BZ98_ORNAN Unreviewed; 240 AA. AC F7BZ98; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000012399}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSOANP00000012399}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000012399, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000012399} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000012399}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000012399}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSOANT00000012401; ENSOANP00000012399; ENSOANG00000007785. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000002279; Unassembled WGS sequence. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 240 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003354601. FT COILED 21 41 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 240 AA; 27144 MW; AEA07BAA8BA5432E CRC64; MDGLILFLFC LQVLGPDSTA LREYQEMVQL QAREIKDLRA ITDKLLATLQ EIRAMSDEQK IVQKILTMIQ GDYIEKPDFA LKSIGAAIDF EHTSATYSCS KARSYWNWFR LWDFAHSPEV ILEPNVTPGN CWPFLGHHGQ VVIRLARKIY LTNVTIQHIP KAVSLSGNLN AAPKDFAVYG VDDTGEDVFL GAFVFQADSA LQTFDLKNKH AKPFGSIKLK ITSNWGHPRF TCLYRVRAHG // ID F7BZA4_ORNAN Unreviewed; 299 AA. AC F7BZA4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000012398}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSOANP00000012398}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000012398, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000012398} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000012398}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000012398}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9258.ENSOANP00000012398; -. DR Ensembl; ENSOANT00000012400; ENSOANP00000012398; ENSOANG00000007785. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7BZA4; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002279; Unassembled WGS sequence. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 299 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003354808. FT COILED 21 41 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 299 AA; 33562 MW; 40E364412146B9D7 CRC64; MDGLILFLFC LQVLGPDSTA LREYQEMVQL QAREIKDLRA ITDKLLATLQ EIRAMSDELT GVCRGMDTIN HSFSTAAEAI PLHFSKKKTR VNKILTMIQG DYIEKPDFAL KSIGAAIDFE HTSATYSCSK ARSYWNWFRL WDFAHSPEVI LEPNVTPGNC WPFLGHHGQV VIRLARKIYL TNVTIQHIPK AVSLSGNLNA APKDFAVYSN GAVARWILPR PRSPRCVLLP SQGVDDTGED VFLGAFVFQA DSALQTFDLK NKHAKPFGSI KLKITSNWGH PRFTCLYRVR AHGTMGEPE // ID F7C0N0_XENTR Unreviewed; 2516 AA. AC F7C0N0; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000062619}; GN Name=hectd1 {ECO:0000313|Ensembl:ENSXETP00000062619}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000062619, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000062619} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000062619} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000062619}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01016385; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01016386; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01016387; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01016388; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8364.ENSXETP00000030896; -. DR PaxDb; F7C0N0; -. DR Ensembl; ENSXETT00000060698; ENSXETP00000062619; ENSXETG00000014149. DR Xenbase; XB-GENE-1010869; hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR ExpressionAtlas; F7C0N0; baseline. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1269 1289 {ECO:0000256|SAM:Coils}. FT COILED 1651 1675 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2516 AA; 278765 MW; 1751BDB9C90E11F3 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVEGAIK ALCNRLVVVE LNNRTSRDLA EQCVKKVLEL ICTRESGAVF EAGGLNCVLT FIRDSGHLVH KDTLHSAMAV VSRLCGKMEP QDASLETCVE SLSSLLKHED HQVSDGALRC FASLADRFTR RGVDPAPLAQ HGLTEELLSR MAAAGGTVSG PSSACKTGRG TSGGPSTSGD SKISNQVSTI VSLLSTLCRG SPVVTHDLLR AELLDSMESA LQGDERCVLD TMRLVDLLLV LLFEGRKALP KSSAGSTGRI PGLRRLDSSG ERSHRQLIDC IRSKDTDALI DAIDTGAFEV NFMDDVGQTL LNWASAFGTQ EMVEFLCERG ADVNRGQRSS SLHYAACFGR PQVAKTLLRH GANPDLRDED GKTPLDKARE RGHSEVVAIL QSPGDWMCPV NKGDEKKKKD SNREEEECNE PKGDPEMAPI YLKRLLPVFA QTFQQTMLPS IRKASLALIR KMIHFCSEAL LKEVCHSDAG HNLPTVLVEI TATVLDQEDD DDGHLLALQI IRDLVDKGDD LFLDQLARLG VISKVSTLAG PTSDDENEED SKPEKVRIHL FPLYSAHKGT QAAASQLKED EPQEDAKELQ QGRPYHWRDW SVIRGRDCLY IWSDAAALEL SNGSNGWFRF ILDGKLATMY SSGSPEGGSD SSESRSEFLE KLQRARSQVK PSTSSQPILS TPGPGKLTVG NWSLTCLKDG EIAIHNSDGQ QATILKEDLP GFVFESNRGT KHSFTAETSL GSEFVTGWTG KRGRKLKSKL EKTKQKVRTM ARDLYDDHFK AVESMPRGVV VTLRNIATQL ESAWELHTNR QCIEGENTWR DLMKTALENL IVLLKDENTI SPYEMCSSGL VQALLTVLNN NEDCDIKQDC GQLVERLNVF KTAFSENEDD ESRPAVALVR KLIAVLESIE RLPLHLYDTP GSSYNLQILT RRLRFRLERA PGETSLIDRT GRMLKMEPLA TVESLEQYLL KMVAKQWYDF DRSSFVFVRK LREGQSCVFR HQHDFDDNGI MYWIGTNAKT AYEWVNPAAY GLVVVTSSEG RNLPYGRLED ILSRDSSALN CHTNDDKSAW FAIDLGLWVV PSAYTLRHAR GYGRSALRNW VFQVSKDGQN WTTLYTHMDD CSLNEPGSTA TWPLDPAREE KQGWRHVRIK QTGKNASGQT HYLSLSGFEL YGNVTGVCED QLGKAAKEAE ANLRRQRRLV RSQVLKYMVP GARVIRGIDW KWRDQDGSAQ GEGTVTGELH NGTPPSWSSL VKNNCPDKAP PSSSSSCVVV GSVAGSGSRK GSSSSVCSVA SSSDVSLSCA KTERRAEEQV SDIHHDPILL LSSNQAASGS STCPPGGETV GEGGDRKAGE APAISMGMVS ISSPDVSSVS ELSNKEVAVP RPLGSSASNR LSVSSLLAAG APMSSSASVP NLSSRETSSL ESFVRRVANI ARTNATNNMN LSRSSSDNNT NTLGRNAVSS ATSPLMGAQS FPNLTTTGTT STVTMSTSSV TSSNVATATT GLSVGQSLSN TLTTSLTSTS SESDTGQEAE YSLYDFLDSC RASTLLAELD DDEDLPEPDE EDDENEDDNQ EEQEYEEVME EEEYETKGGR RRTWDDDYVL KRQFSALVPA FDPRPGRTNV QQTTDLEIPA PGTPHSELLE EVECAPAPRL ALTLKVTGLG SGREVELPLN NFRSTIFFYV QRLLQLSCNG AIKTDKLRRI WEPTYTIMYR EMKDSDKQKE CGRLGCWSVE HVEQSLGTDA LPKNDLITYL QRNADPGFLR RWKLTGTNKS IRKNRNCSQL IAAYKDFCEN GCKSLSMPAA LATLQSADIL SHSREQAQAK AGSSQNSCGV EDVLQLLRIL FIVASDPYSA RTPQEDGEDM LLFSVPPEEF TSKKITTKIV QQIEEPLALA SGALPDWCEQ LTSKCPFLIP FETRQLYFTC TAFGASRAIV WLQNRREATV ERSRTASAVR RDDPGEFRVG RLKHERVKVP RGESLMEWAE NVMQIHADRK SVLEVSYVKK LSYINFIMPE VQPLDFFLWP TSLCPTDKPN CSNPKIFNKT KQRKLILSVS QLRKGDMTYV SVLTQWIDPF HSQNTFWCSA PVSLPLIKSM DNLTLSKGLD QLLYASRGEE SEHCTESQSE ASTEDGHDAL SVGSFEEDCK SEFILDPPKP KPPAWFQGIL TWEDFELINP HRARFLRDIR ELAVKRRQIL GNRCLSEDEK NTQLQELMLK NPSGSGPPVS IEDLGLNFQF CPSSRVYGFS AVDLRPNGED EMVTIDNAEE YVDLMFDFCM QTGVQKQMEA FRSGFNKVFP MEKLGSFSPE EVQMILCGNQ SPSWSAEDII NYTEPKLGYT RESPGFLRFV RVLCGMSSDE RKAFLQFTTG CSTLPPGGLA NLHPRLTVVR KVDATDASYP SVNTCVHYLK LPEYSSEEIM RDRLLAATME KGFHLN // ID F7CMW6_MACMU Unreviewed; 142 AA. AC F7CMW6; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000037819}; DE Flags: Fragment; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSMMUP00000037819}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000037819, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000037819, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000037819, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000037819} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000037819}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000037819}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSMMUT00000044797; ENSMMUP00000037819; ENSMMUG00000020959. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000006718; Chromosome 10. DR ExpressionAtlas; F7CMW6; baseline. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMMUP00000037819}. SQ SEQUENCE 142 AA; 15643 MW; 90F68FF71EAAE12C CRC64; VFPGNCWAFE GDQGQVVIQL PGRVQLSDIT LQHPPPSVEH TGGANSAPRD FAVFVRLKFK TCKGLQVDDE TEVFLGKFTF DVEKSEIQTF HLQNDPPAAF PKVKIQILSN WGHPRFTCLY RVRAHGVRTS EGAEGSATGG PH // ID F7CXX7_ORNAN Unreviewed; 932 AA. AC F7CXX7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000011145}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSOANP00000011145}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000011145, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Ensembl:ENSOANP00000011145, ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000011145, RC ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000011145} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000011145}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000011145}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPN01156741; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPN01156742; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPN01156743; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPN01156744; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPN01156745; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9258.ENSOANP00000011145; -. DR Ensembl; ENSOANT00000011147; ENSOANP00000011145; ENSOANG00000006996. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7CXX7; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002279; Chromosome 2. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 237 259 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 345 370 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 393 413 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 425 443 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 588 608 {ECO:0000256|SAM:Coils}. FT COILED 616 636 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 932 AA; 105767 MW; FD5128BA03E05380 CRC64; MDFSRLHMYT PPQCVPENTG YTYALSSSYS SEALNFETEH KLDPVFDSPR MSRRSLRLVT TANISNSGQT ENVHNSAENT TSLQDRTSKT VKQRRGVSKQ TVINNIPRKA ASNSSSFLSQ NNFTSNAIDV SLKSSVLDES LIREQTKVDH LWGLDDDTHF KGGSKTTVQE NGDLAVRETT MINGYICNDC SMLSERKDAL TTYSASHGPS SRIYSRDRSQ KSGATFHMNR ILRLAKYTAA TLTSLLFQLF QVVFLKLGYE SGSYKLKSFQ SNDYESNSYD LKSHELKAHS NYCGSVNVKE ILRDDGHLSV NGESLCDDCK GKKHLETHTS IHMQSSRSKR VARTIWHIFS YAGYFLMQTL QRIGVAGWYV SKKVLSFLWL AIVSPGKAAS GTFWWLGIGW YQFVALVSWL NVFLLTRCLR KVCKLFLLLI PLLLLLGLGL SLWDQGGFHS FLPVFNWKNM YRPQMVNDES RPFFKPQTDS SHLNQPSEGD TKLFDWHRMG EIERKMTFLS ERCHNYDEEY GKVTLLLQKL QARVDQMDDK SGSLTLIKNI VEEHLNEMKS AGTSDSKADY MAFYQKHELR ILKLEDLLGK ISEKSEVIQK ELEQAKSRTI SEGYEHQDLL SKVKHFEQEL AHLKSELLTW QGLKTSCEKI DTMHTRVDSQ VRETIKLMFS GDQQDGSLDW LLQWLSSKFV SKGDLQVLLQ DLELQILKNI SLHMSLTKET PTSETVITAV RSVGISGITE AQAHTIVNNA LKLYSQDKTG MVDFALESGG GSILSTRCSE TYETKTALIS LFGIPLWYFS QSPRVVIQPD IHPGNCWAFK GSQGYLVVRL SMMIYPTAFT LEHIPKTLSP TGNITSAPKV FSVYGLENEY QEEGLLLGQF TYDQAGESLQ MFQAAKKPEK AFQIVELRIS SNWGHPEYTC LYRFRVHGEP IK // ID F7DFA5_CALJA Unreviewed; 355 AA. AC F7DFA5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000027113}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSCJAP00000027113}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000027113, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000027113, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000027113} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000027113}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01140041; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140042; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140043; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140044; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000044146; -. DR Ensembl; ENSCJAT00000028665; ENSCJAP00000027113; ENSCJAG00000014725. DR Ensembl; ENSCJAT00000059284; ENSCJAP00000044146; ENSCJAG00000014725. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7DFA5; -. DR OMA; CVKLNIF; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008225; Chromosome 8. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}. FT COILED 97 117 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 355 AA; 40342 MW; BC60705ACA51C352 CRC64; MSRTIKVRMA AIFFSDCSEE ASGSGNALLA EHENPDVNGV TRSWKIILST MFTLTFLLVG LLSHQWLKET EDPQKSRRLY AIIAEYGSRI YKYQARLHMS KEQLELLKKE SQTLENNFRE ILFLTEQIDV LKALLRDMKD GMDNNHSWST HGDPAEDPDH TEEMSNLVDY VLKKLREDQV QMADYALKSA GASIIEAGTS ESYKNNKAKL YWHGIGFLNH EMPPDTILQP DVYPGKCWAF PGSQGHTLIK LARKIIPTAV TMEHISEKVS PSGNISSAPK EFSIYGITKK CEGEEIFLGQ FIYNKTGTTI QTFELQHAVS EYLLCVKLNI FSNWGHPNYT CLYRFRVHGI PGSHI // ID F7DFT8_CALJA Unreviewed; 272 AA. AC F7DFT8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000027106}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSCJAP00000027106}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000027106, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000027106, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000027106} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000027106}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01140041; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140042; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140043; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01140044; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSCJAT00000028658; ENSCJAP00000027106; ENSCJAG00000014725. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008225; Chromosome 8. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}. SQ SEQUENCE 272 AA; 31041 MW; 2CE8FA8AB1AEE837 CRC64; MSKEQLELLK KESQTLENNF REILFLTEQI DVLKALLRDM KDGMDNNHSW STHGDPAEDP DHTEEMSNLV DYVLKKLRED QVQMADYALK SAGASIIEAG TSESYKNNKA KLYWHGIGFL NHEMPPDTIL QPDVYPGKCW AFPGSQGHTL IKLARKIIPT AVTMEHISEK VSPSGNISSA PKEFSIYVRS YLNLSFRLVI ERGRTKKCEG EEIFLGQFIY NKTGTTIQTF ELQHAVSEYL LCVKLNIFSN WGHPNYTCLY RFRVHGIPGS HI // ID F7DK05_XENTR Unreviewed; 1096 AA. AC F7DK05; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000001147}; DE Flags: Fragment; GN Name=suco {ECO:0000313|Ensembl:ENSXETP00000001147}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000001147, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000001147} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000001147} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000001147}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01030768; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8364.ENSXETP00000001147; -. DR PaxDb; F7DK05; -. DR Ensembl; ENSXETT00000001147; ENSXETP00000001147; ENSXETG00000000531. DR Xenbase; XB-GENE-965479; suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F7DK05; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR Bgee; F7DK05; -. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}. FT COILED 778 798 {ECO:0000256|SAM:Coils}. FT COILED 1034 1054 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSXETP00000001147}. SQ SEQUENCE 1096 AA; 122592 MW; 0BEE63C58F19693F CRC64; VENSSGALPV ASFSEAGQSE SDCDIGGAVE VDPMSEPSFI KPPVRLVGHH TENISVQHSE RKATSQEQEE KVSKSPEAEQ EKPKLPLSSS EHTEKKKTDQ KEATGVDLVA PGGTKDIPTF DEWKKQVMEV EKEKSQSMHP SSNGGQHTVK VQKNRNNYAS VECGAKILSA NPEAKSTSAI LIENMDLYML NPCSTKIWFV IELCEPIQVK QLDIANYELF SSTPKDFLVS ISDRYPTNKW VKLGTFHARD ERNVQSFPLD EQMYAKYVKM FIKYIKVELI SHFGSEHFCP LSLIRVFGTS MVEEYEEIVD SQYSLERPEL YDEDEDYPID YNTRDDKSSK NLLGSATNAI LSMVNNLAAN MLGANTEDKS SEGKEQNGSK AENLTSEILD KDKAKPSEEM PTQIPTQITQ ETESLTAPTA HVVPLETTKE SPIVQLVQDE EEEQHGQSTV TLLESDDQEE ERTSWYAYET QIYCGELATI CCISSFSEYI YKWCSAIVAI YRQRSKTSES KRKEDSDEFT YQPLASQATV DASFGKTVSD PSESKMTGET DFILVNSGEI FSSNILNNNV ESIVLEPSQT QTISPSVLLH ITSSEKSALP TPVLETTKAE VGSDSKLMSS EALQAQLTDV KTSEVFAQVV TPTASLETVN IYDTVEQNIK DHADNINITT TEIFASQEST TAPIDSDLEE PQKPIVTQTI DSQTTADKKE EETVDDSVTG ALQRSTTDFY AELQNSTDLG YANGNQVHGS NQKESVFMRL NNRIKSLEVN MSLSSRYLEE LSQRYRKQME EMQKAFNKTI LKLQNTSKIA EEQDQRQTDA IQLLQAQLLN MSQLVSNLSA SMVELKREVS DRQSYLVISL VLCVILGIIT CFQRCRGSSQ FSEDDLSNIP KSNNYPSPKR CFSSYDDMNI KRRTTFPLIR SHSFQLSSKE VDPDDLYIVE PLKFSPEKKK KRCKYKSEKI ETLKPIDPLK VLPNGDVKTK KPFTNERDFA NIGEVYHSSY KGPPSEGSSE TSSQSEESYF CGIGACTALC NGTLQKTKTE KRAIKRRRSK LQDQGKFIKS LIQTKSGSMP SLHDIIKTNK ELTVGTFGVT TVSGRV // ID F7DL51_MACMU Unreviewed; 2125 AA. AC F7DL51; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000004174}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSMMUP00000004174}; GN ORFNames=EGK_18090 {ECO:0000313|EMBL:EHH27804.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000004174, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000004174, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000004174, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|EMBL:EHH27804.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CR-5 {ECO:0000313|EMBL:EHH27804.1}; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). RN [3] {ECO:0000313|Ensembl:ENSMMUP00000004174} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000004174}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001259; EHH27804.1; -; Genomic_DNA. DR STRING; 9544.ENSMMUP00000004174; -. DR Ensembl; ENSMMUT00000004424; ENSMMUP00000004174; ENSMMUG00000003120. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR OMA; NRQCIEG; -. DR TreeFam; TF323674; -. DR Proteomes; UP000006718; Chromosome 7. DR ExpressionAtlas; F7DL51; baseline. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 760 780 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2125 AA; 236632 MW; 3C8BF98D6E50E4CC CRC64; MCPVNKGDDK KKKDTNKDEE ECNEPKGDPE MAPIYLKRLL PVFAQTFQQT MLPSIRKASL ALIRKMIHFC SEALLKEVCD SDVGHNLPTI LVEITATVLD QEDDDDGHLL ALQIIRDLVD KGGDIFLDQL ARLGVISKVS TLAGPSSDDE NEEESKPEKE DEPQEDAKEL QQGKPYHWRD WSIIRGRDCL YIWSDAAALE LSNGSNGWFR FILDGKLATM YSSGSPEGGS DSSESRSEFL EKLQRARGQV KPSTSSQPIL SAPGPTKLTV GNWSLTCLKE GEIAIHNSDG QQATILKEDL PGFVFESNRG TKHSFTAETS LGSEFVTGWT GKRGRKLKSK LEKTKQKVRT MARDLYDDHF KAVESMPRGV VVTLRNIATQ LESSWELHTN RQCIESENTW RDLMKTALEN LIVLLKDENT ISPYEMCSSG LVQALLTVLN NSMDLDMKQD CSQLVERINV FKTAFSENED DESRPAVALI RKLIAVLESI ERLPLHLYDT PGSTYNLQIL TRRLRFRLER APGETALIDR TGRMLKMEPL ATVESLEQYL LKMVAKQWYD FDRSSFVFVR KLREGQNFIF RHQHDFDENG IIYWIGTNAK TAYEWVNPAA YGLVVVTSSE GRNLPYGRLE DILSRDNSAL NCHSNDDKNA WFAIDLGLWV IPSAYTLRHA RGYGRSALRN WVFQVSKDGQ NWTSLYTHVD DCSLNEPGST ATWPLDPPKD EKQGWRHVRI KQMGKNASGQ THYLSLSGFE LYGTVNGVCE DQLGKAAKEA EANLRRQRRL VRSQVLKYMV PGARVIRGLD WKWRDQDGSP QGEGTVTGEL HNGWIDVTWD AGGSNSYRMG AEGKFDLKLA PGYDPDTVAS PKPVSSTVSG TTQSWSSLVK NNCPDKTSAA AGSSSRKGSS SSVCSVASSS DISLGSTKTE RRSEIVMEHS IVSGADVHEP IVVLSSAENV PQTEVGSSSS ASTSTLTAET GSENAERKLG PDSSVRTPGE SSAISMGIVS VSSPDVSSVS ELTNKEAASQ RPLSSSASNR LSVSSLLAAG APMSSSASVP NLSSRETSSL ESFVRRVANI ARTNATNNMN LSRSSSDNNT NTLGRNVMST ATSPLMGAQS FPNLTTPGTT STVTMSTSSV TSSSNVATAT TVLSVGQSLS NTLTTSLTST SSESDTGQEA EYSLYDFLDS CRASTLLAEL DDDEDLPEPD EEDDENEDDN QEDQEYEEVM ILRRPSLQRR AGSRSDVTHH AVTSQLPQVP AGAGSRPIGE QEEEEYETKG GRRRTWDDDY VLKRQFSALV PAFDPRPGRT NVQQTTDLEI PPPGTPHSEL LEEVECTPSP RLALTLKVTG LGTTREVELP LTNFRSTIFY YVQKLLQLSC NGNVKSDKLR RIWEPTYTIM YREMKDSDKE KENGKMGCWS IEHVEQYLGT DELPKNDLIT YLQKNADAAF LRHWKLTGTN KSIRKNRNCS QLIAAYKDFC EHGTKSGLNQ GAISTLQSSD ILNLTKEQPQ AKAGNGQNSC GVEDVLQLLR ILYIVASDPY SRISQEDGDE QPQFTFPPDE FTSKKITTKI LQQIEEPLAL ASGALPDWCE QLTSKCPFLI PFETRQLYFT CTAFGASRAI VWLQNRREAT VERTRTTSSV RRDDPGEFRV GRLKHERVKV PRGESLMEWA ENVMQIHADR KSVLEVEFLG EEGTGLGPTL EFYALVAAEF QRTDLGAWLC DDNFPDDESR HVDLGGGLKP PGYYVQRSCG LFTAPFPQDS DELERITKLF HFLGIFLAKC IQDNRLVDLP ISKPFFKLMC MGDIKSNMSK LIYESRGDRD LHCTESQSEA STEEGHDSLS VGSFEEDSKS EFILDPPKPK PPAWFNGILT WEDFELVNPH RARFLKEIKD LAIKRRQILS NKGLSEDEKN TKLQELVLKN PSGSGPPLSI EDLGLNFQFC PSSRIYGFTA VDLKPSGEDE MITMDNAEEY VDLMFDFCMH TGIQKQMEAF RDGFNKVFPM EKLSSFSHEE VQMILCGNQS PSWAAEDIIN YTEPKLGYTR DSPGFLRFVR VLCGMSSDER KAFLQFTTGC STLPPGGLAN LHPRLTVVRK VDATDASYPS VNTCVHYLKL PEYSSEEIMR ERLLAATMEK GFHLN // ID F7DS70_MONDO Unreviewed; 1269 AA. AC F7DS70; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMODP00000004918}; GN Name=SUCO {ECO:0000313|Ensembl:ENSMODP00000004918}; OS Monodelphis domestica (Gray short-tailed opossum). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Didelphimorphia; Didelphidae; Monodelphis. OX NCBI_TaxID=13616 {ECO:0000313|Ensembl:ENSMODP00000004918, ECO:0000313|Proteomes:UP000002280}; RN [1] {ECO:0000313|Ensembl:ENSMODP00000004918, ECO:0000313|Proteomes:UP000002280} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=17495919; DOI=10.1038/nature05805; RA Mikkelsen T.S., Wakefield M.J., Aken B., Amemiya C.T., Chang J.L., RA Duke S., Garber M., Gentles A.J., Goodstadt L., Heger A., Jurka J., RA Kamal M., Mauceli E., Searle S.M., Sharpe T., Baker M.L., Batzer M.A., RA Benos P.V., Belov K., Clamp M., Cook A., Cuff J., Das R., Davidow L., RA Deakin J.E., Fazzari M.J., Glass J.L., Grabherr M., Greally J.M., RA Gu W., Hore T.A., Huttley G.A., Kleber M., Jirtle R.L., Koina E., RA Lee J.T., Mahony S., Marra M.A., Miller R.D., Nicholls R.D., Oda M., RA Papenfuss A.T., Parra Z.E., Pollock D.D., Ray D.A., Schein J.E., RA Speed T.P., Thompson K., VandeBerg J.L., Wade C.M., Walker J.A., RA Waters P.D., Webber C., Weidman J.R., Xie X., Zody M.C., Baldwin J., RA Abdouelleil A., Abdulkadir J., Abebe A., Abera B., Abreu J., RA Acer S.C., Aftuck L., Alexander A., An P., Anderson E., Anderson S., RA Arachi H., Azer M., Bachantsang P., Barry A., Bayul T., Berlin A., RA Bessette D., Bloom T., Bloom T., Boguslavskiy L., Bonnet C., RA Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., RA Costello M., D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., RA Dhargay N., Dooley K., Dooley E., Doricent M., Dorje P., Dorjee K., RA Dupes A., Elong R., Falk J., Farina A., Faro S., Ferguson D., RA Fisher S., Foley C.D., Franke A., Friedrich D., Gadbois L., Gearin G., RA Gearin C.R., Giannoukos G., Goode T., Graham J., Grandbois E., RA Grewal S., Gyaltsen K., Hafez N., Hagos B., Hall J., Henson C., RA Hollinger A., Honan T., Huard M.D., Hughes L., Hurhula B., Husby M.E., RA Kamat A., Kanga B., Kashin S., Khazanovich D., Kisner P., Lance K., RA Lara M., Lee W., Lennon N., Letendre F., LeVine R., Lipovsky A., RA Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., Lubonja R., RA Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., RA Mulrain L., Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., RA Nicol R., Norbu C., Norbu N., Novod N., O'Neill B., Osman S., RA Markiewicz E., Oyono O.L., Patti C., Phunkhang P., Pierre F., RA Priest M., Raghuraman S., Rege F., Reyes R., Rise C., Rogov P., RA Ross K., Ryan E., Settipalli S., Shea T., Sherpa N., Shi L., Shih D., RA Sparrow T., Spaulding J., Stalker J., Stange-Thomann N., RA Stavropoulos S., Stone C., Strader C., Tesfaye S., Thomson T., RA Thoulutsang Y., Thoulutsang D., Topham K., Topping I., Tsamla T., RA Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., Wilkinson J., RA Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., Zimmer A., RA Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., Chin C., RA Gnerre S., MacCallum I., Graves J.A., Ponting C.P., Breen M., RA Samollow P.B., Lander E.S., Lindblad-Toh K.; RT "Genome of the marsupial Monodelphis domestica reveals innovation in RT non-coding sequences."; RL Nature 447:167-177(2007). RN [2] {ECO:0000313|Ensembl:ENSMODP00000004918} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMODP00000004918}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_007480855.1; XM_007480793.1. DR STRING; 13616.ENSMODP00000004918; -. DR Ensembl; ENSMODT00000005022; ENSMODP00000004918; ENSMODG00000003997. DR GeneID; 100020559; -. DR CTD; 51430; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F7DS70; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000002280; Chromosome 2. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002280}; KW Reference proteome {ECO:0000313|Proteomes:UP000002280}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1269 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003356524. FT COILED 951 971 {ECO:0000256|SAM:Coils}. FT COILED 1001 1021 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1269 AA; 141707 MW; F3BFA041204E4B03 CRC64; MKKYRRALAL VSCLFLCSLV WFPSWRVCCK ESSSPSSYYS QNDNCVLENE DEQFQKKEET DKSINAELFE NIDSTMPSPP EYNTLVDDYN IEEQDTQSRI LSSPVLETLP SVDINGDSSS IATNIENVEN ISTSSTSEIT PVSKPNEIEN SSADIPLASL SESEQTETDC NIGGSLYSDP QVEKHGALGL DQHVEHSAFV GPPESLVGQH IENTSSSQDK GMVTKSEFES VSTSKQDANH QKSALNTSDN LKEGLDYNKN REIDPTSVVS PKDPGDIPTF DEWKKKVMEV EKEKSQSMHP ASNGGQHSTK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQFDIANYEL FSSTPKDFLV SISDRYPTNK WIKLGTFHGR DERTVQSFPL DEQMYAKYVK MFIKYIKVEL LSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQLERQE LFDEDYDYPL DYSTGEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLA ETEGNKSLSK SSIPATAAPQ MPDPEPSPIS SSPEFITAEG HLPDIEPPTL DFSKESPIVQ LVQEEEEEPS PSTVTLLGND EQEEESPAWS DLETQAYCSE LATACCVSSF SEYLLKWCSV RVALHRLRSR TLGSRETCYP VTAQPPLLLT TETIDASVSQ PLSEELDGKM MEREGETPVA HNVSSRFHDD LINHTKDAIE LEPSHPPTVS QSDLLDVTPE IKSLSKVEVP ELVKHEVGQT VSQLFPQESI VEVSTEMEKK SETNVAIEKH SVIYETNAVG EVKDSSIRDD TNSIQVILKP SESVLPLEHM ASVADSEDEE AKVTITDTHK HIPPSDVESS PASDIREEEQ AAEDALLAIP VHGGLQRTAP DFYAELQNST ELGYANGNLV HGSNQKESVF MRLNNRIKAL EVNMSLSGRY LEELSQRYRK QMEEMQKAFN KTIIKLQNTS RIAEEQDQRQ TEAIQLLQAQ LTNMTQLVSN LSTAVADLKR EVSDRQSYLV ISLVLCVILG LMLCMQRCRN TSQFDGDYIS RLPKNNHYPS PKRCFSSYDD LNLKRRTSFP LIRSKSLQLT GREVDPDDLY IVEPLKFSPE KKKKRCKYKT EKIETVKPSD TSQPIANGDI KGRKPFTNQR DFSNMGEVYH SSYKGPPSEG SSETSSQSEE SYFCGISACT SLCNGQTQKT KTEKRALKRR RSRVQDQGKL IKTLIQTKSG SMPSLHDLIK GNKEITVGTF GVTAVSGHI // ID F7E8N4_XENTR Unreviewed; 352 AA. AC F7E8N4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSXETP00000063579}; GN Name=sun3 {ECO:0000313|Ensembl:ENSXETP00000063579}; OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Amphibia; Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; OC Silurana. OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000063579, ECO:0000313|Proteomes:UP000008143}; RN [1] {ECO:0000313|Ensembl:ENSXETP00000063579} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20431018; DOI=10.1126/science.1183670; RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J., RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., RA Blitz I.L., Blumberg B., Dichmann D.S., Dubchak I., Amaya E., RA Detter J.C., Fletcher R., Gerhard D.S., Goodstein D., Graves T., RA Grigoriev I.V., Grimwood J., Kawashima T., Lindquist E., Lucas S.M., RA Mead P.E., Mitros T., Ogino H., Ohta Y., Poliakov A.V., Pollet N., RA Robert J., Salamov A., Sater A.K., Schmutz J., Terry A., Vize P.D., RA Warren W.C., Wells D., Wills A., Wilson R.K., Zimmerman L.B., RA Zorn A.M., Grainger R., Grammer T., Khokha M.K., Richardson P.M., RA Rokhsar D.S.; RT "The genome of the Western clawed frog Xenopus tropicalis."; RL Science 328:633-636(2010). RN [2] {ECO:0000313|Ensembl:ENSXETP00000063579} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSXETP00000063579}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAMC01017613; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAMC01017614; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSXETT00000066422; ENSXETP00000063579; ENSXETG00000012087. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008143; Unassembled WGS sequence. DR ExpressionAtlas; F7E8N4; baseline. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008143}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008143}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 85 105 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 128 148 {ECO:0000256|SAM:Coils}. FT COILED 158 178 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 352 AA; 40735 MW; 5C100AC0A5114DB3 CRC64; MLRRSERNRI NLKSPSSESK ERSKATMSPK PTSGKEKLET AQPPITLQGT RRIKHSNPYV TKDIRKGESM EIEVASTSRN NNFDCLYEFV FLFAVFAFVL LLIYIRSQLI SCILLEQKMR QQNTDMTLKE ISRMKDRFQE ILNDVSEQKR TQMTKMMVQE IKNELKKWEE DNVQVKDYAL YSLGATIIKD KTSQSLKSDN LHWSFLGILS WPYTSCPEEI LKPDVYPGKC WTFPGSQGQV LIKLSAKIIP VAVTLQHISK TISPSKNYSS APRDFSVFVI NFLYNVEGNI LGQFAIGLNF FLIKISLPCL QNDDTSRFQF IQLRILSNWG NEKYTSVYRF QVHQELPVQL RS // ID F7FC31_MACMU Unreviewed; 1253 AA. AC F7FC31; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000009545}; GN Name=SUCO {ECO:0000313|Ensembl:ENSMMUP00000009545}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000009545, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000009545, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000009545, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000009545} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000009545}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000009545}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_001100732.1; XM_001100732.2. DR UniGene; Mmu.4574; -. DR STRING; 9544.ENSMMUP00000018073; -. DR Ensembl; ENSMMUT00000010172; ENSMMUP00000009545; ENSMMUG00000007280. DR GeneID; 705776; -. DR KEGG; mcc:705776; -. DR CTD; 51430; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR NextBio; 19980240; -. DR Proteomes; UP000006718; Chromosome 1. DR ExpressionAtlas; F7FC31; baseline. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1253 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003351978. FT COILED 935 955 {ECO:0000256|SAM:Coils}. FT COILED 985 1005 {ECO:0000256|SAM:Coils}. FT COILED 1191 1211 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1253 AA; 139237 MW; 280A908330052300 CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFPKKD EREGPINAES LGKSGSNLPV SPEEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQDGGD QKSALNASDN VKNESSDYTK PGDIDPTSVT SPKDPEDIPT FDEWKKKVME VEKEKSQSMH PSSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPAYVTTEVD TNDMELSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VALYWQRSRT ALSKGKDYLV SAQPPLLPAE SVDISVLQPL SGELENKNIE REAETVVLGD LSSSMHQDDL VNHTVDAVEL EPSHSQTLSQ SLLLDITPEI NPLPKIEVSE SVEYEAGHIT SQVIPQESSV EIDNEAEQKS ESFSSIEKPS VTYETNKVNE VVDNIIKEDV NSMQIFTKLS ETIVPPINTA TVPDNEDGEA KMNVADTAKQ TLISVVDSSS FPEVKEEEQS PEDALLRGLQ RTATDFYAEL QNSTDLGYAN GNLVHGSNQK ESVFMRLNNR IKALEVNMSL SGRYLEELSQ RYRKQMEEMQ KAFNKTIVKL QNTSRIAEEQ DQRQTEAIQL LQAQLTNMTQ LVSNLSATVA ELKREVSDRQ SYLVISLVLC VVLGLMLCMQ RCRNTSQFDG DYISKLPKSN QYPSPKRCFS SYDDMNLKRR TSFPLMRSKS LQLTGKEVDP NDLYIVEPLK FSPEKKKKRC KYKIEKIETI KPAEPLHPIA NGDIKGRKPF TNQRDFSNIG EVYHSSYKGP PSEGSSETSS QSEESYFCGI SACTSLCNGQ SQKTKTEKRA LKRRRSKVQD QGKLIKTLIQ TKSGSLPSLH DIIKGNKEIT VGTFGVTAVS GHI // ID F7FC46_MACMU Unreviewed; 720 AA. AC F7FC46; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000009544}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSMMUP00000009544}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000009544, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000009544, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000009544, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000009544} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000009544}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000009544}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9544.ENSMMUP00000009544; -. DR Ensembl; ENSMMUT00000010171; ENSMMUP00000009544; ENSMMUG00000007284. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7FC46; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000006718; Chromosome 10. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 217 238 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 276 296 {ECO:0000256|SAM:Coils}. FT COILED 355 375 {ECO:0000256|SAM:Coils}. FT COILED 377 404 {ECO:0000256|SAM:Coils}. FT COILED 407 434 {ECO:0000256|SAM:Coils}. FT COILED 481 501 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 720 AA; 80364 MW; E0E0944A01BF59D3 CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVRESYI GTAFPPRSAL EELHGDADWG EDLRVRRRRG TGGSESSRAS GLVGRKAAED FLGSSSGYSS EDDYMGYSDA DQQSSGSRLW NAVSRAGSLL WMVATSPGRL FRLLYWWAGT TWYRLTTAAS LLDVFVLTRR FSSLKTFLWF LLPLLLLTCL TYGAWYFYPY GLQTFHPALV SWWAAKDSRR QDEGWESRDS SHFQAEQRVM SRVHSLERRL EALAAEFSSN WQKEAMRLER LELRQGAPGQ GGGGGLSHED TLALLEGLVS RHEAALKEDF RREAAARIQE ELSALRAEHQ QDSEDLFKKI VRASQESEAR IQQLKSEWQS MTQESFRESS VKELRRLEDQ LAGLQQELAA LALKQSLVAD EVGLLPQQIQ AVRDDVESQF PACISQFLAR GGGGRVGLLQ REEMQAQLRE LESKILTHVA EMQGKSAREA AASLGMTLQK EGVIGVTEEQ VHRIVKQALQ RYSEDRIGLA DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGGEQKLAQ ENMLLTLFPT LSLMLLIPFF STQAPSMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID F7FC53_MACMU Unreviewed; 1401 AA. AC F7FC53; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000009543}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSMMUP00000009543}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000009543, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000009543, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000009543, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000009543} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000009543}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000009543}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9544.ENSMMUP00000018073; -. DR Ensembl; ENSMMUT00000010170; ENSMMUP00000009543; ENSMMUG00000007280. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000006718; Chromosome 1. DR ExpressionAtlas; F7FC53; baseline. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}. FT COILED 1083 1103 {ECO:0000256|SAM:Coils}. FT COILED 1133 1153 {ECO:0000256|SAM:Coils}. FT COILED 1339 1359 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMMUP00000009543}. SQ SEQUENCE 1401 AA; 156059 MW; E83CFBE29FD64F81 CRC64; ERFLARPFLS TNQHLARWGS PLIQGKVQLP SQQPRHSRPS HELCSKEEKS ATLPKLISLV VSSETTDFRN KTMDSRRDRE RRKRVLEGKL QLPRALARTQ RARDEGRTTG ARRPQQRRSP ESCEAPLSAR LWGPRRGLPG REPLRSRSAS ATAFRIIGPI LALLLRLLHL GFGSGGCRED VPPSDRGKKE EKMKKHRRAL ALVSCLFLCS LVWLPSWRVC CKESSSASAS SYYSQDDNCA LENEDVQFPK KNTESKKLSP PVVETLPTVD LHEESSNAVV DSETVENISS SSTSEITPIS KLDEIEKSGT IPIAKPSETE QSETDCDVGE ALDASAPIEQ PSFVSPPDSL VGQHIENVSS SHGKGKITKS EFESKVSASE QDGGDQKSAL NASDNVKNES SDYTKPGDID PTSVTSPKDP EDIPTFDEWK KKVMEVEKEK SQSMHPSSNG GSHATKKVQK NRNNYASVEC GAKILAANPE AKSTSAILIE NMDLYMLNPC STKIWFVIEL CEPIQVKQLD IANYELFSST PKDFLVSISD RYPTNKWIKL GTFHGRDERN VQSFPLDEQM YAKYVKVELV SHFGSEHFCP LSLIRVFGTS MVEEYEEIAD SQYHSERQEL FDEDYDYPLD YNTGEDKSSK NLLGSATNAI LNMVNIAANI LGAKTEDLTE GNKSISENAT ATAAPKMPES TPVSTPVPSP AYVTTEVDTN DMELSTPDTP KESPIVQLVQ EEEEEASPST VTLLGSGEQE DESSPWFESE TQIFCSELTT ICCISSFSEY IYKWCSVRVA LYWQRSRTAL SKGKDYLVSA QPPLLPAESV DISVLQPLSG ELENKNIERE AETVVLGDLS SSMHQDDLVN HTVDAVELEP SHSQTLSQSL LLDITPEINP LPKIEVSESV EYEAGHITSQ VIPQESSVEI DNEAEQKSES FSSIEKPSVT YETNKVNEVV DNIIKEDVNS MQIFTKLSET IVPPINTATV PDNEDGEAKM NVADTAKQTL ISVVDSSSFP EVKEEEQSPE DALLRGLQRT ATDFYAELQN STDLGYANGN LVHGSNQKES VFMRLNNRIK ALEVNMSLSG RYLEELSQRY RKQMEEMQKA FNKTIVKLQN TSRIAEEQDQ RQTEAIQLLQ AQLTNMTQLV SNLSATVAEL KREVSDRQSY LVISLVLCVV LGLMLCMQRC RNTSQFDGDY ISKLPKSNQY PSPKRCFSSY DDMNLKRRTS FPLMRSKSLQ LTGKEVDPND LYIVEPLKFS PEKKKKRCKY KIEKIETIKP AEPLHPIANG DIKGRKPFTN QRDFSNIGEV YHSSYKGPPS EGSSETSSQS EESYFCGISA CTSLCNGQSQ KTKTEKRALK RRRSKVQDQG KLIKTLIQTK SGSLPSLHDI IKGNKEITVG TFGVTAVSGH I // ID F7FQP1_ORNAN Unreviewed; 1243 AA. AC F7FQP1; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOANP00000026064}; GN Name=SUCO {ECO:0000313|Ensembl:ENSOANP00000026064}; OS Ornithorhynchus anatinus (Duckbill platypus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Monotremata; Ornithorhynchidae; Ornithorhynchus. OX NCBI_TaxID=9258 {ECO:0000313|Ensembl:ENSOANP00000026064, ECO:0000313|Proteomes:UP000002279}; RN [1] {ECO:0000313|Proteomes:UP000002279} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Glennie {ECO:0000313|Proteomes:UP000002279}; RX PubMed=18464734; DOI=10.1038/nature06936; RA Warren W.C., Hillier L.W., Marshall Graves J.A., Birney E., RA Ponting C.P., Grutzner F., Belov K., Miller W., Clarke L., RA Chinwalla A.T., Yang S.P., Heger A., Locke D.P., Miethke P., RA Waters P.D., Veyrunes F., Fulton L., Fulton B., Graves T., Wallis J., RA Puente X.S., Lopez-Otin C., Ordonez G.R., Eichler E.E., Chen L., RA Cheng Z., Deakin J.E., Alsop A., Thompson K., Kirby P., RA Papenfuss A.T., Wakefield M.J., Olender T., Lancet D., Huttley G.A., RA Smit A.F., Pask A., Temple-Smith P., Batzer M.A., Walker J.A., RA Konkel M.K., Harris R.S., Whittington C.M., Wong E.S., Gemmell N.J., RA Buschiazzo E., Vargas Jentzsch I.M., Merkel A., Schmitz J., Zemann A., RA Churakov G., Kriegs J.O., Brosius J., Murchison E.P., RA Sachidanandam R., Smith C., Hannon G.J., Tsend-Ayush E., McMillan D., RA Attenborough R., Rens W., Ferguson-Smith M., Lefevre C.M., Sharp J.A., RA Nicholas K.R., Ray D.A., Kube M., Reinhardt R., Pringle T.H., RA Taylor J., Jones R.C., Nixon B., Dacheux J.L., Niwa H., Sekita Y., RA Huang X., Stark A., Kheradpour P., Kellis M., Flicek P., Chen Y., RA Webber C., Hardison R., Nelson J., Hallsworth-Pepin K., Delehaunty K., RA Markovic C., Minx P., Feng Y., Kremitzki C., Mitreva M., Glasscock J., RA Wylie T., Wohldmann P., Thiru P., Nhan M.N., Pohl C.S., Smith S.M., RA Hou S., Nefedov M., de Jong P.J., Renfree M.B., Mardis E.R., RA Wilson R.K.; RT "Genome analysis of the platypus reveals unique signatures of RT evolution."; RL Nature 453:175-183(2008). RN [2] {ECO:0000313|Ensembl:ENSOANP00000026064} RP IDENTIFICATION. RC STRAIN=Glennie {ECO:0000313|Ensembl:ENSOANP00000026064}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOANP00000026064}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9258.ENSOANP00000026064; -. DR Ensembl; ENSOANT00000029867; ENSOANP00000026064; ENSOANG00000000909. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F7FQP1; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000002279; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002279}; KW Reference proteome {ECO:0000313|Proteomes:UP000002279}. FT COILED 925 945 {ECO:0000256|SAM:Coils}. FT COILED 975 995 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1243 AA; 137349 MW; 77BF8FF03D676F38 CRC64; MKLLALPAWN RNPRLPSWQV CCKESPSATS YYTQDDNCPN EEEPIQKKEE AGRLSESTLP KDSVSNTANP PGDHFGDDCV ADDQSTQSNL LNPSAVETFP SVDLSEDSSS IAADTESVEN ISSSSTSQMT PISQPNEIEN SGAEASVASS IDVEHSETDC DIGGTPETDP QVKSSSFVSP PESLVGQHIE NASSSHGKGK ITKSEFESRV SEGAQGAEDQ KSPLNTSDNL KRERANYKKA VEIDPTSVVS PKDPGDIPTF DEWKKKVMEV EKEKSQSMHP SSNGGQHSTK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQLDIANYEL FSSTPKDFLV SISDRYPTNK WIKLGTFHGR DERNVQSFPL DEQMYAKYVK MFIKYIKVEL VSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQSERQE LFDEDYDYPL DYNTGEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLV EAEGNKSLSE NTTTIAAPNM PEPTPVPSPG FVTTEVHPPD TEPSTLEFPK ESPIVQLVQE EEEETSPSTV TLLGSDEQEE ESLAWFEFET QIYCSEMAAS CCLPSFSEYL FKWCSVAVAL HRQRSTSSKK REPPETPQPP LALPTESVNV SVTRPPPGES DSRVVAGESE APVVGGLRST VQEELANHTV DAMELEPSHP QTVSQSLPLE VTPEMKLFSQ GEGYKPTKHE VGQTASQVFP RERTTEASTE TERISESAVA EEKHSAVFET GTLSEARDSS LRVDVSSAPM VPRLPETIAQ ADYPVTASDV EGGEARATPA DTQKPVSTPV VESSPVTEVR DEEQAPEDAL LSLPASGGLQ RTATDFYAEL QNSTDLGYAN GNLVHGSNQK ESVFMRLNNR IKALEVNMSL SGRYLEELSQ RYRKQMEEMQ KAFNKTIIKL QNTSRIAEEQ DQRQTEAIQL LQAQLTNMTQ LVSNLSSTVS ELKREVSDRQ SYLVISLVLC VILGLMLCMQ RCRNSSHFDE DYNSGLPKSN NYPSPKRCFS SYDDMNLKRR TSFPLIRSKS FQLSGKEADP DDLYIVEPLK FSPEKKKKRC KYKAEKIETI KPADPLHPVA NGDIKGRKPF TNQRDFSNMG EVYHSSYKGP PSEGSSETSS QSEESYFCGI SACTSLCNGQ PQKTKTEKRA LKRRRSKVQD QGKLLKTLIQ TKSGSMPSLH DIIKGNKEIT VGTFGVTAVS GHV // ID F7H0K0_MACMU Unreviewed; 438 AA. AC F7H0K0; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000027580}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSMMUP00000027580}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000027580, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000027580, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000027580, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000027580} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000027580}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000027580}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_001098382.1; XM_001098382.2. DR UniGene; Mmu.27419; -. DR STRING; 9544.ENSMMUP00000027580; -. DR Ensembl; ENSMMUT00000029475; ENSMMUP00000027580; ENSMMUG00000020959. DR GeneID; 704522; -. DR KEGG; mcc:704522; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7H0K0; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 19973576; -. DR Proteomes; UP000006718; Chromosome 10. DR ExpressionAtlas; F7H0K0; baseline. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 135 158 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 47954 MW; EE87BEC9299CA510 CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSVSIT SEDSNGLRSA GPGPGDPEGR GARGPSCGEP ALSAGVPGGT TWAGSSRQKP APRSHNWQPA CGAATVRGGA SEPTGSPAVS EEPLDLLPTL DLRQEMPPSR VFKSFLSLLF QVLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLAA IWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SAPRDFAVFG LQVDDETEVF LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSATGGPH // ID F7HGW8_CALJA Unreviewed; 437 AA. AC F7HGW8; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000032677}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSCJAP00000032677}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000032677, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000032677, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000032677} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000032677}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01105663; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000032677; -. DR Ensembl; ENSCJAT00000034536; ENSCJAP00000032677; ENSCJAG00000017715. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7HGW8; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008225; Chromosome 5. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 437 AA; 47932 MW; 9C61DFA0B2F23DFF CRC64; MRRSSRPGSA SSPRKHTPNF FSENSSMSVT SEDSNGRRSA GPEPGEPEGR IAQGRSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNAETA CGAATVRGGA SEPTGSPVVS EEPLALLPTL DLRQEMPPPR LSKSFLSLLF QVLRVSLSLA GNALVSVYRE VCSIRFLFTA VSLLSLFLAV IWLGLLYLVS PSENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLS EDFVRKPDYA LSSVGASIDL EKTSHDYADR NTAYFWNRFR FWNYARPPTV ILEPDVSPGN CWAFEGDQGH VVIRLPSRVQ LSDITLQHPP PSVAHTGGAD SAPRDFAVFG LQVDDETEVF LGKFTFNVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSAQGPH // ID F7HLL7_MACMU Unreviewed; 809 AA. AC F7HLL7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMMUP00000022451}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSMMUP00000022451}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000022451, ECO:0000313|Proteomes:UP000006718}; RN [1] {ECO:0000313|Ensembl:ENSMMUP00000022451, ECO:0000313|Proteomes:UP000006718} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000022451, RC ECO:0000313|Proteomes:UP000006718}; RX PubMed=17431167; DOI=10.1126/science.1139247; RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M., RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., RA Wilson R.K., Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., RA Hardison R.C., Makova K.D., Miller W., Milosavljevic A., Palermo R.E., RA Siepel A., Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J., RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., RA Dinh H.H., Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., RA Godfrey J., Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., RA Jhangiani S.N., Joshi V., Khan Z.M., Kirkness E.F., Cree A., RA Fowler R.G., Lee S., Lewis L.R., Li Z., Liu Y.-S., Moore S.M., RA Muzny D., Nazareth L.V., Ngo D.N., Okwuonu G.O., Pai G., Parker D., RA Paul H.A., Pfannkoch C., Pohl C.S., Rogers Y.-H.C., Ruiz S.J., RA Sabo A., Santibanez J., Schneider B.W., Smith S.M., Sodergren E., RA Svatek A.F., Utterback T.R., Vattathil S., Warren W., White C.S., RA Chinwalla A.T., Feng Y., Halpern A.L., Hillier L.W., Huang X., RA Minx P., Nelson J.O., Pepin K.H., Qin X., Sutton G.G., Venter E., RA Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P., Jones S.M., RA Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L., Csuros M., RA Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H., Liu Y., RA Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E., RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J., RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J., RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A., RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., RA Denby A., Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., RA Marklein A., Nielsen R., Vallender E.J., Clark A.G., Ferguson B., RA Hernandez R.D., Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., RA Pu L.-L., Ren Y., Smith D.G., Wheeler D.A., Schenck I., Ball E.V., RA Chen R., Cooper D.N., Giardine B., Hsu F., Kent W.J., Lesk A., RA Nelson D.L., O'brien W.E., Pruefer K., Stenson P.D., Wallace J.C., RA Ke H., Liu X.-M., Wang P., Xiang A.P., Yang F., Barber G.P., RA Haussler D., Karolchik D., Kern A.D., Kuhn R.M., Smith K.E., RA Zwieg A.S.; RT "Evolutionary and biomedical insights from the rhesus macaque RT genome."; RL Science 316:222-234(2007). RN [2] {ECO:0000313|Ensembl:ENSMMUP00000022451} RP IDENTIFICATION. RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000022451}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMMUP00000022451}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9544.ENSMMUP00000022451; -. DR Ensembl; ENSMMUT00000023986; ENSMMUP00000022451; ENSMMUG00000017056. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7HLL7; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000006718; Chromosome 3. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006718}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006718}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 284 307 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 314 333 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 408 428 {ECO:0000256|SAM:Coils}. FT COILED 453 487 {ECO:0000256|SAM:Coils}. FT COILED 500 520 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 809 AA; 89854 MW; 3571C5F4A25F286D CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADGGASSA VSLKNRAART AKQRRSTNKS AFSINHVSRQ VTSFGVSHSG TDSLQDAVTR QPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DLGAAAATAH NGFSCSNCSM LSERKDVLTA HPAAPGPVSR VYSRDRNQKR DDCKGKRHLD AYRAGTLRHI WACAGYFLLQ TLRRIGAAGR AVSRTAWSAL WLAVVAPGKA ASGVFWWLGI GWYQFVTLIS WLNVFLLTRC LRNICKLLVL LVPLLLLLAG LSLRGQGDFF SFLPVLNWAS THRTQRVDDP QDVFKPATSR LNQPLQGDNE AFPWHWMSGM EQQVTSLSGQ CHHHGENLRE LTTLLQKLQA RVDQMDNGAA GPSTSVRDAV GQPLKETDFM AFHQEHEVRI SHLEDILGKL REKSEAIQKE LEQTKQKTVS AVGEQLLPTV EHLQLELDQL KSELSSWRHM KTGCETVDAL QERVDVQVRE TVKLLFSEDQ QGGSLEQLLQ RFSSQCVSRG DLHSKRDLEL QILRNVINDI SVTKRLPASE VVVSAVSEAG ASGITEAQAR AIVNNALKLY SQDKTGMVDF ALESGGGSIL STRCSETYET KTALMSLFGI PLWYFSQSPR VVIQPDIYPG NCWAFKGSQG YLVVRLSMMI HPAAFTLEHI PKTLSPTGNI SSAPKDFAVY GLENEYQEEG QLLGQFTYDQ DGESLQMFQA LKTPDDRVFQ IVELRIFSNW GHPEYTCLYR FRVHGEPVK // ID F7HS09_CALJA Unreviewed; 1375 AA. AC F7HS09; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000017462}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSCJAP00000017462}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000017462, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000017462, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000017462} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000017462}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01045002; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045003; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045004; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000017399; -. DR Ensembl; ENSCJAT00000018468; ENSCJAP00000017462; ENSCJAG00000009416. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000008225; Chromosome 18. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}. FT COILED 1057 1077 {ECO:0000256|SAM:Coils}. FT COILED 1107 1127 {ECO:0000256|SAM:Coils}. FT COILED 1313 1333 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSCJAP00000017462}. SQ SEQUENCE 1375 AA; 153030 MW; 2F8F352B2F04DA77 CRC64; LVRLPSQPSF RHSRLFHELC SKEENSATMP KLISLAVSSE IIVFPNKTMG SRRDRERENR VLEGKLPLPK GLARTQRARA DGRAWTSRRP QERQSPESFC EASLSAPLWG PRRGRPGREL LKSRSASATA LRTLRPILAL LLRLLHLGLG SGGCREDVPP SGRGKKEEKM KKHRQALALV SCLFLCSLVW LPSWRVCCKE SSSASASSYY SQDDNCALQN EDVQFQKKNT ESKKLSPPVV EILPTVDLHE DSSSVVVDSE TVENISSSST SEITPISKLD EIEKSGTIPV AKPSETEQSE TDCDVGEANA PIEQPSFVNP PDSLVGQHIE NVSSSHGKGK ITKSEFESKV SASEQGGGDP KSGLNASDNL KNESSDYTKP GEIDPTSVAS SKDPEDIPTF DEWKKKVMEV EKEKSQSMHS SSNGGSHATK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQLDIANYEL FSSTPKDFLV SISDRYPTNK WIKLGTFHGR DERNVQSFPL DEQMYAKYVK VELVSHFGSE HFCPLSLIRV FGTSMVEEYE EIADSQYHSE RQELFDEDYD YPLDYNTGED KSSKNLLGSA TNAILNMVNI AANILGAKTE ELTEGNKSIS ENATATAAPK MPESTPVSTP VPSPEDVTTE VHTHDMEPSR PDTPKESPIV QLVQEEEEEA SPSTVTLLGS GEQEDESSPW FESETQIFCS ELTTICCISS FSEYIYKWCS VGVALYRQRS RTAWSKGKDY LVSAEPPLLL PAESVDVSVL QPLSGELENK NIEREAETVV LGDLSSSMHQ DDLVNYTVDA IELEPSHSQT LSQSFLLDIT PEINLLPKIE VSESVKYEAG HIPSQVIPQE SSVEIDNEME QKSESFSSIE KPSIAYETNK VNEVTDNIVK QDVNSMQIFT KLSETIVPPI NAAVPDNEDG EAKMNIADTA KQTLTSVVDS SSLPEVKEEE QSPEDALLRG FQRTATDFYA ELQNSTDLGY ANGNLVHGSN QKESVFMRLN NRIKALEVNM SLSGRYLEEL SQRYRKQMEE MQKAFNKTIV KLQNTSRIAE EQDQRQTEAI QLLQAQLSNM TQLVSNLSTT VAELKREVSD RQSYLLISLV LCVILGLMLC MQRCRNTSQF DGDYISKLPK SNQYPSPKRC FSSYDDMNLK RRTSFPLIRS KSLQLTGKEV DPNDLYIVEP LKFSPEKKKK RCKYKIEKIE TIKPAEPLHP IANGDIKGRK PFTNQRDFSN MGEVYHSSYK GPPSEGSSET SSQSEESYFC GISACTSLCN GQSQKTKTEK RALKRRRSKV QDQGKLIKTL IQTKSGSLPS LHDIIKGNKE ITVGTFGVTA VSGHI // ID F7HW14_CALJA Unreviewed; 1250 AA. AC F7HW14; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000017446}; GN Name=SUCO {ECO:0000313|Ensembl:ENSCJAP00000017446}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000017446, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000017446, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000017446} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000017446}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01045002; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045003; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045004; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000017399; -. DR Ensembl; ENSCJAT00000018451; ENSCJAP00000017446; ENSCJAG00000009416. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000008225; Chromosome 18. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1250 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003361728. FT COILED 932 952 {ECO:0000256|SAM:Coils}. FT COILED 982 1002 {ECO:0000256|SAM:Coils}. FT COILED 1188 1208 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1250 AA; 139109 MW; AECB148416163F18 CRC64; MKKHRQALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALQ NEDVQFQKKD EREGPINAEL LGKSGSNLPI SPEEYKSKDD SIMDVQNTES KKLSPPVVEI LPTVDLHEDS SSVVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPVAK PSETEQSETD CDVGEANAPI EQPSFVNPPD SLVGQHIENV SSSHGKGKIT KSEFESKVSA SEQGGGDPKS GLNASDNLKN ESSDYTKPGE IDPTSVASSK DPEDIPTFDE WKKKVMEVEK EKSQSMHSSS NGGSHATKKV QKNRNNYASV ECGAKILAAN PEAKSTSAIL IENMDLYMLN PCSTKIWFVI ELCEPIQVKQ LDIANYELFS STPKDFLVSI SDRYPTNKWI KLGTFHGRDE RNVQSFPLDE QMYAKYVKMF IKYIKVELVS HFGSEHFCPL SLIRVFGTSM VEEYEEIADS QYHSERQELF DEDYDYPLDY NTGEDKSSKN LLGSATNAIL NMVNIAANIL GAKTEELTEG NKSISENATA TAAPKMPEST PVSTPVPSPE DVTTEVHTHD MEPSRPDTPK ESPIVQLVQE EEEEASPSTV TLLGSGEQED ESSPWFESET QIFCSELTTI CCISSFSEYI YKWCSVGVAL YRQRSRTAWS KGKDYLVSAE PPLLLPAESV DVSVLQPLSG ELENKNIERE AETVVLGDLS SSMHQDDLVN YTVDAIELEP SHSQTLSQSF LLDITPEINL LPKIEVSESV KYEAGHIPSQ VIPQESSVEI DNEMEQKSES FSSIEKPSIA YETNKVNEVT DNIVKQDVNS MQIFTKLSET IVPPINAAVP DNEDGEAKMN IADTAKQTLT SVVDSSSLPE VKEEEQSPED ALLRGFQRTA TDFYAELQNS TDLGYANGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQRYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIQLLQA QLSNMTQLVS NLSTTVAELK REVSDRQSYL LISLVLCVIL GLMLCMQRCR NTSQFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQL TGKEVDPNDL YIVEPLKFSP EKKKKRCKYK IEKIETIKPA EPLHPIANGD IKGRKPFTNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQSQK TKTEKRALKR RRSKVQDQGK LIKTLIQTKS GSLPSLHDII KGNKEITVGT FGVTAVSGHI // ID F7HZ43_CALJA Unreviewed; 2062 AA. AC F7HZ43; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000010612}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSCJAP00000010612}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000010612, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000010612, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000010612} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000010612}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01030721; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030722; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030723; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030724; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030725; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030726; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030727; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030728; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000010594; -. DR Ensembl; ENSCJAT00000011211; ENSCJAP00000010612; ENSCJAG00000005631. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000008225; Chromosome 10. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 697 717 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2062 AA; 229450 MW; AABDAAA7359CDE72 CRC64; MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGS TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNVSLS SSKEQKYNEL MKERINVFKT AFSENEDDES RPAVALIRKL IAVLESIERL PLHLYDTPGS TYNLQILTRR LRFRLERAPG ETALIDRTGR MLKMEPLATV ESLEQYLLKM VAKQWYDFDR SSFVFVRKLR EGQNFIFRHQ HDFDENGIIY WTGTNAKTAY EWVNPAAYGL VVVTSSEGRN LPYGRLEDIL SRDNSALNCH SNDDKNAWFA IDLGLWVIPS AYTLRHARGY GRSALRNWVF QVSKDGQNWT SLYTHVDDCS LNEPGSTATW PLDPPKDEKQ GWRHVRIKQM GKNASGQTHY LSLSGFELYG TVNGVCEDQL GKAAKEAEAN LRRQRRLVRS QVLKYMVPGA RVIRGLDWKW RDQDGSPQGE GTVTGELHNG WIDVTWDAGG SNSYRMGAEG KFDLKLAPGY DPDTVASPKP VSSTVSGTTQ SWSSLVKNNC PDKTSAAAGS SSRKGSSSSV CSVASSSDIS LGSTKTERRS EIVMEHSIVS GADVHEPIVV LSSAENVPQT EVGSSSSAST STLTAETGSE NAERKLGPDS SVRTPGESSA ISMGIVSVSS PDVSSVSELT NKEAASQRPL SSSASNRLSV SSLLAAGAPM SSSASVPNLS SRETSSLESF VRRVANIART NATNNMNLSR SSSDNNTNTL GRNVMSTATS PLMGAQSFPN LTTPGTTSTV TMSTSSVTSS SNVATATTVL SVGQSLSNTL TTSLTSTSSE SDTGQEAEYS LYDFLDSCRA STLLAELDDD EDLPEPDEED DENEDDNQED QEYEEVMILR RPSLQRRAGS RSDVTHHAVT SQLPQVPAGA GSRPIGEQEE EEYETKGGRR RTWDDDYVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPPP GTPHSELLEE VECTPSPRLA LTLKVTGLGT TREVELPLIN FRSTIFYYVQ KLLQLSCNGN VKSDKLRRIW EPTYTIMYRE MKDSDKEKEN GKMGCWSLEH VEQYLGTDEL PKNDLITYLQ KNADAAFLRH WKLTGTNKSI RKNRNCSQLI AAYKDFCEHG TKSGLNQGAI STLQSSDILN LTKEQPQAKA GNGQNSCGVE DVLQLLRILY IVASDPYSRI SQEDGDEQPQ FTFPPDEFTS KKITTKILQQ IEEPLALASG ALPDWCEQLT SKCPFLIPFE TRQLYFTCTA FGASRAIVWL QNRREATVER TRTTSSVRRD DPGEFRVGRL KHERVKVPRG ESLMEWAENV MQIHADRKSV LEVEFLGEEG TGLGPTLEFY ALVAAEFQRT DLGAWLCDDN FPDDESRHVD LGGGLKPPGY YVQRSCGLFT APFPQDSDEL ERITKLFHFL GIFLAKCIQD NRLVDLPISK PFFKLMCMGD IKSNMSKLIY ESRGDRDLHC TESQSEASTE EGHDSLSVGS FEEDSKSEFI LDPPKPKPPA WFNGILTWED FELVNPHRAR FLKEIKDLAI KRRQILSNKG LSEDEKNTKL QELVLKNPSG SGPPLSIEDL GLNFQFCPSS RIYGFTAVDL KPSGEDEMIT MDNAEEYVDL MFDFCMHTGI QKQMEAFRDG FNKVFPMEKL SSFSHEEVQM ILCGNQSPSW AAEDIINYTE PKLGYTRDSP GFLRFVRVLC GMSSDERKAF LQFTTGCSTL PPGGLANLHP RLTVVRKVDA TDASYPSVNT CVHYLKLPEY SSEEIMRERL LAATMEKGFH LN // ID F7HZ46_CALJA Unreviewed; 2610 AA. AC F7HZ46; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:JAB08907.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000010609}; GN Name=HECTD1 {ECO:0000313|EMBL:JAB08907.1, GN ECO:0000313|Ensembl:ENSCJAP00000010609}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000010609, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000010609, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000010609} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. RN [3] {ECO:0000313|EMBL:JAB08907.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Bladder {ECO:0000313|EMBL:JAB08907.1}, RC Cerebellum {ECO:0000313|EMBL:JAB52508.1}, RC Cerebral cortex {ECO:0000313|EMBL:JAB31885.1}, RC Hippocampus {ECO:0000313|EMBL:JAB15949.1}, and RC Skeletal muscle {ECO:0000313|EMBL:JAB41831.1}; RX PubMed=25243066; DOI=10.1186/2047-217X-3-14; RA Maudhoo M.D., Ren D., Gradnigo J.S., Gibbs R.M., Lubker A.C., RA Moriyama E.N., French J.A., Norgren R.B.Jr.; RT "De novo assembly of the common marmoset transcriptome from NextGen RT mRNA sequences."; RL Gigascience 3:14-14(2014). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01030721; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030722; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030723; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030724; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030725; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030726; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030727; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030728; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GAMT01002954; JAB08907.1; -; mRNA. DR EMBL; GAMS01007187; JAB15949.1; -; mRNA. DR EMBL; GAMR01002047; JAB31885.1; -; mRNA. DR EMBL; GAMQ01000020; JAB41831.1; -; mRNA. DR EMBL; GAMP01000247; JAB52508.1; -; mRNA. DR RefSeq; XP_002753836.1; XM_002753790.2. DR RefSeq; XP_009004149.1; XM_009005901.1. DR RefSeq; XP_009004150.1; XM_009005902.1. DR RefSeq; XP_009004151.1; XM_009005903.1. DR ProteinModelPortal; F7HZ46; -. DR STRING; 9483.ENSCJAP00000010594; -. DR Ensembl; ENSCJAT00000011208; ENSCJAP00000010609; ENSCJAG00000005631. DR GeneID; 100385615; -. DR KEGG; cjc:100385615; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR KO; K12231; -. DR Proteomes; UP000008225; Chromosome 10. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:JAB08907.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289358 MW; B3944569AD5071A7 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGS TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWT GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLINFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSLEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID F7I0M5_CALJA Unreviewed; 880 AA. AC F7I0M5; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000017427}; GN Name=SUCO {ECO:0000313|Ensembl:ENSCJAP00000017427}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000017427, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000017427, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000017427} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000017427}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01045002; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045003; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045004; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000017399; -. DR Ensembl; ENSCJAT00000018430; ENSCJAP00000017427; ENSCJAG00000009416. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000008225; Chromosome 18. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 880 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003355830. FT COILED 612 632 {ECO:0000256|SAM:Coils}. FT COILED 818 838 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 880 AA; 97976 MW; D6BCC154966A354F CRC64; MKKHRQALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALQ NEDVQFQKKN TESKKLSPPV VEILPTVDLH EDSSSVVVDS ETVENISSSS TSEITPISKL DEIEKSGTIP VAKPSETEQS ETDCDVGEAN APIEQPSFVN PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQGGGD PKSGLNASDN LKNESSDYTK PGEIDPTSVA SSKDPEDIPT FDEWKKKVME VEKEKSQSMH SSSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEEL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPEDVTTEVH THDMEPSRPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIQLLQA QLSNMTQLVS NLSTTVAELK REVSDRQSYL LISLVLCVIL GLMLCMQRCR NTSQFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQL TGKEVDPNDL YIVEPLKFSP EKKKKRCKYK IEKIETIKPA EPLHPIANGD IKGRKPFTNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQSQK TKTEKRALKR RRSKVQDQGK LIKTLIQTKS GSLPSLHDII KGNKEITVGT FGVTAVSGHI // ID F7I378_CALJA Unreviewed; 817 AA. AC F7I378; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000029198}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSCJAP00000029198}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000029198, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000029198, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000029198} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000029198}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01159957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01159958; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000029198; -. DR Ensembl; ENSCJAT00000030853; ENSCJAP00000029198; ENSCJAG00000015835. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7I378; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008225; Chromosome 2. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 296 319 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 326 345 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 413 440 {ECO:0000256|SAM:Coils}. FT COILED 465 499 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 817 AA; 91138 MW; 0E78671789878FCD CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAA TAGTPGDGRA ASADGCANNI ISLKNRAAKT AKQCRTANKS AFSINHVSRE VTSSGVSHSS TASLQDTVTR RPPVLDESWI REQTTVDHFW VGLDDDGDLK GGNKAAIQGN GDLAAAATAH NGYTCSECSM LSERKDALTA HPVARGPLSR VYSRGRNQKR DDCKGKTHLD AHTAAHLQSP RPSRQAGILR HIWACAGYFL LQTLHRIGAA GRAVSRMVWS ALWLAVIAPG KAASGVFWWL GIGWYQFVTL ISWLNVFLLT RCLRNICKFL VFLIPLFLLL AGLSLRGQDD FFSFLPVWNW VSLHRTQQVD DPKDILKPAT SHLNQPLQGD SEAFPWHWMR GVEQQVASLS GQCHHHGENL RELTAMLQKL QAQVDQMDNG AAGLSASVRD AVGQHLRETD VVAFHQEHEV RISHLEDILE KLREKSEAIQ KELEQTKQKT VRSSAVGEHL QLELDQLKSE LSSWRHVRTG CETVDAVRER VDVQVREMVK LLFSEDEQGG SLEQLLQRFS SQFVSKADLH MLLRDLELQI LRNVTHHISV TKHTPTSEAV VSAVSEAGMS GITEAQARAI VNNALKLYSQ DKTGMVDFAL ESGGGSILST RCSETYETKT ALMSLFGIPL WYFSQSPRVV IQPDIYPGNC WAFKGSQGYL VVRLSMMIYP AAFTLEHIPK TLSPTGNISS APKDFAVYGL ENEYQEEGQL LGQFTYDQDG ESLQMFQALK RPDDTAFQIV ELRIFSNWGH PEYTCLYRFR VHGKPVK // ID F7I3L0_CALJA Unreviewed; 504 AA. AC F7I3L0; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000046758}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSCJAP00000046758}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000046758, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000046758, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000046758} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000046758}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01159957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01159958; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000029198; -. DR Ensembl; ENSCJAT00000061600; ENSCJAP00000046758; ENSCJAG00000015835. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008225; Chromosome 2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 504 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003355157. FT COILED 93 120 {ECO:0000256|SAM:Coils}. FT COILED 145 179 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSCJAP00000046758}. SQ SEQUENCE 504 AA; 57061 MW; CC05BA8C1A7DD2B4 CRC64; RCLRNICKFL VFLIPLFLLL AGLSLRGQDD FFSFLPVWNW VSLHRTQQVD DPKDILKPAT SHLNQPLQGD SEAFPWHWMR GVEQQVASLS GQCHHHGENL RELTAMLQKL QAQVDQMDNG AAGLSASVRD AVGQHLRETD VVAFHQEHEV RISHLEDILE KLREKSEAIQ KELEQTKQKT VRSVANRHTP AGFGEHLQLE LDQLKSELSS WRHVRTGCET VDAVRERVDV QVREMVKLLF SEDEQGGSLE QLLQRFSSQF VSKADLHMLL RDLELQILRN VTHHISVTKH TPTSEAVVSA VSEAGMSGIT EAQARAIVNN ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALM SLFGIPLWYF SQSPRVVIQP DIYPGNCWAF KGSQGYLVVR LSMMIYPAAF TLEHIPKTLS PTGNISSAPK DFAVYGLENE YQEEGQLLGQ FTYDQDGESL QMFQALKRPD DTAFQIVELR IFSNWGHPEY TCLYRFRVHG KPVK // ID F7I3Q4_CALJA Unreviewed; 379 AA. AC F7I3Q4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000034582}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSCJAP00000034582}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000034582, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000034582, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000034582} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUN-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000034582}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01105926; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105927; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105928; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002747326.1; XM_002747280.2. DR STRING; 9483.ENSCJAP00000034574; -. DR Ensembl; ENSCJAT00000036520; ENSCJAP00000034582; ENSCJAG00000018611. DR GeneID; 100408020; -. DR KEGG; cjc:100408020; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; F7I3Q4; -. DR Proteomes; UP000008225; Chromosome 5. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 61 80 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 101 118 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 42758 MW; 12E0C889D6FEB91A CRC64; MPRSSRSPGD PGALLEDVAN NRRPRRIARR GQNTSRMAED PSPNMNDAFL LPVHINAQAP GLTQGMLGCV SWVSCLAFFL RTQAQRVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HVPSKMRVGQ DDSITTPLQS LRLYQEKVRH HSGEIQDLRG SINLLTAKLQ EMEAMSNEQN MAHKIMKMIQ GDYTEKPDFA LKSTGASIDF EHTSATYNHD KAHSYWKWIR LWNYAQPPDV ILEPNVTPGN CWAFEGDHGQ VTIRLAQKIY LSNLTLQHIP KTISLSGNLD TAPKDFVIYG MESSPSEEVF LGAFQFQPEN IIQMFPLQNQ PARAFGAVKV KISSNWGNPG FTCLYRVRVH GSVAPPGAQP HQNRYPERD // ID F7I3R7_CALJA Unreviewed; 379 AA. AC F7I3R7; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000034577}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSCJAP00000034577}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000034577, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000034577, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000034577} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000034577}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01105926; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105927; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105928; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000034574; -. DR Ensembl; ENSCJAT00000036515; ENSCJAP00000034577; ENSCJAG00000018611. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008225; Chromosome 5. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 61 80 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 101 118 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 42801 MW; 954E993030FEB903 CRC64; MPRSSRSPGD PGALLEDVAN NRRPRRIARR GQNTSRMAED PSPNMNDAFL LPVHINAQAP GLTQGMLGCV SWVSCLAFFL RTQAQRVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HVPSKMRVGQ DDSITTPLQS LRLYQEKVRH HSGEIQDLRG SINLLTAKLQ EMEAMSNEQN MAHKIMKMIQ GDYTEKPDFA LKSTGASIDF EHTSATYNHD KAHSYWKWIR LWNYAQPPDV IREPNVTPGN CWAFEGDHGQ VTIRLAQKIY LSNLTLQHIP KTISLSGNLD TAPKDFVIYG MESSPSEEVF LGAFQFQPEN IIQMFPLQNQ PARAFGAVKV KISSNWGNPG FTCLYRVRVH GSVAPPGAQP HQNRYPERD // ID F7I3R9_CALJA Unreviewed; 379 AA. AC F7I3R9; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000034574}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSCJAP00000034574}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000034574, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000034574, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000034574} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000034574}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01105926; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105927; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105928; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000034574; -. DR Ensembl; ENSCJAT00000036512; ENSCJAP00000034574; ENSCJAG00000018611. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008225; Chromosome 5. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 61 80 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 101 118 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 42746 MW; 4CDFFBED19AA4B11 CRC64; MPRSSRSPGD PGALLEDVAN NRRPRRIARR GQNTSRMAED PSPNMNDAFL LPVHINAQAP GLTQGMLGCV SWVSCLAFFL RTQAQRVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HVPSKMRVGQ DDSITTPLQS LRLYQEKVRH HSGEIQDLRG SINLLTAKLQ EMEAMSNEQN MAHKIMKMIQ GDYTEKPDFA LKSTGASIDF EHTSATYNHD KAHSYWKWIR LWNYAQPPDL AEEPNVTPGN CWAFEGDHGQ VTIRLAQKIY LSNLTLQHIP KTISLSGNLD TAPKDFVIYG MESSPSEEVF LGAFQFQPEN IIQMFPLQNQ PARAFGAVKV KISSNWGNPG FTCLYRVRVH GSVAPPGAQP HQNRYPERD // ID F7I5G4_CALJA Unreviewed; 1410 AA. AC F7I5G4; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000017399}; GN Name=SUCO {ECO:0000313|Ensembl:ENSCJAP00000017399}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000017399, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000017399, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000017399} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000017399}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01045002; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045003; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01045004; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000017399; -. DR Ensembl; ENSCJAT00000018401; ENSCJAP00000017399; ENSCJAG00000009416. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; F7I5G4; -. DR OMA; SSPWFES; -. DR TreeFam; TF105817; -. DR Proteomes; UP000008225; Chromosome 18. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}. FT COILED 1092 1112 {ECO:0000256|SAM:Coils}. FT COILED 1142 1162 {ECO:0000256|SAM:Coils}. FT COILED 1348 1368 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1410 AA; 156702 MW; 93FC86334016EB55 CRC64; MREFLALPFT STSQHLAHGH RLTQGKGLLR LPSQPSFRHS RLFHELCSKE ENSATMPKLI SLAVSSEIIV FPNKTMGSRR DRERENRVLE GKLPLPKGLA RTQRARADGR AFATSKGAWP ARTRAPESFC EASLSAPLWG PRRGRPGREL LKSRSASATA LRTLRPILAL LLRLLHLGLG SGGCREDVPP SGRGKKEEKM KKHRQALALV SCLFLCSLVW LPSWRVCCKE SSSASASSYY SQDDNCALQN EDVQFQKKNT ESKKLSPPVV EILPTVDLHE DSSSVVVDSE TVENISSSST SEITPISKLD EIEKSGTIPV AKPSETEQSE TDCDVGEANA PIEQPSFVNP PDGLVGQHIE NVSSSHGKGK ITKSEFESKV SASEQGGGDP KSGLNASDNL KNESSDYTKP GEIDPTSVAS SKDPEDIPTF DEWKKKVMEV EKEKSQSMHS SSNGGSHATK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQLDIANYEL FSSTPKDFLV SISDRYPTNK WIKLGTFHGR DERNVQSFPL DEQMYAKYVK VELVSHFGSE HFCPLSLIRV FGTSMVEEYE EIADSQYHSE RQELFDEDYD YPLDYNTGED KSSKNLLGSA TNAILNMVNI AANILGAKTE ELTEGNKSIS ENATATAAPK MPESTPVSTP VPSPEDVTTE VHTHDMEPSR PDTPKESPIV QLVQEEEEEA SPSTVTLLGS GEQEDESSPW FESETQIFCS ELTTICCISS FSEYIYKMVF SWSCSLSVAL YRQRSRTAWS KGKDYLVSAE PPLLLPAESV DVSVLQPLSG ELENKNIERE AETVVLGDLS SSMHQDDLVN YTVDAIELEP SHSQTLSQSF LLDITPEINL LPKIEVSESV KYEAGHIPSQ VIPQESSVEI DNEMEQKSES FSSIEKPSIA YETNKVNEVT DNIVKQDVNS MQIFTKLSET IVPPINAAVP DNEDGEAKMN IADTAKQTLT SVVDSSSLPE VKEEEQSPED ALLRGFQRTA TDFYAELQNS TDLGYANGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQRYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIQLLQA QLSNMTQLVS NLSTTVAELK REVSDRQSYL LISLVLCVIL GLMLCMQRCR NTSQFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQL TGKEVDPNDL YIVEPLKFSP EKKKKRCKYK IEKIETIKPA EPLHPIANGD IKGRKPFTNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQSQK TKTEKRALKR RRSKVQDQGK LIKTLIQTKS GSLPSLHDII KGNKEITVGT FGVTAVSGHI // ID F7I653_CALJA Unreviewed; 201 AA. AC F7I653; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 22-JUL-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000034554}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSCJAP00000034554}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000034554, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000034554, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000034554} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000034554}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01105926; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105927; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01105928; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSCJAT00000036491; ENSCJAP00000034554; ENSCJAG00000018611. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008225; Chromosome 5. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}. SQ SEQUENCE 201 AA; 22556 MW; 1CBC189E305E3E74 CRC64; MIQGDYTEKP DFALKSTGAS IDFEHTSATY NHDKAHSYWK WIRLWNYAQP PDVILEPNVT PGNCWAFEGD HGQVTIRLAQ KIYLSNLTLQ HIPKTISLSG NLDTAPKDFV IYCCSGAVSR QGMESSPSEE VFLGAFQFQP ENIIQMFPLQ NQPARAFGAV KVKISSNWGN PGFTCLYRVR VHGSVAPPGA QPHQNRYPER D // ID F7I8G3_CALJA Unreviewed; 2615 AA. AC F7I8G3; DT 27-JUL-2011, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCJAP00000010594}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSCJAP00000010594}; OS Callithrix jacchus (White-tufted-ear marmoset). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. OX NCBI_TaxID=9483 {ECO:0000313|Ensembl:ENSCJAP00000010594, ECO:0000313|Proteomes:UP000008225}; RN [1] {ECO:0000313|Ensembl:ENSCJAP00000010594, ECO:0000313|Proteomes:UP000008225} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Warren W., Ye L., Minx P., Worley K., Gibbs R., Wilson R.K.; RL Submitted (MAR-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCJAP00000010594} RP IDENTIFICATION. RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCJAP00000010594}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACFV01030721; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030722; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030723; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030724; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030725; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030726; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030727; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACFV01030728; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9483.ENSCJAP00000010594; -. DR Ensembl; ENSCJAT00000011192; ENSCJAP00000010594; ENSCJAG00000005631. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; F7I8G3; -. DR OMA; NRQCIEG; -. DR TreeFam; TF323674; -. DR Proteomes; UP000008225; Chromosome 10. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008225}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008225}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1250 1270 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2615 AA; 289796 MW; 052585985683C101 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGS TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNVSLS SSKEKNIVLK LITVFCINIA YSYTKGLNNN YHSRPAVALI RKLIAVLESI ERLPLHLYDT PGSTYNLQIL TRRLRFRLER APGETALIDR TGRMLKMEPL ATVESLEQYL LKMVAKQWYD FDRSSFVFVR KLREGQNFIF RHQHDFDENG IIYWTGTNAK TAYEWVNPAA YGLVVVTSSE GRNLPYGRLE DILSRDNSAL NCHSNDDKNA WFAIDLGLWV IPSAYTLRHA RGYGRSALRN WVFQVSKDGQ NWTSLYTHVD DCSLNEPGST ATWPLDPPKD EKQGWRHVRI KQMGKNASGQ THYLSLSGFE LYGTVNGVCE DQLGKAAKEA EANLRRQRRL VRSQVLKYMV PGARVIRGLD WKWRDQDGSP QGEGTVTGEL HNGWIDVTWD AGGSNSYRMG AEGKFDLKLA PGYDPDTVAS PKPVSSTVSG TTQSWSSLVK NNCPDKTSAA AGSSSRKGSS SSVCSVASSS DISLGSTKTE RRSEIVMEHS IVSGADVHEP IVVLSSAENV PQTEVGSSSS ASTSTLTAET GSENAERKLG PDSSVRTPGE SSAISMGIVS VSSPDVSSVS ELTNKEAASQ RPLSSSASNR LSVSSLLAAG APMSSSASVP NLSSRETSSL ESFVRRVANI ARTNATNNMN LSRSSSDNNT NTLGRNVMST ATSPLMGAQS FPNLTTPGTT STVTMSTSSV TSSSNVATAT TVLSVGQSLS NTLTTSLTST SSESDTGQEA EYSLYDFLDS CRASTLLAEL DDDEDLPEPD EEDDENEDDN QEDQEYEEVM ILRRPSLQRR AGSRSDVTHH AVTSQLPQVP AGAGSRPIGE QEEEEYETKG GRRRTWDDDY VLKRQFSALV PAFDPRPGRT NVQQTTDLEI PPPGTPHSEL LEEVECTPSP RLALTLKVTG LGTTREVELP LINFRSTIFY YVQKLLQLSC NGNVKSDKLR RIWEPTYTIM YREMKDSDKE KENGKMGCWS LEHVEQYLGT DELPKNDLIT YLQKNADAAF LRHWKLTGTN KSIRKNRNCS QLIAAYKDFC EHGTKSGLNQ GAISTLQSSD ILNLTKEQPQ AKAGNGQNSC GVEDVLQLLR ILYIVASDPY SRISQEDGDE QPQFTFPPDE FTSKKITTKI LQQIEEPLAL ASGALPDWCE QLTSKCPFLI PFETRQLYFT CTAFGASRAI VWLQNRREAT VERTRTTSSV RRDDPGEFRV GRLKHERVKV PRGESLMEWA ENVMQIHADR KSVLEVEFLG EEGTGLGPTL EFYALVAAEF QRTDLGAWLC DDNFPDDESR HVDLGGGLKP PGYYVQRSCG LFTAPFPQDS DELERITKLF HFLGIFLAKC IQDNRLVDLP ISKPFFKLMC MGDIKSNMSK LIYESRGDRD LHCTESQSEA STEEGHDSLS VGSFEEDSKS EFILDPPKPK PPAWFNGILT WEDFELVNPH RARFLKEIKD LAIKRRQILS NKGLSEDEKN TKLQELVLKN PSGSGPPLSI EDLGLNFQFC PSSRIYGFTA VDLKPSGEDE MITMDNAEEY VDLMFDFCMH TGIQKQMEAF RDGFNKVFPM EKLSSFSHEE VQMILCGNQS PSWAAEDIIN YTEPKLGYTR DSPGFLRFVR VLCGMSSDER KAFLQFTTGC STLPPGGLAN LHPRLTVVRK VDATDASYPS VNTCVHYLKL PEYSSEEIMR ERLLAATMEK GFHLN // ID F7W9G9_SORMK Unreviewed; 1084 AA. AC F7W9G9; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=WGS project CABT00000000 data, contig 2.52 {ECO:0000313|EMBL:CCC13960.1}; GN ORFNames=SMAC_08081 {ECO:0000313|EMBL:CCC13960.1}; OS Sordaria macrospora (strain ATCC MYA-333 / DSM 997 / K(L3346) / OS K-hell). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Sordaria. OX NCBI_TaxID=771870 {ECO:0000313|EMBL:CCC13960.1, ECO:0000313|Proteomes:UP000001881}; RN [1] {ECO:0000313|EMBL:CCC13960.1, ECO:0000313|Proteomes:UP000001881} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-333 / DSM 997 / K(L3346) / K-hell RC {ECO:0000313|Proteomes:UP000001881}; RC TISSUE=Mycelium {ECO:0000313|EMBL:CCC13960.1}; RX PubMed=20386741; DOI=10.1371/journal.pgen.1000891; RA Nowrousian M., Stajich J.E., Chu M., Engh I., Espagne E., Halliday K., RA Kamerewerd J., Kempken F., Knab B., Kuo H.-C., Osiewacz H.D., RA Poeggeler S., Read N.D., Seiler S., Smith K.M., Zickler D., Kueck U., RA Freitag M.; RT "De novo assembly of a 40 Mb eukaryotic genome from short sequence RT reads: Sordaria macrospora, a model organism for fungal RT morphogenesis."; RL PLoS Genet. 6:E1000891-E1000891(2010). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:CCC13960.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CABT02000052; CCC13960.1; -; Genomic_DNA. DR RefSeq; XP_003347189.1; XM_003347141.1. DR STRING; 771870.XP_003347189.1; -. DR EnsemblFungi; CCC13960; CCC13960; SMAC_08081. DR GeneID; 10804609; -. DR KEGG; smp:SMAC_08081; -. DR EuPathDB; FungiDB:SMAC_08081; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; F7W9G9; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001881; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001881}; KW Reference proteome {ECO:0000313|Proteomes:UP000001881}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 1084 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003371191. FT COILED 667 687 {ECO:0000256|SAM:Coils}. FT COILED 722 749 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1084 AA; 116942 MW; C4566B86A21A132F CRC64; MRTPPTLFLG LLGLHAVAAA LPEPASVCES RTVNYITHTL PQQCLRTSWT TPTAVTSAVA TDTTSSEVTN NETAAPAPAN ETPQQQHHDQ PKPSAEHTQE QTKEEQEDED LASSTFMSFE EWKEMMLRKS GDPANSNKGG QQKHSGQQRA GGEHDQNGPS DTDSHGQNED GENSLNFDAL SEKVSELTTS PSGNPSTDHG GSGKAKKDDQ VVYEDGKTQY YRSKDAGKTC KERFSYASFD AGATVLKTSP GAKNAKAILV ENKDSYMLLE CHAKSKFVIV ELSDDILVDT VVLANFEFFS SMIRRFKVSV SDRYPVKLDK WVELGTFEAR NSRDIQAFLV EHPQIYTKYI RIEFLSHYGN EYYCPVSLLR VHGTRMLDTW KEPDDRHDDE QETIEAPPVQ EQLPQTPEPE QPSSQVEEPT VTSEPVPSTV TEAKEELHEE IEPVQMAEVG FTPWEPVFYR DLSLEVCALR SRTTGQPTRI SPGADKYNIS AGNPDLVKGQ NSTRSAAHET SVTKSSSAAS KSQETAKGQP ASSSASHSSA QPQASNTTAG SPNNKAPPAR PNTASNETAP SASSAAKPPS SSSSSSSSST TGTTGRNDSK ENANAGGNSS GSGAPSPNNK NNQQQKPNPG GSPTTSSHPA SPTVQESFFK TVHKRLTHLE SNTSLSLQYI EQQSRFLQDV LSKLERRQLT RVDTFLDTLN KTVLTELRNV RQQYDQIWQS TVIALETQRE QTEREVVALS GRLNVLADEV VFQKRMAILQ SVLLLSCLVL VIFNRTGSNS GGGGGGGNNA LGGNGGGMGG RPNSRGGWFD SPVQAAQRRS MRPGSGWISN MSMSMGMSSP FPFSTTASTS GLQQQQQSTT AAEPSQSGGE DTDAAVGGST GVDAAAEHNQ QQLHPNGNYG RQPPLQTQQQ HSYAYSRSND KALPLTPTSE YDSREGTPLV HASPLRQTST TIDEVLAGEG ADGNSPLYTQ SSFGPEPDCG PDQEGSSQSS SSGSESDGFT QETTTSFVQE SAGPAQNGVM NVRVRSNPAG ERSERVEDDM DLLPVDSIED QQQQTLRHRP RPPHPYHGSG TVKPLPALPE TSSS // ID F8MFY1_NEUT8 Unreviewed; 1020 AA. AC F8MFY1; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGO59357.1}; GN ORFNames=NEUTE1DRAFT_145389 {ECO:0000313|EMBL:EGO59357.1}; OS Neurospora tetrasperma (strain FGSC 2508 / ATCC MYA-4615 / P0657). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=510951 {ECO:0000313|Proteomes:UP000008065}; RN [1] {ECO:0000313|Proteomes:UP000008065} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=FGSC 2508 / P0657 {ECO:0000313|Proteomes:UP000008065}; RX PubMed=21750257; DOI=10.1534/genetics.111.130690; RA Ellison C.E., Stajich J.E., Jacobson D.J., Natvig D.O., Lapidus A., RA Foster B., Aerts A., Riley R., Lindquist E.A., Grigoriev I.V., RA Taylor J.W.; RT "Massive changes in genome architecture accompany the transition to RT self-fertility in the filamentous fungus Neurospora tetrasperma."; RL Genetics 189:55-69(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL891303; EGO59357.1; -; Genomic_DNA. DR RefSeq; XP_009849600.1; XM_009851298.1. DR EnsemblFungi; EGO59357; EGO59357; NEUTE1DRAFT_145389. DR GeneID; 20826577; -. DR KEGG; nte:NEUTE1DRAFT145389; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008065; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008065}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 496 519 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 111 135 {ECO:0000256|SAM:Coils}. FT COILED 164 320 {ECO:0000256|SAM:Coils}. FT COILED 363 383 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1020 AA; 115457 MW; C811BADE3138DF68 CRC64; MPPRRITRRS VVSPSPALSD TGTPKPGKRG TLIPVEQVRN QTPTRFSLSY GSSLVAMPDR NKTAAGTDLE TAFAEIHETV RTDNIKAEAR RRELDARRGS TTPGPRRPDP IEEETEEEEE EDEEVQEEKE EDDDDNQGGY YNDEEEEEPE PDPPRRATKP AQTLNKLQDQ IEKAKLLEKQ RAEERAAEER EKAEKDAAER ERKRVEKDKK DKEEREKREK AEREKKEQAA KAQQEAKAKA AREAQERAER EAKKRARDEE DQEQAELERA ERNARLKRER SEDARRQAEQ KHAAEAARKK EEQRQAREAS EAEMASLEEA KRQAMRPPPP PSKQLLSTPP TSRTRELVVP DTGNSYVEES DVYTDSEKMR EVLEEEVVRM AQQRRLARYT PEPPEPPRIA RRPASTLSNS FQHAPHQVDQ HQDLFDTEAK SMSDKQYPSF GKVSKPTAAR PNQTSRPRAE QSNTTNGETP PPPYTTAPPT FMQRLLKLIR RSTWGVWKLF TFLVPVLLIG LIVLTASSYG SPDSNTSIRW YGWKHWRSNV GQFIPSHPQL TDDQFNDLKD FILEQSSSTE SAVKNIQSLL PRMVHVKRGP NGDLIIQDDF WHALLDKMLK DSSVLTLDGT GDISEEHWDA LRPRLIKAGL FEKGPSDEHI LQIAEGTVSK SWERWVTKNG EKVAQVVKKH LPGDKGDGVT RDAAISRDEF VGLLKKRIAE HKEEIDGQLD SVKKGLETLI DTTVKAAISN SEGSLSKSEI TTLVRNIVKK EIPRAQLEAA AKDGIMRNYH DYVETQVNHF GLGNEAGIVL SESSPVYRLD SQALPGNKHL SKLLGKPKPI SSKDQVTLEA EYMLALSAWN DVGQCWCAGI TASRGAELAV EMANHVIPQA IVVEHVHPNA TNDPGSMPKD IEIWGYYPDA DDSKRLLAWM DELYPGEREA DMKRVDADNK KSLSLINRKY VKIGELEYDY AKTSGSHGMF VHKLSEELLD LDAATYKVLV RAKTNHGALD HTCIYRLKLF GEELEFEGEE // ID F8MME1_NEUT8 Unreviewed; 1098 AA. AC F8MME1; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGO57815.1}; GN ORFNames=NEUTE1DRAFT_129667 {ECO:0000313|EMBL:EGO57815.1}; OS Neurospora tetrasperma (strain FGSC 2508 / ATCC MYA-4615 / P0657). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=510951 {ECO:0000313|Proteomes:UP000008065}; RN [1] {ECO:0000313|Proteomes:UP000008065} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=FGSC 2508 / P0657 {ECO:0000313|Proteomes:UP000008065}; RX PubMed=21750257; DOI=10.1534/genetics.111.130690; RA Ellison C.E., Stajich J.E., Jacobson D.J., Natvig D.O., Lapidus A., RA Foster B., Aerts A., Riley R., Lindquist E.A., Grigoriev I.V., RA Taylor J.W.; RT "Massive changes in genome architecture accompany the transition to RT self-fertility in the filamentous fungus Neurospora tetrasperma."; RL Genetics 189:55-69(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL891304; EGO57815.1; -; Genomic_DNA. DR RefSeq; XP_009850904.1; XM_009852602.1. DR EnsemblFungi; EGO57815; EGO57815; NEUTE1DRAFT_129667. DR GeneID; 20825354; -. DR KEGG; nte:NEUTE1DRAFT129667; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008065; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008065}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 1098 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003375173. FT COILED 665 685 {ECO:0000256|SAM:Coils}. FT COILED 720 747 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1098 AA; 119228 MW; ED6494339DD943F3 CRC64; MRTPPTLFLG LLGLHAVAAA LPEPASVCES RTVNYITHTL PQQCLRTAWT TPTAVTSAIA ADTTSSEVPS NETAAPAQAK ETQQQHPDQP KPSAEHTQEQ TKEEQEDEDL AASTFMSFEE WKEMMLRKSG DPANTKGGQK QPAQQRTGGE HDQNGPNSDT DSHRPGDDGE NPLNFDALSE KVSELTSSPS GDPSTDYGSD KARTDDQVVH EDGKTQYYRS KDAGKTCKER FSYSSFDAGA IVKKTSPGAK NAKAILVENK DSYMLLECHA KSKFVIVQLS DDILVDTVVL ANFEFFSSMI RQFKVSVSDR YPVKLDKWVE LGTFEARNSR DIQAFSVEHP QIYTKYIRIE FLSHYGNEYY CPVSLLRVHG TRMLDTWKEP DDRHDDEQET IEAPPVQEQL PQTPEPEQPS PQVGQPSVAS EPAPSTVTEL EEEAHQETEP VQAVELGFTP WEPVFYRDFS FEICDLRSRT TGQSTATSPE ADNKQGRNSD TAKEQASTGS AVHETLVPKA SSTASKPQEI AKAQPASSAA SHTPVPPQVS GTITGSPSNK APLSRSNTAS NETAPSVSPA AKPSGSSNST AGTTSRSDSK DYGNNASANA GTGGSPLNNS SQNNKNNQPR KPASGAGHGG SPTSSAPPLP TIQESFFKTV HKRLTHLESN TSLSLQYIEQ QSRFLQDVLS KLERRQLTRV DTFLDTLNKT VLTELRNVRQ QYDQIWQSTV IALETQREQT EREVVALSGR LNVLADEVVF QKRMAILQSV LLLSCLILVI FNRTGGGGGG GGVNGGGGIA LNSNRGTGGR PGSRGGGGGG GGWFDSPIQA VQRRSMKPGS GWISNMSMSM GMSSPFPFST TVSTSGVQQQ VTAVATAEAR SGSGEDADSV GTSTGVDIAA AQQRNQQQLH PNDNHNLGQR QHQHMLQTQQ HSYAYPRNND KALPLTPTSE YDSREGTPLV HTSPLRQTST TIDEVLAAED ADDDSQLYTQ SSFGPESECV PDQEESSRSS SSEFESGGLT QERTLEIYQE STEPNRNGVT NVPVRSNSAE ESSERIEEDN INLMPVDSIE YHQQQTLRPR ARPSRTHLGS ETVKPLPAVP ETSKFIIT // ID F8NVV0_SERL9 Unreviewed; 993 AA. AC F8NVV0; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGO24261.1}; GN ORFNames=SERLADRAFT_449028 {ECO:0000313|EMBL:EGO24261.1}; OS Serpula lacrymans var. lacrymans (strain S7.9) (Dry rot fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Boletales; Coniophorineae; OC Serpulaceae; Serpula. OX NCBI_TaxID=578457 {ECO:0000313|Proteomes:UP000008064}; RN [1] {ECO:0000313|Proteomes:UP000008064} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S7.9 {ECO:0000313|Proteomes:UP000008064}; RX PubMed=21764756; DOI=10.1126/science.1205411; RA Eastwood D.C., Floudas D., Binder M., Majcherczyk A., Schneider P., RA Aerts A., Asiegbu F.O., Baker S.E., Barry K., Bendiksby M., RA Blumentritt M., Coutinho P.M., Cullen D., de Vries R.P., Gathman A., RA Goodell B., Henrissat B., Ihrmark K., Kauserud H., Kohler A., RA LaButti K., Lapidus A., Lavin J.L., Lee Y.-H., Lindquist E., Lilly W., RA Lucas S., Morin E., Murat C., Oguiza J.A., Park J., Pisabarro A.G., RA Riley R., Rosling A., Salamov A., Schmidt O., Schmutz J., Skrede I., RA Stenlid J., Wiebenga A., Xie X., Kuees U., Hibbett D.S., RA Hoffmeister D., Hoegberg N., Martin F., Grigoriev I.V., RA Watkinson S.C.; RT "The plant cell wall-decomposing machinery underlies the functional RT diversity of forest fungi."; RL Science 333:762-765(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL945434; EGO24261.1; -; Genomic_DNA. DR RefSeq; XP_007318280.1; XM_007318218.1. DR EnsemblFungi; EGO24261; EGO24261; SERLADRAFT_449028. DR GeneID; 18816522; -. DR KEGG; sla:SERLADRAFT_449028; -. DR InParanoid; F8NVV0; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000008064; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008064}. FT COILED 267 287 {ECO:0000256|SAM:Coils}. FT COILED 603 623 {ECO:0000256|SAM:Coils}. FT COILED 639 659 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 993 AA; 110041 MW; 232C2E63377C0F7F CRC64; MSFSGTPLGQ GRRLDHRTFL NRQNPNTNHN NPRTVNPPTS PQRIIPTSYA YGAPTAATRS PPKPLSANST QELGQEGNTD EPALVRYAHI KQREQTTHTR PSAQLGPRTI TSPPRPEKWS VKDTSVNIAT AFLQAASSSH EMQSNNPNHS WASGSQTNLN VPRSTSVEYE KETQSISNRR LAPPPNRLSQ RSNRIPLSKQ VSVTQVPDSE GEDEAVDENG RAKTPFEQVV DITKRVLAPA TFYLRQRSQE PADHSNVSNG KDSSYDYSAE EREYQEMQGQ KEGEDQNGDQ DAKASSRRIN ATHKRNRMSM DNKAYRPSVS EMEESEDDLS DDGKKRRRRA KKKELGGGPL TTLPVASYGK RRRRKGRGSN RNLEEIGEEE QEGSSSGNEQ RSAPQRTSIP RGSAPPSVRG SIPPVHETSQ DESMDVEAGL QSIPEIDEPP VVDGYMGSQP VPARSKSFSI GGLLGKLVNR LWRGIVAVLH FLLDVCTAVA MLAGKVLGTI VDVILRRPAD MVSRTNPGPF VQLGKYLMVA LSVYAAWYAL RDPFLQWIPS RISGPPTYYA PDTPITDFTE FTARLQNIES ALSGLSLDHQ RSRSQLDVDA RSNAELINRI FALEARVKEE IKRSVEAQHL LQAATSRSLQ EVREEMEAVQ SHVQEVEAHP RESVERHAAE VSDEEARGKL KVLEERLGTV EGGVKEALEL GKNSVRLEAL QLLLSSGSAT GSGITIKSSD GHDVTSLIGH LVESAVLKYS QQDDLSRPDF ALHSAGARLI PSLTSQTLTI YPATLGARLY GLVSGQGTAT GRTPVTVLHH EVHNGYCWPM EGTKGSVGVM LAYPAYVSDF TIDHVSKEVA FDLRSAPRDM EVWGFVDGKD NLAKVREWQA DKAKRREEAR RIAEEEGLEY VEESEPEYPS MLPTSEPYIR IASFTYDIYS SRNIQTFPVS QEIKDLGIDF GVVVLVINNN WGRDEFTCIY RFRVHGELMG EMALPYPEET SES // ID F8P0H5_SERL9 Unreviewed; 991 AA. AC F8P0H5; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 14-OCT-2015, entry version 11. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGO23530.1}; GN ORFNames=SERLADRAFT_449900 {ECO:0000313|EMBL:EGO23530.1}; OS Serpula lacrymans var. lacrymans (strain S7.9) (Dry rot fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Boletales; Coniophorineae; OC Serpulaceae; Serpula. OX NCBI_TaxID=578457 {ECO:0000313|Proteomes:UP000008064}; RN [1] {ECO:0000313|Proteomes:UP000008064} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S7.9 {ECO:0000313|Proteomes:UP000008064}; RX PubMed=21764756; DOI=10.1126/science.1205411; RA Eastwood D.C., Floudas D., Binder M., Majcherczyk A., Schneider P., RA Aerts A., Asiegbu F.O., Baker S.E., Barry K., Bendiksby M., RA Blumentritt M., Coutinho P.M., Cullen D., de Vries R.P., Gathman A., RA Goodell B., Henrissat B., Ihrmark K., Kauserud H., Kohler A., RA LaButti K., Lapidus A., Lavin J.L., Lee Y.-H., Lindquist E., Lilly W., RA Lucas S., Morin E., Murat C., Oguiza J.A., Park J., Pisabarro A.G., RA Riley R., Rosling A., Salamov A., Schmidt O., Schmutz J., Skrede I., RA Stenlid J., Wiebenga A., Xie X., Kuees U., Hibbett D.S., RA Hoffmeister D., Hoegberg N., Martin F., Grigoriev I.V., RA Watkinson S.C.; RT "The plant cell wall-decomposing machinery underlies the functional RT diversity of forest fungi."; RL Science 333:762-765(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL945435; EGO23530.1; -; Genomic_DNA. DR RefSeq; XP_007319292.1; XM_007319230.1. DR EnsemblFungi; EGO23530; EGO23530; SERLADRAFT_449900. DR GeneID; 18816590; -. DR KEGG; sla:SERLADRAFT_449900; -. DR InParanoid; F8P0H5; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008064; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008064}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 991 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003381774. FT COILED 646 673 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 991 AA; 107186 MW; 7C2D91F545D9BB22 CRC64; MVSFTLLPST LIALLLALPA FSAPTTPNDP FHAIAILAPR QPEPPVCCLK PLQTLEPVED DVLLSFEEWK TKQFQMQQAQ ARADTANRSA AHSSAHSAAR ADAQRRGEKA PSDDGAAVLA SSSSPSSSGA TVPEHGQASY HPDDLLPPHF RVPLTDRFNY ANLDCSARVH TAHRSAKSPS SILSSKKDRY MLSPCAASAR KKEKQFVVVE LCEDIRIDTV QLANFEFFSG VFKDFSVSVA KTYTTNDDGW TLAGTYKAKN VRGVQSFHPP TSLRDFYRYI RIDFHSHYSN EYYCPVSLLR VYGLTHLEEW KWDRWEAESR AKLEESILAS TPVEVVEAPV PTEKPTMDTP EPVTSVPIHT IVHPSPSTQV EREELDNFMD TLESREVAQG DSVKPEPPSA PHASDDGIAY DGGKQHSHSH SHSSTSSVAR KINSSTTNIV GHGKPSTASP VATDSADVPP LPETSVLASL NADIQQVHAS HGHETAEHRP SSSSSLPSPD SIANTLTHAS VILSSVSSSS ITSVEISPPL ATQSLDGGGS SSGVRVVSSG SGAIHDNTSS SPHTSISSSV PPPPSSHVLP PISPPPMTTG GGESIYRTIM NRLTALEANH TLYALYVEEQ TTGVREMLRR LGEDVGRLEG LGKMQAQMYQ RTVHEWERQR RRLEIEHGEL MSRVNYLADE VILEKRLGIA QLCLLLAVLI FMGLTRGSRG EPFILDHGPA LLNRSVREWG RGKFALNLGG FKSRSPTPRG PEPGRVKREE ATASMSTAKL GDVLVDKGST PKGRKRSRTP SSLHTPTRLH HHRPWTPTGG GGGTHTHIRP RITRTNSHGA SSLGIVGGPG GSGGLGHVAP RSAKRWARTA HLHEVKMDRD KGKNAEGVGR YVGENVREDG GMGGVFLGAG GTENQPHPHH HHHQVKKVEL GLDTGKMVRG KAKPPRIVTE DLRRPLSPVD LTRRRVVETG ARAGVEEGMG DDGWVDTDME GSDMDLGGVG G // ID F8Q0W4_SERL3 Unreviewed; 923 AA. AC F8Q0W4; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 14-OCT-2015, entry version 8. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGN97942.1}; GN ORFNames=SERLA73DRAFT_169048 {ECO:0000313|EMBL:EGN97942.1}; OS Serpula lacrymans var. lacrymans (strain S7.3) (Dry rot fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Boletales; Coniophorineae; OC Serpulaceae; Serpula. OX NCBI_TaxID=936435 {ECO:0000313|Proteomes:UP000008063}; RN [1] {ECO:0000313|Proteomes:UP000008063} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=strain S7.3 {ECO:0000313|Proteomes:UP000008063}; RX PubMed=21764756; DOI=10.1126/science.1205411; RA Eastwood D.C., Floudas D., Binder M., Majcherczyk A., Schneider P., RA Aerts A., Asiegbu F.O., Baker S.E., Barry K., Bendiksby M., RA Blumentritt M., Coutinho P.M., Cullen D., de Vries R.P., Gathman A., RA Goodell B., Henrissat B., Ihrmark K., Kauserud H., Kohler A., RA LaButti K., Lapidus A., Lavin J.L., Lee Y.-H., Lindquist E., Lilly W., RA Lucas S., Morin E., Murat C., Oguiza J.A., Park J., Pisabarro A.G., RA Riley R., Rosling A., Salamov A., Schmidt O., Schmutz J., Skrede I., RA Stenlid J., Wiebenga A., Xie X., Kuees U., Hibbett D.S., RA Hoffmeister D., Hoegberg N., Martin F., Grigoriev I.V., RA Watkinson S.C.; RT "The plant cell wall-decomposing machinery underlies the functional RT diversity of forest fungi."; RL Science 333:762-765(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL945481; EGN97942.1; -; Genomic_DNA. DR EnsemblFungi; EGN97942; EGN97942; SERLA73DRAFT_169048. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008063; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008063}; KW Reference proteome {ECO:0000313|Proteomes:UP000008063}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 923 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003382375. FT COILED 578 605 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 923 AA; 100579 MW; 9784217A6E5357A3 CRC64; MVSFTLLPST LIALLLALPA FSAPTTPNDP FHAIAILAPR QPEPPVCCLK PLQTLEPVED DVLLSFEEWK TKQFQMQQAQ ARADTANRSA AHSSAHSAAR ADAQRRGEKA PSDDGAAVLA SSSSPSSSGA TVPEHGQASY HPDDLLPPHF RVPLTDRFNY ANLDCSARVH TAHRSAKSPS SILSSKKDRY MLSPCAASAR KKEKQFVVVE LCEDIRIDTV QLANFEFFSG VFKDFSVSVA KTYTTNDDGW TLAGTYKAKN VRGVQSFHPP TSLRDFYRYI RIDFHSHYSN EYYCPVSLLR VYGLTHLEEW KWDRWEAESR AKLEESILAS TPVEVVEAPV PTEKPTMDTP EPVTSVPIHT IVHPSPSTQV EREELDNFMD TLESREVAQG DSVKPEPPSA PHASDDGIAY DGGKQHSHSH SHSSTSSVAR KINSSTTNIV GHGKPSTASP VATDSADVPP LPETSVLASL NADIQQVHAS HGHETAEHRP SSSSSLPSPD SIANTLTHAS VILSSVSSSS ITSVEISPPT IMNRLTALEA NHTLYALYVE EQTTGVREML RRLGEDVGRL EGLGKMQAQM YQRTVHEWER QRRRLEIEHG ELMSRVNYLA DEVILEKRLG IAQLCLLLAV LIFMGLTRGS RGEPFILDHG PALLNRSVRE WGRGKFALNL GGFKSRSPTP RGPEPGRVKR EEATASMSTA KLGDVLVDKG STPKGRKRSR TPSSLHTPTR LHHHRPWTPT GGGGGTHTHI RPRITRTNSH GASSLGIVGG PGGSGGLGHV APRSAKRWAR TAHLHEVKMD RDKGKNAEGV GRYVGENVRE DGGMGGVFLG AGGTENQPHP HHHHHQVKKV ELGLDTGKMV RGKAKPPRIV TEDLRRPLSP VDLTRRRVVE TGARAGVEEG MGDDGWVDTD MEGSDMDLGG VGG // ID F8Q1A4_SERL3 Unreviewed; 190 AA. AC F8Q1A4; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 16-SEP-2015, entry version 7. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGN98082.1}; GN ORFNames=SERLA73DRAFT_182953 {ECO:0000313|EMBL:EGN98082.1}; OS Serpula lacrymans var. lacrymans (strain S7.3) (Dry rot fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Boletales; Coniophorineae; OC Serpulaceae; Serpula. OX NCBI_TaxID=936435 {ECO:0000313|Proteomes:UP000008063}; RN [1] {ECO:0000313|Proteomes:UP000008063} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=strain S7.3 {ECO:0000313|Proteomes:UP000008063}; RX PubMed=21764756; DOI=10.1126/science.1205411; RA Eastwood D.C., Floudas D., Binder M., Majcherczyk A., Schneider P., RA Aerts A., Asiegbu F.O., Baker S.E., Barry K., Bendiksby M., RA Blumentritt M., Coutinho P.M., Cullen D., de Vries R.P., Gathman A., RA Goodell B., Henrissat B., Ihrmark K., Kauserud H., Kohler A., RA LaButti K., Lapidus A., Lavin J.L., Lee Y.-H., Lindquist E., Lilly W., RA Lucas S., Morin E., Murat C., Oguiza J.A., Park J., Pisabarro A.G., RA Riley R., Rosling A., Salamov A., Schmidt O., Schmutz J., Skrede I., RA Stenlid J., Wiebenga A., Xie X., Kuees U., Hibbett D.S., RA Hoffmeister D., Hoegberg N., Martin F., Grigoriev I.V., RA Watkinson S.C.; RT "The plant cell wall-decomposing machinery underlies the functional RT diversity of forest fungi."; RL Science 333:762-765(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL945481; EGN98082.1; -; Genomic_DNA. DR EnsemblFungi; EGN98082; EGN98082; SERLA73DRAFT_182953. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000008063; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008063}; KW Reference proteome {ECO:0000313|Proteomes:UP000008063}. SQ SEQUENCE 190 AA; 21025 MW; 2B28EE79B5C4BA11 CRC64; MSRDTVTRVA NLEVQIGSML NNQENMSEDL ASDRKGLAHL EETILKLELV SCEELLRRAY IQFSKDYVGK EDFALKSAGA RVVKSLTSSS SLCPSRSFWP FSTSPQCTHN PDIVLNEDLH AGSCWKVEET PSQLGIALAE PIVITDITID HIPQELTHEI GLAPKNIVVW GVLDGQDNIE KTICISFRTG // ID F8WIE5_MOUSE Unreviewed; 2610 AA. AC F8WIE5; DT 21-SEP-2011, integrated into UniProtKB/TrEMBL. DT 21-SEP-2011, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|Ensembl:ENSMUSP00000046766}; GN Name=Hectd1 {ECO:0000313|Ensembl:ENSMUSP00000046766, GN ECO:0000313|MGI:MGI:2384768}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090 {ECO:0000313|Ensembl:ENSMUSP00000046766, ECO:0000313|Proteomes:UP000000589}; RN [1] {ECO:0000213|PubMed:17242355} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=17242355; DOI=10.1073/pnas.0609836104; RA Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.; RT "Large-scale phosphorylation analysis of mouse liver."; RL Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007). RN [2] {ECO:0000313|Ensembl:ENSMUSP00000046766, ECO:0000313|Proteomes:UP000000589} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000046766, RC ECO:0000313|Proteomes:UP000000589}; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [3] {ECO:0000213|PubMed:21183079} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). RN [4] {ECO:0000313|Ensembl:ENSMUSP00000046766} RP IDENTIFICATION. RC STRAIN=C57BL/6J {ECO:0000313|Ensembl:ENSMUSP00000046766}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMUSP00000046766}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC157213; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC159644; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; NP_659037.2; NM_144788.2. DR RefSeq; XP_006515701.1; XM_006515638.2. DR UniGene; Mm.249391; -. DR ProteinModelPortal; F8WIE5; -. DR SMR; F8WIE5; 1266-1338, 1879-1966. DR STRING; 10090.ENSMUSP00000046766; -. DR MaxQB; F8WIE5; -. DR PaxDb; F8WIE5; -. DR PRIDE; F8WIE5; -. DR Ensembl; ENSMUST00000042052; ENSMUSP00000046766; ENSMUSG00000035247. DR GeneID; 207304; -. DR KEGG; mmu:207304; -. DR UCSC; uc007nmz.2; mouse. DR CTD; 25831; -. DR MGI; MGI:2384768; Hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR KO; K12231; -. DR ChiTaRS; Hectd1; mouse. DR NextBio; 371924; -. DR Proteomes; UP000000589; Chromosome 12. DR Bgee; F8WIE5; -. DR ExpressionAtlas; F8WIE5; baseline and differential. DR Genevisible; F8WIE5; MM. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IDA:MGI. DR GO; GO:0001892; P:embryonic placenta development; IMP:MGI. DR GO; GO:0001779; P:natural killer cell differentiation; IMP:MGI. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IMP:MGI. DR GO; GO:0001843; P:neural tube closure; IMP:MGI. DR GO; GO:0051865; P:protein autoubiquitination; IMP:MGI. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IDA:MGI. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IMP:MGI. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IMP:MGI. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000589}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Proteomics identification {ECO:0000213|MaxQB:F8WIE5, KW ECO:0000213|PeptideAtlas:F8WIE5}; KW Reference proteome {ECO:0000313|Proteomes:UP000000589}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289229 MW; 502E46051EB4B42C CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSAAADS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP RGDPEMAPLY LKRLLPVFAQ TFQHTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTTLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSALAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSIDL DMKQDCSQLV ERINVFKTAF SESEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET SLIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGVWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPAKDEKQGW RHVRLKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLA STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RAPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHVVTSQ LPQVPSGAGS RPVGEQEEEE YETKGGRRRA WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAISS LQSSDILNLT KEQPQAKAGN GQSPCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GTWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILGNKSLS EDEKNTKLQE LVLRNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID F9F4F1_FUSOF Unreviewed; 845 AA. AC F9F4F1; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGU88138.1}; GN ORFNames=FOXB_01276 {ECO:0000313|EMBL:EGU88138.1}; OS Fusarium oxysporum (strain Fo5176) (Fusarium vascular wilt). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Fusarium; Fusarium oxysporum species complex. OX NCBI_TaxID=660025 {ECO:0000313|EMBL:EGU88138.1, ECO:0000313|Proteomes:UP000002489}; RN [1] {ECO:0000313|EMBL:EGU88138.1, ECO:0000313|Proteomes:UP000002489} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Fo5176 {ECO:0000313|EMBL:EGU88138.1, RC ECO:0000313|Proteomes:UP000002489}; RX PubMed=21942452; DOI=10.1094/MPMI-08-11-0212; RA Thatcher L.F., Gardiner D.M., Kazan K., Manners J.; RT "A highly conserved effector in Fusarium oxysporum is required for RT full virulence on Arabidopsis."; RL Mol. Plant Microbe Interact. 25:180-190(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGU88138.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFQF01000478; EGU88138.1; -; Genomic_DNA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002489; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002489}; KW Reference proteome {ECO:0000313|Proteomes:UP000002489}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 845 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003389011. FT COILED 654 681 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 845 AA; 91497 MW; 4A73F6DC77E2B02A CRC64; MPSLGPALLR SLALIAAWTQ AQTEAQSHEP NAIPDPIDLT CDARTINYIT HTLPQACLTS SWTSTSMSAT PSDANSTSNA SPDSAQSDAP PPNTQSESPS QPQPSASTAT PAKEESEDAA ADGDAKPFMS FEDWKAMMLK QTGQDPKDLH RNAKPRDRTP PDMGYGGLGE EDEISLNFGS YMDDTGEQKE RPSDLEQAKN EGSGKDGRVA IHRNKDAGKT CKERFSYSSF DAGATILKTS PGARNAKAIL VENKDSYLLF ECSAKEKWFI VELSDDVLID TVVLANFEFF SSMIQTFTVS VSDRYPVKRE EWKQIGVFQA ENSRAIQPFL VENAQIFSKY VRIDYLTHYG KQYYCPVSLL RIHGSRFFAA WNEGRDEDAN EAEEAEVTPQ ALPSGEESTK PQELESPPEP AKPSSMGLMP FCEVNTTSRL LFEPLFCSAS LNQTTQPISN HSSSVEVTSA VPTTKSPDER ARKGTSTQHT PRSEHPASEE PTASSSSTAV SPSATPAVSP TSPSSISSSN IESASNSTAS AASTKTTATP SSSSTPSTQK PSPANTVNAK KGSTGTASGS SASPTVQEGF FKSISKRLTI VETNLTLSLK YVEDQARHMS ETLHRTEQKQ LSKTTLFLEN LNKTVLAELR SVREQYDQIW QSTVLALESQ REQSNREIVA LSARLNLLAD EVVFQKRMAI VQAVLLMSCL ILVIFSRGVP LHHLAPFSDH AGLASYDGAS SAARVRAMHG SAYDGEDAVL LAARQRGQYT PTSRGDDGAT DLHRGPFADD IHHDRVECEQ LSPPPTPRSS GGFSSSSDLS PPSHDTQPNV LRRSIAQPTN SRKPLPALPE NPSSP // ID F9XID3_ZYMTI Unreviewed; 443 AA. AC F9XID3; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGP85475.1}; DE Flags: Fragment; GN ORFNames=MYCGRDRAFT_29926 {ECO:0000313|EMBL:EGP85475.1}; OS Zymoseptoria tritici (strain CBS 115943 / IPO323) (Speckled leaf OS blotch fungus) (Septoria tritici). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Dothideomycetidae; Capnodiales; Mycosphaerellaceae; OC Zymoseptoria. OX NCBI_TaxID=336722 {ECO:0000313|EMBL:EGP85475.1, ECO:0000313|Proteomes:UP000008062}; RN [1] {ECO:0000313|EMBL:EGP85475.1, ECO:0000313|Proteomes:UP000008062} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 115943 / IPO323 {ECO:0000313|Proteomes:UP000008062}; RX PubMed=21695235; DOI=10.1371/journal.pgen.1002070; RA Goodwin S.B., Ben M'barek S., Dhillon B., Wittenberg A.H.J., RA Crane C.F., Hane J.K., Foster A.J., Van der Lee T.A.J., Grimwood J., RA Aerts A., Antoniw J., Bailey A., Bluhm B., Bowler J., Bristow J., RA van der Burgt A., Canto-Canche B., Churchill A.C.L., Conde-Ferraez L., RA Cools H.J., Coutinho P.M., Csukai M., Dehal P., De Wit P., RA Donzelli B., van de Geest H.C., van Ham R.C.H.J., Hammond-Kosack K.E., RA Henrissat B., Kilian A., Kobayashi A.K., Koopmann E., Kourmpetis Y., RA Kuzniar A., Lindquist E., Lombard V., Maliepaard C., Martins N., RA Mehrabi R., Nap J.P.H., Ponomarenko A., Rudd J.J., Salamov A., RA Schmutz J., Schouten H.J., Shapiro H., Stergiopoulos I., RA Torriani S.F.F., Tu H., de Vries R.P., Waalwijk C., Ware S.B., RA Wiebenga A., Zwiers L.-H., Oliver R.P., Grigoriev I.V., Kema G.H.J.; RT "Finished genome of the fungal wheat pathogen Mycosphaerella RT graminicola reveals dispensome structure, chromosome plasticity, and RT stealth pathogenesis."; RL PLoS Genet. 7:E1002070-E1002070(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001203; EGP85475.1; -; Genomic_DNA. DR RefSeq; XP_003850499.1; XM_003850451.1. DR EnsemblFungi; Mycgr3T29926; Mycgr3P29926; Mycgr3G29926. DR GeneID; 13395152; -. DR KEGG; ztr:MYCGRDRAFT_29926; -. DR InParanoid; F9XID3; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008062; Chromosome 8. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008062}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008062}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 423 441 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 389 409 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EGP85475.1}. FT NON_TER 443 443 {ECO:0000313|EMBL:EGP85475.1}. SQ SEQUENCE 443 AA; 50101 MW; EB1A12D5D6FAC9B1 CRC64; ESPLDNANFL SFEEWKKQNL AKMGQSPEDL QRAQQINANR QRPGDPSVTD SVPASAAGQQ LAASPTLRSK DAGKTCKERT NYASFDCAAT ILKSNKECKS ASSVLVENKD SYMLNICSAD NKFFIVELCD DIQIDTVVLA NYEFFSSSFR HFKVSVSDRY PVKLEKWRDL GTFEARNTRE IQAFLVQNPL IWARYLRIEF LTHYGTEYYC PVSVLRVHGT TMWEDYRHQE ELARGEEDEL VLEAEVEAVP PVAQEPLAST VEDPKSTVHT TPPSVTKGPP TTQNNTSSMN TTRPAAAATS TTAPAPPQPS SQESFFKSFH KRLQQLESNS TLSLQYIEEQ SRILRDAFTK VEKRQLSTTT KFLSTLNSTV MAELQNYRLA YDQLWQSTVI ELEGQRESYQ DDMLALSRRL TMVADELVWQ KRMGIVQSTL LLLCLALVLF GRN // ID F9XJM3_ZYMTI Unreviewed; 700 AA. AC F9XJM3; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGP84456.1}; GN ORFNames=MYCGRDRAFT_95868 {ECO:0000313|EMBL:EGP84456.1}; OS Zymoseptoria tritici (strain CBS 115943 / IPO323) (Speckled leaf OS blotch fungus) (Septoria tritici). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Dothideomycetidae; Capnodiales; Mycosphaerellaceae; OC Zymoseptoria. OX NCBI_TaxID=336722 {ECO:0000313|EMBL:EGP84456.1, ECO:0000313|Proteomes:UP000008062}; RN [1] {ECO:0000313|EMBL:EGP84456.1, ECO:0000313|Proteomes:UP000008062} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 115943 / IPO323 {ECO:0000313|Proteomes:UP000008062}; RX PubMed=21695235; DOI=10.1371/journal.pgen.1002070; RA Goodwin S.B., Ben M'barek S., Dhillon B., Wittenberg A.H.J., RA Crane C.F., Hane J.K., Foster A.J., Van der Lee T.A.J., Grimwood J., RA Aerts A., Antoniw J., Bailey A., Bluhm B., Bowler J., Bristow J., RA van der Burgt A., Canto-Canche B., Churchill A.C.L., Conde-Ferraez L., RA Cools H.J., Coutinho P.M., Csukai M., Dehal P., De Wit P., RA Donzelli B., van de Geest H.C., van Ham R.C.H.J., Hammond-Kosack K.E., RA Henrissat B., Kilian A., Kobayashi A.K., Koopmann E., Kourmpetis Y., RA Kuzniar A., Lindquist E., Lombard V., Maliepaard C., Martins N., RA Mehrabi R., Nap J.P.H., Ponomarenko A., Rudd J.J., Salamov A., RA Schmutz J., Schouten H.J., Shapiro H., Stergiopoulos I., RA Torriani S.F.F., Tu H., de Vries R.P., Waalwijk C., Ware S.B., RA Wiebenga A., Zwiers L.-H., Oliver R.P., Grigoriev I.V., Kema G.H.J.; RT "Finished genome of the fungal wheat pathogen Mycosphaerella RT graminicola reveals dispensome structure, chromosome plasticity, and RT stealth pathogenesis."; RL PLoS Genet. 7:E1002070-E1002070(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001204; EGP84456.1; -; Genomic_DNA. DR RefSeq; XP_003849480.1; XM_003849432.1. DR EnsemblFungi; Mycgr3T95868; Mycgr3P95868; Mycgr3G95868. DR GeneID; 13402433; -. DR KEGG; ztr:MYCGRDRAFT_95868; -. DR InParanoid; F9XJM3; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008062; Chromosome 9. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008062}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008062}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 187 209 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 700 AA; 78335 MW; 004ABE4B3584B107 CRC64; MATPRVTRSR ASATPQPQAL EAAQHRPNFS YGTRGPAAPN AQVANDEANF DNVFQSSRAT PQPEPVAKPG KGKAKQQDSM EPESAVGPRP PTRFQNPGLY EDGEPGPPPR PPVPAFDAAP LRTAPRVQTP AAAPLRTAPR VQTPAAAPIA TAPVQPAGNV QPWSFTVFLH AVPRWVKEAT MFKNCMAFLF LATFLLGVLT AGLLTASFLP VGPLGPARDS LLGGFKRTLG LPYFDLPPNS TVVDTRLWGN SPALQEQINA QTYWNREFRN NILAQIEITG GLRRDLSLAN ETLSQLSQIL PSMMAVVERD GKLEIPEIFW DALLDKVGSD AAAPLWENFL RLNEDWLAKQ QNATLQGKLD TFQVDRKLID REEFTATLER NQAYIDSHLD EHLDAFRTKL LHDAQREVQR QATTILESSP AYKISRAQLS TLALTNLLMN INDAQRDINW LTRATDALID PRYTSKTQDS RIASAKTWYQ RLFIKLVPAA VPAYPPAMAL LGWDEAGQCW CTPTSKGETE AGQLGVLLGH TISPQKFYIE HVPARATRNI HSAPQEFELW AQMNSSAEVK RVHDIMVDDD YYYQYPPEKA FKNGGPCGKP PRGEDTWICI LRQKYDIHNH NHFMNWEVPF SPALNMTTNR IVVRAKTNWG APHTCFYRVR LTGVEVRDEP DPERPIPRME PKEIDNSYMM KTMAVDRRFI // ID G0MAC4_CAEBE Unreviewed; 1156 AA. AC G0MAC4; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=CBN-UNC-84 protein {ECO:0000313|EMBL:EGT40526.1}; GN Name=Cbn-unc-84 {ECO:0000313|EMBL:EGT40526.1}; GN ORFNames=CAEBREN_15154 {ECO:0000313|EMBL:EGT40526.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379788; EGT40526.1; -; Genomic_DNA. DR STRING; 135651.CBN15154; -. DR EnsemblMetazoa; CBN15154; CBN15154; CBN15154. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0MAC4; -. DR OMA; WKSEFAS; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 120 140 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 410 430 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 531 548 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 1156 AA; 130088 MW; 338D4A7E441955CA CRC64; MPPTNDDFDS HEWKSEFAST RSGRNSPNIF AKVRRKLLLT PPVRNARSPR LTDEELDALT GDLPYATNYT YAYSKIYDPS LPDHWEVPNL AGGATSRTLA EQEHYSAASL SRQLLYILRF PLYLIVHVIT YILEAFYHLI KLSTFTLWDY ILYIIKIAKI RYATYQDYRR RTALIRNRQD PFSVKAANFF RRFFEFLFYI VSTPYRLITM VSSNKNGVDQ YEYKSIKDQL ENERASRVTT RSQALGLSRR FQGLSQSPAR HSTPAVSKTT TNTITRITTR VFSNSLLGAA PSGNNTVENH TVETTPVTVT TRTVRERSVT PRFRTTRNSK AATKRGSIDF DSPEVEVDTP LSTYGLRSRN RISNLNTPEP TFDIGNAMAT STPLSPRDVF IPGYVHDGGK SSTVQTVVTW IGYLVLFPFF AARHIWYTIF DYGKSAYMKF SNYQPTAMEA IHVRDAGEPA PIYDYSTGML YMPWTTRVSN FFGSFFSAIK ESHQIVFAML YGVIQDTASY AAGLFRGLTD KKSSRFSMCQ LLGLLLALLL AFFLFGFLTS DNTAIRKELQ KDSNASKSPD NALPSVPFYV SAGNKVKHYI WMGKEYVVDL VFNGYNYAKP MLGRTITAPK YAWDLLASGC GAAGNLFGGI LAPIRNSTSN AWYFITGGLT DAGKSIGSSV TGAGKAVSGS VTGAGKAITG SVTGAGKSIY GLVTGAGKNV KNGLTTVVDT TKIAGYYIAT ESYNYLTSYF GNFLESTQSF FGYIYSSFKG FAWGIYDFIY NNLFVGPIRF FMNNYPGVLQ LGWNGARWVY DTFVLTVKTI VDWAVFLVTY PVGLATRAWV HISQYAPDDV VQVIPIPQAI TPTPDVEPII EEPEDAVDTL IKRNELGDEE LVIIPAPAPE PIPVPVPEHE PVDIHHTNVV ETIDKDAIIR EVTEKLRAEF QQEMSAQFEQ NYNSIIQKLK VENTKIQYDN NQLEAIIRQM IYEYDTDKTG KVDYALESGG GAVVTTRCSE TYKSYTRLTK FWDIPMYFWD ESPRIVIQRN SKSLFPGECW CFKEGRGYIA VQLSHYIDVA SVSYEHIGKE VAPEGDRTSA PKGVLVWAYK QIDDINSRVL LGDYTYDLEG PPLQFFIAKH KPDFPVKFVE LEITSNYGAP YTCLYRIRVH GKMIRM // ID G0MG29_CAEBE Unreviewed; 211 AA. AC G0MG29; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT56503.1}; GN ORFNames=CAEBREN_12503 {ECO:0000313|EMBL:EGT56503.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379793; EGT56503.1; -; Genomic_DNA. DR EnsemblMetazoa; CBN12503; CBN12503; CBN12503. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. FT COILED 1 30 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 211 AA; 23873 MW; 1C724FF03948170F CRC64; MKEETEKVNE NESEKAKATI EEDLKDVRRE SVVKLEQSQS IMNITTNFKK TYIPPAIPTV STYSTVKQYN AASLIARSTI DKSLSSSSTL SSFIDQSSLV LVDRPEPPAN KAWCTSDKNP VLTVTLGYYI DPTAVSYQHS KWNGTIPDDA PKEYSVEILS IQSQLDSIQK PIEKDFKNMK EDAEETIENK FEKLKSTPKE DLKEVRMKPD E // ID G0MG31_CAEBE Unreviewed; 446 AA. AC G0MG31; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT56528.1}; GN ORFNames=CAEBREN_15508 {ECO:0000313|EMBL:EGT56528.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379793; EGT56528.1; -; Genomic_DNA. DR EnsemblMetazoa; CBN15508; CBN15508; CBN15508. DR InParanoid; G0MG31; -. DR OMA; ENEMERT; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 427 445 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 222 257 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 446 AA; 49988 MW; D17A192516F91DBB CRC64; MNKEMEKEKG NESKETKAMP EGFLNEVRMP SITITNNKSK TEETHVPPII PITSNSTLKR FNAASFIAGA TIDISLSSSS SLSKTWFLDF DQSSLVLVDR PEPPANKAWC TSDKNPVLTV NLGSYIDPTA VSYQHSKWNG TIPDDAPREY SVEVSLQSQL NSIQNHIGKD FLNTKDEAEK GNENETEKTK ATLEEGFKET RMKLDILSIH SQLNSFQKES NGKDFMNTKE EAEEENENEM ERTKATLEDV LKEIRMESRV ELKAIQSMIN STSHSEEAHA HVQQKVQAVV TNFTKNQYNA ANLIAGAAID ASLSSSSSLN PFIGFDQSSL VLVDRPEPPV DKAWCTSDKN PVLTVNLGYY IKPTAVSYQH SKWNRILPDD APKMYSVEFT YKTCSLLYSK GCCQECPECC EECDIKDINH DVLNEKVMVP IVYVAVMLFG CIVFFL // ID G0N268_CAEBE Unreviewed; 352 AA. AC G0N268; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT50589.1}; GN ORFNames=CAEBREN_02338 {ECO:0000313|EMBL:EGT50589.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379829; EGT50589.1; -; Genomic_DNA. DR STRING; 135651.CBN02338; -. DR EnsemblMetazoa; CBN02338; CBN02338; CBN02338. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0N268; -. DR OMA; NNTATEC; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 50 69 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 324 343 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 75 102 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 352 AA; 40344 MW; 35917613B66284F0 CRC64; MEIKVVDTVT NWEDSSSTDF EFIEKKDVVV EPKNKYEIWL EWLKYRIKNY SQLDVVLAVF MIFIVINSYK TSSDNERLFK MISNIDSRIT NLENQMEMIL NNTATECNLK RKNNIYSYFE DLFGEQHEEK PANPDVLKQE EPIGTVLVFR AAVGLFGSID NTPTRTNSLY GSSLVLVDHK WPPVDRQWCT TVQYPMLTAN LAKSITPTSI SYQHSKWGGK VPDGAPRKYT VRGCLDVDCD KTILLTDVCE YKSTGNSQQE QLCPVISSAT KTPIVKIQFK IIGNHGSQAE TCINYVRVYG KLAKEKEETT RKGKQPHQLS KKEFCVVLVS FLLMVLWVCA QIIKEEKNGK RH // ID G0NFX6_CAEBE Unreviewed; 350 AA. AC G0NFX6; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59816.1}; GN ORFNames=CAEBREN_00461 {ECO:0000313|EMBL:EGT59816.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59816.1; -; Genomic_DNA. DR STRING; 135651.CBN00461; -. DR EnsemblMetazoa; CBN00461; CBN00461; CBN00461. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0NFX6; -. DR OMA; RESEKIC; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. SQ SEQUENCE 350 AA; 41010 MW; 35F9D92BF5859D68 CRC64; MLMLTAGHRG THQYYTEQDI DCTGCGEGFS EWCKRLYYRI RHYFVLEIVT SVLLVVLLMN SRNNWIRSEE TYELISSLRY EIRLLEKQLE SCNCSVHEDS TEDISFDFKT TKPPITQSQN GHQDHRTSPI ENNQKLETEK INQHHENLRT VSSIPLVNAA DYISGARISW LLFSSEPNQM ILDRPNPPKN AEWCTETEKP NVTISLSRFI VPVYVSYQHS KWNHIVPNEA PRKYNVVACY DTRCNSWTPL ATDCEYNSRN EEQEQTCMID PTLNEMPIET VQFQFHENHG RNKETCISLL RVYGEPKDKK KNLGELRESE KICEDLAWSY KNWPVLYTMI IFFMRSGTVL // ID G0NFX7_CAEBE Unreviewed; 144 AA. AC G0NFX7; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 01-APR-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59843.1}; GN ORFNames=CAEBREN_08397 {ECO:0000313|EMBL:EGT59843.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59843.1; -; Genomic_DNA. DR EnsemblMetazoa; CBN08397; CBN08397; CBN08397. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. SQ SEQUENCE 144 AA; 16449 MW; 0FA59572B70FFE84 CRC64; MTTLQNQMDY YEKQNGKSLS PESHNQLESS TVVSNLFNCS DLSERSTSCP LEIINAASYM RGARVKNNDT GHSDQYVIFE RPNPPKNSVW CSEEKSPNLT IRLTEFIHPL YVSYQHSKWY GMVPNGAPRV FDLVNVVNNV KSRM // ID G0NFX8_CAEBE Unreviewed; 415 AA. AC G0NFX8; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59909.1}; GN ORFNames=CAEBREN_13744 {ECO:0000313|EMBL:EGT59909.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59909.1; -; Genomic_DNA. DR STRING; 135651.CBN13744; -. DR EnsemblMetazoa; CBN13744; CBN13744; CBN13744. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0NFX8; -. DR OMA; WETIAEN; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 71 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 379 401 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 75 95 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 415 AA; 46737 MW; DE23925CAAA3576D CRC64; MAQLPPLRDA RSPRRIDATQ HFQCKERFTI IPNQPMKPKP WFTYPHPTNQ VIMEIVTAFV LILLLFNTGL LSSQVSKTNE LLAKVQLQVE KLENARGFSG HYAEPEPTTT TTEATPAPKI EHVKPPQSMC PTISIPETNS SGQPEIFNAA NYFLGASVDS KYSSESDNFA YGRDQSGYVI LDRKDPPPNR AWCSDETEPL LIINLAKTIQ PVAVSYQHSQ WNGTVPVGAP KVYDVEGCLD QNCLQWETIA ENCTYQSLES EQQEQVCAIP VNATRQYGYE KVQFRFRENH GDVVKTCAFL VRVYGEAGKS LVTKGDMEES EKICKDLTTA YHDSPFVYAN LKSKSCSVLY KNKCCSECPE CCEKCEIEDV NSKWIAPHII AVVVFLIIFM LFVFFCCFAN SKDDGSNDFE SFGKY // ID G0NFX9_CAEBE Unreviewed; 325 AA. AC G0NFX9; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59797.1}; GN ORFNames=CAEBREN_08822 {ECO:0000313|EMBL:EGT59797.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59797.1; -; Genomic_DNA. DR STRING; 135651.CBN08822; -. DR EnsemblMetazoa; CBN08822; CBN08822; CBN08822. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0NFX9; -. DR OMA; ECCEQCE; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 51 70 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 282 312 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 76 96 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 325 AA; 36997 MW; C3C378DFEEBFF9CD CRC64; MKIEVRPIKK DADAEKSISP FALNEDNYNV DLDKDVFPAN RVLFTIQWRH YYIFETMVFV MLSFLVFNSF QLVSKVDKSD ELLLKLQIQI NKLEKRLGSD NVSDFSELEV TETVTQPILT TPVVTTTTTE PPKIKPIVLP TLICNSNYTS TIDAIEHFNA ANYFLGARVV SSLSSSSDNN PFFGRDQSEY VLLDRKEPPP NKAWCSDVKQ PNLTVKLTKN IEPFAVSYQH TKWNGTVPNG APKVYDLVKN KSCITLHKSN CCKDCPECCE QCEIRDMNTN EILLNITVSI LLIFLAFLVL IAIFAMVFSV YIHKKQVRSQ LQHAQ // ID G0NFY0_CAEBE Unreviewed; 584 AA. AC G0NFY0; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59864.1}; GN ORFNames=CAEBREN_08870 {ECO:0000313|EMBL:EGT59864.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59864.1; -; Genomic_DNA. DR STRING; 135651.CBN08870; -. DR EnsemblMetazoa; CBN08870; CBN08870; CBN08870. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0NFY0; -. DR OMA; HFNAANI; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 2. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. SQ SEQUENCE 584 AA; 67690 MW; 3EF5F6AC6407D796 CRC64; MVYQIKNELQ RNSKKQFIQD DGKTTEPPDT EIEKVVEEIA TTTTTTNQPA VEATVPVFPS QKPLKPLREH ENTFNRSIPH FNAANILLGA SVDERYSSGK IIWFQSNNWD YVILDRPESR PDKAWCTNNT LPILTINLFK YIRPVAVSYK HLKWNGKVPN GAPRVYDVMT CLDQDERRSP CKETTPLVNN CKYLSSGEQE QICLVPHNPN LLPTKKIQIQ FLSNHGDSKM TCVSQVRVYG EGYLKKRKPL EGHKERCESL IWHRKNYPFV YNNLITLLHY QIQSIETMVY QIKNELQINS KDKVIKNNEK TTEPPDTGIE KVVEETTTTT TTYQPFVEST VPAFPPQKPL KSLRVYEENK KTFNRSIPHF NAANILLGAS IDNHYSTGTA NRFYSNKWDY VILDRPELQQ DKAWCTNYTL PVLTINLFKY IRPVAVSYKH LKWNGRVPNG APRVYDVMTC LDQEKIGSPC EEISPLVNNC KYLSSGEQEQ ICLVPHNPNL LPTKKIQIQF QRNHGDSEMT CVSQVRVYGE GYLKKKKPLE GHKERCESLI WYRKNYPFIY NNLTLWMFSD VIGAEERKET QRKE // ID G0NFY1_CAEBE Unreviewed; 318 AA. AC G0NFY1; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59795.1}; GN ORFNames=CAEBREN_12797 {ECO:0000313|EMBL:EGT59795.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59795.1; -; Genomic_DNA. DR EnsemblMetazoa; CBN12797; CBN12797; CBN12797. DR InParanoid; G0NFY1; -. DR OMA; CKECHIS; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 55 74 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 288 309 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 80 100 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 318 AA; 36557 MW; 690867AE0D6F960E CRC64; MHKNCKVVYE NDCCTDCPEC CRECETKDIN ISVILDVLKY SVIGYKWLKY RIRQCMLLEV IILIILLLLL HNSYKTTSQN ETILELISHL QTRIDKLEKM QSFSNVPTNK SDEEMRKSIG SSEESTTKVI EEIKTTTVPT TTEAPVKISP DAMELYKPPE PFNPSLPRFN AASYLSGARV SQNLSSHQKS YLDQSHYVIL EREPVQNKAW CTDSKDSVLT IVLAEKIRPV AVSYQHFKWN RIVPSGAPRA YDVKLIKNCK RLYENDCCAE CPECCKECHI SDWNYIEILK IIGITFFIGF WVILICIGIN ADKLKRGR // ID G0NFY2_CAEBE Unreviewed; 238 AA. AC G0NFY2; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59832.1}; GN ORFNames=CAEBREN_05410 {ECO:0000313|EMBL:EGT59832.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59832.1; -; Genomic_DNA. DR EnsemblMetazoa; CBN05410; CBN05410; CBN05410. DR InParanoid; G0NFY2; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 42 61 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 206 226 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 238 AA; 28217 MW; D4B44243C422BD40 CRC64; MTVPEDKLKK TEADYSEMQW GTTSPPETKL WFRWMKHRIR QFMILEAVFF VILILLLNNS YKIISQNETN FEWASFSGRP TYELPLALNC KYDSSGPQEQ KCLVSYPDYL SDVYTIVFRF RKNHGNSKVT CAYLFRVFGK GMDPNEPIKA SKETCSSLIW YRKNYPFIYN NILEKNCKVW YNNDCCDECP ECCTECETQD LNYKTLLVSM AVFGTIFFFI HICYIVEKDK RKREGYIQ // ID G0NFY3_CAEBE Unreviewed; 376 AA. AC G0NFY3; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT59848.1}; GN ORFNames=CAEBREN_18314 {ECO:0000313|EMBL:EGT59848.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59848.1; -; Genomic_DNA. DR STRING; 135651.CBN18314; -. DR EnsemblMetazoa; CBN18314; CBN18314; CBN18314. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0NFY3; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. SQ SEQUENCE 376 AA; 43146 MW; BD6FEA941F4E4A2F CRC64; MEDTSKIKYQ NERNFAMMNQ LKSQYRNIES QMTSILRVSR ATLAYPELSK ELKKKVFRGE RFWVMKEQNR TEETIEKICM SLKPSTTTKQ PRTEQLTTTT TIPTTTIPVK PATQVVPTST ITPKLPKHAD IKVPHFNAAS YLSGASVLFG LSSRPVNSSE FSFDQSSYAI LEREPVPNRA WCTNSEKPYL TISLSKYVRP VAVSYQHLKW NGTIPDGAPM EYDVVACYGK SCKVEVPLAL NCKYKSSGPQ EQKCLVPLSG NYVPEVDWIR FNFRKTYGNS KTICAYLIRV FAEGTVKPMK PSKETCSSLI WHRKNSPFFY NYILEKNCKI WYNNECCADC PECCTECVPK DVNYKLLLPI GNVYFIVHRY YARFLS // ID G0NG44_CAEBE Unreviewed; 471 AA. AC G0NG44; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=CBN-SUN-1 protein {ECO:0000313|EMBL:EGT59912.1}; GN Name=Cbn-sun-1 {ECO:0000313|EMBL:EGT59912.1}; GN ORFNames=CAEBREN_25731 {ECO:0000313|EMBL:EGT59912.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379878; EGT59912.1; -; Genomic_DNA. DR STRING; 135651.CBN25731; -. DR EnsemblMetazoa; CBN25731; CBN25731; CBN25731. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0NG44; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. FT COILED 171 191 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 471 AA; 54011 MW; 835B4784F61A0654 CRC64; MALRHTISPQ LSNRHSPPVT RSVSRNGRPH LFEATSTPIT RKSLQPGQIH ISQDIERVFE SADDTENELN TSKFIYREHF TAKEMTSMKK EMWYDWVEYQ VRMIRRHFVP SADNILKILF ALAFISMIIR YSYDCLYPYQ PQIIHGATLD DQWKTEIKRT DEAISHLQTH FENMVDTHQN LANRIASLEE KLRNSEGFKE SMIDELKEIK AWNTYISDSI ISLKTELEER KSAKIIPTEE EKPIFESIPS ATSSLQILPH DIHANRFPSG VNVANSLIGA SIDNSCSSRS VSAKDGIFYD LLSYFGSFQE GYVLLDRELL SPGEAWCTYD KRATLTVKLA RFIKPTAVSY QHVRWNRIVP NHSPKLYDLV ACLDPCCTKS EPIVSDCEYR ASEDGHDEQE QFCIVPVNPD ATPIDRVQFR FRENHGNMEK TCAYLVRVYG EPTTPPKEKV PENGTSSHFE STVIDSMSDT V // ID G0NJR4_CAEBE Unreviewed; 815 AA. AC G0NJR4; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT32551.1}; GN ORFNames=CAEBREN_09951 {ECO:0000313|EMBL:EGT32551.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL379896; EGT32551.1; -; Genomic_DNA. DR STRING; 135651.CBN09951; -. DR EnsemblMetazoa; CBN09951; CBN09951; CBN09951. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G0NJR4; -. DR OMA; ERCEETQ; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 815 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003405412. FT TRANSMEM 659 681 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 535 555 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 815 AA; 92143 MW; C1C1594B4A604D62 CRC64; MKLMQLLILA VLVLPKTQAN QEISFVKHWR DIFLLSSIET SSTCALSVEE CARSVPYNVT KKIMKTSGNE SEKETVPEKS IESFDEWTKK RRDAAVANQN GQTQKIIDPI PGTILRQEDV VVPLPPIPRP ARNFASRECG AKIIAANPEA ENAKAVVNEK DVDDYMRNPC QSAREKFIVI ELCEAIQIKK LAIGNFELFA SRPKTIQVFI SERYPPLSNW ISLGSFHLQD HHKNLQTFDV PSTSVYAKYV RINLEDHYGK EHYCIVSVVN VMGSTLADEY DKEEAAAQLM NVIEEKNDEP VTTLPPLEQN MPTQLPKPPK SPNLSLLPKD IFDFRHLKSS CSQCSVGKVS YLLCHILPRS SRPNKINSTP KPLNIKPSVS ENPSLKTELG IWAERSRLAN FEQSRRRNLA TIQRLMPQGN AKNLDKTEVH QANPSIVPIQ KDESPEKPTE KSFTMEDTRT PIQAPVQPVT SEKTSPPPKA KSEPILPAGG STNQREMVLM KLSKRIAAVE MNLTLSTEYL SELSKQYVTQ MSGYQQELKE TRKSAKKSAQ TAEAVMRSKM STVRRELRDL RQSIYLLQQL ENNRYKNVQN EMSRNIFMSS CHISSNVPPS PTLARLPLII PAINRRLENI TNFEKKIKKI YETAKSVMFG SITWNTDHLI VALISFNILA LSFLFAGVFY IHRRNKERCE ETKLTVRNEL RARIAKVGAE NRKLISKGMR RAELAVTAAV SSALKIEKSS NNRKAMTELE TALANLFAAQ QTRIEEQFEQ NQQILRDALA EGRRSRADDT LSAEDSESSS ETEHSKEDTP IFEQD // ID G0P266_CAEBE Unreviewed; 471 AA. AC G0P266; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT42887.1}; GN ORFNames=CAEBREN_18299 {ECO:0000313|EMBL:EGT42887.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL380021; EGT42887.1; -; Genomic_DNA. DR STRING; 135651.CBN18299; -. DR EnsemblMetazoa; CBN18299; CBN18299; CBN18299. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0P266; -. DR OMA; VPNHAPK; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. FT COILED 171 191 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 471 AA; 54081 MW; 1F0C3FF1DC93A380 CRC64; MALRHTISPQ LSNRHSPPVT RSVSRNGRPH LYEATSTPIT RKSLQPGQIH ISQDIERVFE SADDTENELN TSKFIYREHF TAKEMTSMKK EMWYDWVEYQ VRMIRRHFVP SADNILKILF ALAFISMIIR YSYDCLYPYQ PQIIHGATLD DQWKTEIKRT DEAISHLQTH FENMVDTHQN LANRIASLEE KLRNNEGFKE SMIDELKEIK AWNTYISDSI NSLKTELEER KSAKIIPTEE EKPIFESIPS ATSSLQILPH DIHTNRFPSG VNVANSLIGA SIDNSCSSRS VSAKDGIFYD LLSYFGSFQE GYVLLDRELL SPGEAWCTYD KRATLTVKLA RFIKPIAVSY QHVRWNRIVP NHAPKLYDLV ACLDPCCTKS EPIVSDCEYR ASEDGHDEQE QFCIVPVNPD ATPIDRVQFR FRENHGNMEK TCAYLVRVYG EPTTPPKEKV PENGTSSHFE STVIDSMSDT V // ID G0P282_CAEBE Unreviewed; 384 AA. AC G0P282; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT42936.1}; GN ORFNames=CAEBREN_06472 {ECO:0000313|EMBL:EGT42936.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL380021; EGT42936.1; -; Genomic_DNA. DR STRING; 135651.CBN06472; -. DR EnsemblMetazoa; CBN06472; CBN06472; CBN06472. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0P282; -. DR OMA; TSIGRDS; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 354 373 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 384 AA; 43599 MW; A2E29AC1236EEA9B CRC64; MNEDRQLHRD IESAYGSDVS SESNSLRKRE SFSVQEPAVE KHITTLQAQF EKFQKQVSGK DIINSKNQSE TKSQNEGVKK MEDVLNELLK EVEVKPVELV ESTPNPTIPL QSPVSLVSKP QVNQLNTSFT RINVASYMLG AVLDKTRSSS SNLNPIFGWD QSDLVLLDRP DPPANRAWCT YEKNPVLTVN LAKYVNITAI SYQHSKWNGT IPDDAPKIYD VVACLDYYCD TWKTIAKNCE YEPDGIGQEQ LCNLSLNSIN STIGKVQFRF LRNHGNIEKT CVGLVRVYDD PPKEETSEKS RICSNLKKSY HNNTFVYKYL NLKSCDIVYA EGCCTECPEC CQECWIKDFD GNEFVFFIFI TATFVPLLII IICKHFNPHA FPVA // ID G0P5M0_CAEBE Unreviewed; 494 AA. AC G0P5M0; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT45510.1}; GN ORFNames=CAEBREN_04819 {ECO:0000313|EMBL:EGT45510.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL380085; EGT45510.1; -; Genomic_DNA. DR STRING; 135651.CBN04819; -. DR EnsemblMetazoa; CBN04819; CBN04819; CBN04819. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0P5M0; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 69 89 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 455 480 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 494 AA; 56560 MW; C6A94D93A13AA8B8 CRC64; MKIKLTEKDL EKSLEEKSEG KIKEKTEEKT EEKTALVSTE IENSLQTFVL AKENTTRNFC LFLIWRHGLL FALLGIFNFI ILGAIHIHVV HQNRDLKTEI NNLKQLYSTE DFDFVKRMTR IKNEIPEYTS VVLETKRMDS NGMYSIDRIV FKAEEYSPEN NKSVLGTVAV VYDKDPMDVQ LDPSSVIQCS MNSANNVTFA FNAADAIKGA FVVQGQSSKT VSVGDDFEDQ VSNFFGDVDN GFVVLDRKEI PVDKAWCSNE KNPLLTIRLS EYIKPTAVSY QHAHWSNVVP NGAPKLYDVV ACLNAVCNKT VPLVTDCEYK SGPGPCFQQQ FCKVSSDKKL PPTNKVQIHF RENHGNATKT CAYRIRVHGK PVIYIPEKSE VDQDEEVLAE FHKRKQIPRN SRFTNCSDLA WYHNNLPILY NARRHTCPRL YSEECCDECP RCCKDCRMEP GIASYIQLAV VVLVVLFIPI FIMFVLYQIY SILDKCYMKN NHYN // ID G0PBS1_CAEBE Unreviewed; 199 AA. AC G0PBS1; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT50647.1}; GN ORFNames=CAEBREN_05069 {ECO:0000313|EMBL:EGT50647.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL380218; EGT50647.1; -; Genomic_DNA. DR STRING; 135651.CBN05069; -. DR EnsemblMetazoa; CBN05069; CBN05069; CBN05069. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. FT COILED 1 21 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 199 AA; 22426 MW; 8AC52C7FC93509A5 CRC64; MSNLESRMTN LEKRVELLSN GQFNRQNHQT PLEPLMETRE PPKPVQIQPQ SIQSIFDSGK RAQKLEPASE SSSVSLFNAA NFLVGASVDS SRSSSSNLSP FLTWDQTGLV LLDRPEPPVD KAYCTSEKNP VLTVNLAKYV KPISVSYQHS KWYGIIPSGA PKRYSVWACL DYYCEDMQPL VSNCEYKVTS DNQQEQMFP // ID G0PCF1_CAEBE Unreviewed; 493 AA. AC G0PCF1; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT51086.1}; GN ORFNames=CAEBREN_25235 {ECO:0000313|EMBL:EGT51086.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL380240; EGT51086.1; -; Genomic_DNA. DR STRING; 135651.CBN25235; -. DR EnsemblMetazoa; CBN25235; CBN25235; CBN25235. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G0PCF1; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 68 88 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 454 479 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 493 AA; 56435 MW; DE6C0E0001E0F560 CRC64; MKIKLTEKDL EKSLEEKSEE KIGEKIVEKT EETTALVPSK TESLQTFVLA KENTTRNFCL FLIWRHGLLF ALLGIFNFII LGAIHIHVVH QNRDLKTEIN SLKQLYSTED FDFVKRMTRI KNEFPEYTSV VLETKRMDLN GMYSIDRIVF KAEEYSTENN KAVLGTVAVV YDNDPMDVKL DPSSVIQCSM NSANNVTFAF NAADAIKGAF VVQGQSSKTV SVGDDFEDQV SNFFGDVDNG FVVLDRKEIP LDKAWCSNEK NPVLTIRLSE YIKPTAVSYQ HAHWSNVVPN GAPKLYDVVA CLNAACNKTV PLVTNCEYKS GPRPCFQQQF CKVTSDKKLP PTNKVQIRFR ENHGNASKTC AYRIRVHGKP VIYIPEKSEV DEDEEVLAEF HKRKQIPRNS RYTNCSDLAW YHNNLPILYN ARRHTCPRLY SEECCDECPG CCKHCRMEPG IASYIQLAVV VLVVLFIPIF IMFVLYQIYS VLDKCYMRNN HYN // ID G0PDS4_CAEBE Unreviewed; 815 AA. AC G0PDS4; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT52387.1}; GN ORFNames=CAEBREN_18173 {ECO:0000313|EMBL:EGT52387.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL380289; EGT52387.1; -; Genomic_DNA. DR STRING; 135651.CBN18173; -. DR EnsemblMetazoa; CBN18173; CBN18173; CBN18173. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G0PDS4; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 815 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003406178. FT TRANSMEM 659 681 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 535 555 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 815 AA; 92138 MW; C4E93515AE9D182B CRC64; MKLTQLLILA VLVLPKTQAN QEISFVKHWR DIFLLSSIET SSTCALSVEE CARSVPYNVT KKIMKTSGNE SEKETVPEKS IESFDEWTKK RRDAAVANQN GQTQKIIDPI PGTILRQEDV VVPLPPIPRP ARNFASRECG AKIIAANPEA ENAKAVVNEK DVDDYMRNPC QSAREKFIVI ELCEAIQIKK LAIGNFELFA SRPKTIQVFI SERYPPLSNW ISLGSFHLQD HHKNLQTFDV PSTSVYAKYV RINLEDHYGK EHYCIVSVVN VMGSTLADEY DKEEAAAQLM NVIEEKNDEP VTTLPPLEQN MPTQLPKPPK SPNLSLLPKD IFDFRHLKSS CSQCSVGKVS YLLCHILPRS SRPNKINSTP KPLNIKPSVS ENPSLKTELG IWAERSRLAN FEQSRRRNLA TIQRLMPQGN AKNLDKTEVH QANPSILPIQ KDESPEKPTE KSFTTEDTRT PIQAPVQPLT SEKTSPPPKA KSEPILPAGG STNQREMVLM KLSKRIAAVE MNLTLSTEYL SELSKQYVTQ MSGYQQELKE TRKSAKKSAQ TAEAVMRSKM STVRRELRDL RQSIYLLQQL ENNRYKNVQN EMSRNIFMSS CHISSNVPPS PTLARLPLII PAINRRLENI TNFEKKIKKI YETVKSVMFG SITWNTDHLI VALISFNILA LSFLFAGVFY IHRRNKERCE ETKLTVRNEL RARIAKVGAE NRKLISKGMR RAELAVTAAV SSALKIEKSS NNRKAMTELE TALANLFAAQ QTRIEEQFEQ NQQILRNALA EGRRSRADDT LSAEDSESSS ETEHSKEDTP IFEQD // ID G0PMR9_CAEBE Unreviewed; 1239 AA. AC G0PMR9; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGT38408.1}; DE Flags: Fragment; GN ORFNames=CAEBREN_30160 {ECO:0000313|EMBL:EGT38408.1}; OS Caenorhabditis brenneri (Nematode worm). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=135651 {ECO:0000313|Proteomes:UP000008068}; RN [1] {ECO:0000313|Proteomes:UP000008068} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PB2801 {ECO:0000313|Proteomes:UP000008068}; RG Caenorhabditis brenneri Sequencing and Analysis Consortium; RA Wilson R.K.; RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL381492; EGT38408.1; -; Genomic_DNA. DR STRING; 135651.CBN30160; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; ENOG411132C; LUCA. DR InParanoid; G0PMR9; -. DR Proteomes; UP000008068; Unassembled WGS sequence. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008068}; KW Reference proteome {ECO:0000313|Proteomes:UP000008068}. FT COILED 474 494 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EGT38408.1}. FT NON_TER 1239 1239 {ECO:0000313|EMBL:EGT38408.1}. SQ SEQUENCE 1239 AA; 136764 MW; 58A654533402D0D9 CRC64; IRNLCISFIS KPFSEEEQKV KAAATQQPGT STKQELPNPN LVRKVLHQLL PIFCEIFQRS LNGVVRRTSL SLMRKIVENI GDLRQSAVSD EGVTPNSARK MSADVSNAAE SLVAVVVSVL DQEDDHEGHE QVLLILQSLL EKDADLWVTE LIRLGVFERV EAMAQEPPKG LEEVLKAIHL DGRSRVAPME IDFEHQQQPS SSAAGNEIMD TTPVADVPEG EGSSSAAEVA EPETSTPSSS SQQSIPKPKT TASSSASSAI LQVVSKLSGV ASLDKSAAAA AADKKPSKIV LNQGTPYRWK EWRIVRGPSS LFIWSDVLLI ELPFQSNGWF RYLADNDFHV QFVTGTANVD QQMTDEEKVE VPAWEMWSAK SSELQIKSIS SSNPSGQANT MVTTIKVQDD AGGFMFETGT GRKTNVMPEY ALPIDFHTGW SAHGVTTRKI KFRQDIQKRK VQELAWKLWN DHLREAHAKP REALLKLEEA ARTIEVTLRK LLNAKPRVSK QPVINKLHKY MDAMETVYTS VIDDRRLSTF EFSVSGIVPV IFALLSSMDK YPDVYRNVFI EKFSKGDALS KLALKMVAVL EASEKFPQYL YDSPGGSSFG LQLLSRRVRT KLEMIPRNDG KDHPDEQLVD KTGKIVKCEP LASIGSIRTY LMKMVARQWH DRDRSKFRYV KEIQELKQKG EAVVLRYVSD FDENGVIYWI GTNGRTAPSW SNPSSVKAVK ITCSDARQPF GKPDDLLSRD QNPINCHTSD DKNAHFTIDL GLFVVPTSYT LRHSRGYGRS ALRNWILQGS LDAKKWENVI VHVDDKSLGE PGSTASWHVA EKGTNAYRYY RIAQNGKNSS GQTHYLSCSG FEIYGDIVDV VAEAICEEAP KKESVPGTSS AGSSSSAAPL TKEQVIEMLP AREHNNKLKS GITLDSLVAM MQRSRNRIRG TYKISESKSK VVRGKDWRWE EQDGGEGKFV RIISPPENGW VDVTWDNGYS NSYRFGASGH FDIERVTSSG HRYSTPPFSS NVPSSVSFVH YDLNKHKIHY STGYGRGPKK SCILNRQNLW SIILNVRRSN FIGASTLSRF SSVKNTTPSG TSSSGGSSGG AIGKKSMSTT NLVDERQKAS GPSVASTGQA ASAESLQHQT PSLENLLARA MPQTFGRIAE NQEQEDEPMG GEESDSAASM RSAASSTSQV STGSSQQQQQ QQQDPDLTPR DSAGTPSTPR DDKNQALSVS APDLAAARQR QASAEAERM // ID G0RWV3_HYPJQ Unreviewed; 477 AA. AC G0RWV3; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 17. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EGR44370.1}; DE Flags: Fragment; GN ORFNames=TRIREDRAFT_52709 {ECO:0000313|EMBL:EGR44370.1}; OS Hypocrea jecorina (strain QM6a) (Trichoderma reesei). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Hypocreaceae; OC Trichoderma. OX NCBI_TaxID=431241 {ECO:0000313|Proteomes:UP000008984}; RN [1] {ECO:0000313|EMBL:EGR44370.1, ECO:0000313|Proteomes:UP000008984} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=QM6a {ECO:0000313|EMBL:EGR44370.1, RC ECO:0000313|Proteomes:UP000008984}; RX PubMed=18454138; DOI=10.1038/nbt1403; RA Martinez D., Berka R.M., Henrissat B., Saloheimo M., Arvas M., RA Baker S.E., Chapman J., Chertkov O., Coutinho P.M., Cullen D., RA Danchin E.G., Grigoriev I.V., Harris P., Jackson M., Kubicek C.P., RA Han C.S., Ho I., Larrondo L.F., de Leon A.L., Magnuson J.K., RA Merino S., Misra M., Nelson B., Putnam N., Robbertse B., Salamov A.A., RA Schmoll M., Terry A., Thayer N., Westerholm-Parvinen A., Schoch C.L., RA Yao J., Barabote R., Nelson M.A., Detter C., Bruce D., Kuske C.R., RA Xie G., Richardson P., Rokhsar D.S., Lucas S.M., Rubin E.M., RA Dunn-Coleman N., Ward M., Brettin T.S.; RT "Genome sequencing and analysis of the biomass-degrading fungus RT Trichoderma reesei (syn. Hypocrea jecorina)."; RL Nat. Biotechnol. 26:553-560(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL985092; EGR44370.1; -; Genomic_DNA. DR RefSeq; XP_006969723.1; XM_006969661.1. DR EnsemblFungi; EGR44370; EGR44370; TRIREDRAFT_52709. DR GeneID; 18485460; -. DR KEGG; tre:TRIREDRAFT_52709; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008984; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008984}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008984}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 458 475 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 369 389 {ECO:0000256|SAM:Coils}. FT COILED 424 451 {ECO:0000256|SAM:Coils}. FT NON_TER 477 477 {ECO:0000313|EMBL:EGR44370.1}. SQ SEQUENCE 477 AA; 52719 MW; 9CD4CD5AE5819FFD CRC64; MSFEDWKEMM LKETGQDPQD LHLRNSSGRQ KGDRPSPDVD EALGEEGEIS LTFDYGEGDG IRTGTYADSA TSNTGDDADE ALVSKDGKAP IHRSKDAGKT CKERFSYSSF DAGATILKAG PQAKNAKAIL VENKDSYMLL ECAAQNKYVI VELSDDILID TIVIANFEFF SSMIRHFRVS VSDRYPVKMD KWREVGIFEA ANSRDIQAFL VENPQIWAKY IRIEFLTHYG NEYYCPVSLL RVHGSRMLDS WKDSETGRED EASKDGGALD ADPAASAHLR SNATQARDAS TAPPSKQTNS TKPAADTVQP SPDSADATGT TSGTTSGTGK SRSGGTASAS APTPTVQASF FNSITKRLQQ VEANLTLSLK YVEDQSRLMQ EALQKTEQKQ VSKLTRFLGD LNHTVLAEMR NVRDQYEQIW QSTVLALESQ REQSERDIVA LSTRLNLLAD EVVFQKRMAI VQAILLLSCL FLVIFSR // ID G0S4E7_CHATD Unreviewed; 919 AA. AC G0S4E7; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGS20425.1}; GN ORFNames=CTHT_0022550 {ECO:0000313|EMBL:EGS20425.1}; OS Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. OX NCBI_TaxID=759272 {ECO:0000313|Proteomes:UP000008066}; RN [1] {ECO:0000313|EMBL:EGS20425.1, ECO:0000313|Proteomes:UP000008066} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=DSM 1495 / CBS 144.50 / IMI 039719 RC {ECO:0000313|Proteomes:UP000008066}; RX PubMed=21784248; DOI=10.1016/j.cell.2011.06.039; RA Amlacher S., Sarges P., Flemming D., van Noort V., Kunze R., RA Devos D.P., Arumugam M., Bork P., Hurt E.; RT "Insight into structure and assembly of the nuclear pore complex by RT utilizing the genome of a eukaryotic thermophile."; RL Cell 146:277-289(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL988041; EGS20425.1; -; Genomic_DNA. DR RefSeq; XP_006692721.1; XM_006692658.1. DR EnsemblFungi; EGS20425; EGS20425; CTHT_0022550. DR GeneID; 18256293; -. DR KEGG; cthr:CTHT_0022550; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008066; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008066}; KW Reference proteome {ECO:0000313|Proteomes:UP000008066}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 919 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003408826. FT COILED 668 695 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 919 AA; 99346 MW; 0B7EAF1F106212F9 CRC64; MRAPADWTKK QIAPALFLLL GLHAVLVHGG PRSGSNGTAV TVTDVCESRT INYITHTLPQ QCLRTSWASP TTAAASETTA SSTTTTTVTA TTTATSSESV VNGSTGRTTQ DNGTSKEQDG QEELAASSFM SFEEWKEMML RKSGHDPAHV KAHKQRESHH RGGRDPSLDN GGDSFGEEGE ISLDFDALAE KVSEITSPKS GSVAPDSKKG EVREEQVLYD DGKTQYYRSK DAGKTCKERF SYSSFDAGAT VLKTSPGAKN AKAILVENKD SYMLLECRQK NKFVIVELSD DILVDTIVIA NFEFFSSMIR HFRVSASDRY PVKDNKWVEL GTFEARNSRD IQAFLVEHPH IFAKYIRIEF LTHYGNEFYC PVSLLRVHGT RMLDTWKEPS HDNEPEQVEG SPQEPAAPVS DVQQDNLSVS DNSSSTVLIQ VQDNFTEAVT ETSLTPWLPV FYSDLSLEAC PLRSPTAVQA TPVQPSANGH LHEPVAGNEK ATIWSSSSAN SDEPANGASA PMIHLSSESA MAQEKPSNPS PVGPNAQPNG AGTNSTARSF NGKSSDVHAD ASEASSSATT TTNRNKTTSI STSPSASPTV QESFFKTVNK RLQLLESNTS LSLQYIEQQS RFLQDVLLKM ERKHITRIDA FLDSLNKTVL TELRNVRTQY DQIWQSTVLA LETQREQSQR EIVALTSRLN VLADEVVFQK RMAILQSVLL LACLVLVIFS RGGLSVLDGG PSSLASLPGS FGTAAGGSSA FSPYRRYGYG RPLSPTPSAS SPFTVGAASD NNGTSSGSGG LAASAMPRYR STVLATNRDK SLPLTPTVSE YGSREGTPTV HKLTTNNGDS GEESPTRPST EQGDDIEDIG MPSTPPSERR KEGEGGGERE REGTKGEEEK PPPDRARSSV AQLAGLNMKP LPALPEDPS // ID G0SXD0_RHOG2 Unreviewed; 1225 AA. AC G0SXD0; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGU12705.1}; GN ORFNames=RTG_01263 {ECO:0000313|EMBL:EGU12705.1}; OS Rhodotorula glutinis (strain ATCC 204091 / IIP 30 / MTCC 1151) OS (Yeast). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Sporidiobolales; mitosporic Sporidiobolales; OC Rhodotorula. OX NCBI_TaxID=1001064 {ECO:0000313|EMBL:EGU12705.1, ECO:0000313|Proteomes:UP000006141}; RN [1] {ECO:0000313|EMBL:EGU12705.1, ECO:0000313|Proteomes:UP000006141} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 204091 / IIP 30 / MTCC 1151 RC {ECO:0000313|Proteomes:UP000006141}; RX PubMed=24526636; DOI=10.1128/genomeA.00046-14; RA Paul D., Magbanua Z., Arick M. II, French T., Bridges S.M., RA Burgess S.C., Lawrence M.L.; RT "Genome sequence of the oleaginous yeast Rhodotorula glutinis ATCC RT 204091."; RL Genome Announc. 2:E0004614-E0004614(2014). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGU12705.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEVR02000003; EGU12705.1; -; Genomic_DNA. DR EnsemblFungi; EGU12705; EGU12705; RTG_01263. DR InParanoid; G0SXD0; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000006141; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006141}; KW Reference proteome {ECO:0000313|Proteomes:UP000006141}. FT COILED 454 483 {ECO:0000256|SAM:Coils}. FT COILED 673 700 {ECO:0000256|SAM:Coils}. FT COILED 716 736 {ECO:0000256|SAM:Coils}. FT COILED 897 917 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1225 AA; 131752 MW; 4A0DAA8F9DB8DD39 CRC64; MSRPPNSRSS HGRSPSVEGE IRRASVHSLN LSYAYGAPAS PRVGTSTSTR LQASPPRRVR SSTGAVEEED EGEQEDQAAG AGPADTGRAA EPAAVRYARL AQRKKDTGGN AYPPPPPVAY GGLQNTSVNI ANAFKAATSG LGGVVTGGRR EDFPPLNGRG DNGVDEAEEA DEDQEQRQPA KAPASAGKKR KKHAPKDPTY RHQAGQTSSS EESEFDQRAK GKKRTKVHSE EDDLADDPRS ANKPRRRSQK PADPSYNPTK DPSYHAGDTS TADSDNGAAT KRRKSKGKGR ASTGGRSALQ EAIPRGIRDG EIWYGKKRKG KRGSRRSTAG AEGDEQEGDE EDEEEDDLEY EGQDLGGMDP QDHPAGNDYL DDEADRTPPA ASYFLRLRSP SPQANGGAPT SHQQAAASSS RTNGAGPVDP AFAAFDRSLG HGNDSNSLSA SFDDSVLRGS SYDYSEEERI VQALEAQKKR QFEEEQRRRQ QAQPTPQHQR AAAAAPGTSG ATPMPTTGAY PLTPGAAGSP VSALRKRRLP GPPSMLGAGT PLGPIDDDED DVARAERMGG EWGRKCGNAL RPLVELVARL WRKTQDPLLH WGRIWKAIAA TLLLLGIVTA ILRSNRLTSS TSSSPSSFVA PSAPPDSFEG LVARLSDLES AMSRLSSASD NDRQQSSDDR EYVSRLSEQL ESLEATLSTE QARAKAALQS LEKSGESRSL AAEKVAQDFK GDIDSLQARI RSLTTEQQRD SADLRHLQTT VTAVSREVAA LDQQIAKVAK DVAAATDVER ITKIALDAIA RKLPGKVAVR LDDSGRLEID PAFWRVLKDA FVDKRAVERT VDAKIAALDG SKRNGLFGSS KEAKGQAGPP SWDDFLAANE GALKAWVASD LSSRTGSDAF VSKQTFLDFL RREIKLLKRD FEAKANENFE QMGQEILAKV AKQEDMRRKD ASLASHLNPF ARHHSAPTAD GPVTIKSSDG QNVTAIISSL VDSALLRYSK DVLARPDYAL YTAGGRVIRS LTSRTYEPYP LSRSRSVLAW ITGTSVPQGR SPVTALHPDR TPGSCWPFAG QHGQIGIQLS RRVVPTDITL EHISPDVALD GDVSSAPKDF EVWGIVDGPQ NVAKVTQFRL EEREAKRAAR AAGQDPLDDL DAIENEPTSM PPSANHILLA VGSFDPSAPS PVQSFPVTPS ARRLGLPVQV VVVKVLSNHG ESAYTCLYRI RVSGQTESQL LDASA // ID G0T1G4_RHOG2 Unreviewed; 1149 AA. AC G0T1G4; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGU11171.1}; GN ORFNames=RTG_02974 {ECO:0000313|EMBL:EGU11171.1}; OS Rhodotorula glutinis (strain ATCC 204091 / IIP 30 / MTCC 1151) OS (Yeast). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Sporidiobolales; mitosporic Sporidiobolales; OC Rhodotorula. OX NCBI_TaxID=1001064 {ECO:0000313|EMBL:EGU11171.1, ECO:0000313|Proteomes:UP000006141}; RN [1] {ECO:0000313|EMBL:EGU11171.1, ECO:0000313|Proteomes:UP000006141} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 204091 / IIP 30 / MTCC 1151 RC {ECO:0000313|Proteomes:UP000006141}; RX PubMed=24526636; DOI=10.1128/genomeA.00046-14; RA Paul D., Magbanua Z., Arick M. II, French T., Bridges S.M., RA Burgess S.C., Lawrence M.L.; RT "Genome sequence of the oleaginous yeast Rhodotorula glutinis ATCC RT 204091."; RL Genome Announc. 2:E0004614-E0004614(2014). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGU11171.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEVR02000025; EGU11171.1; -; Genomic_DNA. DR EnsemblFungi; EGU11171; EGU11171; RTG_02974. DR InParanoid; G0T1G4; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006141; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006141}; KW Reference proteome {ECO:0000313|Proteomes:UP000006141}. FT COILED 359 396 {ECO:0000256|SAM:Coils}. FT COILED 691 725 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1149 AA; 121185 MW; 31851474BD8C424B CRC64; MTVSTATEGV GGTASASQAT ATPTETPTQL PTTSSTPSPS SDSSLPPAAE PSSPPPPEPS PSVLPDLPVV EELPLTEVPP APEFLSFNEW RERYAVAADP SVARRAKKAA QRARQDVVGA SASGANGAAY DGDGADLGSL FVGGDEAGSK VGDRLVFQDI GSVPGVQVGV IDVIETDSIA VTPTSDMANP IQPLPDVGTG EPNDPLLLLK DRSNYAAFEC AAMVHRSSRQ SKGASAILVE KKDRYMLTPC SANPKFVDVE LCDEIQIDTL VLANFEFFSS TFKHFKASCS VDYPGKPADW HDLGTFRARN VRGIQVFKPI RNPHFCRYLR IDFLSHFGSE FYCPVSVLRV YGYTQLDAYR ESERKAKAIE EALAAAELIE EEVAQHERVL EDALRVEVDK LERLEDAGAK TVNATVEITP SPSTVPPIAE SGSPSPAPSS SASSPSNVSK LSQAVPSPLS VATSQPLPTA STEYSPSSTL SSTTAPPAAT TSPIEADTHT TSSATPFSTV AASTDTSDSA SSATVDVAEP ASSSNTATSS ATRIEPSSSS SIAVTPVSTL SPSSSSPTAE VPTSSNSSVP DATPNPLSGR PVTPSDIPVV ISRPPPAPRN DTHSASQPHV PLPPPSRPPV IQPTQPGESI YGTIMKRLTS LEHNQTLAMH FIEAQSSMLR EAFGRIERRL TDIEGSRGRQ EQSIRQALLD LEQQRVELER ERLALSTQVS KLTQEVRLEK RLTVAQLIGL LLLVIFVGFT RGIPTSPFLH LASTHLDTKR ETKRTQADLL ADRLGEAVGS RLQTEAVDEG EYRILVSAAE SSTNAPHLAD APAAPRRGHR ISPSVSLSRY NGTSSAFKRY PSISKAGPRR HYGIGSSKAT NGKGDSSRAR PWSPPTRHAS APPEEPPAMI LDGKAAKRRR PLADLAGSDG HAFEFPARSS STQPSSTSAV GSIATGAASS SASLSPIPFP STSLSSPAPT FSPSAAPHPQ LDLPINLGYL STSLGPPAER SDSAFSPAST NGFVGDDERD EGGYHTYSSD EEALASVFPS RTPSRDQMRA AALASSSPSS PGTTASPRIR PPKPQVPLRP ATSMGFRDER ATQKDKSRRT SRGPEAKQAA NSDDLPPLGP APTISTPSPP PETPAPSSLV QHKRSDTLA // ID G0TWI7_TRYVY Unreviewed; 473 AA. AC G0TWI7; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 8. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCC48325.1}; GN ORFNames=TVY486_0601160 {ECO:0000313|EMBL:CCC48325.1}; OS Trypanosoma vivax (strain Y486). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Trypanosoma; OC Duttonella. OX NCBI_TaxID=1055687 {ECO:0000313|EMBL:CCC48325.1}; RN [1] {ECO:0000313|EMBL:CCC48325.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Y486 {ECO:0000313|EMBL:CCC48325.1}; RA Jackson A.P., Berry A., Aslett M., Allison H.C., Burton P., RA Vavrova-Anderson J., Brown R., Browne H., Corton N., Hauser H., RA Gamble J., Gilderthorp R., Marcello L., McQuillan J., Otto T.D., RA Quail M.A., Sanders M.J., van Tonder A., Ginger M.L., Field M.C., RA Barry J.D., Hertz-Fowler C., Berriman M.; RT "Antigenic diversity is generated by distinct evolutionary mechanisms RT in African trypanosome species."; RL Proc. Natl. Acad. Sci. U.S.A. 109:3416-3421(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE573022; CCC48325.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 443 467 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 359 393 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 473 AA; 53181 MW; 39DB8FA68C6383AD CRC64; MKKYRFQFLA LFIAVCSLLI CFFYNSPGRS FNKERPSERA TRFTTNYASA YLGATLADFT PGCQGASSVL NEDGEKYMIC PCELRRKFFT VQLIRGIEVH ILTLVNNEHF SSGVKNFTVL GSNRYPTNEW RVLGHFKAEP WRGTQHFDVA PQQPVRFLRF LWATSHDKHS WCTLTSFKAF GVDVLETLTE DYTVSVEQQQ QEQEEEEVNS QHQHPSPKEG QEKSPLPAAS PPCDQADATL GTSCFQYGSQ SPGPRMSGED YTVPGALPYH GDVVGIGRGG NKFGSGSFSY LRNPSLEALL QSYCDYPLMA NNISMVCLPH ERHLYEFRAL GLCTSRGTFG GRGTLVPKLV PTSTALLMLS QINRQSKLLQ QGVEELRSRV EMAEKRLSHI DSTLLFLASH ARESKRIASD YREKLHTVLK EVEVLKSKHL LCMRMKHEDN GDVILRTLIV VLVGLSLFAV VLSCIAVRTT HSH // ID G0UNE0_TRYCI Unreviewed; 490 AA. AC G0UNE0; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 7. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CCC90900.1}; GN ORFNames=TCIL3000_6_1320 {ECO:0000313|EMBL:CCC90900.1}; OS Trypanosoma congolense (strain IL3000). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Trypanosoma; OC Nannomonas. OX NCBI_TaxID=1068625 {ECO:0000313|EMBL:CCC90900.1}; RN [1] {ECO:0000313|EMBL:CCC90900.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=IL3000 {ECO:0000313|EMBL:CCC90900.1}; RA Jackson A.P., Berry A., Aslett M., Allison H.C., Burton P., RA Vavrova-Anderson J., Brown R., Browne H., Corton N., Hauser H., RA Gamble J., Gilderthorp R., Marcello L., McQuillan J., Otto T.D., RA Quail M.A., Sanders M.J., van Tonder A., Ginger M.L., Field M.C., RA Barry J.D., Hertz-Fowler C., Berriman M.; RT "Antigenic diversity is generated by distinct evolutionary mechanisms RT in African trypanosome species."; RL Proc. Natl. Acad. Sci. U.S.A. 109:3416-3421(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE575319; CCC90900.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 454 478 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 490 AA; 55290 MW; 18CD99B15060B77C CRC64; MKRHKVLVVL LIIVIAVKYT FLFLRHAERA RRDEKPPDRN TGFTTNYASA YLGATLTDFS PTCHDASSVL NEDDEKYMLC PCSTRRKFFT VQLIRGIEVR ILTLVNHEHF SSNVKNFTVL GSSKYPTNEW RVLGHFSADP RRGTQHFDVG PQQPVRFLRF LWATSHGEHS WCTLTTFKAF GVDVLETLTE DFTVSVEEEE QQREHFRHDD APPPPPLPIR EVLNMGHTRQ GNEYALVDAF GGDMITDDAS NADDSNVSVN TPKGHVCNYT INDAGKCGGS RNGNASKGHG QHHRGANPSI FNVDMVNQRY CSILLPPENV SGTCFPHERI LYEAHMLSAC MSKAVFSTKV TALAKPFTGN SVLLMLAQMS KQIKMMQQEV LEMASHQKEL QSKLGHTETT MQWLQSQLMD SRRNSIELRD RLQDAMKHVE VLKSKVSLQL SMGHNCEEDT MLRVMVVGSV SCSFLSLVLS CIAARAFYRP RRRKGLLPMG // ID G0V716_NAUCC Unreviewed; 626 AA. AC G0V716; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCC67264.1}; GN Name=NCAS0A07060 {ECO:0000313|EMBL:CCC67264.1}; GN OrderedLocusNames=NCAS_0A07060 {ECO:0000313|EMBL:CCC67264.1}; OS Naumovozyma castellii (strain ATCC 76901 / CBS 4309 / NBRC 1992 / NRRL OS Y-12630) (Yeast) (Saccharomyces castellii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Naumovozyma. OX NCBI_TaxID=1064592 {ECO:0000313|EMBL:CCC67264.1, ECO:0000313|Proteomes:UP000001640}; RN [1] {ECO:0000313|EMBL:CCC67264.1, ECO:0000313|Proteomes:UP000001640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 76901 / CBS 4309 / NBRC 1992 / NRRL Y-12630 RC {ECO:0000313|Proteomes:UP000001640}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Type strain:CBS 4309; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Genome sequence of Naumovozyma castellii."; RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE576752; CCC67264.1; -; Genomic_DNA. DR RefSeq; XP_003673645.1; XM_003673597.1. DR STRING; 1064592.XP_003673645.1; -. DR EnsemblFungi; CCC67264; CCC67264; NCAS_0A07060. DR GeneID; 11527071; -. DR KEGG; ncs:NCAS_0A07060; -. DR eggNOG; ENOG410IE9E; Eukaryota. DR eggNOG; ENOG4111CR2; LUCA. DR InParanoid; G0V716; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000001640; Chromosome 1. DR GO; GO:0005825; C:half bridge of spindle pole body; IEA:EnsemblFungi. DR GO; GO:0016021; C:integral component of membrane; IEA:EnsemblFungi. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:EnsemblFungi. DR GO; GO:0005635; C:nuclear envelope; IEA:EnsemblFungi. DR GO; GO:0034399; C:nuclear periphery; IEA:EnsemblFungi. DR GO; GO:0006348; P:chromatin silencing at telomere; IEA:EnsemblFungi. DR GO; GO:0034087; P:establishment of mitotic sister chromatid cohesion; IEA:EnsemblFungi. DR GO; GO:0000741; P:karyogamy; IEA:EnsemblFungi. DR GO; GO:0045141; P:meiotic telomere clustering; IEA:EnsemblFungi. DR GO; GO:0000743; P:nuclear migration involved in conjugation with cellular fusion; IEA:EnsemblFungi. DR GO; GO:0030474; P:spindle pole body duplication; IEA:EnsemblFungi. DR GO; GO:0007129; P:synapsis; IEA:EnsemblFungi. DR GO; GO:0034398; P:telomere tethering at nuclear periphery; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 97 117 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 181 201 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 626 AA; 72013 MW; 2A4DA570A8AB6A70 CRC64; MSKQPSGSES SFEEIEDVDH SGVHQSIADL AFIEEEDGES YDPDYVESEG NGTDNEDYID DSILEEFSSE EEDDENESDD YLESGDEANY TASSYKWAPY TLILIVILIV GLTMGHVPTT SPDLSSPSTK ATFMNLQMQV NNLYQELSQR DEKSKSDFDR SVEIVVSKFE DNLKNLLPVD VVNLKSQLVA LNDQVNSLSD AVTSWQDRNN KKYHQFTLRN MTDLQEKLLS NLETNLPNEV PIVMNGTSSF LIIPELHTYL SNLISNILNI SSMNNTESHG GNINWEYNVN EYVKEILTNE LKYVDKDFFI KELNRRLQEN KERIWQEINI KLEDEKDKIR ELTYTNDQST FPQQYSSILL KKMIYQIYNT NQHQWEGDLD FASSFQGTRL MNHLTSPTWN HGNGVGPIEL LTSSRLTGST YWQCQDIKEC SWAIRFNKPI YLTKVSYIHG RFTNNLHMMN SAPKLISLYV RLAGNSPSVE EERLTKLARK YGQGQPFGRD KRYIKIAQFQ YSIESSMVRQ SLALPRWYIE SKPLIHSLVL QVDSNYGNEN FTSLKKFIIN GLTREDLDIM QSDKFLQRFQ EVPEYGAMPS SAVTSFVIQE HPPIQGNHNR KPDEEVPSFG DDELDT // ID G0VJA9_NAUCC Unreviewed; 586 AA. AC G0VJA9; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCC71588.1}; GN Name=NCAS0H02780 {ECO:0000313|EMBL:CCC71588.1}; GN OrderedLocusNames=NCAS_0H02780 {ECO:0000313|EMBL:CCC71588.1}; OS Naumovozyma castellii (strain ATCC 76901 / CBS 4309 / NBRC 1992 / NRRL OS Y-12630) (Yeast) (Saccharomyces castellii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Naumovozyma. OX NCBI_TaxID=1064592 {ECO:0000313|EMBL:CCC71588.1, ECO:0000313|Proteomes:UP000001640}; RN [1] {ECO:0000313|EMBL:CCC71588.1, ECO:0000313|Proteomes:UP000001640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 76901 / CBS 4309 / NBRC 1992 / NRRL Y-12630 RC {ECO:0000313|Proteomes:UP000001640}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Type strain:CBS 4309; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Genome sequence of Naumovozyma castellii."; RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE576759; CCC71588.1; -; Genomic_DNA. DR RefSeq; XP_003677935.1; XM_003677887.1. DR STRING; 1064592.XP_003677935.1; -. DR EnsemblFungi; CCC71588; CCC71588; NCAS_0H02780. DR GeneID; 11528890; -. DR KEGG; ncs:NCAS_0H02780; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G0VJA9; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001640; Chromosome 8. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001640}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 586 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003410771. FT TRANSMEM 532 550 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 586 AA; 67530 MW; 7AD90489DA41A9E2 CRC64; MRRSFIDVCL IILLSITTRT QAQDNNTYPT ILNDTVPPLS FDNDNDNNKS HPFLSFEEWK EVKFQEQPLN MNNPPHLRNR EPVDPSCYKE RESIGDEMEI ELGFFAKDEQ EQEDKPYNRR YNYASLDCAA TVVKSNAEAI GSTSILVENK DQYLLNPCSA VNKFVIIELC EDVLVEEVEM ANFEFFSSTF KTIRLSVSDR FPVSRNGWVV LGEFEAENNL QIQQFNIKNP QIWARYLRVE VLSYYNNEFY CPISLIRAHG KTMMDEFKMG QIANDEVVST TKLEDVMTPD ETSNVTIEDE LESTDKINEG GNSHTTMDVA TVITRNSNTS INSKSNIEYF EKCTIWSYDD YSNVSVVPFE LSHVLENGHC KFEPLNFTNF FLKRQNDTFN NGDSSNNNGT SKRNGTCNVT MTPMAQSPTS PEESIFKNII KRLNALENNS TLTVLYIEEQ SQLLSKSFQS LTKEHGIKFA NLIDNFNTMI IARLDSLKEF AADLKDQSLQ ILTSQRLENE RFVAENSYRI RELEREVRLQ KFIVYSMFFV LMGLLTYLLF AKEVYIVDDL DNRETTPDKV VYTLKSKTQL SPNADM // ID G0W9R5_NAUDC Unreviewed; 690 AA. AC G0W9R5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCD24526.1}; GN Name=NDAI0D02120 {ECO:0000313|EMBL:CCD24526.1}; GN OrderedLocusNames=NDAI_0D02120 {ECO:0000313|EMBL:CCD24526.1}; OS Naumovozyma dairenensis (strain ATCC 10597 / BCRC 20456 / CBS 421 / OS NBRC 0211 / NRRL Y-12639) (Saccharomyces dairenensis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Naumovozyma. OX NCBI_TaxID=1071378 {ECO:0000313|EMBL:CCD24526.1, ECO:0000313|Proteomes:UP000000689}; RN [1] {ECO:0000313|EMBL:CCD24526.1, ECO:0000313|Proteomes:UP000000689} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10597 / BCRC 20456 / CBS 421 / NBRC 0211 / NRRL Y-12639 RC {ECO:0000313|Proteomes:UP000000689}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE580270; CCD24526.1; -; Genomic_DNA. DR RefSeq; XP_003669769.1; XM_003669721.1. DR EnsemblFungi; CCD24526; CCD24526; NDAI_0D02120. DR GeneID; 11494897; -. DR KEGG; ndi:NDAI_0D02120; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000000689; Chromosome 4. DR GO; GO:0005825; C:half bridge of spindle pole body; IEA:EnsemblFungi. DR GO; GO:0016021; C:integral component of membrane; IEA:EnsemblFungi. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:EnsemblFungi. DR GO; GO:0005635; C:nuclear envelope; IEA:EnsemblFungi. DR GO; GO:0034399; C:nuclear periphery; IEA:EnsemblFungi. DR GO; GO:0006348; P:chromatin silencing at telomere; IEA:EnsemblFungi. DR GO; GO:0034087; P:establishment of mitotic sister chromatid cohesion; IEA:EnsemblFungi. DR GO; GO:0000741; P:karyogamy; IEA:EnsemblFungi. DR GO; GO:0045141; P:meiotic telomere clustering; IEA:EnsemblFungi. DR GO; GO:0000743; P:nuclear migration involved in conjugation with cellular fusion; IEA:EnsemblFungi. DR GO; GO:0030474; P:spindle pole body duplication; IEA:EnsemblFungi. DR GO; GO:0007129; P:synapsis; IEA:EnsemblFungi. DR GO; GO:0034398; P:telomere tethering at nuclear periphery; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000689}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000689}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 131 150 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 234 254 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 690 AA; 79668 MW; 4895BECF3001B71B CRC64; MVLSTLEMNH TKDIDEESME LSKEPNGLRS QNYTLNALLH CDEGYNEDLD PDYDSYCSDY VLNNEFEDDE LSSNTESSGN GHSYDESDTA SSSQESIGDS SLEDVSDMDI EEGGEGFQGY MDILNSLNLR NILIFLFLIT SIYMGRWIYF RYRNDEAIFN DGGMLENFNP SSRSPPVSFL NLQKQINTLY HDLNKRDDKI TNDFDKSVKI VISQFEKNLR NLLPVDLVDF KSQLDSLKGQ VNNLSSIISN WQENEPPSVH DFIKRNRSSP FSMENITKFQ NSLIQELEKS LPNEIPIIMK GNNNNATTEN STAFLIIPEL HSYITDLVAQ TLNTSRDNLD EISKKWEYNM DTYITEYLTN ELKYVNKHDF LAELNSKLES NKLEIFEEMN NKLNDIRFQN DQSAFPQYSN SMLKKLVYDI YNSNMHQWQH DLDFASIFQG TRLLNHMTSN TWKNGNGILP IELLKPSKLL SSTYWQCANE NGNSKSGYKC SWAIRFQEPI YLTKISYVHG RFLNNLHMMN SAPRLISLYV RLTNNKDTEE LIRRAKKHNQ GQTFVKSYDY IKISQHTYSI DDLETRQTFP LPKWYIEMKP LVHSLVFQID DNYGNEKYTS LKKFLINGLT REDLNILKNN NGALRKSVKT VPDYIHQKVK NPTSQDTSSG NGSPLKIESS KLNPVMQGKV PAFGEDELDV // ID G0WCN8_NAUDC Unreviewed; 625 AA. AC G0WCN8; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCD25549.1}; GN Name=NDAI0F02310 {ECO:0000313|EMBL:CCD25549.1}; GN OrderedLocusNames=NDAI_0F02310 {ECO:0000313|EMBL:CCD25549.1}; OS Naumovozyma dairenensis (strain ATCC 10597 / BCRC 20456 / CBS 421 / OS NBRC 0211 / NRRL Y-12639) (Saccharomyces dairenensis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Naumovozyma. OX NCBI_TaxID=1071378 {ECO:0000313|EMBL:CCD25549.1, ECO:0000313|Proteomes:UP000000689}; RN [1] {ECO:0000313|EMBL:CCD25549.1, ECO:0000313|Proteomes:UP000000689} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10597 / BCRC 20456 / CBS 421 / NBRC 0211 / NRRL Y-12639 RC {ECO:0000313|Proteomes:UP000000689}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE580272; CCD25549.1; -; Genomic_DNA. DR RefSeq; XP_003670792.1; XM_003670744.1. DR STRING; 1071378.XP_003670792.1; -. DR EnsemblFungi; CCD25549; CCD25549; NDAI_0F02310. DR GeneID; 11496887; -. DR KEGG; ndi:NDAI_0F02310; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000689; Chromosome 6. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000689}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000689}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 625 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003410899. FT TRANSMEM 576 598 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 392 412 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 625 AA; 71038 MW; 8193CF2336C527FE CRC64; MLGSSMKTVL IVLSGLVITN CDSHFPSIDK VSTIEPTTIS LSSSSESTLG PARHSGVVIH EQTMSYPDSS TSKASSQRRP DITSMEPKDD GAQPGEARPS SGSIQDTYTD NNEPNISIEK ERDIFLSFDA YKEAKLHEHD HHSSWKRHQQ QQYSSSSKKN IINTSEDSLG DEMEIELGFF LNGNDDDDND DCETDDEEVD LLDIDEACLN LLNSIDSNIE GEEQDDDDGS TGQKSNSLYK RRYNYASLDC AATIVKTNPE AMGSTSILVE NKDSYLLNPC SVKQKFIIIE LCEDILVEEI DIANFEFFSS TFKQIRVSVS DRFPVKENNN KDGGGWKILG KFEAINNREL QRFKIENPQI WARYLKIEIL SYYDNEFYCP ISLVRVHGKT MMDEFKSEQQ EQQQNNNQKD NKDALLLLSS ANNSCGNIQD KEILSKLITD NLIKSNATTI KKNLTKDDIL LQGKCKVKTP IAIQSFDKFL KNHTEKIQES ISSNKCPSIP IPEDSFLKNI VKRLSALEDN STLTLLYIEE QTRLLSQSLD RIKTNHSITL NDILLSNEKY IRANSMIINE LQNQIYFQKI LLSTMIIAIL GLFSYLLLSK EVYVFNENYI EESKEMEMRR HSTHR // ID G1KHB8_ANOCA Unreviewed; 716 AA. AC G1KHB8; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000007890}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSACAP00000007890}; OS Anolis carolinensis (Green anole) (American chameleon). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; OC Toxicofera; Iguania; Iguanidae; Polychrotinae; Anolis. OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000007890, ECO:0000313|Proteomes:UP000001646}; RN [1] {ECO:0000313|Ensembl:ENSACAP00000007890, ECO:0000313|Proteomes:UP000001646} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Genome Sequencing Platform; RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J., RA Lander E.S., Lindblad-Toh K.; RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard)."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSACAP00000007890} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSACAP00000007890}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003221058.1; XM_003221010.2. DR STRING; 28377.ENSACAP00000007890; -. DR Ensembl; ENSACAT00000008056; ENSACAP00000007890; ENSACAG00000007922. DR GeneID; 100554232; -. DR KEGG; acs:100554232; -. DR CTD; 25777; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1KHB8; -. DR KO; K19347; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001646; Chromosome 5. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001646}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001646}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 178 196 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 228 248 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 285 312 {ECO:0000256|SAM:Coils}. FT COILED 362 382 {ECO:0000256|SAM:Coils}. FT COILED 403 430 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 716 AA; 80356 MW; 0FEBDB1F79069D31 CRC64; MSRRSQRLGG TVRYYQSEDD GNSSSSGGSS LLGGQQSLFK DISNSRSLRK KSGSVKRPSP APSLGTSPAT HTSFYESVVT ESYLNDGRGL SIRGSTSLDD PMDNSSYWSE DLSMRRRRAG AGGIETSKKI NGLSERKIYD TYASSSGYSS EDDYAGYTYS DQYTSGSGFK KAVSKVGAFL WMVVTSPGRF FGLLYWWIGT TWYRLTTAAS LLDVFVLTRS YPILKKLFLL LLLLLLLTAF GYGMWYFYPF GLQSSVFSWG AMKFSSPVTK KEYEAGDSGV IPAAQQQVLS RLQALERRFE TLEAAAALLE LQKGKSAEGS SLSHEDTLAL LDGLVKRREA AMREDFRTET HLHLQSKLDA FRSQMQQDFD HLQKKISQAS EETEGRMLQM GSQWQSSTQE GLMGSFRKEM DRLEAQLSNL KKEFGNLASD QKVLSKHVES LPRQIKEMRE DVEVQFPVWL SQFLSQSRKE GTGSLFLQQD ELQEHLLALE KKILARISED QEFSVQNVRV ALQREGVIGV TQEDVHRIVN QALNRYSEDR IGMFDYALES AGASVISTRC SETYETKTAL LSLFGIPLWY HSQSPRVILQ PEVLPGNCWA FQGSQGFAVI RLSSNIYPTA VTLEHISRSL SPKATIPSAP KDFAVYGLEE EGLQEGVLLG QFTYNQEGDP IQTFHLQDEN RTSAAFQLIE LRVLSNWGHP EYTCIYRFRV HGEPVS // ID G1KQQ3_ANOCA Unreviewed; 2570 AA. AC G1KQQ3; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 30. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000014864}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSACAP00000014864}; OS Anolis carolinensis (Green anole) (American chameleon). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; OC Toxicofera; Iguania; Iguanidae; Polychrotinae; Anolis. OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000014864, ECO:0000313|Proteomes:UP000001646}; RN [1] {ECO:0000313|Ensembl:ENSACAP00000014864, ECO:0000313|Proteomes:UP000001646} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Genome Sequencing Platform; RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J., RA Lander E.S., Lindblad-Toh K.; RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard)."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSACAP00000014864} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSACAP00000014864}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_008101611.1; XM_008103404.1. DR STRING; 28377.ENSACAP00000014864; -. DR Ensembl; ENSACAT00000015164; ENSACAP00000014864; ENSACAG00000015118. DR GeneID; 100556040; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G1KQQ3; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001646; Chromosome 2. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001646}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001646}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2570 AA; 285354 MW; A68AA92B13467E43 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAERTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTASGP SSACKPGRSS TGAPSANADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSTGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDEKKKKDA NKDEEESNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDL FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPSTT SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESAW ELHTNRQFIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNNVDF DVKQDCSQLV ERINVFKTAF SENEDDESRP AIALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERASGET ALIDRTSRML KMEPLATVES LEQYLLKMVA KQWYDFDRAS FVFVRKLREG QTFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DSSALNCHTN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTTL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHIRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGIDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DSAASPKPVS STVSGTTQSW SSLVKNNCPD KMTAAAGSSS RKGSSSSVCS VASSSDISLG STKMERRSEN LMEQNIVSGT DVHEPIVVLS SADSMPQTEV GSASSASTST LTADVGNENV ERKLGPDNSI RATGESNAIS MGIVSVSSPD VSSVSELTNK EATSQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TTGTTSTVTM STSSVTSSSN VATATTVLSV CQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMEEEEY ETKGGRRRTW DDDYVLKRQF SALVPAFDPR PGRTNVQQTT DLEIPPPGTP HSELLEEVEC TPSPRLALTL KVTGLGTSRE VELPLTNFRS TIFYYVQKLL QLSCNGSVKT DKLRRIWEPT YTIVYREMKD SDKEKENGKM GCWSIEHVEQ YLGTDELPKN DLITYLQKNA DSAFLRHWKL TGTNKSIRKN RNCSQLIAAY KDFCEHGSKS SLNQGTISTL QNSDILSSIK EQPQAKAGSG QNSCGVEDVL QLLRILYIVA SDPYSTRTSQ EEGDEQLQFN FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQITSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TPSTVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTEL GAWLCDDDFP DDESRQVDIG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LMMKNRSGSG PALSIEDLGL NFQFCPSSRV YGFTAVDLKP GGEDELVTMD NAEEYVDLMF DFCMHTGIQK QMDAFRDGFN RVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID G1LDQ2_AILME Unreviewed; 359 AA. AC G1LDQ2; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000005034}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSAMEP00000005034}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000005034, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000005034} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000005034} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000005034}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01162303; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01170301; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01178299; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000005034; -. DR Ensembl; ENSAMET00000005241; ENSAMEP00000005034; ENSAMEG00000004771. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1LDQ2; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}. SQ SEQUENCE 359 AA; 40504 MW; 18E565FE3D2B25CF CRC64; MSGRPKLRRS ARFFRGHSEE ASSSASLSTL LSEPELGNAD ANGLTQSWKI VISTASILTL LLIGLGNHMW LKETEFPQRS RQFYALIAEY GSRLYNYQAR LRMPKEQLEL LKKESQTLEN NFREILFLIE QIDVLKALLR DTRDGLHYSW NADGGKDPEP LEATEEEMSN LVNYVLKKLR EDQVQMADYA LKSAGASVIE AGTSESYKNN KAKLYWHGIG FLTYEMPPDI ILQPDVHPGK CWAFPGSQGH ALIKLARKIK PTAITMEHIS EKVSPSGNIS SAPKEFSVYG ISKQCEGEEI FLGQFVYNKT GSTVQTFKLQ HDVSESLLCV KLKILSNWGH PKYTCLYRFR VHGTPGENP // ID G1LH55_AILME Unreviewed; 920 AA. AC G1LH55; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000006247}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSAMEP00000006247}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000006247, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000006247} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000006247} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000006247}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01052952; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01060952; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000006247; -. DR Ensembl; ENSAMET00000006502; ENSAMEP00000006247; ENSAMEG00000005870. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1LH55; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 389 410 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 422 439 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 507 534 {ECO:0000256|SAM:Coils}. FT COILED 569 603 {ECO:0000256|SAM:Coils}. FT COILED 618 638 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 920 AA; 102470 MW; 4749D4E000D29AA8 CRC64; MDFSWLHMYT PPQCVPENTG YTYALSSSYS SDALAFETEH RLDPVFDSPR MSRRSLRLVT TACAVEDGQA GDACSCVSST ASLKDRVARA AKQRRSVSKP AVSVNHTSRK VVSCAAGQSA ASMLSGAACL RPPVLDESLI REQTKVDHFW GLDDDGDLKG GNKAATQGNG DLAAEGTRSN GYTCSDCLLL AERKDTLTAH SAPRGTSPRL YSRDMNQKRG GPFHTERILW LAKDTSSCLS SFLVQLFRVV LMKLNYESEN YKLKSYESKD RESKSYKSES HESKAHSSHC GTVNVGGLLR EDGHLSVTGE SLCDDCKRKE LLEMHTAVRL QSSSPKSVAG AIWHVFSYTG HLLVQTLQRI GASGWSVLKM LLSVLWLAVL APGKAASGIF WWLGIGWYQF VTLISWLNVF LLTRCLRNIC KFLLLLIPLL LLLAAGLSLC GQGDFLSGLP VLNWTRIYGA QRVDGPESTF TPGESHLSQL LEDGDEAFRW FRRSEVERQL TSLSGQCRSH DEKLRELAAV LQKLQAQVDQ MDGDSEATLS LVQRVVGQHL KEMGADRLSG SQTDTMSFHQ EHELRLSNLE DVLGKLTEKS EAIRKELEQT KLRTASGAEE EQYLLSMVKH LELELGQLKS ELSSWQHLKT SCEEVDAQVR ETIRRMFSGE EKGGSLEWLL QTVSSRFVSK DDLQVLLRDL ELQILKNVTH YISVTKRVPD SETVVSAAKE AGISGITEAQ ARVIVNNALK LYSQDKTGMV DFALESGGGS VLSTRCSETY ETKTALISLF GIPLWYFSQS PRVVIQPDIH PGNCWAFRGS QGYLVVRLSM KIRPTTFTLE HIPKTLSPTG NITSAPKDFA VYGLENEYQE EGQLLGQFMY DQEGESLQMF HVLERPDGTF QIVELRILSN WGHPEYTCLY RFRVHGEPVK // ID G1LTI9_AILME Unreviewed; 438 AA. AC G1LTI9; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000010386}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSAMEP00000010386}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000010386, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000010386} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000010386} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000010386}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01008063; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01016063; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000010386; -. DR Ensembl; ENSAMET00000010839; ENSAMEP00000010386; ENSAMEG00000009879. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1LTI9; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 135 158 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 47956 MW; 80E3A70C5E73B1F6 CRC64; MRRSPRPGSA TSPHKHTPNF YSDNSNSSVS VTSGDSSGHR SAGPGEPEGR RARGSSCGEP ALSAGVPGGT TWAGSSRQKP APGSHNGPIE PGRAEGGGGA QEPAGSPVVS EEQLDLLSTL DLRQEIPPPR VSKNFLSLLL QVLSVLLSLV GDVLVIVYRE VCSIRFLLTA VSLLSLFLAA LWWGLLYLVP PSENEPKEML TLSEYHERVR SQGQQLQQLQ AELNKLHKEV SSVRAANSER VAKIVFQRLN EDFVRKPDYA LSSVGASIDL EKTSHDYEDA NTAYFWNRFS FWNYARPPTV ILEPDVFPGN CWAFEGDQGQ VVIRLPGRVQ LSDITLQHPP PSVAHSGGAN SAPRDFAVYG LQVDDETEVF LGKFTFDVEK SEIQTFHLQN DPPNAFPKVK IQILSNWGHP RFTCLYRVRA HGMRISEGAG DSATGERH // ID G1LW79_AILME Unreviewed; 748 AA. AC G1LW79; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000011330}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSAMEP00000011330}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000011330, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000011330} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000011330} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000011330}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01113544; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000011330; -. DR Ensembl; ENSAMET00000011813; ENSAMEP00000011330; ENSAMEG00000010757. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1LW79; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 218 234 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 243 264 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 392 430 {ECO:0000256|SAM:Coils}. FT COILED 433 460 {ECO:0000256|SAM:Coils}. FT COILED 507 527 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSAMEP00000011330}. SQ SEQUENCE 748 AA; 83647 MW; F916CEDFE2FE0DC9 CRC64; PIWASGAFRL SSRGGSPSYV VMSRRSQRLT RYSQGDDDGG SSSGGSSVMG SQSTLFKDSP LRTLKRKSSN MKRLSPAPQL GPSSDAHTSY YSESVVRESY FGSPRASSLA RSSILDDHLH SDPYWSEDLR GRRRRGTGGT ESSKLNGLAE NKSSEDFLGS SSGYSSEDDF AGYLETDHRS SGSRLRNAVS WAASCFWTLV TSPGRLFGLL YWWVGTTWYR LTTAASLLDV FVLTRRFSSV KTFLWFLLLL LLMTGLTYGA WYFYPYGLQT LQPAVVSWWA AKSSSGRQDM WESRDSSPFQ AEQHIMSRVH SLERRLEALA AEFSSNWQKE AVRLERLELR QGAAGGGGHV GLSQEDTLAL LEGLVSRREA ALKEDFRRDT AAWIQEELVS LRAEHQQDSE DLFKKIVQAS QESEARIQQL KSEWQRMTQE SFRENSMKEL ARLEGQLAGL RQELAALSLK QSSVADQVGL LPQQLQAVRD DVESQFPAWV SQFLLRGGGT RTGLVQREEL QAQLQELESK ILAHVAEMQG RSASEAAASL GLTLQKEGVI GVTEEQVQRI VNQALKRYSE DRIGMVDYAL ESGVLGASVI STRCSETYET KTALLSLFGI PLWYHSQSPR VILQPDVHPG NCWAFQGPQG FAVVRLSARI RPTAVTLEHV PKSLSPNSTI SSAPKDFSIF GFDEDLQQEG TLLGQFTYDQ DGEPIQTFYF QDTKMATYQV VELRILTNWG HPEYTCIYRF RVHGEPTH // ID G1LZ21_AILME Unreviewed; 141 AA. AC G1LZ21; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000012329}; GN Name=NR2C2AP {ECO:0000313|Ensembl:ENSAMEP00000012329}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000012329, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000012329} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000012329} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000012329}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01048572; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000012329; -. DR Ensembl; ENSAMET00000012858; ENSAMEP00000012329; ENSAMEG00000011727. DR eggNOG; ENOG410IX85; Eukaryota. DR eggNOG; ENOG4111V3C; LUCA. DR GeneTree; ENSGT00390000017748; -. DR InParanoid; G1LZ21; -. DR OMA; NEETCWN; -. DR OrthoDB; EOG7NKKPC; -. DR TreeFam; TF300180; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0070062; C:extracellular exosome; IEA:Ensembl. DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}. SQ SEQUENCE 141 AA; 16060 MW; B11AA8D350455ABE CRC64; MTHSLVCPET VSRVSSVLNR NTRQFGKKHL FDQDEETCWN SDQGLSQWVI LEFPQRIRVS QLQIQFQGGF SSRRGHLEGT GSRGGEALSK IVDFYPEDNN SLQTFPVPAA EVDQLKVTFE DATDFFGRVV IYHLRVLGER V // ID G1M3L5_AILME Unreviewed; 368 AA. AC G1M3L5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000013927}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSAMEP00000013927}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000013927, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000013927} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000013927} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000013927}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01026846; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01034846; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000013927; -. DR Ensembl; ENSAMET00000014508; ENSAMEP00000013927; ENSAMEG00000013221. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1M3L5; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}. FT COILED 156 176 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 368 AA; 41924 MW; 9B2376F75AF321F5 CRC64; MPRSSRSPVD LCSITRGCLN KFAPLKEVAP RSRNTCRITE NTLSNARDAF VLPVRIHAPA PGLTQCLLAC VSWITCLACF LRTQVHQILF NTCRCKLFIQ KLMEKTGVLV LCAFGFWVFS MHLPSKMEVW QDDSINSPLQ SLRMYQEKVR HHTGEIQDLR GNMTQLIAKL QLMEAMSDEQ KMAQKIMKMI QGDFIEKPDF ALKSIGASID FEQTSATYNH DKARSYWNWI RLWNYAQPPD VILEPNMTPG NCWAFSGDRG QVTIRLAQKV YLSNLTLQHI PKTISLSGSL DTAPKDFVIY GMEGSPREEV FLGAFQFQPE NIIQMFQLQN QPVRAFGAVK VKISSNWGNP RFTCLYRVRV HGSVTPPR // ID G1MAK1_AILME Unreviewed; 1360 AA. AC G1MAK1; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000016375}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSAMEP00000016375}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000016375, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000016375} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000016375} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000016375}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01090193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01098193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01106193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01114193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01122193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000016375; -. DR Ensembl; ENSAMET00000017054; ENSAMEP00000016375; ENSAMEG00000015507. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G1MAK1; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}. FT COILED 1042 1062 {ECO:0000256|SAM:Coils}. FT COILED 1092 1112 {ECO:0000256|SAM:Coils}. FT COILED 1298 1318 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSAMEP00000016375}. SQ SEQUENCE 1360 AA; 150440 MW; 0FC4426AD6536D7F CRC64; DHPSHELCSK EKNNIAIPEL VILAVRRETI DLSIKTDSGR GGERERSVPE DKLQLPRGLA RTRHALGEGA GQCKEHGCTE GEPGRAGGAP LSAPPWAPRR GRPVSQPRRS RSPSAAALRT LGPILSLLLR LLHLGLGSGG DVPPSGRGKK EEKMKKYRRA LALVSCLSLC SLVWLPSWRV CCKESSSASS YYSQDDNCAL ENEDVQFQKK NMESKKLSPS VIETLHTIDL REDSSSVVVG SENIENISSS STSEITPISK LDEIEKSGTI PVAKPSESEQ SETDCDVGEA LEASAPADQP SFVSPPESLV GQHIENVSSS HGKGKITKSE FESKVSASDQ ASGDPKSSLN TSDNLKNESS DYTKPGEIDH TSVTSPKDPE DIPTFDEWKK KVMEVEKEKS QSMHPSSNGG LHATKKVQKN RNNYASVECG AKILAANPEA KSTSAILIEN MDLYMLNPCS TKIWFVIELC EPIQVKQLDI ANYELFSSTP KDFLVSISDR YPTNKWIKLG TFHGRDERNV QSFPLDEQMY AKYVKVELVS HFGSEHFCPL SLIRVFGTSM VEEYEEIADS QYQSERQELF DEDYDYPLDY NTGEDKSSKN LLGSATNAIL NMVNIAANIL GAKTEDLTEG NKSISENATA TAAPKMPDLA PVSTPVPSPE FVTTEGHIHE TELSSPDTPK ESPIVQLVQE EEEEASPSTV TLLGSGEQED ESSPWFESET QIFCSELTTI CCISSFSEYV YKWCSVRVAL YRQRSTTAVS KEKDDLVSAQ PPLPLPAESV DVLVLQPPSG ELDSRRKEKN AETIVPGDLS SMHQGDLINH SADAIELEPS HPQTLSQSLL LDVTPEIHSL SKTEVSEPIK YDAGPTPSQV IPQENSIEAD NEMEKKSESF SSIEKPAVIY ETNKFNEVMD NIVKEDTNSM HITTKLSETI VPPVNTASMP DSEDGEATVN TADTPKQILT PVMDSSSLPE VREEEQSPED ALLRGLQRTA TDFYAELQNS TDLGYTNGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQRYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIQLLQA QLTNMTQLVS NLSTTVAELK REVSDRQSYL VISLVLCVVL GLMLCMQRCR NTSQFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQL TGKEVDPSDL YIVEPLKFSP EKKKKRCKYK TEKIETIKPA DPLHPVANGD LKGRKPFMNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQSQK TKTEKRALKR RRSKVQDQGK LIKTLIQTKS GSLPSLHDII KGNKEITVGT FGVTAVSGHI // ID G1MAK8_AILME Unreviewed; 1251 AA. AC G1MAK8; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMEP00000016382}; GN Name=SUCO {ECO:0000313|Ensembl:ENSAMEP00000016382}; OS Ailuropoda melanoleuca (Giant panda). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; OC Ailuropoda. OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000016382, ECO:0000313|Proteomes:UP000008912}; RN [1] {ECO:0000313|Ensembl:ENSAMEP00000016382} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20010809; DOI=10.1038/nature08696; RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., RA Li B., Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., RA Jian M., Li J., Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., RA Ryder O.A., Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., RA Guo X., Wang B., Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., RA Wang G., Yu C., Nie W., Wang J., Wu Z., Liang H., Min J., Wu Q., RA Cheng S., Ruan J., Wang M., Shi Z., Wen M., Liu B., Ren X., Zheng H., RA Dong D., Cook K., Shan G., Zhang H., Kosiol C., Xie X., Lu Z., RA Zheng H., Li Y., Steiner C.C., Lam T.T., Lin S., Zhang Q., Li G., RA Tian J., Gong T., Liu H., Zhang D., Fang L., Ye C., Zhang J., Hu W., RA Xu A., Ren Y., Zhang G., Bruford M.W., Li Q., Ma L., Guo Y., An N., RA Hu Y., Zheng Y., Shi Y., Li Z., Liu Q., Chen Y., Zhao J., Qu N., RA Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X., Vinar T., Wang Y., RA Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y., Wang X., RA Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L., RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., RA Wang J., Wang J.; RT "The sequence and de novo assembly of the giant panda genome."; RL Nature 463:311-317(2010). RN [2] {ECO:0000313|Ensembl:ENSAMEP00000016382} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSAMEP00000016382}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACTA01090193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01098193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01106193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01114193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ACTA01122193; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9646.ENSAMEP00000016375; -. DR Ensembl; ENSAMET00000017061; ENSAMEP00000016382; ENSAMEG00000015507. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000008912; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008912}; KW Reference proteome {ECO:0000313|Proteomes:UP000008912}. FT COILED 933 953 {ECO:0000256|SAM:Coils}. FT COILED 983 1003 {ECO:0000256|SAM:Coils}. FT COILED 1189 1209 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1251 AA; 139194 MW; 18A423EABFBE0655 CRC64; MLKNCQILVI SNRLLLTSGI KLPSWRVCCK ESSSASSYYS QDDNCALENE DVQFQKKDER EGPFNAELSE KVGSNLPIPP EERKLKDDYI VDVENMESKK LSPSVIETLH TIDLREDSSS VVVGSENIEN ISSSSTSEIT PISKLDEIEK SGTIPVAKPS ESEQSETDCD VGEALEASAP ADQPSFVSPP ESLVGQHIEN VSSSHGKGKI TKSEFESKVS ASDQASGDPK SSLNTSDNLK NESSDYTKPG EIDHTSVTSP KDPEDIPTFD EWKKKVMEVE KEKSQSMHPS SNGGLHATKK VQKNRNNYAS VECGAKILAA NPEAKSTSAI LIENMDLYML NPCSTKIWFV IELCEPIQVK QLDIANYELF SSTPKDFLVS ISDRYPTNKW IKLGTFHGRD ERNVQSFPLD EQMYAKYVKM FIKYIKVELV SHFGSEHFCP LSLIRVFGTS MVEEYEEIAD SQYQSERQEL FDEDYDYPLD YNTGEDKSSK NLLGSATNAI LNMVNIAANI LGAKTEDLTE GNKSISENAT ATAAPKMPDL APVSTPVPSP EFVTTEGHIH ETELSSPDTP KESPIVQLVQ EEEEEASPST VTLLGSGEQE DESSPWFESE TQIFCSELTT ICCISSFSEY VYKWCSVRVA LYRQRSTTAV SKEKDDLVSA QPPLPLPAES VDVLVLQPPS GELDSRRKEK NAETIVPGDL SSMHQGDLIN HSADAIELEP SHPQTLSQSL LLDVTPEIHS LSKTEVSEPI KYDAGPTPSQ VIPQENSIEA DNEMEKKSES FSSIEKPAVI YETNKFNEVM DNIVKEDTNS MHITTKLSET IVPPVNTASM PDSEDGEATV NTADTPKQIL TPVMDSSSLP EVREEEQSPE DALLRGLQRT ATDFYAELQN STDLGYTNGN LVHGSNQKES VFMRLNNRIK ALEVNMSLSG RYLEELSQRY RKQMEEMQKA FNKTIVKLQN TSRIAEEQDQ RQTEAIQLLQ AQLTNMTQLV SNLSTTVAEL KREVSDRQSY LVISLVLCVV LGLMLCMQRC RNTSQFDGDY ISKLPKSNQY PSPKRCFSSY DDMNLKRRTS FPLIRSKSLQ LTGKEVDPSD LYIVEPLKFS PEKKKKRCKY KTEKIETIKP ADPLHPVANG DLKGRKPFMN QRDFSNMGEV YHSSYKGPPS EGSSETSSQS EESYFCGISA CTSLCNGQSQ KTKTEKRALK RRRSKVQDQG KLIKTLIQTK SGSLPSLHDI IKGNKEITVG TFGVTAVSGH I // ID G1MTG5_MELGA Unreviewed; 938 AA. AC G1MTG5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 2. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMGAP00000001695}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSMGAP00000001695}; OS Meleagris gallopavo (Common turkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Meleagridinae; Meleagris. OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000001695, ECO:0000313|Proteomes:UP000001645}; RN [1] {ECO:0000313|Ensembl:ENSMGAP00000001695, ECO:0000313|Proteomes:UP000001645} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20838655; RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A., RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K., RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C., RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A., RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., RA de Jong P., Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., RA Lee M.K., Lee T., Mane S., Marcais G., Marz M., McElroy A.P., RA Modise T., Nefedov M., Notredame C., Paton I.R., Payne W.S., RA Pertea G., Prickett D., Puiu D., Qioa D., Raineri E., Ruffier M., RA Salzberg S.L., Schatz M.C., Scheuring C., Schmidt C.J., Schroeder S., RA Searle S.M., Smith E.J., Smith J., Sonstegard T.S., Stadler P.F., RA Tafer H., Tu Z.J., Van Tassell C.P., Vilella A.J., Williams K.P., RA Yorke J.A., Zhang L., Zhang H.B., Zhang X., Zhang Y., Reed K.M.; RT "Multi-platform next-generation sequencing of the domestic turkey RT (Meleagris gallopavo): genome assembly and analysis."; RL PLoS Biol. 8:E1000475-E1000475(2010). RN [2] {ECO:0000313|Ensembl:ENSMGAP00000001695} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMGAP00000001695}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9103.ENSMGAP00000001695; -. DR Ensembl; ENSMGAT00000002360; ENSMGAP00000001695; ENSMGAG00000002073. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1MTG5; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001645; Chromosome 16. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001645}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001645}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 360 383 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 403 426 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 438 466 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 509 529 {ECO:0000256|SAM:Coils}. FT COILED 576 610 {ECO:0000256|SAM:Coils}. FT COILED 618 638 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMGAP00000001695}. SQ SEQUENCE 938 AA; 105983 MW; 52D1ACCC43E90EC1 CRC64; FETVTMDFSR LHTYTPPQCV PENTGYTYAL SSSYSSDALD FEIEHKIDPV FDSPRMSRRS LRLAAAGVNK PDSARSDILH DSSYSGSLTF REQSSNMVKQ RKSINKQSGS VRAMPRKNLS SSPIFNQSSF LSRASDTSMV STVLDESSIR EQTEVDHFWG LDDDGDTKGS DATLLQRNGD IATAETQTTM INGYTCSDCS MLSERKEVLT AYSASSGPSS RIYSRDRSQR HASRGTYFYM SKILRLVKHT AASLASLLVQ LFQMVLLKQS YESKEEKSRI TQGRLLSGSA FRFPSGYIAA HSDYCGSMNI KEFYREDSHL GVNEESICDD CKGKKQLEIH TTEHMQSSRA KRVARTISHI FSYAGYFVLH VLRTVGAAGW FVSQKVLSLL WLALLSPGRA ASGIFRLLRA GWSQLLTLMS LLKVFILRKC LPKISRLLLF LIPLLFLLEG LWFWGFDGFI ALLPLLNRTR IDKVQSTDDS IYVPRPQPDS PRSVQPAKDT INTFDSARIN ELEKQMAFVS DRCHHHDEEY SKVLLLVHNL QDQVAQMGDR NEILKLIRNV MDQHLKVFKT DFLALHQEHN LRIVTLEDLL RKLSAEFKDI QKELEVAKAK AIRDGDEQNQ LLSRVKKLEL ELSQMKSELL TGGSVKTSCE KIDVIHEKVD AQVKESVKMM VFGDQHKDFP ESLLQWLTSN FVTRSDLQTL LQDLELQILK NITLQMAVTD QRITSEVVTN AVNNAGISGI TEAQAQIIVN NALKLYSQDK TGMVDFALES GGGSILSTRC SETYETKTAL ISLFGIPLWY FSQSPRVVIQ PDMYPGNCWA FKGSEGYLVV RLSMKIYPTA FSLEHIPKTL SPSGNITSAP RKFSVYGLDD EYQEEGTFLG QYVYDQEGEP LQMFTVVSLD DEKSENVFQI VELRILSNWG HAEYTCLYRF RVHGKPAE // ID G1MYU5_MELGA Unreviewed; 1246 AA. AC G1MYU5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMGAP00000004154}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSMGAP00000004154}; OS Meleagris gallopavo (Common turkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Meleagridinae; Meleagris. OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000004154, ECO:0000313|Proteomes:UP000001645}; RN [1] {ECO:0000313|Ensembl:ENSMGAP00000004154, ECO:0000313|Proteomes:UP000001645} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20838655; RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A., RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K., RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C., RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A., RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., RA de Jong P., Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., RA Lee M.K., Lee T., Mane S., Marcais G., Marz M., McElroy A.P., RA Modise T., Nefedov M., Notredame C., Paton I.R., Payne W.S., RA Pertea G., Prickett D., Puiu D., Qioa D., Raineri E., Ruffier M., RA Salzberg S.L., Schatz M.C., Scheuring C., Schmidt C.J., Schroeder S., RA Searle S.M., Smith E.J., Smith J., Sonstegard T.S., Stadler P.F., RA Tafer H., Tu Z.J., Van Tassell C.P., Vilella A.J., Williams K.P., RA Yorke J.A., Zhang L., Zhang H.B., Zhang X., Zhang Y., Reed K.M.; RT "Multi-platform next-generation sequencing of the domestic turkey RT (Meleagris gallopavo): genome assembly and analysis."; RL PLoS Biol. 8:E1000475-E1000475(2010). RN [2] {ECO:0000313|Ensembl:ENSMGAP00000004154} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMGAP00000004154}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9103.ENSMGAP00000004154; -. DR Ensembl; ENSMGAT00000004867; ENSMGAP00000004154; ENSMGAG00000004336. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G1MYU5; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001645; Chromosome 10. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001645}; KW Reference proteome {ECO:0000313|Proteomes:UP000001645}. FT COILED 927 947 {ECO:0000256|SAM:Coils}. FT COILED 977 997 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMGAP00000004154}. SQ SEQUENCE 1246 AA; 137248 MW; 8F92B220F412F926 CRC64; KKIFSRMTCT VTCIRFCFVV WLPVWHVFCK DSLLSTVQYA SSDACVLAND GENIQEKNGK EAGPGLEPEH ADSSTQTYST EELLGDFIKS EQAAEVSETS QPEAQTPPSV DVNEASSSTV ASTESNSSSP TSEISTVSQP DAIENTRADI PVVSSIEAEQ SEPDCDIGGT LEADPQSEPS SFVSPQESLA GQHIENISSS HGKGKKTKSE FESKVEAAEK GADQKSALNA SENLKREKDY KKTGEIDPTS VITPKDPGDI PTFDEWKKKV MEVEKEKSQS MHPSAAGGQH STKKVQKNRN NYASVECGAK ILAANPEAKS TSAILMENMD LYMLNPCSTK IWFVIELCEP VQVKQLDIAN HELFSSTPKD FLVSISDRYP TNKWIKLGTF HARDERNVQS FPLDEQMYAK YVKMFIKYIK VELISHFGSE HFCPLSLIRV FGTSMVEEYE EIADSQYQSE RQELFDEDYD YPLDYNTGEE KSSKNLLGSA TNAILNMVNI AANMLGAKTE ETSEGNKSIS ENVTVTTPAS STAAPKLLEP TPVPSPELPT TDIPQIDKEQ VHVDLTKESP IVQLVQEYEE DTSQSTVTLL SSDDQEEEKS AWFELETEVY CYDLATVCCI STFTEYLFKC CSVTVAMHRQ HGKTEGKQEQ GDSAQLPQVV LPQSVPVSDE PLPEQLDTKA DKVPGSTVAV DFSSVVHEII SNETTAAVEL EPSHPQTVSQ SLLLEVTSEV KPLPTTEMLL EPSQEDAGQE APGVTPQVDS AEINAATEKA ESSVAEETVV VSETGVITEV KETSTRETAA TPVISKPTET VGQPENTVGI LASDAGEGKE STPEVQKPVS SPVESSVSVE TKEEDQATEE AFMSIPVSGG PQRTATDFYA ELQNSTDLGY ANGNLVHGSN QKESVFMRLN NRIKALEVNM SLSSRYLEEL SQRYRKQMEE MQKAFNKTII KLQNTSRIAE EQDQRQTEAI QLLQAQLTNM TQLVSNLSTT VAELKREVSD RQTYLVISLV LCVILGLVLC VQRCRSTSQF CEGYLSKIPK SNHYPSPKRC FSSYDDMNLK RRTSLPLVRS QSFQLSGKEV DPEDLYIVEP LKFSPEKKKK RCKYKSEKIE TIKPTAEPPH PIANGEIKGR KPFTNQRDFS NIGEVYHSSY KGPPSEGSSE TSSQSEESYF CGISACTSLC NGQAQKTKTE KRAIKRRRSK VSDQGKLIKT LIQTKSGSMP SLHDIIKGNK DITVGTLGVT AVSGHI // ID G1N254_MELGA Unreviewed; 129 AA. AC G1N254; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 2. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMGAP00000005678}; DE Flags: Fragment; OS Meleagris gallopavo (Common turkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Meleagridinae; Meleagris. OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000005678, ECO:0000313|Proteomes:UP000001645}; RN [1] {ECO:0000313|Ensembl:ENSMGAP00000005678, ECO:0000313|Proteomes:UP000001645} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20838655; RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A., RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K., RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C., RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A., RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., RA de Jong P., Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., RA Lee M.K., Lee T., Mane S., Marcais G., Marz M., McElroy A.P., RA Modise T., Nefedov M., Notredame C., Paton I.R., Payne W.S., RA Pertea G., Prickett D., Puiu D., Qioa D., Raineri E., Ruffier M., RA Salzberg S.L., Schatz M.C., Scheuring C., Schmidt C.J., Schroeder S., RA Searle S.M., Smith E.J., Smith J., Sonstegard T.S., Stadler P.F., RA Tafer H., Tu Z.J., Van Tassell C.P., Vilella A.J., Williams K.P., RA Yorke J.A., Zhang L., Zhang H.B., Zhang X., Zhang Y., Reed K.M.; RT "Multi-platform next-generation sequencing of the domestic turkey RT (Meleagris gallopavo): genome assembly and analysis."; RL PLoS Biol. 8:E1000475-E1000475(2010). RN [2] {ECO:0000313|Ensembl:ENSMGAP00000005678} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMGAP00000005678}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9103.ENSMGAP00000005678; -. DR Ensembl; ENSMGAT00000006419; ENSMGAP00000005678; ENSMGAG00000005748. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1N254; -. DR OMA; WHISSAP; -. DR OrthoDB; EOG7TJ3P9; -. DR Proteomes; UP000001645; Chromosome Z. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001645}; KW Reference proteome {ECO:0000313|Proteomes:UP000001645}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMGAP00000005678}. SQ SEQUENCE 129 AA; 14564 MW; EFFAD88A144A348A CRC64; PALCFQLDIS PGYCWPVKTS QSQMVFNLPT EVQPTAVTVQ HTVDTTLWHI SSAPRDFAVF GLDEKGENKV LLGKFTYDIR EELSQTFELQ TETPRAFWHI KLSVLNNWGN AGHTCIYRVQ VHGKSAGIK // ID G1NGE1_MELGA Unreviewed; 191 AA. AC G1NGE1; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMGAP00000012148}; DE Flags: Fragment; OS Meleagris gallopavo (Common turkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Meleagridinae; Meleagris. OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000012148, ECO:0000313|Proteomes:UP000001645}; RN [1] {ECO:0000313|Ensembl:ENSMGAP00000012148, ECO:0000313|Proteomes:UP000001645} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20838655; RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A., RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K., RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C., RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A., RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., RA de Jong P., Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., RA Lee M.K., Lee T., Mane S., Marcais G., Marz M., McElroy A.P., RA Modise T., Nefedov M., Notredame C., Paton I.R., Payne W.S., RA Pertea G., Prickett D., Puiu D., Qioa D., Raineri E., Ruffier M., RA Salzberg S.L., Schatz M.C., Scheuring C., Schmidt C.J., Schroeder S., RA Searle S.M., Smith E.J., Smith J., Sonstegard T.S., Stadler P.F., RA Tafer H., Tu Z.J., Van Tassell C.P., Vilella A.J., Williams K.P., RA Yorke J.A., Zhang L., Zhang H.B., Zhang X., Zhang Y., Reed K.M.; RT "Multi-platform next-generation sequencing of the domestic turkey RT (Meleagris gallopavo): genome assembly and analysis."; RL PLoS Biol. 8:E1000475-E1000475(2010). RN [2] {ECO:0000313|Ensembl:ENSMGAP00000012148} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMGAP00000012148}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9103.ENSMGAP00000012148; -. DR Ensembl; ENSMGAT00000013035; ENSMGAP00000012148; ENSMGAG00000011604. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1NGE1; -. DR OMA; WALKTMG; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001645; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001645}; KW Reference proteome {ECO:0000313|Proteomes:UP000001645}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMGAP00000012148}. SQ SEQUENCE 191 AA; 21238 MW; FB9ECBB245EEFB7B CRC64; QEILQLTQHA MEKIMENGIW RPNWALKTMG ATVDAERTSE SYGGKSWKNH CLPPLLSTAK PPETLLQPDI SPGNCWAFPR SQGHVVIRLP EEIQLTALTI WHISRAVSPS GEVSSAPREF AVSGVDEAGG ETLLGLFIYD VDGEIAQTFH LQEEPRKAFG HVKLEVWSNW GNAEHTCVYR VEIHGNSQKT A // ID G1NL78_MELGA Unreviewed; 297 AA. AC G1NL78; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 2. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMGAP00000014248}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSMGAP00000014248}; OS Meleagris gallopavo (Common turkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Meleagridinae; Meleagris. OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000014248, ECO:0000313|Proteomes:UP000001645}; RN [1] {ECO:0000313|Ensembl:ENSMGAP00000014248, ECO:0000313|Proteomes:UP000001645} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20838655; RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A., RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K., RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C., RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A., RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., RA de Jong P., Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., RA Lee M.K., Lee T., Mane S., Marcais G., Marz M., McElroy A.P., RA Modise T., Nefedov M., Notredame C., Paton I.R., Payne W.S., RA Pertea G., Prickett D., Puiu D., Qioa D., Raineri E., Ruffier M., RA Salzberg S.L., Schatz M.C., Scheuring C., Schmidt C.J., Schroeder S., RA Searle S.M., Smith E.J., Smith J., Sonstegard T.S., Stadler P.F., RA Tafer H., Tu Z.J., Van Tassell C.P., Vilella A.J., Williams K.P., RA Yorke J.A., Zhang L., Zhang H.B., Zhang X., Zhang Y., Reed K.M.; RT "Multi-platform next-generation sequencing of the domestic turkey RT (Meleagris gallopavo): genome assembly and analysis."; RL PLoS Biol. 8:E1000475-E1000475(2010). RN [2] {ECO:0000313|Ensembl:ENSMGAP00000014248} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMGAP00000014248}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9103.ENSMGAP00000014248; -. DR Ensembl; ENSMGAT00000015176; ENSMGAP00000014248; ENSMGAG00000013484. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1NL78; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001645; Chromosome 1. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001645}; KW Reference proteome {ECO:0000313|Proteomes:UP000001645}. FT COILED 19 39 {ECO:0000256|SAM:Coils}. FT COILED 59 79 {ECO:0000256|SAM:Coils}. FT COILED 84 104 {ECO:0000256|SAM:Coils}. FT COILED 122 149 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMGAP00000014248}. SQ SEQUENCE 297 AA; 33198 MW; 0DDB78EC12612797 CRC64; QGELDALRTQ LQADVDKRLG KMAQASQEME ARLQELNAEW QRWHRSVQEE LQGHWQRATG ELQREVSALR RELAGLRSDQ EAVSKHLEAV LEQIKATRAD VEAQMPAWIS RFLAQPRQDD STAGLLLQRE DLQAELRALE LRILAQMREE RGLAARDSIG VALRQGGAGG VTEEQVHLIV DQALKRYSED RVGMVDYALE SAGASVINTR CSETYETKTA LLSLFGIPLW YHSQSPRVIL QPDVNPGNCW AFRGSQGFAV IRLSSLIRPT AVTLEHVPKA LSPQGTIPSA PKDFTVY // ID G1NVP5_MYOLU Unreviewed; 438 AA. AC G1NVP5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMLUP00000001353}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSMLUP00000001353}; OS Myotis lucifugus (Little brown bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Vespertilionidae; Myotis. OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000001353, ECO:0000313|Proteomes:UP000001074}; RN [1] {ECO:0000313|Ensembl:ENSMLUP00000001353} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The genome sequence of Myotis lucifugus (Bat)."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001074} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSMLUP00000001353} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMLUP00000001353}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPE02019296; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59463.ENSMLUP00000001353; -. DR Ensembl; ENSMLUT00000001474; ENSMLUP00000001353; ENSMLUG00000001475. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1NVP5; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001074; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001074}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001074}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 136 159 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 165 190 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 48105 MW; 6A6AA749BB4EA613 CRC64; MRRSPRPGSA ASHKHTPNFY SDNSNSSVST TSGDSSGHRS AGPGPGEPEG RRAQGSSCGE PALSSGVPGG TARAGSSRQK PALRSHSAPT AEGAATVRGG ASEPAGSPVV SEERFNLLST LDLRQEMRSP RVFKSFLNPL FQVLSVFLSL LGEVLVTVYR EVCSIRFLLT AVSLLSLFLT ALWWGLLYLV PPLESEPEML TISEYHERVR TQGQQLQQLQ AELDQLHKEV SSVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL EKTSRDYEDA NSAYFWNRFS FWNYARPPMV ILEPDVFPGN CWAFEGDQGQ VVIRLPGRVQ LSDITLQHPP PSVAHTGGAN SAPRDFAVYG LQVDDETEVF LGKFTFDMEK SEIQTFHLQN DPPTAFPKVK IQILSNWGHP HFTCLYRVRA HGMRTTEEAG DSATGGPN // ID G1NX26_MYOLU Unreviewed; 908 AA. AC G1NX26; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMLUP00000001909}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSMLUP00000001909}; OS Myotis lucifugus (Little brown bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Vespertilionidae; Myotis. OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000001909, ECO:0000313|Proteomes:UP000001074}; RN [1] {ECO:0000313|Ensembl:ENSMLUP00000001909} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The genome sequence of Myotis lucifugus (Bat)."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001074} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSMLUP00000001909} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMLUP00000001909}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPE02057116; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59463.ENSMLUP00000001909; -. DR Ensembl; ENSMLUT00000002097; ENSMLUP00000001909; ENSMLUG00000002094. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1NX26; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001074; Unassembled WGS sequence. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001074}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001074}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 393 411 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 418 435 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 510 530 {ECO:0000256|SAM:Coils}. FT COILED 562 589 {ECO:0000256|SAM:Coils}. FT COILED 597 617 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMLUP00000001909}. SQ SEQUENCE 908 AA; 100540 MW; 8E854FFB832CD92F CRC64; ETVDMDFSRL HIYTPPQCVP ENTGYTYALS SSYSSDALDF ETEHKLDPVF DSPRMSRRSL RLATTACSTE GGQAGDADSC LGSTASLKDR TARTAKQHRS ASKLAFSVNR PSRQVAAGQS SGLQGAACLR PPVLDESLIR EQTKVDHFWG LDDDGDLKGG NKAVVQGNGK VAADAAWSNG YTCRACTLLS ERQDTLPAHG ASSRVYSRDR SQKRGASLYV DRILWLATYA SSSFSSFLVQ LFQVVLMKLN YESENSKLKS YESKDRESES FKSESHDSQA RSDPCGRRTG TEFLRADGGL SVNGESCDDF RGTGTEHLET HTATSLWSPR TERAAGTTRP TLSRAGHVAE RALRRIGAAG RSVSQAVWSA LWLAAAAPGT AASAVFWWLG SGWYQFVTLI SWLNVFLLTR CLRDICKFLI LLIPLLLLLG ASLSLRGQDG FLSLLPVFNW THTLRTQRVD DPKDMFTPET SHLSQPPEGA AEASPWRWMS EVERQVTSLS GQCQSRDEKL RELTASLQKL QVRVDQMDDG GAGVSSLVTS VVGQHLKDAD AVTSHHEQLS RISDLEDLLG KLAGKLEAIQ RELEQTKLRT ESAPGQEQHL LSMVAQLERE LELLRSDVAA WRPLRSSCEE AHTVLGKVDA QVRETVRLLL SSDQQDGSLD WLLQKLSAQF VTEPGRSACS EPAWPHSVCQ GLGRWEPCGQ GGRPPMPFAW PRRMQTAQAR VIVNNALKLY SQDKTGMVDF ALESGGGSIL STRCSETHET KTALISLFGI PLWYFSQSPR VVIQPDIYPG NCWAFKGSQG YLVVRLSMEI HPTSFTLEHI PKTLSPTGNI TSAPKDFSVY GLENEYQEEG QLLGQFMFDQ EGESLQTFPV PKRPERAFQI VELRIFSNWG HPEYTCLYRF RVHGDPVK // ID G1P0Y4_MYOLU Unreviewed; 755 AA. AC G1P0Y4; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMLUP00000003445}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSMLUP00000003445}; OS Myotis lucifugus (Little brown bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Vespertilionidae; Myotis. OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000003445, ECO:0000313|Proteomes:UP000001074}; RN [1] {ECO:0000313|Ensembl:ENSMLUP00000003445} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The genome sequence of Myotis lucifugus (Bat)."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001074} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSMLUP00000003445} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMLUP00000003445}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPE02056080; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPE02056081; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59463.ENSMLUP00000003445; -. DR Ensembl; ENSMLUT00000003780; ENSMLUP00000003445; ENSMLUG00000003774. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1P0Y4; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001074; Unassembled WGS sequence. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001074}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001074}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 198 216 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 246 263 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 309 329 {ECO:0000256|SAM:Coils}. FT COILED 399 437 {ECO:0000256|SAM:Coils}. FT COILED 440 467 {ECO:0000256|SAM:Coils}. FT COILED 507 534 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSMLUP00000003445}. SQ SEQUENCE 755 AA; 84256 MW; 8A279C4E9BFBEE6A CRC64; LAQPQASPAL LSLPRLSLGE GSPSHLIMSR RSQRLTRYSQ GDDDGGSSSG GSSVAGSQST LFKDSPFRTV KRKPSSTKRL SPAPQLGPSD SHTYYSESVV RESYYGSPRA ASLARSSILD DQLHSDRYWS EDLPVRRRRG TGDTESSKIN GLVGAKLSED FPGSSSGYSS EDDYVGYSQA AQQGSGSRLR GAVSRAGSFF WTVVTFPGRL FGLLYWWVGT TWYRLTTAAS LLDVFVLTRR FSSLKAFLWF LLLLLLLTGL TYGQGLENFY PYGLHTVHPT LVSWWAAKGS SQQREVWEPR DSQPRLQAEQ RILSRVHSLE RRLEALAAEF SSAWQKEAMR LERLELRQGA AGEGGGRGLS QEDTLELLEG LVSRREAALK EDFRRDTAAR IQEELGALRA EHQQDAEDLF KKIVRASQES EDRLQQLKSE WHRMTQESFR ENSMKELGRL EGQLAGLRQE LAALALKQSS VADQVGLLPQ QLQAMRDDVE SQFPAWVSQF LLQGGGARSG LLQREEMQAR LQELEGKILR HVAEMQGKSA KEAVASMGLR LQKEGVIGVT EEEVQRIVNQ ALKRYSEDRI GMVDYALESG GSGASVISTR CSETYETKTA LLSLFGIPLW YHSQSPRVIL QPDVHPGNCW AFQGPQGFAV VRLSARIRPT AVTLEHVPKS LSPNSTISSA PKDFAVFGFD EDLQQEGTPL GQFTYNQDGE PIQTFYFQDP KMATYQVVEL RILTNWGHPE YTCIYRFRVH GEPAH // ID G1PDU0_MYOLU Unreviewed; 1257 AA. AC G1PDU0; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMLUP00000008651}; GN Name=SUCO {ECO:0000313|Ensembl:ENSMLUP00000008651}; OS Myotis lucifugus (Little brown bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Vespertilionidae; Myotis. OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000008651, ECO:0000313|Proteomes:UP000001074}; RN [1] {ECO:0000313|Ensembl:ENSMLUP00000008651} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The genome sequence of Myotis lucifugus (Bat)."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001074} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSMLUP00000008651} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMLUP00000008651}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPE02018255; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59463.ENSMLUP00000008651; -. DR Ensembl; ENSMLUT00000009494; ENSMLUP00000008651; ENSMLUG00000009448. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G1PDU0; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001074; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001074}; KW Reference proteome {ECO:0000313|Proteomes:UP000001074}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1257 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003416888. FT COILED 939 959 {ECO:0000256|SAM:Coils}. FT COILED 989 1009 {ECO:0000256|SAM:Coils}. FT COILED 1195 1215 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1257 AA; 139468 MW; 5B8B5A33D23181E8 CRC64; MKKYRRALAL VSCLSLCSLV WLPSWHVCCK ESSSASASSY YSQDDSCALE NEDVQFQKKD EREGPIPISA ELSGKVGSNL PVPPEEHKLK DNYVVDVQNT ESKTLSPPVI ETHSTIDLHE DSSGVVGSEN IENISSSSTS EIIPISKLDE IEKSGTIPIA KPRETEQPET DCEVGEALDA SVPTDQPSFV SPHESLVGQH IENVSSAHGK GKITKSEFAS KVSASDQGGG DPKSALNASD NLNLKNESSD YIKPEEIDPA SVTSPKEPED IPTFDEWKKK VMEVEKEKSQ SMHASSNGGL HATKKVQKNR NNYASVECGA KILAANPEAK STSAILIENM DLYMLNPCST KIWFVIELCE PIQVKQFDIA NYELFSSTPK DFLVSISDRY PTNKWVKLGT FHGRDERNVQ SFPLDEQMYA KYVKMFIKYI KVELVSHFGS EHFCPLSLIR VFGTSMVEEY EEIADSQYQS ERQELFDEEY DYPLDYNTGE DKASKNLLGS ATNAILNMVN IAANILGAKT EDLTEVGNKS ISENATAAAA PKMPESAPVS APVPSPEFIN TEEQRHDTEP SSPDTPKESP IVQLVQEDEE EASPSTVTLL GSGEQEDESS PWFEAETQIF CSELTTICCI SSFSEYIYKW CSLRVALYRQ RSRTAVSEGK DDLVSAQPSL PLPAESVDVS VLQPPGGELD SKNKEKEAET VVLGDLSSTH QGDLINHTVD VIELEPSRPQ TLSQSLHLDV TPEIHSLSKI EVSEPIKYEA GHTPSQVIPQ ESSAEVDNAI EKKSESFSSI EKPTVMIYET KHSEVIDRTV KEDINSMQII TKLSETVVPP INTAAVPDSE DGEAKMNIAD TPKQMLTPIV ESSSLPEVKE EEQSPEDALL RGLQRTATDF YAELQNSTDL GYANGNLVHG SNQKESVFMR LNNRIKALEV NMSLSGRYLE ELSQRYRKQM EEMQKAFNKT IIKLQNTSRI AEEQDQRQTE SIHLLQAQLT NMTQLVSNLS ATVAELKREV SDRQSYLITS LVLCVVLGLM LCMQRCRNTS QFDGDYSSRL PKSNQYPSPK RCFSSYDDMN LKRRTSFPLI RSKSLQLTGK EVDPNALYIV EPLKFSPEKK KKRCKYKTEK IETIKPADPS HPVANGDIKG RKPFTNQRDF SHVGEVYHSS YKGPPSEGSS ETSSQSEESY FCGISACTSL CNGQSQKTKT EKRALKRRRS KVQDQGKLIK TLIQTKSGSL PSLHDIIKGN KELTVGTFGV TAVSGHI // ID G1PLT6_MYOLU Unreviewed; 2610 AA. AC G1PLT6; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 28. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMLUP00000011833}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSMLUP00000011833}; OS Myotis lucifugus (Little brown bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Vespertilionidae; Myotis. OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000011833, ECO:0000313|Proteomes:UP000001074}; RN [1] {ECO:0000313|Ensembl:ENSMLUP00000011833} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The genome sequence of Myotis lucifugus (Bat)."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001074} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSMLUP00000011833} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMLUP00000011833}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPE02011293; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPE02011294; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPE02011295; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPE02011296; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAPE02011297; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_006085911.1; XM_006085849.1. DR STRING; 59463.ENSMLUP00000011833; -. DR Ensembl; ENSMLUT00000013007; ENSMLUP00000011833; ENSMLUG00000013003. DR GeneID; 102433744; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G1PLT6; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001074; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001074}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001074}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289480 MW; B3E60AA988748D6B CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTISGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVVILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTA SQPILSTPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DVKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FIFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENIPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NMMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGGGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKEKEK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EEGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFDTR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GTWLCDDNFP DDESRQVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNISKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKALS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMVTMD NAEEYVDLIF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID G1PUJ5_MYOLU Unreviewed; 376 AA. AC G1PUJ5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSMLUP00000014955}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSMLUP00000014955}; OS Myotis lucifugus (Little brown bat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; OC Vespertilionidae; Myotis. OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000014955, ECO:0000313|Proteomes:UP000001074}; RN [1] {ECO:0000313|Ensembl:ENSMLUP00000014955} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The genome sequence of Myotis lucifugus (Bat)."; RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001074} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSMLUP00000014955} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSMLUP00000014955}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAPE02019219; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59463.ENSMLUP00000014955; -. DR Ensembl; ENSMLUT00000016411; ENSMLUP00000014955; ENSMLUG00000016408. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1PUJ5; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001074; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001074}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001074}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 66 85 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 106 127 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 160 187 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 376 AA; 43265 MW; 3EDAF5E52011CB0C CRC64; MPRTSRSPRD LGDPLDDVAH LCRDPRPRSP RAVQRGRNPC RMTEESSRNM NDPFLFPLRL NGPSPGLIQF VMGCMSWITC LACILRTQVH QVLFNTFRCK LLFQKLMEKT GVLVLFLFGF WVFMMHLPSK VEVWQDDSIN TPLQTVRMYQ EKVRHHTGEI QVLRGTINQL VAKLQEMEAM SDEQKMAQKI MKMIQGDYIE KPDFALKSIG ASIDFEQTSA TYNYNKARSY WNWIRLWNYA QPPDVILEPN VTPGNCWAFS GDRGQVTIRL AQKVYLSNLT LQHIPKTISL SGSLDTAPKD FVIYGMEGSP REEVFLGAFQ FQPENIIQTF QLQNQPTRTF GAVKVKISSN WGNPRFTCLY RVRVHGSVTP PREQPN // ID G1QX32_NOMLE Unreviewed; 389 AA. AC G1QX32; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 01-MAY-2013, sequence version 2. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000005503}; DE Flags: Fragment; GN Name=SUN3 {ECO:0000313|Ensembl:ENSNLEP00000005503}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000005503, ECO:0000313|Proteomes:UP000001073}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000005503} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000005503} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000005503}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01047111; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01047112; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 61853.ENSNLEP00000005503; -. DR Ensembl; ENSNLET00000005787; ENSNLEP00000005503; ENSNLEG00000004522. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1QX32; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSNLEP00000005503}. SQ SEQUENCE 389 AA; 44334 MW; B7FE86C141FCF54E CRC64; TEDSEAFQNS LFKKKHPWRK TTLNKTNLKA GIMSGKTKAR RAAMFFRRCS EDASGRASGN ALLSEDENPD ANGVTRSWKI ILSTMLTLTF LLVGLLNHQW LKETDVPQKS RQLYSIIAEY GSRLYKYQAR LRMPKEQLEL LTKESQTLEN NFRQILFLIE QIDVLKALLR DMKDGMDNNH NWNTHGDPVK DPDHTEEMSN LVNYVLKKLR EDQVQMADYA LKSAGASIIE AGTSESYKNN KAKLYWHGIG FLNHEMPPDI ILQPDVYPGK CWAFPGSQGH TLIKLATKII PTAVTMEHIS EKVSPSGNIS SAPKEFSVYG ITKKCEGEEI FLGQFIYNKT GTTIQTFELQ HAVSEYLLCV KLNIFSNWGH PKYTCLYRFR VHGTPGKYN // ID G1RAI6_NOMLE Unreviewed; 918 AA. AC G1RAI6; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 01-MAY-2013, sequence version 2. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000010233}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSNLEP00000010233}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000010233}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000010233} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000010233} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000010233}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01109213; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01109214; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01109215; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01109216; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 61853.ENSNLEP00000010233; -. DR Ensembl; ENSNLET00000010735; ENSNLEP00000010233; ENSNLEG00000008323. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1RAI6; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0031965; C:nuclear membrane; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 392 415 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 422 441 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 516 536 {ECO:0000256|SAM:Coils}. FT COILED 561 595 {ECO:0000256|SAM:Coils}. FT COILED 608 628 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 918 AA; 102464 MW; 8E8FE4EE20F1A02B CRC64; MNMDFSRLHM YSPPQCVPEN TGYTYALSSS YSSDALDFET EHKLDPVFDS PRMSRRSLRL ATTACTLGDG EAVGANNGTS SAVSLKNRAA RTAKQRRSAN KSAFSINHVS RQVTSSGISH GGTDSLQDAV TRRPSVLDES WIREQTTVDH FWGLDDDGDL KGGNKAAIQG NGDVGAATAT AHNGFSCSNC SMLSERKDVL TAHPAAPGPV SRVYSRDRNQ KRGASFYVNR ILWLARYTAS SFSSFLVQLF QVVLMKLNYE SENYKLKTHE SKDCESESYK SKSHESKAHA SYYGRMNVRE VLREDGHLSV NGEALCNDCK GKRHLDAHTA IHLQSPRPPG RAGTLQHIWA CAGYFLLQIL RRIGAAGWAV SRTAWLALWL AVVAPGKAAS GVFWWLGIGW YQLVTLISWL NVFLLTRCLR TICKFLVLLI PLFLLLAGLS LRGQGDFFSF LPVLNWASMH RTQRVDDPQD AFKPATSRLN QPLQGDNEAL PWHWMSGVEQ QVASLSGRCH HHGENLRELT ALLQKLQARV DQMDGGAARP SASVRDAVGQ PLRETDFMAF HQEHEVRISH LEDILGKLRE KSEAIQKELE QTKQKTISAV GEQLLPTVEH LQLELDQLKS ELSSWRHVKS GCETVDAVRE RVDVQVREMV KLLFSEDQQG SSLEQLLQRF SSQFVSKGDL RTRLRDLELQ ILRNVTHHVS VTKQLPTSEA MVSAVSEVGA SGITEAQARA IVNNALKLYS QDKTGMVDFA LESGGGSILS TRCSETYETK TALMSLFGIP LWYFSQSPRV VIQPDIYPGN CWAFKGSQGY LVVRLSMMIH PAAFTLEHIP KTLSPTGNIS SAPKDFAVYG LENEYQEEGQ LLGQFTYDQD GESLQMFQAL KRPDDTAFQI VELRIFSNWG HPEYTCLYRF RVHGEPVK // ID G1RFJ6_NOMLE Unreviewed; 379 AA. AC G1RFJ6; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000011996}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSNLEP00000011996}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000011996, ECO:0000313|Proteomes:UP000001073}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000011996} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000011996} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000011996}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01077242; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_003273570.1; XM_003273522.2. DR STRING; 61853.ENSNLEP00000011996; -. DR Ensembl; ENSNLET00000012581; ENSNLEP00000011996; ENSNLEG00000009839. DR GeneID; 100593141; -. DR KEGG; nle:100593141; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1RFJ6; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 42889 MW; D9B2B19E1E4F8046 CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSSNMNDNFL LPVRINAQAP GLTQCMLGCV SWFTCFACSL KTQAQQFLFN TCRCKLLCQK LMEKTGILLL CAFGFWMVSI HLPSKMKVWQ DDSINGPLQS LRLYQEKVRH HSGEIQDLRG SMNLLIAKLQ EMEAMSDEQK MAQKIMKMIH GDYIEKPDFA LKSTGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNVTPGN CWAFEGDRGQ VTIQLAQKVY LSNLTLQHIP KTISLSGSLD TAPKDLVIYG MEGSPKEEVF LGAFQFQPEN IIQMFPLQNQ PALAFSAVKV KISSNWGNSA FTCLYRVRVH GSVAPPREQP HQNPYPERD // ID G1RH11_NOMLE Unreviewed; 438 AA. AC G1RH11; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000012511}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSNLEP00000012511}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000012511, ECO:0000313|Proteomes:UP000001073}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000012511} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000012511} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000012511}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01077522; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01077523; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01077524; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_003273631.1; XM_003273583.2. DR STRING; 61853.ENSNLEP00000012511; -. DR Ensembl; ENSNLET00000013122; ENSNLEP00000012511; ENSNLEG00000010261. DR GeneID; 100582984; -. DR KEGG; nle:100582984; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1RH11; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 48159 MW; 75D1E5DD7EB01B0B CRC64; MRRSSRPGSA ASSRKHTPDF FSENSSMSIT SEDSKGLRSA GPGPGEPEGR RARGPSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNWQTA CGAATVRGGA SEPTGSPVVS EEPLDLLPTL DMRQEMPPPR VFKSFLSLLF QGLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLSA FWLGLLYLVS PLENEPKEML TLSEYHQRVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQKLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PRVEHTGGAN SAPRDFAVFG LQVDDETEVF LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP HFTCLYRVRA HGVRTSEGAE GSATGGPH // ID G1RYX5_NOMLE Unreviewed; 767 AA. AC G1RYX5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 01-MAY-2013, sequence version 2. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000018452}; DE Flags: Fragment; GN Name=SUN2 {ECO:0000313|Ensembl:ENSNLEP00000018452}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000018452, ECO:0000313|Proteomes:UP000001073}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000018452} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000018452} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000018452}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01024598; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01024599; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 61853.ENSNLEP00000018452; -. DR Ensembl; ENSNLET00000019377; ENSNLEP00000018452; ENSNLEG00000015173. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1RYX5; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0031965; C:nuclear membrane; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 230 251 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 263 284 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 323 343 {ECO:0000256|SAM:Coils}. FT COILED 402 451 {ECO:0000256|SAM:Coils}. FT COILED 461 481 {ECO:0000256|SAM:Coils}. FT COILED 528 548 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSNLEP00000018452}. SQ SEQUENCE 767 AA; 85440 MW; DFDB689045C996F8 CRC64; LRGVPVWAAG AFRFSSGEES TSHLIMSRRS QRLTRYSQGD DDGSSSSGGS SVAGSQSTLF KDSPLRTLKR KSSNMKRLSP APQLGPSSDA HTSYYSESLV RESYIGTGFP PRSSLEELHG DANWGEDLRV RRRRGTGGSE SSRASGLVGR KAAEDFLGSS SGYSSEDDYV EDSEGRGSKV TETEPVSSFP AGYSDADQQS SSSRLRSAVS RAGSLLWMVA TSPGRLFRLL YWWAGTTWYR LTTAASLLDV FVLTRRFSSL KTFLWFLLPL LLLTCLTYGA WYFYPYGLQT FHPALVSWWA AKDSRRQDEG WESRDSSPHF QAEQRVMSRV HSLERRLEAL AAEFSSNWQK EAMRLERLEL RQGAPGPGGG GGLTHEDTLA LLEGLVSRRE AALKEDFRRE TAARIQEELS TLRAEHQQDS EDLFKKIVQA SQESEARIQQ LKSEWQSMTQ ESFPESSVKE LRRLEDQLAG LQQELAALAL KQSSVADEVD LLPQQIQAVR DDVESQFPAW ISQFLARGGG GRVGLLQREE MQAQLRELES KILTHVAEMQ GKSAREAAAS LGLTLQKEGV IGVTEEQVHH IVKQALQRYS EDRIGLADYA LESGGASVIS TRCSETYETK TALLSLFGIP LWYHSQSPRV ILQPDVHPGN CWAFQGPQGF AVVRLSARIR PTAVTLEHVP KALSPNSTIS SAPKDFAIFG FDEDLQQEGT LLGKFTYDQD GEPIQTFHFQ APTMATYQVV ELRILTNWGH PEYTCIYRFR VHGEPAH // ID G1RZ51_NOMLE Unreviewed; 1407 AA. AC G1RZ51; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000018528}; GN Name=SUCO {ECO:0000313|Ensembl:ENSNLEP00000018528}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000018528}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000018528} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000018528} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000018528}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01184647; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01184648; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01184649; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 61853.ENSNLEP00000018528; -. DR Ensembl; ENSNLET00000019456; ENSNLEP00000018528; ENSNLEG00000015249. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G1RZ51; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}. FT COILED 1089 1109 {ECO:0000256|SAM:Coils}. FT COILED 1139 1159 {ECO:0000256|SAM:Coils}. FT COILED 1345 1365 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1407 AA; 156712 MW; 0B65CC5633F7B86C CRC64; MRGFLARPFL STNQHLAQWG SPLTQGKGFK LVQFPSQHPR HSRPFHELCS KEENSATVPK LISLVVSSET IDFSNKTMDS RRDREREKRV LEGKLQLPKA LARTQRARDE GRRAWTSRWP QQRRSPESCE APLSAPLWGP QRGLPGREPL RSRSAIAIAL RTIGPILALL LRLLHLGLGS GGCREDVPPS GRGKKEEKMK KHRRALALVS CLFLCSLVWL PSWHVCCKES SSASASSYYS QDDNCALENE DVQFQKKNTE SKKLSPPVVE TLPTVDLHEE SSNAVVDSET VENISSSSTS EITPISKLDE IEKSGTIPIA KPSETEQSET DCDVGEALDA NAPIEQPSFV SPPDSLVGQH IENVSSSHGK GKITKSEFES KVSATEQGGS DPKSALNASD NLKNESSDYT KPGDIDPTSV ASPKDPEDIP TFDEWKKKVM EVEKEKSQSM HASSNGGSHA TKKVQKNRNN YASVECGAKI LAANPEAKST SAILIENMDL YMLNPCSTKI WFVIELCEPI QVKQLDIANY ELFSSTPKDF LVSISDRYPT NKWIKLGTFH GRDERNVQSF PLDEQMYAKY VKVELLSHFG SEHFCPLSLI RVFGTSMVEE YEEIADSQYH SERQELFDED YDYPLDYNTG EDKSSKNLLG SATNAILNMV NIAANILGAK TEDLTEGNKS ISENATATAA PKMPESTPVS TPVPSPEYVT TEVHTHDMEP STPDTPKESP IVQLVQEEEE EASPSTVTLL GSGEQEDESS PWFESETQIF CSELTTICCI SSFSEYIYKW CSVRVALYRQ RSRTALSKGK DYLVSAQPPL LLAESVDVSV LQPLSGEWKI RNIEREAETV VLGDLSSSMH QDDLVNHTVD AVELEPSHSQ TLSQSLLLDI TPESNPLPKI EVSESVEYEA GHIPSQVIPQ DSSVESDNEA QQKSESFSSI EKPSITYETN KVNELMDNII KEDVNSMQIF TKLSETIVPP INTATVPDNE DGEAKMNIAD TAKQTLISVV DSSSLPEVKE EEQSPEDALL RGLQRTATDF YAELQNSTDL GYANGNLVHG SNQKESVFMR LNNRIKALEV NMSLSGRYLE ELSQRYRKQM EEMQKAFNKT IVKLQNTSRI AEEQDQRQTE AIQLLQAQLT NMTQLVSNLS ATVAELKREV SDRQSYLVIS LVLCVVLGLM LCMQRCRNTS QFDGDYISKL PKSNQYPSPK RCFSSYDDMN LKRRTSFPLM RSKSLQLTGK EVDPNDLYIV EPLKFSPEKK KKRCKYKIEK IETIKPAEPL HPIANGDIKG RKPFTNQRDF SNMGEVYHSS YKGPPSEGSS ETSSQSEESY FCGISACTSL CNGQSQKTKT EKRALKRRRS KVQDQGKLIK TLIQTKSGSL PSLHDIIKGN KEITVGTFGV TAVSGHI // ID G1S1H4_NOMLE Unreviewed; 2610 AA. AC G1S1H4; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 01-MAY-2013, sequence version 2. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSNLEP00000019362}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSNLEP00000019362}; OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates OS leucogenys). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hylobatidae; Nomascus. OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000019362, ECO:0000313|Proteomes:UP000001073}; RN [1] {ECO:0000313|Ensembl:ENSNLEP00000019362} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSNLEP00000019362} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG Gibbon Genome Sequencing Consortium; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSNLEP00000019362}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADFV01192463; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192464; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192465; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192466; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192467; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192468; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192469; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192470; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192471; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ADFV01192472; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_012353571.1; XM_012498117.1. DR STRING; 61853.ENSNLEP00000019362; -. DR Ensembl; ENSNLET00000020341; ENSNLEP00000019362; ENSNLEG00000015912. DR GeneID; 100596490; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G1S1H4; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001073; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001073}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001073}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289444 MW; 7C2FB71534C6C99B CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAMST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RDDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID G1SKG9_RABIT Unreviewed; 2610 AA. AC G1SKG9; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 13-NOV-2013, sequence version 2. DT 11-NOV-2015, entry version 29. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOCUP00000003310}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSOCUP00000003310}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000003310, ECO:0000313|Proteomes:UP000001811}; RN [1] {ECO:0000313|Ensembl:ENSOCUP00000003310} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000003310}; RG The Genome Sequencing Platform; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Genome Sequence of Oryctolagus cuniculus (European rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Proteomes:UP000001811}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSOCUP00000003310} RP IDENTIFICATION. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000003310}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOCUP00000003310}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAGW02023936; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02023937; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02023938; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002718172.1; XM_002718126.2. DR RefSeq; XP_008267698.1; XM_008269476.1. DR ProteinModelPortal; G1SKG9; -. DR STRING; 9986.ENSOCUP00000003310; -. DR Ensembl; ENSOCUT00000003817; ENSOCUP00000003310; ENSOCUG00000003816. DR GeneID; 100352700; -. DR KEGG; ocu:100352700; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G1SKG9; -. DR KO; K12231; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001811; Chromosome 17. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001811}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001811}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289344 MW; 2ED73460921B383B CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLCRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPTGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGSNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRMSQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNISKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID G1STD5_RABIT Unreviewed; 1248 AA. AC G1STD5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOCUP00000006564}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSOCUP00000006564}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000006564, ECO:0000313|Proteomes:UP000001811}; RN [1] {ECO:0000313|Ensembl:ENSOCUP00000006564} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000006564}; RG The Genome Sequencing Platform; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Genome Sequence of Oryctolagus cuniculus (European rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Proteomes:UP000001811}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSOCUP00000006564} RP IDENTIFICATION. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000006564}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOCUP00000006564}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAGW02000003; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02000004; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02000005; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02000006; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02000007; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAGW02000008; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9986.ENSOCUP00000006564; -. DR Ensembl; ENSOCUT00000007593; ENSOCUP00000006564; ENSOCUG00000007590. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G1STD5; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001811; Chromosome 13. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001811}; KW Reference proteome {ECO:0000313|Proteomes:UP000001811}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 1248 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003422987. FT COILED 930 950 {ECO:0000256|SAM:Coils}. FT COILED 980 1000 {ECO:0000256|SAM:Coils}. FT COILED 1186 1206 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSOCUP00000006564}. SQ SEQUENCE 1248 AA; 139297 MW; 08BBBC9CC6FA1571 CRC64; LVKFRMSLAS VSILSVIVVM LPSWRVCCKE SSSTSASSYY SQDDNCALEN EDVQFQKKDE REGPINAELL GKSGSNLPVP PEEHKLKDDH IVDSQTLCKM IKCSDPSTKH SPAVDLHEDS SSVVVGGELL FNSSPKNKVK CSLLYQMSEI EKSGTIPVAK PSETEQSETD CDVGEALEAN APVEQPSFVN PSESLVGQHI ENVSSSHSKE KITKSEFESE VSVSEQDDGN PKSALNASEI LKNESSDYTK PGEIDPTSVG NPKDPEDIPT FDEWKKKVME VEKEKSQSMH PSSNGGPHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYQSERQ ELFDEDYVDY PLDYNTVEDK SSKNLLGSAT NAILNMVNIA ANILGAKTED LTEGNKSVSE NATTTTAPKM PESTPVSTPV PSPEYITTEA IQDTEPSSPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYRWCSVR IARYRQRSRI AVSKGKDLAQ PQLLFPAETV DVSVLHPLSG ELDRKSVDKE AETIVLGDLS NMHQADLMNH TVDAIELEPS HPQTLSQSLL LDITPEINPL SKIEVSMSVK HETEHIPSQV IPQESSVEVD NEIEKKSESF SSLEKPSVIF ETKVHEMMDN IVNEDMSSVQ IITKLTETVV PPMNTATVQD SEDGEAKMNV ADTPKLVLTP VVDSSLPEVK EEEQSPEDAI LRGLQRTATD FYAELQNSTD LGYANGNLVH GSNQKESVFM RLNNRIKALE VNMSLSGRYL EELSQRYRKQ MEEMQKAFNK TIVKLQNTSR IAEEQDQRQT EAIQLLQAQL TNMTQLVSNL SVTVAELKRE VSDRQSYLVI SLVLCVVLGL MLCMQRCRNT SQFDGDYISK LPKSNQYPSP KRCFSSYDDM SLKRRTSFPL IRSKSLQLTG KEVDPNDLYI VEPLKFSPEK KKKRCKYKTE KIETIKPADP LHPIANGDIK ARKPFTNQRD FSNMGEVYHS SYKGPPSEGS SETSSQSEES YFCGISACTS LCNGQSQKTK TEKRALKRRR SKVQDQGKLI KTLIQTKSGS LPSLHDIIKG NKEITVGTFG VTTVSGHI // ID G1SZU7_RABIT Unreviewed; 442 AA. AC G1SZU7; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 19-OCT-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOCUP00000009226}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSOCUP00000009226}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000009226, ECO:0000313|Proteomes:UP000001811}; RN [1] {ECO:0000313|Ensembl:ENSOCUP00000009226} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000009226}; RG The Genome Sequencing Platform; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Genome Sequence of Oryctolagus cuniculus (European rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Proteomes:UP000001811}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSOCUP00000009226} RP IDENTIFICATION. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000009226}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOCUP00000009226}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAGW02060162; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9986.ENSOCUP00000016279; -. DR Ensembl; ENSOCUT00000010714; ENSOCUP00000009226; ENSOCUG00000010709. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; KHTPNFY; -. DR Proteomes; UP000001811; Chromosome 4. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001811}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001811}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 136 159 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 165 190 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 203 237 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 442 AA; 48317 MW; 80AE456D608654AC CRC64; MRRSPRPGSA ASPHKHKPNF YSDNSNSSES AISGNSRGHR SAGSGPGELE GRRARGSSCG EPALSAGVPG GATWAGSSRP KPAPRSHNGQ TACGAATVRG GASEPDGAPV VPEEQLDLST LDLRQEMPPR PVFKSFLSLL FQVLSLLLSL TGDALVSVYR EVCSIRFLLT AVSLLGFFLA VLWWGLLYLV PPLENEPKEM LTLSEYHERV RSQGQQLQQL QAELDRLHKE VSSVRSANSE RVAKLVFQRL NEDFVRKPDY ALSSVGASID LEKTSQDYED ANTAYFWKRF SFWNYARPPT VILEPDVFPG NCWAFEGDQG QVVIRLPGRV QLSDITLQHP PPSVAHTGGA NSAPRDFAVF GLQVDDETEV FLGKFTFDVK KSEIQTFHLQ NEPPAAFPKV KIQILSNWGH PRFTCLYRVR AHGVRTSEGA GDSATGVTGG PH // ID G1THH7_RABIT Unreviewed; 586 AA. AC G1THH7; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 13-NOV-2013, sequence version 2. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOCUP00000016388}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSOCUP00000016388}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000016388, ECO:0000313|Proteomes:UP000001811}; RN [1] {ECO:0000313|Ensembl:ENSOCUP00000016388} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000016388}; RG The Genome Sequencing Platform; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Genome Sequence of Oryctolagus cuniculus (European rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Proteomes:UP000001811}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSOCUP00000016388} RP IDENTIFICATION. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000016388}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOCUP00000016388}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAGW02071774; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9986.ENSOCUP00000016388; -. DR Ensembl; ENSOCUT00000028486; ENSOCUP00000016388; ENSOCUG00000024234. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1THH7; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001811; Chromosome 4. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001811}; KW Reference proteome {ECO:0000313|Proteomes:UP000001811}. FT COILED 180 200 {ECO:0000256|SAM:Coils}. FT COILED 317 337 {ECO:0000256|SAM:Coils}. FT COILED 377 404 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 586 AA; 63907 MW; 51BF2B20B972A02B CRC64; MSPKARSGVT TGSRAGAGPA SQLLQLLEPA LSGGVARCHV FQAAWVILMD SPAKNHLGRA PGSTPRSSPV PGSCCLPSFT VCSNTARAGA KNDSPGSPLP LTGRRPQRPG EVRLPPPGVC PGPGPANPAS PPPPPGAWYF YPFGLQSIHP AVVSWWAARD GRRQQEVWES RDAAPHFQAE QRLLSHVHSL ERRLEALAAE FSSTWQKEAV RLERLELRQG AAGQAGGGLS QEDTLALLDG LVSRREAALK EDLRKDAAAR LQEELAVLRA EQHRDVEDLF HKIVQAAQES EARVQQLKSE WQSSGTRTAQ ESTLKELGRL EGQLVSLRHE LEALALKQSS VADEVGLLPQ RIQAVRDDVE SQVPAWITQF LHRGGGARTG LLQREELQAQ LRELESKILA QVAETQGTSA REAAAALSLT LQREGVIGVT EEQVHRIVKQ ALQRYSEDRI GMVDYALESG EASVISTRCS ETYETKTALL SLFGIPLWYH SQSPRVILQP DVHPGNCWAF QGPQGFAVVR LSARIRPTAV TLEHVPKALS PNSTISSAPR DFAVFGFDED LQQEGTLLGK FTYDQDGEPI QTFYFQ // ID G1TQE5_RABIT Unreviewed; 360 AA. AC G1TQE5; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 13-NOV-2013, sequence version 2. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOCUP00000019240}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSOCUP00000019240}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000019240, ECO:0000313|Proteomes:UP000001811}; RN [1] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke inbred {ECO:0000313|Proteomes:UP000001811}; RG The Genome Sequencing Platform; RG The Genome Assembly Team; RA Lindblad-Toh K., Chang J.L., Gnerre S., Clamp M., Lander E.S.; RL Submitted (MAY-2005) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOCUP00000019240} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000019240}; RG The Genome Sequencing Platform; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Genome Sequence of Oryctolagus cuniculus (European rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke inbred {ECO:0000313|Proteomes:UP000001811}; RG The Genome Sequencing Platform; RG The Assembly Computation and Development Core Team; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Version 2 of The Genome Sequence of Oryctolagus cuniculus (European RT rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|Ensembl:ENSOCUP00000019240} RP IDENTIFICATION. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000019240}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. RN [5] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke inbred {ECO:0000313|Proteomes:UP000001811}; RG RefSeq; RL Submitted (SEP-2015) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOCUP00000019240}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9986.ENSOCUP00000019240; -. DR Ensembl; ENSOCUT00000021590; ENSOCUP00000019240; ENSOCUG00000023106. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1TQE5; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001811; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001811}; KW Reference proteome {ECO:0000313|Proteomes:UP000001811}. SQ SEQUENCE 360 AA; 40624 MW; E8BB7A0413D1BCBA CRC64; MMSGRPQLRR GVGLFRGSPE GACCSASFQG LLSEPENPEA HGIPRSWKIT LGITFTLIFL LIGLRNQQWL QEAEFSQTSR QLYAAVAEFG SRLYNYQARL RMPKEHLELL KKESQTLENN FREILFLIEQ INALKVLLRE LQEGAHNRSW RAAQDQDGAA DPDEEMSHLV NYVLKKKLRE DQVQMADYAL KSAVVGASII ESGTSESYKN DKAKLYWHGI GFLNYEMPPD VILQPDVHPG KCWAFPGSQG HALIKLARKI VPSAVTLEHI SEKVSPSGNI SSAPKEFSVY GVMKKCEGEG IFLGQFIYNK TETTVQTFEL QHEVPESISC VKLKVLSNWG HPQYTCLYRF RVHGTPSDHT // ID G1TZM8_RABIT Unreviewed; 372 AA. AC G1TZM8; DT 19-OCT-2011, integrated into UniProtKB/TrEMBL. DT 13-NOV-2013, sequence version 2. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOCUP00000022552}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSOCUP00000022552}; OS Oryctolagus cuniculus (Rabbit). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Lagomorpha; Leporidae; OC Oryctolagus. OX NCBI_TaxID=9986 {ECO:0000313|Ensembl:ENSOCUP00000022552, ECO:0000313|Proteomes:UP000001811}; RN [1] {ECO:0000313|Ensembl:ENSOCUP00000022552} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000022552}; RG The Genome Sequencing Platform; RA Di Palma F., Heiman D., Young S., Gnerre S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "Genome Sequence of Oryctolagus cuniculus (European rabbit)."; RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001811} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Thorbecke {ECO:0000313|Proteomes:UP000001811}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [3] {ECO:0000313|Ensembl:ENSOCUP00000022552} RP IDENTIFICATION. RC STRAIN=Thorbecke {ECO:0000313|Ensembl:ENSOCUP00000022552}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOCUP00000022552}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAGW02060241; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002710880.1; XM_002710834.2. DR STRING; 9986.ENSOCUP00000022552; -. DR Ensembl; ENSOCUT00000024351; ENSOCUP00000022552; ENSOCUG00000022223. DR GeneID; 100358243; -. DR KEGG; ocu:100358243; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G1TZM8; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001811; Chromosome 4. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001811}; KW Reference proteome {ECO:0000313|Proteomes:UP000001811}. FT COILED 156 183 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 372 AA; 42387 MW; B00756369BF4F7ED CRC64; MPRSARSPHG DPCALPDDAA HNARQRRVIQ RGRNTCRTAE DSLPNTSDAL LFPVRLNTPT LGLTQSVLGC VSWFTCLACF LRTQAQQVLF NTCRCKLIFQ RLMEKTGVLV LCAFGFWMFS MHLPSKMEVW QDDSINSPLQ SLRMYQEKVR HHTGEIQDLR GSMNQLLARL QEMEAMSDEQ KMTQKILKMI QGDYIEKPDF ALKSIGASID FEHTSATYNH DKARSYWNWI RLWNYAQPPD VILEPNVTPG NCWAFVGDRG QVTIRLAQKV YLSNLTLQHI PKTISLSGSL DTAPKDFVIY GMESSPREEV FLGAFQFQPE NTIQMFPLQN QPPRAFGSVK VKISSNWGNP RFTCLYRVRV HGSVTPPRTQ PS // ID G1WZI0_ARTOA Unreviewed; 909 AA. AC G1WZI0; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGX53703.1}; GN ORFNames=AOL_s00006g31 {ECO:0000313|EMBL:EGX53703.1}; OS Arthrobotrys oligospora (strain ATCC 24927 / CBS 115.81 / DSM 1491) OS (Nematode-trapping fungus) (Didymozoophaga oligospora). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Orbiliomycetes; OC Orbiliales; Orbiliaceae; Orbilia. OX NCBI_TaxID=756982 {ECO:0000313|EMBL:EGX53703.1, ECO:0000313|Proteomes:UP000008784}; RN [1] {ECO:0000313|EMBL:EGX53703.1, ECO:0000313|Proteomes:UP000008784} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 24927 / CBS 115.81 / DSM 1491 RC {ECO:0000313|Proteomes:UP000008784}; RX PubMed=21909256; DOI=10.1371/journal.ppat.1002179; RA Yang J., Wang L., Ji X., Feng Y., Li X., Zou C., Xu J., Ren Y., Mi Q., RA Wu J., Liu S., Liu Y., Huang X., Wang H., Niu X., Li J., Liang L., RA Luo Y., Ji K., Zhou W., Yu Z., Li G., Liu Y., Li L., Qiao M., Feng L., RA Zhang K.-Q.; RT "Genomic and proteomic analyses of the fungus Arthrobotrys oligospora RT provide insights into nematode-trap formation."; RL PLoS Pathog. 7:E1002179-E1002179(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGX53703.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADOT01000009; EGX53703.1; -; Genomic_DNA. DR RefSeq; XP_011117642.1; XM_011119340.1. DR EnsemblFungi; EGX53703; EGX53703; AOL_s00006g31. DR GeneID; 22888831; -. DR InParanoid; G1WZI0; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000008784; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008784}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008784}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 333 354 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 63 90 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 909 AA; 99721 MW; C169C7C4A5213696 CRC64; MAGRRSNRAG PAFSGNNLIQ SNPLPPLTAI PSFSYGSPQS ALPKPMSARD TKVNMTDALD RADAAAKARL EAAAKERKLA EQRAADAAAA AESARQSARQ ISEEPPMTAR KRSLRSQSVV SQDDNFYEGD ETSLEIHRII ESARNPVPEA SMLRARSANA GTKEPRSAMK GSREEALRPS AKTRAKYTAG SRRTVPMTPV PIDGERDVTF DEENQALGIA PRLASPSFVS PSPAPQPLHE DVPPAGNDVF NNRQNSPPDT VGYNEGISNL PMSPASPNDD PSRPKTSLLI SQQASVTTSW FSPAKTGLNS IMGGIQKTAV ILGIFFRNLF GPLLYYLLLG FFIFGITTLS YNFIQRSSYS TSTPPSSSDE LIYRLMALEK QVVEFQKGQE LEKKSYKTME QQLVSIRGAM ATYSSVSSQL DKHTRNYQTD RRASSAAMST MSAQIDQFDR SIKRNDEAAK EQQGNLKIVA SQVTEIQGEI EGINKGIQIL QKSQELSERA FQRIEESLPK QLAARVDPQT GKLIVAPELL RYLQTVLRED IQNEMIRFAQ THGGSSAAVR SNSDSYNWQD FLKTNAAKLQ GYIGDISEEK WRKAISDGIV VTREDMMKVI REQLDSARGV AERNNNDLMR KLVLEAEGVA NNAASRAATS ISSAALAAVT NYMRNFQGSS STSSRYGDAL IQAALHQYSA TILQKPDYAL LAHGTLIDPR LTSSTYDPYD TPGLLGKLTA FLRPGPNEPA HILTESTNVG DCWSFPQASG QVSLLLAEPI YPTDVTIDHV PRGISGDVSS APQEIEVWVK IEDEFLRDQA GKAASVAIGE VSDNASTRHY LANGYVRVAS FIYDINSQYP IQTFALPIEL EKLGVSVRSV SFRILSNWGR KEYTSIYRLR VHGITLKDQF EGQETAARI // ID G1XUL9_ARTOA Unreviewed; 930 AA. AC G1XUL9; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGX43221.1}; GN ORFNames=AOL_s00215g677 {ECO:0000313|EMBL:EGX43221.1}; OS Arthrobotrys oligospora (strain ATCC 24927 / CBS 115.81 / DSM 1491) OS (Nematode-trapping fungus) (Didymozoophaga oligospora). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Orbiliomycetes; OC Orbiliales; Orbiliaceae; Orbilia. OX NCBI_TaxID=756982 {ECO:0000313|EMBL:EGX43221.1, ECO:0000313|Proteomes:UP000008784}; RN [1] {ECO:0000313|EMBL:EGX43221.1, ECO:0000313|Proteomes:UP000008784} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 24927 / CBS 115.81 / DSM 1491 RC {ECO:0000313|Proteomes:UP000008784}; RX PubMed=21909256; DOI=10.1371/journal.ppat.1002179; RA Yang J., Wang L., Ji X., Feng Y., Li X., Zou C., Xu J., Ren Y., Mi Q., RA Wu J., Liu S., Liu Y., Huang X., Wang H., Niu X., Li J., Liang L., RA Luo Y., Ji K., Zhou W., Yu Z., Li G., Liu Y., Li L., Qiao M., Feng L., RA Zhang K.-Q.; RT "Genomic and proteomic analyses of the fungus Arthrobotrys oligospora RT provide insights into nematode-trap formation."; RL PLoS Pathog. 7:E1002179-E1002179(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EGX43221.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ADOT01000320; EGX43221.1; -; Genomic_DNA. DR RefSeq; XP_011128181.1; XM_011129879.1. DR EnsemblFungi; EGX43221; EGX43221; AOL_s00215g677. DR GeneID; 22899131; -. DR InParanoid; G1XUL9; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008784; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008784}; KW Reference proteome {ECO:0000313|Proteomes:UP000008784}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 930 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003427733. SQ SEQUENCE 930 AA; 102509 MW; 22F08B95DFDEA8DC CRC64; MKGKCSRAWL FQLHLLGLAT TTAAALITGS SSEPKQPTVT IGSSLGVTTC PFRTANYINH GLPQQCLRTS RVVSAGGGAV YALNATGRLE VSDDNPRASA SPSVEEVGLA TTTSIDLGQK FERPITSPSE DTPPPTEDAE TPIDTANFLS FEEWKEQNLA KAGQSTENFG ERPPKEARRR PGPLSSALDA LGVDDEIELN FEGETEEQGE VRVRDEAKPS ETLEPTKHVK SKDAGKTCKE RFNYASFDCA ATVHKTNPES KGASSILVEN KDSYMLNKCG AKNKFIIVEL CDDILVDTVV LANYEFFSSM FRTFKVSVSD RYPVKSAGWK ELGLFEAKNS REIQAFLVEN PLIWARYIRI EFLTHFGKEF YCPISLLRVH GTTMMEEFRH QEEQSRGYTD ESEEMEAEAI AEPIQEATKE EQAAHEAESS SRIIAEESSA PTRDYAPPIQ STASVPPDSV GPALAEPTEK QDHTISEASN TPAVGSEHQE KDVKSSSSIS QSPPSTPTPS AQKERRGHSS PETDPTITSV PVSEKAPPKA ASRSSRPNSN DSYQAPNSKE NNIPRNASST VSDPSRQPSA VSSTAPTPTT QESFFKTIHK RLQLLEANAT LSLQYIEEQS KLMRDTFSKM EKRQFNKSAS YLDQLNTTVF KELREFRTQY DQLWQSTVIA LDTQREVSER EILAISARLT MLADEVVFQK RMTIAQSFLM LVIVCLIVFS KTQHLETGIV RTVFSGRSQD HLGLESPPTS PSPPRNRPKS GRAASHKRRL REYVHQRRDS SDLDIPPVAT LKRNPIPAQF YRGARNVSVD SISDVIQLDD DGSIVSPPLL PQLTQSSPAT PSGLRAVKER GWGSSASDGE YMTPDMRERW RPTSPLSKVS GAEEESLDGE AGERVPDTDR ATPVGDREAS QPYLESQNND SDGAVESFFL // ID G2HHI0_PANTR Unreviewed; 752 AA. AC G2HHI0; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Sad1/unc-84 protein-like 1 {ECO:0000313|EMBL:BAK63188.1}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|EMBL:BAK63188.1}; RN [1] {ECO:0000313|EMBL:BAK63188.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Testis {ECO:0000313|EMBL:BAK63188.1}; RX PubMed=21484476; DOI=10.1007/s10142-011-0220-9; RA Kim R.N., Kim D.W., Choi S.H., Chae S.H., Nam S.H., Kim D.W., Kim A., RA Kang A., Park K.H., Lee Y.S., Hirai M., Suzuki Y., Sugano S., RA Hashimoto K., Kim D.S., Park H.S.; RT "Major chimpanzee-specific structural changes in sperm development- RT associated genes."; RL Funct. Integr. Genomics 11:507-517(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK306194; BAK63188.1; -; mRNA. DR RefSeq; NP_001267341.1; NM_001280412.1. DR GeneID; 472263; -. DR KEGG; ptr:472263; -. DR CTD; 23353; -. DR KO; K19347; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 227 250 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 257 275 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 350 370 {ECO:0000256|SAM:Coils}. FT COILED 395 429 {ECO:0000256|SAM:Coils}. FT COILED 442 462 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 752 AA; 83532 MW; 28411CAF53E31F1C CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGANSGTSSA VSLKNRAART AKQRRSTNKS AFSINHVSRQ VTSSGVSHGG TVSLQDAVTR RPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGAAAAASA HNGFSCSNCS MLSERKDVLT AHPAAPGPVS RVYSKDRNQK WKAASGVFWW LGIGWYQFVT LISWLNVFLL TRCLRNICKF LVLLIPLFLL LGLSLRGQGN FFSFLPVLNW ASMHRTQRVD DPQDVFKPTT SRLKQPLQGD SEALPWHWMS GVEQQVASLS GQCHHHGENL RELTALLQKL QARVDQMDGG AAGPSASVRD AVGQPPRETD FKAFHQEHEV RISHLEDILG KLREKSEAIQ KELEQTKQKT ISAVGEQLLP TVEHLQLELD QLKSELSSWR HLKTGCETVD AVQERVDVQV REMVKLLFSE DQQGGSLEQL LQRFSSQFVS KGDLHTMLRD LQLQILRNVT HHVSVTKRLP TSEAVVSAVS EAGASGITEA QARAIVNNAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALMSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MMIHPAAFTL EHIPKTLSPT GNISSAPKDF AVYGLENEYQ EEGQLLGQFT YDQDGESLQM FQALKRPDDT AFQIVELRIF SNWGHPEYTC LYRFRVHGEP VK // ID G2QCE5_MYCTT Unreviewed; 982 AA. AC G2QCE5; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AEO58121.1}; GN ORFNames=MYCTH_2305218 {ECO:0000313|EMBL:AEO58121.1}; OS Myceliophthora thermophila (strain ATCC 42464 / BCRC 31852 / DSM 1799) OS (Sporotrichum thermophile). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Myceliophthora. OX NCBI_TaxID=573729 {ECO:0000313|EMBL:AEO58121.1, ECO:0000313|Proteomes:UP000007322}; RN [1] {ECO:0000313|EMBL:AEO58121.1, ECO:0000313|Proteomes:UP000007322} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 42464 / BCRC 31852 / DSM 1799 RC {ECO:0000313|Proteomes:UP000007322}; RX PubMed=21964414; DOI=10.1038/nbt.1976; RA Berka R.M., Grigoriev I.V., Otillar R., Salamov A., Grimwood J., RA Reid I., Ishmael N., John T., Darmond C., Moisan M.-C., Henrissat B., RA Coutinho P.M., Lombard V., Natvig D.O., Lindquist E., Schmutz J., RA Lucas S., Harris P., Powlowski J., Bellemare A., Taylor D., Butler G., RA de Vries R.P., Allijn I.E., van den Brink J., Ushinsky S., Storms R., RA Powell A.J., Paulsen I.T., Elbourne L.D.H., Baker S.E., Magnuson J., RA LaBoissiere S., Clutterbuck A.J., Martinez D., Wogulis M., RA de Leon A.L., Rey M.W., Tsang A.; RT "Comparative genomic analysis of the thermophilic biomass-degrading RT fungi Myceliophthora thermophila and Thielavia terrestris."; RL Nat. Biotechnol. 29:922-927(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP003004; AEO58121.1; -; Genomic_DNA. DR RefSeq; XP_003663366.1; XM_003663318.1. DR STRING; 573729.XP_003663366.1; -. DR EnsemblFungi; AEO58121; AEO58121; MYCTH_2305218. DR GeneID; 11509244; -. DR KEGG; mtm:MYCTH_2305218; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G2QCE5; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007322; Chromosome 3. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007322}; KW Reference proteome {ECO:0000313|Proteomes:UP000007322}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 27 {ECO:0000256|SAM:SignalP}. FT CHAIN 28 982 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003435459. FT COILED 653 680 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 982 AA; 105460 MW; 558B1D44FC047E3C CRC64; MRAPADCLKT FAPALVLLGL HALGVHGSRA GSETAATAAP AATDEVCESR TINYITHSLP QQCLRTSWTS PSPAATDEST SHATITATAT AVGESPATDP DHGTAQGQAQ DTQEELAASS FMSFEEWKEM MLRKSGQDPA NIRSHRQREH RERDPSMQSG DVYSFGEEGE ISLEFDALAE KVSEIASSTD KATPKAKEVV KEEQVLYDDG KTQYYRSKDA GKTCKERFSY SSFDAGATVL KTSPGAKNAK AILVENKDSY MLLECRAKNK FVIVELSDDI LVDTVVLANF EFFSSMIRKF RVSVSDRYPV KMDKWVELGT FEARNSRDMQ AFLIEHPQIY TKYIRIEFLS HWGNEFYCPI SLLRVHGTRM LDTWKEPSHD DEPEQIEPPP GSTAETQQVQ KPAGSDNTSS VADEEKAAPR TPSTETGLTP WSPLFQGNFS LQVCELPSPT AAEPTPIDSG LNGLPKEPAA ASDSATPRPS AARTVDERIQ ASNSSPAEPV GSAEASASHR QSAGSASSGV YSTPSQASNN GTVSSTGQRQ SDSRTNATDN TSSATPTTPR NKTSSASSAS ASPTVQESFF KAITKRLQLL ESNTSLSLQY IEEQSRFLQE VLLKMERKQI TRVDSFLDTL NKTVLSELHN VRTQYDQIWQ STVLALETQR EQSQREIVAL TSRLNVLADE VVFQKRMAIL QSVLLLSCLV LVIFSSRGGL AALDSAPFPP PWASSPTGYR RYGHAHSDSF SGMSMPGSPP LQGQQTGGAA AAAAAAAATP TTSAFQRQTY PTTTSYKDKS LPLTPPSEYS RESTPATHPN RSARPSYYYS GNQEEGAEGE GEEGEEGDEA GGEGGTASRQ FRRHVTTPTT TPAAAAAAAA AAGPSAQAAA STAQAQQTPS SGGEAEVASI RSDPGLLQRD DAASSISKGK EQQGDDDDDD DEGEEEEEEE EKGGCRVSPA LLRARSASSH SASSQPNGGL RKPLPALPED PS // ID G2RB29_THITE Unreviewed; 1013 AA. AC G2RB29; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AEO69000.1}; GN ORFNames=THITE_2118928 {ECO:0000313|EMBL:AEO69000.1}; OS Thielavia terrestris (strain ATCC 38088 / NRRL 8126) (Acremonium OS alabamense). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Thielavia. OX NCBI_TaxID=578455 {ECO:0000313|EMBL:AEO69000.1, ECO:0000313|Proteomes:UP000008181}; RN [1] {ECO:0000313|EMBL:AEO69000.1, ECO:0000313|Proteomes:UP000008181} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 38088 / NRRL 8126 {ECO:0000313|Proteomes:UP000008181}; RX PubMed=21964414; DOI=10.1038/nbt.1976; RA Berka R.M., Grigoriev I.V., Otillar R., Salamov A., Grimwood J., RA Reid I., Ishmael N., John T., Darmond C., Moisan M.-C., Henrissat B., RA Coutinho P.M., Lombard V., Natvig D.O., Lindquist E., Schmutz J., RA Lucas S., Harris P., Powlowski J., Bellemare A., Taylor D., Butler G., RA de Vries R.P., Allijn I.E., van den Brink J., Ushinsky S., Storms R., RA Powell A.J., Paulsen I.T., Elbourne L.D.H., Baker S.E., Magnuson J., RA LaBoissiere S., Clutterbuck A.J., Martinez D., Wogulis M., RA de Leon A.L., Rey M.W., Tsang A.; RT "Comparative genomic analysis of the thermophilic biomass-degrading RT fungi Myceliophthora thermophila and Thielavia terrestris."; RL Nat. Biotechnol. 29:922-927(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP003012; AEO69000.1; -; Genomic_DNA. DR RefSeq; XP_003655336.1; XM_003655288.1. DR STRING; 578455.XP_003655336.1; -. DR EnsemblFungi; AEO69000; AEO69000; THITE_2118928. DR GeneID; 11524339; -. DR KEGG; ttt:THITE_2118928; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008181; Chromosome 4. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008181}; KW Reference proteome {ECO:0000313|Proteomes:UP000008181}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 30 {ECO:0000256|SAM:SignalP}. FT CHAIN 31 1013 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003437092. FT COILED 659 686 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1013 AA; 109540 MW; 0F6C8DA7BCB85215 CRC64; MRSLGNWAKR IAPVLAVLVL LGLHAGSAHA SRSAPGATTA ASAPAPAVTE VCESRTINYI THTLPQQCLR TSWTSPTPAA TETDNSTVHP TITTVAPSDL SDSPASAQDN GTAAQEQDAQ EELSASSFMS FEEWKEMMLR KSGQDPASIK AHKQREHRER DPGAGTGDSD SFGEEGEISL DFDALAAKVS EITSPSPGAT VPDTGKDVKE EQILYDDGKT QYYRSKDAGK TCKERFSYAS FDAGATVLKT SPGAKNAKAV LVENKDSYML MECRTKNKFI IVELSDDILV DTVVLANFEF FSSMIRKFRV SVSDRYPVKM DKWVDLGTFE ARNSRDIQAF LIEHPQIYTK YIRIEFLSHW GNEFYCPVSL LRVHGTRMLD TWKEPSHDDE PDHIEGSGEE QVTEVHNVQG TTAGNDDPSA VEQEESIPQA TIETGLTPWR PIFFSNFSLE MCELRSPTTP EHVKSDSNKP ANKSVGAPDS VTPRPSPVQN ADEASQPSSI SASEPASSHT PVASPGQATS VPPSAVASPP QTGNNSTANG SSQKQADGRV DTADSSASAT TTTSRNKTSS VSSSPSASPT VQESFFKTVT KRLQLLESNT SLSLQYIEEQ SRFLQDVLLK MERKQITRVD AFLDTLNKTV LSELRNVRTQ YDQIWQSTVI ALETQREQSE REIVALTSRL NILADEVVFQ KRMAIIQSVM LLCCLVLVIF ARGGLSSAVD STSFSAFPQS TTPYRRHGSS YDYGYSSESM AGSPSPRPGS SPPSRYAAAG SDASASLAAS ALPRRLYTTS YRDKTLPLTP PSEYSRESTP VTRLHTSPEP RPPTSSYDQD PDRVASEEAG QPLRHADPTP SPSPGPESNG APAQANAADA AAAASASTKV PSTLSGTAPQ PSRQPAPTAA PLREAEREHD HPQPQPQPQP GRHHQQKQHR QNKPDEEQDE ASAAQGQESN DHDDDNDDTN TPPRPVRSHS TTSHLVSSSS SGGGSSGGLR KPLPALPEDA SPNDDIWQLV QPT // ID G2WN26_YEASK Unreviewed; 587 AA. AC G2WN26; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=K7_Yor154wp {ECO:0000313|EMBL:GAA26469.1}; GN Name=K7_YOR154W {ECO:0000313|EMBL:GAA26469.1}; GN ORFNames=SYK7_065391 {ECO:0000313|EMBL:GAA26469.1}; OS Saccharomyces cerevisiae (strain Kyokai no. 7 / NBRC 101557) (Baker's OS yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=721032 {ECO:0000313|Proteomes:UP000001608}; RN [1] {ECO:0000313|Proteomes:UP000001608} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Kyokai no. 7 / NBRC 101557 {ECO:0000313|Proteomes:UP000001608}; RX PubMed=21900213; DOI=10.1093/dnares/dsr029; RA Akao T., Yashiro I., Hosoyama A., Kitagaki H., Horikawa H., RA Watanabe D., Akada R., Ando Y., Harashima S., Inoue T., Inoue Y., RA Kajiwara S., Kitamoto K., Kitamoto N., Kobayashi O., Kuhara S., RA Masubuchi T., Mizoguchi H., Nakao Y., Nakazato A., Namise M., Oba T., RA Ogata T., Ohta A., Sato M., Shibasaki S., Takatsume Y., Tanimoto S., RA Tsuboi H., Nishimura A., Yoda K., Ishikawa T., Iwashita K., Fujita N., RA Shimoi H.; RT "Whole-genome sequencing of sake yeast Saccharomyces cerevisiae Kyokai RT no. 7."; RL DNA Res. 18:423-434(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DG000051; GAA26469.1; -; Genomic_DNA. DR EnsemblFungi; GAA26469; GAA26469; SYK7_065391. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001608; Chromosome XV. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001608}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67396 MW; E882D6830F68BDAB CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGIHDTTV ITTGRTTNVQ KEHSSPLSTG SLRTHDFRQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGAERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID G2X8W3_VERDV Unreviewed; 955 AA. AC G2X8W3; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EGY15431.1}; GN ORFNames=VDAG_06595 {ECO:0000313|EMBL:EGY15431.1}; OS Verticillium dahliae (strain VdLs.17 / ATCC MYA-4575 / FGSC 10137) OS (Verticillium wilt). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Glomerellales; OC Plectosphaerellaceae; mitosporic Plectosphaerellaceae; Verticillium. OX NCBI_TaxID=498257 {ECO:0000313|Proteomes:UP000001611}; RN [1] {ECO:0000313|Proteomes:UP000001611} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=VdLs.17 / ATCC MYA-4575 / FGSC 10137 RC {ECO:0000313|Proteomes:UP000001611}; RX PubMed=21829347; DOI=10.1371/journal.ppat.1002137; RA Klosterman S.J., Subbarao K.V., Kang S., Veronese P., Gold S.E., RA Thomma B.P.H.J., Chen Z., Henrissat B., Lee Y.-H., Park J., RA Garcia-Pedrajas M.D., Barbara D.J., Anchieta A., de Jonge R., RA Santhanam P., Maruthachalam K., Atallah Z., Amyotte S.G., Paz Z., RA Inderbitzin P., Hayes R.J., Heiman D.I., Young S., Zeng Q., Engels R., RA Galagan J., Cuomo C.A., Dobinson K.F., Ma L.-J.; RT "Comparative genomics yields insights into niche adaptation of plant RT vascular wilt pathogens."; RL PLoS Pathog. 7:E1002137-E1002137(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS572708; EGY15431.1; -; Genomic_DNA. DR RefSeq; XP_009657594.1; XM_009659299.1. DR EnsemblFungi; EGY15431; EGY15431; VDAG_06595. DR GeneID; 20708058; -. DR KEGG; vda:VDAG_06595; -. DR InParanoid; G2X8W3; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001611; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001611}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001611}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 733 754 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 698 718 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 955 AA; 105727 MW; E559119D0BBB6E75 CRC64; MVTIDHYSIC YIVVNVGAEY EASFRVPRKR DYPTRLSEEA MFWHAQGWAV LARLMLVTIP AGSCESDTPI PSANEPTSTC HARDINYVTD TLPEQCYRVR WQHAESIVFN DGDQTAEYIG HDPSFLDTRP VDREENDRQP STASVETTNS GDDHTEATTF MSFEDWKDLK AREAEREAQD TNPDSEKALP PPQGHDSREE SDIALKVEAV SEELSSIAAP PRQSLAGAGE TEQPSEPVLY DDGKAQYYRS KDAGKTCKER FSYSSFDAGA TVLKTNTGAK NAKAILVENK DSYMLLECAA DNKFAIVELT DDILIDTVVL ANFEFFSSMI RHFKVSVSDR YPVKVDKWKD LGVFEAKNSR DIQPFLVENP LIWAKYVRIE FLTHYGNEYY CPVSLLRVHG TRMLDSWKDT EAPPDEDDAE DEPVDAVLDL IQDLDIDQAQ RVPPPDNSEA DEVTRQGQSI WSDVGNSVPS SPLQASLVQE NPLDFTCPVN TAVLEEAASK PRSTVDRLPH ETESQKADVY PAAHSDDLLL SQRLKEHWTG PLDRVDDPSD LVTGITSTDS TSIIPVTVAT SRPTSVPHIS SQNASRIASG SPKRPSPVAQ APTRSKNGTS ASVPPASPTV QESFHKAVSK RLQLLESNVT LSLEYLEEQS RFLQQSQRAS ERRQLAKVDL LLDSLNHTVL SELRHVRQQY DQMWQSTVMA LENQRDQSQR ELVALGSRLN VLADEVVFQK RMAILQAILL LSCLVMVIFS RTIVSVSPNQ MDMPFSSSRQ YQHSLPRALH SSGLGRSHSG HLSTSIVEYD HEGDAQQQGQ GHGTDPFIRV HQHSHGGSLA DARRTSGPRL ERARLATSPI SLSEDDSWVT RQSEGVPISP SISATPPTAN ERHISRSFNP HSSEQILTPS SSDAQDTQSD GIGSDPSSED GSRTPPSPLG RLANTHDLGP GFRKPLPALP EHLPS // ID G2YIQ0_BOTF4 Unreviewed; 1059 AA. AC G2YIQ0; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 16-SEP-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCD51587.1}; GN ORFNames=BofuT4_P018930.1 {ECO:0000313|EMBL:CCD51587.1}; OS Botryotinia fuckeliana (strain T4) (Noble rot fungus) (Botrytis OS cinerea). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Leotiomycetes; OC Helotiales; Sclerotiniaceae; Botrytis. OX NCBI_TaxID=999810 {ECO:0000313|EMBL:CCD51587.1, ECO:0000313|Proteomes:UP000008177}; RN [1] {ECO:0000313|Proteomes:UP000008177} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=T4 {ECO:0000313|Proteomes:UP000008177}; RX PubMed=21876677; DOI=10.1371/journal.pgen.1002230; RA Amselem J., Cuomo C.A., van Kan J.A.L., Viaud M., Benito E.P., RA Couloux A., Coutinho P.M., de Vries R.P., Dyer P.S., Fillinger S., RA Fournier E., Gout L., Hahn M., Kohn L., Lapalu N., Plummer K.M., RA Pradier J.-M., Quevillon E., Sharon A., Simon A., ten Have A., RA Tudzynski B., Tudzynski P., Wincker P., Andrew M., Anthouard V., RA Beever R.E., Beffa R., Benoit I., Bouzid O., Brault B., Chen Z., RA Choquer M., Collemare J., Cotton P., Danchin E.G., Da Silva C., RA Gautier A., Giraud C., Giraud T., Gonzalez C., Grossetete S., RA Gueldener U., Henrissat B., Howlett B.J., Kodira C., Kretschmer M., RA Lappartient A., Leroch M., Levis C., Mauceli E., Neuveglise C., RA Oeser B., Pearson M., Poulain J., Poussereau N., Quesneville H., RA Rascle C., Schumacher J., Segurens B., Sexton A., Silva E., Sirven C., RA Soanes D.M., Talbot N.J., Templeton M., Yandava C., Yarden O., RA Zeng Q., Rollins J.A., Lebrun M.-H., Dickman M.; RT "Genomic analysis of the necrotrophic fungal pathogens Sclerotinia RT sclerotiorum and Botrytis cinerea."; RL PLoS Genet. 7:E1002230-E1002230(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FQ790337; CCD51587.1; -; Genomic_DNA. DR EnsemblFungi; CCD51587; CCD51587; BofuT4_P018930.1. DR InParanoid; G2YIQ0; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008177; Unplaced contigs. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008177}; KW Reference proteome {ECO:0000313|Proteomes:UP000008177}. SQ SEQUENCE 1059 AA; 118011 MW; 1BDD17884862E22A CRC64; MSASRSSFNG EFDQTFEQVT NSREKQGIRK ICFALLYFTH RTIESTKLLG RALSQICGCI TITSGTPWHF IVANSLIHSS QHPAFDPKHY AAEPEWSSHV IDCLRSSWSN TDSPPTTKTN AIGAAGTLES QEIDVVTDTI SNAPFSESVA QTNDETKITD ETSRTSVIQL KSSSPSPGIP SPTASPTTVD EGELNDASFL SFEEWKKQTL EQAGQQDLNL GKRRSAEAAR KRESEAFQNN LESLGDDGEI DLDFGAFRNG GAEQTSRTTK DKGVGSSQDS QEEKSGSGHR KEHRSKDAGK TCKERFSYAS FDAGATVLKT HQGAKNSKAV LIENKDSYML SECKTQNKFL VIELSEDIWI DTLVLANYEF FSSMLRTFRV SVSDRWPVKT DKWKDLGVYE ARNSREIQAF LIENPQIWAR YIRIEFLTHY GKEYYCPLSL VRVHGTRMLE SWKDTEANND DDEEADEDPE EGFVPEAVAE IIQTKSTVMQ AVHVTVGSQT EPTGLYTTRD VQREEMPMET HLKPPPTPTS LWKKPIAREF EMLSIRPLDL CYPSDIPEHI LTSQAAVENE SYNFKTTMKV PPSPEMISTA FTDNGITSSS LSFTGPSAQQ TLSEVKASLA STSLTQETHE SSKVIQDSSH STIPTSSTTI IVNKSQDATS TNKTRGTNTS SGSASLPTIQ ESFFKAVSRR LQLLETNSTL SLKYIEEQSK MLREAFLRVE KRQLQKTTDF LENLNSTVLT ELRVFRQQYD EIWQSTVISL ESQREESRRE ILAISARLNI LADEVVFQKR MSIIQSVLLL LCLGLVIFSR VSTAEPLSFS LHNRRSRVTS NMTNIESPLD TPGYTSRERE DYIGDAASPV NAWSGHHRRQ PSDESVNSRS RSRGWGPPTP ISTYSRSDNE LTPPRSFDET TTNTMTGATT GTFSRLRRSI TMKYQSSNPL LSASREHELL RTSSFGPSLR SHNSSPASFL SVSDAKERGV RDRNPTALAS PPPSDIESHD PMDELNSTPT GIQHDALEGQ SVDDIRESRS LLTPKQEHVN EQINEQKPLP ALPDGGPSP // ID G3ATI7_SPAPN Unreviewed; 694 AA. AC G3ATI7; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGW30950.1}; GN ORFNames=SPAPADRAFT_156436 {ECO:0000313|EMBL:EGW30950.1}; OS Spathaspora passalidarum (strain NRRL Y-27907 / 11-Y1). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Spathaspora. OX NCBI_TaxID=619300 {ECO:0000313|Proteomes:UP000000709}; RN [1] {ECO:0000313|EMBL:EGW30950.1, ECO:0000313|Proteomes:UP000000709} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NRRL Y-27907 / 11-Y1 {ECO:0000313|Proteomes:UP000000709}; RX PubMed=21788494; DOI=10.1073/pnas.1103039108; RA Wohlbach D.J., Kuo A., Sato T.K., Potts K.M., Salamov A.A., RA LaButti K.M., Sun H., Clum A., Pangilinan J.L., Lindquist E.A., RA Lucas S., Lapidus A., Jin M., Gunawan C., Balan V., Dale B.E., RA Jeffries T.W., Zinkel R., Barry K.W., Grigoriev I.V., Gasch A.P.; RT "Comparative genomics of xylose-fermenting fungi for enhanced biofuel RT production."; RL Proc. Natl. Acad. Sci. U.S.A. 108:13212-13217(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL996504; EGW30950.1; -; Genomic_DNA. DR RefSeq; XP_007376983.1; XM_007376921.1. DR EnsemblFungi; EGW30950; EGW30950; SPAPADRAFT_156436. DR GeneID; 18871050; -. DR KEGG; spaa:SPAPADRAFT_156436; -. DR InParanoid; G3ATI7; -. DR OMA; EHESSSF; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000709; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000709}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000709}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 637 654 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 437 462 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 694 AA; 76100 MW; B5344AB893B51AA8 CRC64; MESSSETFAS ETSSDISSSS EVSSSEISSS SEISSSLESS SSMSSSSSSE ISSSSVTSSS EISSSSEISS SSEISSSSEI SSSSEISSTE HESSSFEPSS TEHESSSFEP SSTEAESSSA EPSSSSEIES SSITDFEASS SLEPSSSEVS STEEPSSSEA SESSETSSES SSSSSSSSSS SSSSSSSSSS NNTIIDNVHF LSFEEWKKQK IIEKNNHTSA SSISRILSSS SSSSSSISTA SSISSSKCIS ANNTCSTQSL NNTSTASNSP NGTTIEEDAS AKPIKETEGK VYKDKFNYAS VDCAATIVKT NANAKSPSAI LKENKDSYLL NQCSIPNKFV VIELCQDILV DSVVIGNFEF FSSMFKEIRI SVSDRFPTSN WKVLGEFTAK NIRDVQSFKI ENPLIWARYL RLEILSHYGS EFYCPISVVR VHGKTMMEEF KEDTEQQQQQ EQKEQEQQQE KEKPELFSKE EPQMTNILML NQTGNECPII MPHLKLNAFL KDINQTQDYC LPQSEPITST AIPTTQESIY KNIMKRLSLL ESNATLSLLY IEEQSKLLST AFSNLEKRQT ANFNALISSV NVTLINQLTS FKEAFNSLHD QYGNLYETQL QSYHHLLQDS NKKVSTLTSE LTFQKRVTVF NSIIIICLLV YVILTRDTYI DVQQDDETEG KVSTPRSTPF KKFTPTKRNK YKKA // ID G3BEK4_CANTC Unreviewed; 371 AA. AC G3BEK4; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGV60566.1}; GN ORFNames=CANTEDRAFT_96012 {ECO:0000313|EMBL:EGV60566.1}; OS Candida tenuis (strain ATCC 10573 / BCRC 21748 / CBS 615 / JCM 9827 / OS NBRC 10315 / NRRL Y-1498 / VKM Y-70) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Yamadazyma; OC Yamadazyma/Candida clade. OX NCBI_TaxID=590646 {ECO:0000313|Proteomes:UP000000707}; RN [1] {ECO:0000313|EMBL:EGV60566.1, ECO:0000313|Proteomes:UP000000707} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10573 / BCRC 21748 / CBS 615 / JCM 9827 / NBRC 10315 / RC NRRL Y-1498 / VKM Y-70 {ECO:0000313|Proteomes:UP000000707}; RX PubMed=21788494; DOI=10.1073/pnas.1103039108; RA Wohlbach D.J., Kuo A., Sato T.K., Potts K.M., Salamov A.A., RA LaButti K.M., Sun H., Clum A., Pangilinan J.L., Lindquist E.A., RA Lucas S., Lapidus A., Jin M., Gunawan C., Balan V., Dale B.E., RA Jeffries T.W., Zinkel R., Barry K.W., Grigoriev I.V., Gasch A.P.; RT "Comparative genomics of xylose-fermenting fungi for enhanced biofuel RT production."; RL Proc. Natl. Acad. Sci. U.S.A. 108:13212-13217(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL996528; EGV60566.1; -; Genomic_DNA. DR RefSeq; XP_006689780.1; XM_006689717.1. DR EnsemblFungi; EGV60566; EGV60566; CANTEDRAFT_96012. DR GeneID; 18250568; -. DR KEGG; cten:CANTEDRAFT_96012; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000707; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000707}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000707}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 321 339 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 371 AA; 43374 MW; A6DCFF68A75F12F4 CRC64; MSFESWKKLK IKSDPTLKNT SQISPVKNDE NRFNFASIDC AAKVVKTNEG AQESKSILME NKDSYLVNKC STKDQFLIIE LCQDILIDLI EIGNFEFFSS NFKRFKVSVN ERYEDTNWKS LGEFEASNSR TLQKFKIINP LIWAKFIKIE ILEHYGSEFY CPISLVKVYG KTMLEEFKEQ TIEQPSVETD ECKINSTLPY LGLNEFLNSI PEYCEVVEEE STATNNPQDS IFKNIIKRLS LLESNASLSL LYIEEQSRLL SDSFRKLQME QSHDLDKLLW NFNQTFNQQM LKINQFNQFK LFESNKVISN LANDLSFYKK LLLVNFVLLT VLVCFLLLSK DLPVEIPIPR NYQFKVGSTK KKYKKKSKYR F // ID G3GZK4_CRIGR Unreviewed; 1150 AA. AC G3GZK4; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Protein C1orf9-like {ECO:0000313|EMBL:EGV93284.1}; GN ORFNames=I79_003278 {ECO:0000313|EMBL:EGV93284.1}; OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Cricetidae; Cricetinae; Cricetulus. OX NCBI_TaxID=10029 {ECO:0000313|Proteomes:UP000001075}; RN [1] {ECO:0000313|Proteomes:UP000001075} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075}; RX PubMed=21804562; DOI=10.1038/nbt.1932; RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell RT line."; RL Nat. Biotechnol. 29:735-741(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH000079; EGV93284.1; -; Genomic_DNA. DR InParanoid; G3GZK4; -. DR Proteomes; UP000001075; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001075}; KW Reference proteome {ECO:0000313|Proteomes:UP000001075}. FT COILED 832 852 {ECO:0000256|SAM:Coils}. FT COILED 882 902 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1150 AA; 127593 MW; 0BB96C981B72FFCE CRC64; METTKLNLPV VEALPTVDLH EDSSSVVVGS ENIENSSSSS TSETTPVSKL DEIEKSGTLS LAKPGETEQS EADCDAGEAP DADTPVEQHA FVSPPESLVG QHIENVSSSH GKEKVTKSEF ESKVSVSGQD GDDPKSGLNA SDSLKNKSSD YADYKKPGET DPAPITGPKD PEDIPTFDEW KKKVMEVEKE KSQSLHPSSN GGPHATKKVQ KNRNNYASVE CGAKILAANP EAKSTSAILI ENMDLYMLNP CSTKIWFVIE LCEPIQVKQF DIANYELFSS TPKDFLVSIS DRYPTNKWIK LGTFHGRDER TVQSFPLDEQ MYAKYVKVEL VSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQSERQE LFDEDYDYPL DYNTVEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLT EGNKSISENA TATTAPKMPE STGVSTPVPS PEYVINEVRT RDTDPSTSDT PKESPIVQLV QEEEEEASQS TVTLLGSGEQ EDESSSWFES ETQILCSELT SICCISSFSE YIYKWCSVRI ALYRQRSRTA MSKGKEFVSA QPSLLLPVES VEVSLSQPPS GDVDNENMER EAETVVLDDL SSVHQGDLMN HTVDAIEIEP SHPQSLSQSL LLDITPEMNS LPKVEGSESV KYERGHTPLQ VMPQESSVES DDEMGKKPES FSSVEKPSVI YETSKVNEIV DSAVKEDISS IEIITKVSET VPPPLNTAIV PDSEDGETKM SIADTPKQTV TPVMDPSLPE VKEEDQSPED ALLRGLQRTA TDFYAELQNS TDLGYGNGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQRYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIHLLQA QLTNMTQLVS NLSTTVTELK REVSDRQSYL VMSLILCVIL GLMLCMQRCR NTSQFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQF TGKEVDPNDL YIVEPLKFSP EKKKKRCKYK TEKIETIKPA DPLHPIANGD IKGRKPFTNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQTQK TKTEKRALKR RRSKVQDQGK LIKALIQTKS GSLPSLHDII KGNKEITVGA FGVTAVSGHI // ID G3H4J1_CRIGR Unreviewed; 2610 AA. AC G3H4J1; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=E3 ubiquitin-protein ligase {ECO:0000313|EMBL:ERE72331.1}; DE EC=6.3.2.- {ECO:0000313|EMBL:ERE72331.1}; DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:EGV97528.1}; GN ORFNames=H671_5g15090 {ECO:0000313|EMBL:ERE72331.1}, GN I79_005194 {ECO:0000313|EMBL:EGV97528.1}; OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Cricetidae; Cricetinae; Cricetulus. OX NCBI_TaxID=10029 {ECO:0000313|Proteomes:UP000001075}; RN [1] {ECO:0000313|Proteomes:UP000001075} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075}; RX PubMed=21804562; DOI=10.1038/nbt.1932; RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell RT line."; RL Nat. Biotechnol. 29:735-741(2011). RN [2] {ECO:0000313|EMBL:EGV97528.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan C.H., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary CHO-K1 cell line."; RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Proteomes:UP000030759} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17A/GY {ECO:0000313|Proteomes:UP000030759}; RX PubMed=23929341; DOI=10.1038/nbt.2645; RA Brinkrolf K., Rupp O., Laux H., Kollin F., Ernst W., Linke B., RA Kofler R., Romand S., Hesse F., Budach W.E., Galosy S., Muller D., RA Noll T., Wienberg J., Jostock T., Leonard M., Grillari J., Tauch A., RA Goesmann A., Helk B., Mott J.E., Puhler A., Borth N.; RT "Chinese hamster genome sequenced from sorted chromosomes."; RL Nat. Biotechnol. 31:694-695(2013). RN [4] {ECO:0000313|EMBL:ERE72331.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=17A/GY {ECO:0000313|EMBL:ERE72331.1}; RA Brinkrolf K., Rupp O., Laux H., Kollin F., Ernst W., Linke B., RA Kofler R., Romand S., Hesse F., Budach W.E., Galosy S., Muller D., RA Noll T., Wienberg J., Jostock T., Leonard M., Grillari J., Tauch A., RA Goesmann A., Helk B., Mott J.E., Puehler A., Borth N.; RT "Chinese hamster genome sequenced from sorted chromosomes."; RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH000139; EGV97528.1; -; Genomic_DNA. DR EMBL; KE680492; ERE72331.1; -; Genomic_DNA. DR RefSeq; XP_003499219.1; XM_003499171.2. DR GeneID; 100762999; -. DR KEGG; cge:100762999; -. DR CTD; 25831; -. DR KO; K12231; -. DR Proteomes; UP000001075; Unassembled WGS sequence. DR Proteomes; UP000030759; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001075}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EGV97528.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000001075}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289196 MW; 8067AC6ED6EEF939 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTTLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSALAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSIDL DMKQDCSQLV ERINVFKTAF SESEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGVWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPAKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGV DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RAPGETSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPSGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLSQGTISA LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GTWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSISVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILGNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID G3HAR1_CRIGR Unreviewed; 377 AA. AC G3HAR1; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=Sperm-associated antigen 4 protein {ECO:0000313|EMBL:EGW02779.1}; GN ORFNames=I79_007519 {ECO:0000313|EMBL:EGW02779.1}; OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Cricetidae; Cricetinae; Cricetulus. OX NCBI_TaxID=10029 {ECO:0000313|Proteomes:UP000001075}; RN [1] {ECO:0000313|Proteomes:UP000001075} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075}; RX PubMed=21804562; DOI=10.1038/nbt.1932; RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell RT line."; RL Nat. Biotechnol. 29:735-741(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH000254; EGW02779.1; -; Genomic_DNA. DR InParanoid; G3HAR1; -. DR Proteomes; UP000001075; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001075}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001075}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 87 112 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 139 159 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 377 AA; 41984 MW; 0398145008FA8603 CRC64; MRRSHRPGSA ASSHNHAPDF YSENSNSSHS VTSGDSNGRR SPGPELEQPE GRRARGSSCD LLFQVLSVLL SVAGDALVSV YREVCSIRFL LTAVTLLSVF LAALWWGLLY LVPALENEPK EMLTLSQYHQ RVYSQGQQLQ QLQTELNKLH KEVSSVRAAH SERVAKLVFQ RLNEDFVRKP DYALSSVGAS IDLEKTSSDY EDTNTVYFWN RLSFWNYARP PSVILEFVIC SDGLPKPLQP DVFPGNCWAF EGDQGQVVIR LPGHVQLSDV TLQHPPPTVA HTGGASSAPR DFAVFGLQAD DETEVFLGKF IFDVQKSEIQ TFHLQNDPPS AFPKVKIQIL SNWGHPRFTC LYRVRAHGVR ISEWAEDNAT GVAGGPH // ID G3HE83_CRIGR Unreviewed; 245 AA. AC G3HE83; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Sperm-associated antigen 4-like protein {ECO:0000313|EMBL:EGW00270.1}; GN ORFNames=I79_008860 {ECO:0000313|EMBL:EGW00270.1}; OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Cricetidae; Cricetinae; Cricetulus. OX NCBI_TaxID=10029 {ECO:0000313|Proteomes:UP000001075}; RN [1] {ECO:0000313|Proteomes:UP000001075} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075}; RX PubMed=21804562; DOI=10.1038/nbt.1932; RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell RT line."; RL Nat. Biotechnol. 29:735-741(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH000311; EGW00270.1; -; Genomic_DNA. DR InParanoid; G3HE83; -. DR Proteomes; UP000001075; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001075}; KW Reference proteome {ECO:0000313|Proteomes:UP000001075}. FT COILED 29 49 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 245 AA; 27949 MW; 1AF5093741D415E9 CRC64; MLHQDDSING PLQSLRMYQE KVRHHTGEIQ DLRGSMNLLI AKLQEMQAMS DKQKMAQKIM KMIHGDYIEK PDFALKSIGA SIDFEHTSAT YNHDKARSYW NWIRLWNYAQ PPDPNVTPGN CWAFAGDRGQ VTIRLAQKVY LSNITLQHIP KTISLSGSLD TAPKDFVIYG MESPPREEVF LGAFQFQPEN TIQMFPLQNQ PPRGFAAVKV KISSNWGNPR FTCLYRVRVH GSVTPPRDSN LESLS // ID G3HUQ1_CRIGR Unreviewed; 916 AA. AC G3HUQ1; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Protein unc-84-like A {ECO:0000313|EMBL:EGW02830.1}; GN ORFNames=I79_014665 {ECO:0000313|EMBL:EGW02830.1}; OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Cricetidae; Cricetinae; Cricetulus. OX NCBI_TaxID=10029 {ECO:0000313|Proteomes:UP000001075}; RN [1] {ECO:0000313|Proteomes:UP000001075} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075}; RX PubMed=21804562; DOI=10.1038/nbt.1932; RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell RT line."; RL Nat. Biotechnol. 29:735-741(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH000745; EGW02830.1; -; Genomic_DNA. DR RefSeq; XP_003507117.2; XM_003507069.2. DR GeneID; 100769590; -. DR KEGG; cge:100769590; -. DR CTD; 23353; -. DR InParanoid; G3HUQ1; -. DR KO; K19347; -. DR Proteomes; UP000001075; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR015880; Znf_C2H2-like. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00355; ZnF_C2H2; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001075}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001075}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 387 407 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 419 438 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 566 600 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 916 AA; 102766 MW; EC2E77E632EF988E CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFETEH RLEPVFDSPR MSRRSLRLVT TTATYSSGDS QAVDTHISTS RATPSKEKET RTVKQRRSTS KPAFSINHLS GKGVSSSASH DISCSLRSAT TLRHPVLDES LIREQTKVDH FWGLDDDGDL KGGNKTATQG NGELAAEVTA SNGYTCRDCR MLSARTDALT AHPATHGTTS RVYSRDRTLK HRGASFYLDR TLWLAKYTSS SFASFLVQLF QVFLMKLSFE SETYKLKGYE SRAYESQSYE TKSHGSEAHL GHCGRMTAGE LSRVDGESLC DDCKGKKHLE THTVTHSQLL QLHRVAGAMG RLCTYTGDLL VQALHRTRAA GWSVTKAMWS VLWLAVTAPG KAASGTFWWL GSGWYQFVTL ISWLNVFLLT RCLRNICKVF VLLLPLLLLL GAGLSLWGQS NFLSLLPVLN WTTMQPAQRV DDPEGVHRPG PVPPSPPLKV DYEASQWPRE SDVGQKVASL SAQCHNHDER LAELTFLLQK LQMRVDQVDD GREGLSLWVK DVVGQHLQEM GTTEPLSAKT DLMTFHHEHQ VRLSNLEDIL RKLTEKSEAI QKELEESKLR AGSRAEEQPL LDRVQHLELE LNLLKSQLSD WHHLRTSCEQ ADARIQETVR LMFSEDQHNG SLEWLLQKLS SRFVSKDELQ VLLHDLELKV LQNITHHVTV TGQAPTSEAI VSAMSEAGIS GITEAQAHII VNNALKLYSQ DKTGMVDFAL ESGGGSILST RCSETYETKT ALLSLFGIPL WYFSQSPRVV IQPDIYPGNC WAFKGSQGYL VVRLSMKIYP TTFTMEHIPK TLSPTGNISS APRDFAVYGL ETEYQEEGQP LGRFTYNQEG DSLQMFHTLE RPDQAFQIVE LRVLSNWGHP EYTCLYRFRV HGEPAQ // ID G3I2J5_CRIGR Unreviewed; 730 AA. AC G3I2J5; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Protein unc-84-like B {ECO:0000313|EMBL:EGW01684.1}; GN ORFNames=I79_017631 {ECO:0000313|EMBL:EGW01684.1}; OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Cricetidae; Cricetinae; Cricetulus. OX NCBI_TaxID=10029 {ECO:0000313|Proteomes:UP000001075}; RN [1] {ECO:0000313|Proteomes:UP000001075} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075}; RX PubMed=21804562; DOI=10.1038/nbt.1932; RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., RA Xie M., Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., RA Koh W., Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., RA Quake S.R., Famili I., Palsson B.O., Wang J.; RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell RT line."; RL Nat. Biotechnol. 29:735-741(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH001135; EGW01684.1; -; Genomic_DNA. DR RefSeq; XP_007615912.1; XM_007617722.1. DR RefSeq; XP_007648246.1; XM_007650056.1. DR GeneID; 100773056; -. DR CTD; 25777; -. DR InParanoid; G3I2J5; -. DR Proteomes; UP000001075; Unassembled WGS sequence. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001075}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001075}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 198 214 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 226 247 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 417 451 {ECO:0000256|SAM:Coils}. FT COILED 491 511 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 730 AA; 81759 MW; F89E9A4C7AEA7C45 CRC64; MSRRSQRLTR YSQDDNDGSS SSGASSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDSHTSYY SESVVRESYI GSPRAASLAR SALLDDRLHS EPYWSGDLRG RRRRGTGGSE SSKANGLTTE SKVSEDFFGS SSGYSSEDDF AGYMESDQHG SGSGLRSAAS RAGSFVWTLV TFPGRLFGLL YWWVGTTWYR LTTAASLLDV FVLTRSRHFS LNLKTFLWFL LFLLLLTVLT YGAWYFYPFG LHTLQPTLAS WWAAKESKRQ PEVWESRDAS PHFQAEQRIL SRVYSLERRL EALAAEFSSN WQKEAIRLER LELRQGATGH GGGSGLSHED TLTLLEGLVS RREAALKEDL RRDTMSRIQE ELATLRAEHH QDSEDLFRKI VQASQESEAH VQQLKTEWQR MTQEAFQESS VKELGQLEAQ LASLRQELAA LTLRQNSVAD EVGLLPQKIQ AARADVESQF PDWVSRFLLR DKGASSGLLQ RDELHAQLQE LESKILASMA EMQGKSAREA AASLGQTLQK EGVVGVTEEQ VHQIVKQALQ RYSEDRIGMV DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGTFAY DQDGEPIQTF YFQTSKMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID G3JHA0_CORMM Unreviewed; 852 AA. AC G3JHA0; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 16-SEP-2015, entry version 13. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EGX91656.1}; GN ORFNames=CCM_05814 {ECO:0000313|EMBL:EGX91656.1}; OS Cordyceps militaris (strain CM01) (Caterpillar fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Cordycipitaceae; OC Cordyceps. OX NCBI_TaxID=983644 {ECO:0000313|EMBL:EGX91656.1, ECO:0000313|Proteomes:UP000001610}; RN [1] {ECO:0000313|EMBL:EGX91656.1, ECO:0000313|Proteomes:UP000001610} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CM01 {ECO:0000313|EMBL:EGX91656.1, RC ECO:0000313|Proteomes:UP000001610}; RX PubMed=22112802; DOI=10.1186/gb-2011-12-11-r116; RA Zheng P., Xia Y., Xiao G., Xiong C., Hu X., Zhang S., Zheng H., RA Huang Y., Zhou Y., Wang S., Zhao G.-P., Liu X., St. Leger R.J., RA Wang C.; RT "Genome sequence of the insect pathogenic fungus Cordyceps militaris, RT a valued traditional Chinese medicine."; RL Genome Biol. 12:R116-R116(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH126402; EGX91656.1; -; Genomic_DNA. DR RefSeq; XP_006671021.1; XM_006670958.1. DR EnsemblFungi; EGX91656; EGX91656; CCM_05814. DR GeneID; 18167832; -. DR KEGG; cmt:CCM_05814; -. DR InParanoid; G3JHA0; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001610; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001610}; KW Reference proteome {ECO:0000313|Proteomes:UP000001610}. SQ SEQUENCE 852 AA; 93455 MW; FDD86573BF43CC06 CRC64; MPPRQKPGQI ISKQRQLPSI CRGQDGLQGS RTAVANGSSK NGKMLISANA RRSVATALLV LATAFGTHAA EQATQATQTG TKPRSNHDGH LTGTSQCEAR TINYITHTLP QSCLTSSWSS TAAVVESTPL TATVPATARA NATSTVNDSE SVENQTNIIE ASETPESADT DSTTAPFMSF EDWKAMMLQK TGQDPQDLKQ VKRSEAQGDE RQRLQSYSGG GLGDEGEISL DFGGHSDQAH AVHGYDGDEE HADESSAEVE QAAIHRSKDA GKTCKERFSF ASFDAGATIL KTSSGAKNSK AILVENKDTY MLLECTTPNK YVIIELTDDI WVDTIVLANF EFFSSMIRHF RVSVSDRYPV KMDKWKELGT FEARNSRDIQ AFLVENPQIW AKYLRLEFLT HYSNEYYCPV SLVRVHGSRM LDKWKDSETG TDDDPVGEID GIAENAIVPQ ENSSEATVFQ ESNTTTADAN DICLLMELTP LQLPLFMCPV DRASALTNNS AVVAEDVSNK QYKKAIQEQI TPEASKSASQ PSTDSEDERA SDNISSKHTK YSTTSTAAAL SPAAAAGKVP AASASNSRSR GNATNIATPS PPSVQEGFFN SVTKRLQQVE TNLTLSLKYV EEQSRHVQDA FQRSEQKQLV KISSVLMDLN QTVLAELRNF RDQYDQIWQS TVLALESQKD QSQRDILALS SRLHVLADEV VFQKRMAIVQ AVLLLSCLLL VIFSRGVPIP YLGLPSDQGM GTSEAFAEIL ASMQRRDMYP SGSPRFQDAR TPTTPEPHAA RPPLENPMAA SRLNVDDYSY QRPSPPLTPN DELDDNVPDW QASGHLSAIQ ATGYDKGSRY TNSRRRVHIN DI // ID G3JQ82_CORMM Unreviewed; 1039 AA. AC G3JQ82; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Spindle pole body-associated protein sad1 {ECO:0000313|EMBL:EGX89333.1}; GN ORFNames=CCM_07584 {ECO:0000313|EMBL:EGX89333.1}; OS Cordyceps militaris (strain CM01) (Caterpillar fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Cordycipitaceae; OC Cordyceps. OX NCBI_TaxID=983644 {ECO:0000313|EMBL:EGX89333.1, ECO:0000313|Proteomes:UP000001610}; RN [1] {ECO:0000313|EMBL:EGX89333.1, ECO:0000313|Proteomes:UP000001610} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CM01 {ECO:0000313|EMBL:EGX89333.1, RC ECO:0000313|Proteomes:UP000001610}; RX PubMed=22112802; DOI=10.1186/gb-2011-12-11-r116; RA Zheng P., Xia Y., Xiao G., Xiong C., Hu X., Zhang S., Zheng H., RA Huang Y., Zhou Y., Wang S., Zhao G.-P., Liu X., St. Leger R.J., RA Wang C.; RT "Genome sequence of the insect pathogenic fungus Cordyceps militaris, RT a valued traditional Chinese medicine."; RL Genome Biol. 12:R116-R116(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH126404; EGX89333.1; -; Genomic_DNA. DR RefSeq; XP_006672788.1; XM_006672725.1. DR EnsemblFungi; EGX89333; EGX89333; CCM_07584. DR GeneID; 18169595; -. DR KEGG; cmt:CCM_07584; -. DR InParanoid; G3JQ82; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001610; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001610}; KW Reference proteome {ECO:0000313|Proteomes:UP000001610}. FT COILED 547 570 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1039 AA; 116823 MW; 8DB12935682B1745 CRC64; MRPPGTPYRP RRPGAVTRER PRATPVPEDN ANEFRKPNLP SLEGTPSARR QYTYGAAEEP SPARPVLRGD IVDLSGAVQG ALHRHEQRRV ADERRQSSPN LFLEEAERQD DDQESRQKAD QLSMPPPSFK PAQDPPSTRV DFVPLTEPGD DSDADDIRSF ATESEFFDDA SIVSAPASTT ATEARQKFSA MLQRKTAEPQ LPGSPELPAL TPVRPRPSSI LENERPSSPS TQGTPGLRSN PRRPKQSKRT AIASGFQQPK DATSPAESRR PPQMSFAPRL NGTKSLPTDR KMIPVESTVR AGKRLSARLE STQRTPLRQR SGNSDHGIGE LEDETEARPS LFAQAKSFAA SVSPFSTRSY YAADHSEMDD AMQREIESNE SESVEGEHGW TWLRPLTSLP HIRRRMPHGS DDMLDNINWW QLLNPYTYFK ASWWFAREAY LSTLGSLRNP FPQRLMDRLV SSLGPMLYFT AAVIVLITLV SVGHAALFGK ASGDLSMDRL LSMPEIRWPN LGSMTGKAHE FLPAFSWPTW GRSSLLPDLT QLDNDGLARL DEYLKQYQRE FERIQQASKL HDSSLKKLEA VVPKLVHIQL ENGKPVVAQE FWHALRDLIH RDGDFLTFEQ KGNKYEVASE AHWKAIASRI NKDPTFTKQI NITMDSTVKS MEQRVKQGAA GFWEAWIKNN DAKISDMLGS ALDEIQTAGS QREFDKRLQR IVKEHIDESN KDSSVISREE FLRHFKNEFA THRAEVRSEV AELQPQLENM VRQAAELVGK EAPESMSKAE IVTLVHGMVN KAVADMNLEA MARGQIHSHW DSVLRHQINY FGVGAGATID AQHVSPTFDP PKDSSYVKQK GLRGVQTPIP RVAIEPWSDE GDCWCAARSE NPRGNPHGVI LPVQLGHRIV PQHIVVEHIV AGATTDPDAR PKEIEVYADI DADLRELVRD FSAIHFPDIY PLDEEGLGWN VSPVKLPERF VKIGQFVYED IQPHDGVQVH RLSDELLNLG VATDHVIVRA VSNYGSKTHT CFYRVRLYGK RMDEHDDFP // ID G3PC00_GASAC Unreviewed; 1001 AA. AC G3PC00; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGACP00000015124}; OS Gasterosteus aculeatus (Three-spined stickleback). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae; OC Gasterosteus. OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000015124, ECO:0000313|Proteomes:UP000007635}; RN [1] {ECO:0000313|Ensembl:ENSGACP00000015124} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.; RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGACP00000015124} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGACP00000015124}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 69293.ENSGACP00000015124; -. DR Ensembl; ENSGACT00000015152; ENSGACP00000015124; ENSGACG00000011433. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3PC00; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007635; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007635}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007635}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 420 443 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 450 470 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 633 663 {ECO:0000256|SAM:Coils}. FT COILED 672 692 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1001 AA; 111041 MW; B666FF369DA3E8AD CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALDFEKEH QIAAVYESPR MSRRSLRLQA GAGHHGNETL ADFSQSHSSS YTSTRRETRS YCSRSLRSKK QQSSSTSLSL PLSQAATPRK TLCFSGNTAS DGSLLTSTLD QSHLRQRTVT TTTTSTTVDG HWGSNSSNDH RSSKLNGGAS ASKSHDSVNG YICNDCSFHS QKTDSPITQS SSSSLSSKAA GASSEGLFSS STSSPFTSIY SRDRTAHSSF CGSMNVKDLR TEDASHLKLN GSLCKQTNTH FLPSSSTWWT NFYLNINDIN LTLIKPYPWV CLTFLRQKIT SSDPFPPLHK YSVCLLSIIP AHTGDDCKGK QHSETHTVLL THSSRCRRLL AGLWSAVAYT GHCVAKAGQA LGSGVGRVVQ RLLSLCWMLL AAPVKAVRGL VWFLATGWYQ LVSLMSLLNV FFLTRCLPKL WRLLLLLLLL FLLLALWLWG PSTAVLLAYL PAINLTEWRP ASAFTLLSNL VPVPAPYPAS VPAAETPPSP PSLPQPSLPP VVVSSVDLER LERVERQLTL LWEQVQQGDQ RQQQRHGDVL GLYSSLRETL HTQTDRESLG LWVSTLLEQR LGVLRGELEQ ENADRAQSAE QQKQQQAGQA TRLADLELQL NALAAMTEEV QQKQQHQQQH EHKNVGVKQE DHDALLVEVQ RLELELAGIR QDLQGVVGCK GKCEQLDTLQ ETVSSQVRKE LQALFFGSGE PGVVPESLIL WLSQRYVTTP DLHASLSSME LSILRNVSQQ LELNRAQTLG EAESQAQTIV KTVTGTVQHA AAAEGLTEEQ VKLIVQNALR LYSQDRTGLV DYALESGGGS ILSTRCSETY ETKTALMSLF GLPLWYFSQS PRVVIQPDVY PGNCWAFKGS QGYLVIRLSL RILPTSFCME HIPRAMSPTR NITSAPRDFT VLGLDDEYQE EGKLLGHYTY EEDGESLQSF PVMEQNDRAF QIIEVRVLSN WGHPEYTCMY RFRVHGEPRP Q // ID G3PHC9_GASAC Unreviewed; 2549 AA. AC G3PHC9; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGACP00000017004}; OS Gasterosteus aculeatus (Three-spined stickleback). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae; OC Gasterosteus. OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000017004, ECO:0000313|Proteomes:UP000007635}; RN [1] {ECO:0000313|Ensembl:ENSGACP00000017004} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.; RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGACP00000017004} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGACP00000017004}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 69293.ENSGACP00000017004; -. DR Ensembl; ENSGACT00000017038; ENSGACP00000017004; ENSGACG00000012853. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G3PHC9; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000007635; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007635}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007635}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1244 1264 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2549 AA; 279914 MW; 66E6B74053EAECB3 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLSF IRDSGHLVHK DTLHSAMAVV SRLCSKMEPQ DPSLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGSGSGP PSSCKPGRTS TGAAPSAPDS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ALPDSMESAL GGDERCVLDT MRLVDLLLVL LFEGRKALPK STAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCDRGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDV NKEEEEGGEP KGDPEMAPIY LRRLLPVFAQ TFQQTMLPSI RKASLALIRK MVHYSSEVLL KEVCDSETGH HLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDV FLDQLARLGV INKVSTLAGP ASDDENEDEA KPEKEEEAQE DAREVQQGKP YHWKDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPVTS SQPILSSVGP SKLTVGNWSL TCLKDGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVKSTAREL YDDHFKAVES MPRGVVVTLR NISTQLESAW ELHTNRQCVE GENTWRDLMK TALENLIVVL KDENTISPYE MCSSGLVQAL FTVLNNVSRT GSHDCKPLME RINVFKAAFS ENEDGESRPA VALIRKLIAV LESIERLPLH LYDTPGSTYN LQILTRRLRF RLERAPGETA LIDRTGRMLK MEPLATVESL EQYLLKMVAK QWYDFERSSF VFVRKLREGQ TFTFRHQHDF DENGIIYWVG TNAKTAYEWV NPAAYGLVVV TSSEGRNLPY GRLEDILSRD SSALNCHTND DKNAWFAVDL GLWVIPSAYT LRHARGYGRS ALRNWVFQVS KDGQNWTSLY THVDDCSLNE PGSTATWPLD PSKEEKQGWR HIRIKQMGKN ASGQTHYLSL SGLEVYGTVT AVCEDQLGKA VKEAEANLRR QRRLFRSQVM KYIVPGARVV RGIDWKWRDQ DGNPSGEGAV TGEAHNGWID VTWDAGGSNS YRMGAEGKFD LKLAPGFDPE SAASAPSPKP VSSTVSGPAS STQSWSSLVK NNCPDKGGAA SLGGAGSSSR KGSSSSVCSV ASSSDISLSS SAGPPGAGAL RLERRAEGLL LDQGPGAGGP ACIGADGHEP LVVLSSAAHG GSGSASSTGT LTADAPPGAE DDGRNKDASS DPAAAICMGL VSVSSPDVSS VSESSGKDAP SQRPLCSAAN ARLSVSSLLA AGAPMSSSAS VPNLSSREAS LMESFVRRAP NMSRTNATNN MNLSRSSSDN NTNTLGRNVM STATSPLMGA QSFPNLTTTG TTSTVTMSTS IVTSSNNVAT ATTGLSVGQL LSNTLTTSLT STSSESDTGQ EAEFSLYDFL DSCRANTLLA ELDDEEDLPE PDDDDDENED DNQEDQEYEE VLGVMQVCVC FSQEEEEYET KGGRRRTWDD DFVLKRQFSA LVPAFDPRPG RTNVQQTTDL EIPQPGTPRS EVQEEVECAP SPHLSLTLKV AGLGTTREVE LPLSNYKSTI FYYVQRLLQL SCSGAVKTDK LRRIWEPTYT IMYRELKDSD KEKESMKMDL CEHGISVSGG RSGGLSPGSV SANQSSEILC VARETAQAKA GCSQNACGVE DVLQLLRILY IIGGDGASNA RTLQEDFDEL QFNASPEEFT SKKITTKILQ QIEEPLALAS GALPDWCEQL TSKCPFLIPF ETRQLYFTCT AFGASRAIVW LQNRREATME RSRPSTTVRR DDPGEFRVGR LKHERVKVPR GEAMMEWAES VMQLHADRKS VLEVEFQGEE GTGLGPTLEF YALVAAEFQR TSLGIWLCDD DFPDDESRQV DLGGGLKPPG FYVQRSCGLF PAPFPQDSEE LERISKLFHF LGVFLAKCIQ DNRLVDLPVS QPFFKLLCMG DIKSTMSKLL YHSRGAPPGH GADRPLPFLL LSEASTEESQ ETYSLGSFDE DSKSEFIMDP PKPKPPAWYH GILTWDDFQL VNPHRASFLK EVKELAVKRR QILSSKSLCE DEKNTRLQDL MLRNPLGSGP PLSIEDLGLN FQFCPSSKVH GFSAVDLKTN GDDEMVTMEN AEEYVELMLD FCMHTGIQKQ MEAFREGFNR VFPMEKLSSF SHKEVQMILC GNQSPSWTAD DIINYTEPKL GFTRDSPGFL RFVRVLCGMS SDERKAFLQF TTGCSTLPPG GLANLHPRLT IVRKVDATDS SYPSVNTCVH YLKLPEYSSE DIMRERLLAA TMEKGFHLN // ID G3PV90_GASAC Unreviewed; 872 AA. AC G3PV90; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGACP00000021527}; DE Flags: Fragment; OS Gasterosteus aculeatus (Three-spined stickleback). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae; OC Gasterosteus. OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000021527, ECO:0000313|Proteomes:UP000007635}; RN [1] {ECO:0000313|Ensembl:ENSGACP00000021527} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.; RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGACP00000021527} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGACP00000021527}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 69293.ENSGACP00000021527; -. DR Ensembl; ENSGACT00000021568; ENSGACP00000021527; ENSGACG00000016309. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G3PV90; -. DR OMA; KIWFIIE; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000007635; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007635}; KW Reference proteome {ECO:0000313|Proteomes:UP000007635}. FT COILED 686 706 {ECO:0000256|SAM:Coils}. FT COILED 729 756 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGACP00000021527}. SQ SEQUENCE 872 AA; 96387 MW; 441B6FD99E884DB7 CRC64; DTSVSTKDPE DIPTFDEWTR KMMEVENEKT QSTHTSNNVA PPLVKKVQKN FNNYASVECG AKILGSNPEA KSTSAILMEN MDMYMLNPCS NKIWFIIELC EPIQVKQLDI ANFELFSSTP KDFLVSISDR YPTNKWLKLG TFHGRDERTV QSFPLDEHLY AKYVKVELLS HFGSEHFCPL SLIRVFGTSM VEEYEEIADP AERPDDQDDD LDYPPGYAPG EDRLSNNLIG SAKDVILNMV NNIAVNVLGG GSEIPGNLSS PGVNVTEPSA QHSEVEEVFP DPSTLTTEVR SLETSVSETS TADTSKQELP RVKGDIVIPL DREEEESIGS TITLLEKEEF DGEKETRDQH EQRLQIQKYC PQLSSLSCCC AASLQEYLHQ QCSTLLSKKR KCQAMKGKQV IIPILTPTCH ISLSPSACTA PRQHYSEVHR PRDQEQASEQ ETKSEARPSE TPPPPPGGPV QSHTESPSEP PLLEPSQTSN LPRPSATDSS SAKPAPIMET PQLSAEEPKP EKSQDVLAED AHVEPSASLS SSVNVNPEVS AAVDDAAVAQ KENSDTDASK QETKASIHSP DKTDEYPVLH PTASPQSEPH PDPPAVPESS TPSPDVSHPD ADTPTELEPS PVTETKTDFA EDGSTSAGDV YAEAPNGTEP NGNTVHGSSQ KESVFMRLNN RIKALEMNMS LSGRYLEQLS QRYRKQMEEM QKAFNKTIIK LQNTSTIAEE QDQRQTESIQ LLQGQLQNLT QLVVNLSVRV SQLQDEVSDR QNYLLLSLAL CLCLGLLLCA NHRRVPTGPP TTEPEPPTPK SYTYCCPERH FTSCDETGLK RSASYPLIHS LGTSEGPEML QAEETQSLCP ANRKRRRRKM KANEKVETLK PS // ID G3PV91_GASAC Unreviewed; 282 AA. AC G3PV91; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGACP00000021528}; DE Flags: Fragment; OS Gasterosteus aculeatus (Three-spined stickleback). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae; OC Gasterosteus. OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000021528, ECO:0000313|Proteomes:UP000007635}; RN [1] {ECO:0000313|Ensembl:ENSGACP00000021528} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.; RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGACP00000021528} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGACP00000021528}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 69293.ENSGACP00000021527; -. DR Ensembl; ENSGACT00000021569; ENSGACP00000021528; ENSGACG00000016309. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000007635; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007635}; KW Reference proteome {ECO:0000313|Proteomes:UP000007635}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGACP00000021528}. SQ SEQUENCE 282 AA; 31749 MW; 30D580D849946440 CRC64; DTSVSTKDPE DIPTFDEWTR KMMEVENEKT QSTHTSNNVA PPLVKKVQKN FNNYASVECG AKILGSNPEA KSTSAILMEN MDMYMLNPCS NKIWFIIELC EPIQVKQLDI ANFELFSSTP KDFLVSISDR YPTNKWLKLG TFHGRDERTV QSFPLDEHLY AKYVKMFTKY IKVELLSHFG SEHFCPLSLI RVFGTSMVEE YEEIADPAER PDDQDDDLDY PPGYAPGEDR LSNNLIGSAK DVILNMVNNI AVNVLGGGSE IPGMTCTGSR PLPPVSSYRS QP // ID G3QNI6_GORGO Unreviewed; 1404 AA. AC G3QNI6; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000004102}; GN Name=SUCO {ECO:0000313|Ensembl:ENSGGOP00000004102}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000004102, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000004102, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000004102} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000004102}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSGGOT00000004203; ENSGGOP00000004102; ENSGGOG00000004171. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G3QNI6; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001519; Chromosome 1. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}. FT COILED 1086 1106 {ECO:0000256|SAM:Coils}. FT COILED 1136 1156 {ECO:0000256|SAM:Coils}. FT COILED 1342 1362 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1404 AA; 156033 MW; 02C9EED679A8E49E CRC64; MRGFLARPFL STNQHLAWGS PLPQGNGLVQ LPSQPSRHSR PFHELCSKEE NSATVPKLLS LVVSSETIDF SNKTMDSRRD WEREKRILEG KLQLPKALAR TQRAGDEGRR AWTSRWPQQR RSPESCEAPL SAPLWGPQRG LPGREPLRSR SASAIALRTI GHILALLLRL LHLGLGSGGC REDVPPSGRG KKEEKMKKHR RALALVSCLF LCSLVWLPSW RVCCKESSSA SASSYYAQDD NCALENEDVQ FQKKNTESKK LSPPVVETLP TVDLHEESSN AVVDSETVEN ISSSSTSEIT PISKLDEIEK SGTIPIAKPS ETEQSETDCD VGEALDASAP IEQPSFVSPP DSLVGQHIEN VSSSHGKGKI TKSEFESKVS ASEQGGGDPK SALNASDNLK NESSDYTKPG DIDPTSVASP KDPEDIPTFD EWKKKVMEVE KEKSQSMHAS SNGGSHATKK VQKNRNNYAS VECGAKILAA NPEAKSTSAI LIENMDLYML NPCSTKIWFV IELCEPIQVK QLDIANYELF SSTPKDFLVS ISDRYPTNKW IKLGTFHGRD ERNVQSFPLD EQMYAKYVKV ELLSHFGSEH FCPLSLIRVF GTSMVEEYEE IADSQYHSER QELFDEDYDY PLDYNTGEDK SSKNLLGSAT NAILNMVNIA ANILGAKTED LTEGNKSISE NATATAAPKM PESTPVSTPV PSPEYVTTEV HTHDMEPSTP DTPKESPIVQ LVQEEEEEAS PSTVTLLGSG EQEDESSPWF ESETQIFCSE LTTICCISSF SEYIYKWCSV RVALYRQRSR TALSKGKDYL VSAQPPLLLP AESVDVSVLQ PLSGLENKNI EREAETVVLG DLSSSMHQDD LVNHTVDAVE LEPSHSQTLS QSLLLDITPA INPLPKIEVS ESVEYEAGHI PSQVIPQESS VEIDNEAEQK SESFSSIEKP SITYETNKVN ELMDNIIKED VNSMQIFTKL SETIVPPINT ATVPDNEDGE AKMNIADTAK QTLISVVDSS SLPEVKEEEQ SPEDALLRGL QRTATDFYAE LQNSTDLGYA NGNLVHGSNQ KESVFMRLNN RIKALEVNMS LSGRYLEELS QRYRKQMEEM QKAFNKTIVK LQNTSRIAEE QDQRQTEAIQ LLQAQLTNMT QLVSNLSATV AELKREVSDR QSYLVISLVL CVVLGLMLCM QRCRNTSQFD GDYISKLPKS NQYPSPKRCF SSYDDMNLKR RTSFPLMRSK SLQLTGKEVD PNDLYIVEPL KFSPEKKKKR CKYKIEKIET IKPEEPLHPI ANGDIKGRKP FTNQRDFSNM GEVYHSSYKG PPSEGSSETS SQSEESYFCG ISACTSLCNG QSQKTKTEKR ALKRRRSKVQ DQGKLIKTLI QTKSGSLPSL HDIIKGNKEI TVGTFGVTAV SGHI // ID G3QY67_GORGO Unreviewed; 437 AA. AC G3QY67; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000007797}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSGGOP00000007797}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000007797, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000007797, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000007797} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000007797}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_004062121.1; XM_004062073.1. DR Ensembl; ENSGGOT00000008006; ENSGGOP00000007797; ENSGGOG00000007969. DR GeneID; 101127395; -. DR KEGG; ggo:101127395; -. DR CTD; 6676; -. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3QY67; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001519; Chromosome 20. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 437 AA; 48033 MW; 4ED6CFC7E3ECB8BF CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSMSIT SEDSKGLRSA GPGPGEPEGR RARGPSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNWQTA CGAATVRGGA SEPTGSPVVS EEPLDLLPTL DLRQEMPPPR VFKSFLSLLF QGLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLSA FWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SSPRDFAVFG LQVDDETEVS LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVQTSEGAE GSAQGPH // ID G3QZ06_GORGO Unreviewed; 379 AA. AC G3QZ06; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000008103}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSGGOP00000008103}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000008103, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000008103, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000008103} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000008103}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_004062040.1; XM_004061992.1. DR Ensembl; ENSGGOT00000008328; ENSGGOP00000008103; ENSGGOG00000008288. DR GeneID; 101125941; -. DR KEGG; ggo:101125941; -. DR CTD; 140732; -. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3QZ06; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001519; Chromosome 20. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}. SQ SEQUENCE 379 AA; 43070 MW; B7959C3E1D282500 CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSPNMNDNIL LPVRNNDQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DESINGPLQS LRLYQEKVRH HSGEIQDLRG SMNQLIAKLQ EMEAMSDEQK MAQKIMKMIH GDYIEKPDFA LKSIGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNMTPGN CWAFEGDRGQ VTIQLAQKVY LSNLTLQHIP KTISLSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPEN IIQMFPLQNQ PARAFGAVKV KISSNWGNPG FTCLYRVRVH GSVAPPREQP HQNPYPERD // ID G3RA70_GORGO Unreviewed; 905 AA. AC G3RA70; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000012328}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSGGOP00000012328}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000012328, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000012328, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000012328} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000012328}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSGGOT00000012685; ENSGGOP00000012328; ENSGGOG00000012630. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3RA70; -. DR OMA; CEEITTH; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001519; Chromosome 7. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 221 254 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 371 394 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 401 420 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 550 584 {ECO:0000256|SAM:Coils}. FT COILED 597 617 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 905 AA; 100948 MW; 35C3AC199A83A3DD CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADSGTSSA VSLKNRAASS LRWCTPRAVP ALRLHMLSWT ISVSSCKNKS FRSSRLSLKK HHNHLGSSLC KEFDSQPHPW RSEDKQIPLG SSVSSTQLPG SYSSPVSVSQ WRGEQVCCRC TATPRSCGRH PPSAGCHCCL WPPPLPSKGR ILFIVLGSLY LMVVTFTGCF HLTAFLLSFV KLFFDYHSHK PYFFACQPCC NYIVAGINKQ VLSHCEEITT HSENWNKRDT CILHLKYLQA EETLDPPMQA LVKGHTRLLC TGYFLLQILR RIGAAGQAVS RTAWSAVWLA VVAPGKAASG VFWWLGIGWY QFVTLISWLN VFLLTRCLRN ICKFLVLLIP VFLFLAGLSL RGQGDFFSFL PVLNWASMHR TQRVDDPQDV FKPTTSRLKQ PLQGDSEAFP WHWMSGVEQQ VASLSGQCHH HGENLRELTT LLQKLQARVD QMDGGAAGPS ASVRDTVGQP PRKVGAAGLP GSTTDFMAFH QEHEVRISHL EDILGKLREK SEAIQKELEQ TKQKTISAVG EQLLPTVEHL QLELDQLKSE LSSWQHVKTG CETVDAVQVD VQVREMVKLL FSEDQQGGSL EQLLQRFSSQ FVSKGDLHTM LRDLQLQILR NVTHHVSVTK QLPTSEAVVS AVSEAGASGI TEAQARAIVN NALKLYSQDK TGMVDFALES GGGSILSTRC SETYETKTAL MSLFGIPLWY FSQSPRVVIQ PDIYPGNCWA FKGSQGYLVV RLSMMIHPAA FTLEHIPKTL SPTGNISSAP KDFAVYGLEN EYQEEGQLLG QFTYDQDGES LQMFQALKRP DDTAFQIVEL RIFSNWGHPE YTCLYRFRVH GEPVK // ID G3RLS7_GORGO Unreviewed; 357 AA. AC G3RLS7; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000016737}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSGGOP00000016737}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000016737, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000016737, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000016737} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000016737}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_004045470.1; XM_004045422.1. DR Ensembl; ENSGGOT00000017208; ENSGGOP00000016737; ENSGGOG00000017150. DR GeneID; 101133429; -. DR KEGG; ggo:101133429; -. DR CTD; 256979; -. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3RLS7; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001519; Chromosome 7. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}. SQ SEQUENCE 357 AA; 40522 MW; 6561CDF4623592A0 CRC64; MSGKTKARRA AMFFRRCSED ASGSASGNAL LSEDENPDAN GVTRSWKIIL STMLTLTFLL VGLLNHQWLK ETDVPQKSRQ LYAIVAEYGS RLYKYQARLR MPKEQLELLK KESQTLENNF RQILFLIEQI DVLKALLRDM KDGMDNNHNW NTHGDPVEDP DHTEEMSNLV NYILKKLRED QVEMADYALK SAGASIIEAG TSESYKNNKA KLYWHGIGFL NHEMPPDIIL QPDVYPGKCW AFPGSQGHTL IKLATKIIPT AVTMEHISEK VSPSGNISSA PKEFSVYGIT KKCEGEEIFL GQFIYNKTGT TVQTFELQHA VSEYLLCVKL NIFSNWGHPK YTCLYRFRVH GTPGKHI // ID G3RWZ3_GORGO Unreviewed; 1249 AA. AC G3RWZ3; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000020328}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSGGOP00000020328}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000020328, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000020328, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000020328} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000020328}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSGGOT00000027656; ENSGGOP00000020328; ENSGGOG00000004171. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000001519; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 1249 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003453255. FT COILED 931 951 {ECO:0000256|SAM:Coils}. FT COILED 981 1001 {ECO:0000256|SAM:Coils}. FT COILED 1187 1207 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGGOP00000020328}. SQ SEQUENCE 1249 AA; 138826 MW; 535C11BD26261820 CRC64; ELQIILLSFP FLCLFYLLPS WRVCCKESSS ASASSYYAQD DNCALENEDV QFQKKDEREG PINAESLGKS GSNLPISPKE HKLKDDSIVD VQNTESKKLS PPVVETLPTV DLHEESSNAV VDSETVENIS SSSTSEITPI SKLDEIEKSG TIPIAKPSET EQSETDCDVG EALDASAPIE QPSFVSPPDS LVGQHIENVS SSHGKGKITK SEFESKVSAS EQGGGDPKSA LNASDNLKNE SSDYTKPGDI DPTSVASPKD PEDIPTFDEW KKKVMEVEKE KSQSMHASSN GGSHATKKVQ KNRNNYASVE CGAKILAANP EAKSTSAILI ENMDLYMLNP CSTKIWFVIE LCEPIQVKQL DIANYELFSS TPKDFLVSIS DRYPTNKWIK LGTFHGRDER NVQSFPLDEQ MYAKYVKMFI KYIKVELLSH FGSEHFCPLS LIRVFGTSMV EEYEEIADSQ YHSERQELFD EDYDYPLDYN TGEDKSSKNL LGSATNAILN MVNIAANILG AKTEDLTEGN KSISENATAT AAPKMPESTP VSTPVPSPEY VTTEVHTHDM EPSTPDTPKE SPIVQLVQEE EEEASPSTVT LLGSGEQEDE SSPWFESETQ IFCSELTTIC CISSFSEYIY KWCSVRVALY RQRSRTALSK GKDYLVSAQP PLLLPAESVD VSVLQPLSGL ENKNIEREAE TVVLGDLSSS MHQDDLVNHT VDAVELEPSH SQTLSQSLLL DITPAINPLP KIEVSESVEY EAGHIPSQVI PQESSVEIDN EAEQKSESFS SIEKPSITYE TNKVNELMDN IIKEDVNSMQ IFTKLSETIV PPINTATVPD NEDGEAKMNI ADTAKQTLIS VVDSSSLPEV KEEEQSPEDA LLRGLQRTAT DFYAELQNST DLGYANGNLV HGSNQKESVF MRLNNRIKAL EVNMSLSGRY LEELSQRYRK QMEEMQKAFN KTIVKLQNTS RIAEEQDQRQ TEAIQLLQAQ LTNMTQLVSN LSATVAELKR EVSDRQSYLV ISLVLCVVLG LMLCMQRCRN TSQFDGDYIS KLPKSNQYPS PKRCFSSYDD MNLKRRTSFP LMRSKSLQLT GKEVDPNDLY IVEPLKFSPE KKKKRCKYKI EKIETIKPEE PLHPIANGDI KGRKPFTNQR DFSNMGEVYH SSYKGPPSEG SSETSSQSEE SYFCGISACT SLCNGQSQKT KTEKRALKRR RSKVQDQGKL IKTLIQTKSG SLPSLHDIIK GNKEITVGTF GVTAVSGHI // ID G3RYC9_GORGO Unreviewed; 2617 AA. AC G3RYC9; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000020816}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSGGOP00000020816}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000020816, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000020816, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000020816} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000020816}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSGGOT00000026268; ENSGGOP00000020816; ENSGGOG00000026552. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G3RYC9; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001519; Chromosome 14. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2617 AA; 289933 MW; A42BECC3218F5562 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGGFEIS VLLPEGDVIN SKVFSRLIGT DQLLPYLGQY GAAFKVLSAK AEIWIATCLI GCQRNKTLLR HGANPDLRDE DGKTPLDKAR ERGHSEVVAI LQSPGDWMCP VNKGDDKKKK DTNKDEEECN EPKGDPEMAP IYLKRLLPVF AQTFQQTMLP SIRKASLALI RKMIHFCSEA LLKEVCDSDV GHNLPTILVE ITATVLDQED DDDGHLLALQ IIRDLVDKGG DIFLDQLARL GVISKVSTLA GPSSDDENEE ESKPEKEDEP QEDAKELQQG KPYHWRDWSI IRGRDCLYIW SDAAALELSN GSNGWFRFIL DGKLATMYSS GSPEGGSDSS ESRSEFLEKL QRARGQVKPS TSSQPILSAP GPTKLTVGNW SLTCLKEGEI AIHNSDGQQA TILKEDLPGF VFESNRGTKH SFTAETSLGS EFVTGWTGKR GRKLKSKLEK TKQKVRTMAR DLYDDHFKAV ESMPRGVVVT LRNIATQLES SWELHTNRQC IESENTWRDL MKTALENLIV LLKDENTISP YEMCSSGLVQ ALLTVLNNVS LSSSKEQKSK VISLPKHIIK LFKNNSDSVF RPAVALIRKL IAVLESIERL PLHLYDTPGS TYNLQILTRR LRFRLERAPG ETALIDRTGR MLKMEPLATV ESLEQYLLKM VAKQWYDFDR SSFVFVRKLR EGQNFIFRHQ HDFDENGIIY WIGTNAKTAY EWVNPAAYGL VVVTSSEGRN LPYGRLEDIL SRDNSALNCH SNDDKNAWFA IDLGLWVIPS AYTLRHARGY GRSALRNWVF QVSKDGQNWT SLYTHVDDCS LNEPGSTATW PLDPPKDEKQ GWRHVRIKQM GKNASGQTHY LSLSGFELYG TVNGVCEDQL GKAAKEAEAN LRRQRRLVRS QVLKYMVPGA RVIRGLDWKW RDQDGSPQGE GTVTGELHNG WIDVTWDAGG SNSYRMGAEG KFDLKLAPGY DPDTVASPKP VSSTVSGTTQ SWSSLVKNNC PDKTSAAAGS SSRKGSSSSV CSVASSSDIS LGSTKTERRS EIVMEHSIVS GADVHEPIVV LSSAENVPQT EVGSSSSAST STLTAETGSE NAERKLGPDS SVRTPGESSA ISMGIVSVSS PDVSSVSELT NKEAASQRPL SSSASNRLSV SSLLAAGAPM SSSASVPNLS SRETSSLESF VRRVANIART NATNNMNLSR SSSDNNTNTL GRNVMSTATS PLMGAQSFPN LTTPGTTSTV TMSTSSVTSS SNVATATTVL SVGQSLSNTL TTSLTSTSSE SDTGQEAEYS LYDFLDSCRA STLLAELDDD EDLPEPDEED DENEDDNQED QEYEEVMILR RPSLQRRAGS RSDVTHHAVT SQLPQVPAGA GSRPIGEQTE CKVPFKGGKF LLWDYSTCIK YERKYTRLYK CSLGRIDRLN QGVLRSLQKP SSKFLGTPHS ELLEEVECTP SPRLALTLKV TGLGTTREVE LPLTNFRSTI FYYVQKLLQL SCNGSVKSDK LRRIWEPTYT IMYREMKDSD KEKENGKMGC WSIEHVEQYL GTDELPKNDL ITYLQKNADA AFLRHWKLTG TNKSIRKNRN CSQLIAAYKD FCEHGTKSGL NQGAISALQS SDILNLTKEQ PQAKAGNGQN SCGVEDVLQL LRILYIVASD PYSRISQEDG DEQPQFTFPP DEFTSKKITT KILQQIEEPL ALASGALPDW CEQLTSKCPF LIPFETRQLY FTCTAFGASR AIVWLQNRRE ATVERTRTTS SVRRDDPGEF RVGRLKHERV KVPRGESLME WAENVMQIHA DRKSVLEVEF LGEEGTGLGP TLEFYALVAA EFQRTDLGAW LCDDNFPDDE SRHVDLGGGL KPPGYYVQRS CGLFTAPFPQ DSDELERITK LFHFLGIFLA KCIQDNRLVD LPISKPFFKL MCMGDIKSNM SKLIYESRGD RDLHCTESQS EASTEEGHDS LSVGSFEEDS KSEFILDPPK PKPPAWFNGI LTWEDFELVN PHRARFLKEI KDLAIKRRQI LSNKGLSEDE KNTKLQELVL KNPSGSGPPL SIEDLGLNFQ FCPSSRIYGF TAVDLKPSGE DEMITMDNAE EYVDLMFDFC MHTGIQKQME AFRDGFNKVF PMEKLSSFSH EEVQMILCGN QSPSWAAEDI INYTEPKLGY TRDSPGFLRF VRVLCGMSSD ERKAFLQFTT GCSTLPPGGL ANLHPRLTVV RKVDATDASY PSVNTCVHYL KLPEYSSEEI MRERLLAATM EKGFHLN // ID G3SH67_GORGO Unreviewed; 379 AA. AC G3SH67; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000027455}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSGGOP00000027455}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000027455, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000027455, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000027455} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000027455}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSGGOT00000025411; ENSGGOP00000027455; ENSGGOG00000008288. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000001519; Chromosome 20. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}. FT COILED 155 182 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 43113 MW; 303BCD87FB282519 CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSPNMNDNIL LPVRNNDQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DESINGPLQS LRLYQEKVRH HSGEIQDLRG SMNQLIAKLQ EMEAMSDEQK MAQKIMKMIH GDYIEKPDFA LKSIGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV IREPNMTPGN CWAFEGDRGQ VTIQLAQKVY LSNLTLQHIP KTISLSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPEN IIQMFPLQNQ PARAFGAVKV KISSNWGNPG FTCLYRVRVH GSVAPPREQP HQNPYPERD // ID G3SHR7_GORGO Unreviewed; 2484 AA. AC G3SHR7; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 14-OCT-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGGOP00000027657}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSGGOP00000027657}; OS Gorilla gorilla gorilla (Western lowland gorilla). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Gorilla. OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000027657, ECO:0000313|Proteomes:UP000001519}; RN [1] {ECO:0000313|Ensembl:ENSGGOP00000027657, ECO:0000313|Proteomes:UP000001519} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Scally A.; RT "Insights into the evolution of the great apes provided by the gorilla RT genome."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSGGOP00000027657} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGGOP00000027657}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSGGOT00000032938; ENSGGOP00000027657; ENSGGOG00000026552. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000001519; Chromosome 14. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001519}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001519}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1172 1192 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2484 AA; 274891 MW; 823D2FBB84CD2741 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGTLLRH GANPDLRDED GKTPLDKARE RGHSEVVAIL QSPGDWMCPV NKGDDKKKKD TNKDEEECNE PKGDPEMAPI YLKRLLPVFA QTFQQTMLPS IRKASLALIR KMIHFCSEAL LKEVCDSDVG HNLPTILVEI TATVLDQEDD DDGHLLALQI IRDLVDKGGD IFLDQLARLG VISKVSTLAG PSSDDENEEE SKPEKEDEPQ EDAKELQQGK PYHWRDWSII RGRDCLYIWS DAAALELSNG SNGWFRFILD GKLATMYSSG SPEGGSDSSE SRSEFLEKLQ RARGQVKPST SSQPILSAPG PTKLTVGNWS LTCLKEGEIA IHNSDGQQAT ILKEDLPGFV FESNRGTKHS FTAETSLGSE FVTGWTGKRG RKLKSKLEKT KQKVRTMARD LYDDHFKAVE SMPRGVVVTL RNIATQLESS WELHTNRQCI ESENTWRDLM KTALENLIVL LKDENTISPY EMCSSGLVQA LLTVLNNVSL SSSKEQKYSF TPSVDRPAVA LIRKLIAVLE SIERLPLHLY DTPGSTYNLQ ILTRRLRFRL ERAPGETALI DRTGRMLKME PLATVESLEQ YLLKMVAKQW YDFDRSSFVF VRKLREGQNF IFRHQHDFDE NGIIYWIGTN AKTAYEWVNP AAYGLVVVTS SEGRNLPYGR LEDILSRDNS ALNCHSNDDK NAWFAIDLGL WVIPSAYTLR HARGYGRSAL RNWVFQVSKD GQNWTSLYTH VDDCSLNEPG STATWPLDPP KDEKQGWRHV RIKQMGKNAS GQTHYLSLSG FELYGTVNGV CEDQLGKAAK EAEANLRRQR RLVRSQVLKY MVPGARVIRG LDWKWRDQDG SPQGEGTVTG ELHNGWIDVT WDAGGSNSYR MGAEGKFDLK LAPGYDPDTV ASPKPVSSTV SGTTQSWSSL VKNNCPDKTS AAAGSSSRKG SSSSVCSVAS SSDISLGSTK TERRSEIVME HSIVSGADVH EPIVVLSSAE NVPQTEVGSS SSASTSTLTA ETGSENAERK LGPDSSVRTP GESSAISMGI VSVSSPDVSS VSELTNKEAA SQRPLSSSAS NRLSVSSLLA AGAPMSSSAS VPNLSSRETS SLESFVRRVA NIARTNATNN MNLSRSSSDN NTNTLGRNVM STATSPLMGA QSFPNLTTPG TTSTVTMSTS SVTSSSNVAT ATTVLSVGQS LSNTLTTSLT STSSESDTGQ EAEYSLYDFL DSCRASTLLA ELDDDEDLPE PDEEDDENED DNQEDQEYEE VMILRRPSLQ RRAGSRSDVT HHAVTSQLPQ VPAGAGSRPI GEQTPHSELL EEVECTPSPR LALTLKVTGL GTTREVELPL TNFRSTIFYY VQKLLQLSCN GSVKSDKLRR IWEPTYTIMY REMKDSDKEK ENGKMGCWSI EHVEQYLGTD ELPKNDLITY LQKNADAAFL RHWKLTGTNK SIRKNRNCSQ LIAAYKDFCE HGTKSGLNQG AISALQSSDI LNLTKEQPQA KAGNGQNSCG VEDVLQLLRI LYIVASDPYS RISQEDGDEQ PQFTFPPDEF TSKKITTKIL QQIEEPLALA SGALPDWCEQ LTSKCPFLIP FETRQLYFTC TAFGASRAIV WLQNRREATV ERTRTTSSVR RDDPGEFRVG RLKHERVKVP RGESLMEWAE NVMQIHADRK SVLEVEFLGE EGTGLGPTLE FYALVAAEFQ RTDLGAWLCD DNFPDDESRH VDLGGGLKPP GYYVQRSCGL FTAPFPQDSD ELERITKLFH FLGIFLAKCI QDNRLVDLPI SKPFFKLMCM GDIKSNMSKL IYESRGDRDL HCTESQSEAS TEEGHDSLSV GSFEEDSKSE FILDPPKPKP PAWFNGILTW EDFELVNPHR ARFLKEIKDL AIKRRQILSN KGLSEDEKNT KLQELVLKNP SGSGPPLSIE DLGLNFQFCP SSRIYGFTAV DLKPSGEDEM ITMDNAEEYV DLMFDFCMHT GIQKQMEAFR DGFNKVFPME KLSSFSHEEV QMILCGNQSP SWAAEDIINY TEPKLGYTRD SPGFLRFVRV LCGMSSDERK AFLQFTTGCS TLPPGGLANL HPRLTVVRKV DATDASYPSV NTCVHYLKLP EYSSEEIMRE RLLAATMEKG FHLN // ID G3SWB8_LOXAF Unreviewed; 2617 AA. AC G3SWB8; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000004605}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSLAFP00000004605}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000004605, ECO:0000313|Proteomes:UP000007646}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000004605} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000004605}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000004605} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000004605}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000004605; -. DR Ensembl; ENSLAFT00000005491; ENSLAFP00000004605; ENSLAFG00000005491. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G3SWB8; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1249 1269 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2617 AA; 289902 MW; 895EA6C431D225DA CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTASGP SSACKPGRST TGAPPTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAAFEV NFMDDVGQTL LNWASAFGTQ EMVEFLCERG ADVNRGQRSS SLHYAACFGR PQVAKTLLRH GANPDLRDED GKTPLDKARE RGHSEVVAIL QSPGDWMCPV NKGDDKKKKD TNKDEEECNE PKGDPEMAPI YLKRLLPVFA QTFQQTMLPS IRKASLALIR KMIHFCSEAL LKEVCDSDVG HNLPTILVEI TATVLDQEDD DDGHLLALQI IRDLVDKGGD IFLDQLARLG VISKVSTLAG PSSDDENEEE SKPEKEDEPQ EDAKELQQGK PYHWRDWSII RGRDCLYIWS DAAALELSNG SNGWFRFILD GKLATMYSSG SPEGGSDSSE SRSEFLEKLQ RARGQVKPST SSQPILSAPG PTKLTVGNWS LTCLKEGEIA IHNSDGQQAT ILKEDLPGFV FESNRGTKHS FTAETSLGSE FVTGWTGKRG RKLKSKLEKT KQKVRTMARD LYDDHFKAVE SMPRGVVVTL RNIATQLESS WELHTNRQCI EGENTWRDLM KTALENLIVL LKDENTISPY EMCSSGLVQA LLTVLNNVSL SSNMKQDCSQ LVERINVFKT AFSENEDDES HSRPAVALIR KLIAVLESIE RLPLHLYDTP GSTYNLQILT RRLRFRLERA PGETALIDRT GRMLKMEPLA TVESLEQYLL KMVAKQWYDF DRSSFVFVRK LREGQNFIFR HQHDFDENGI IYWIGTNAKT AYEWVNPAAY GLVVVTSSEG RNLPYGRLED ILSRDNSALN CHSNDDKNAW FAIDLGLWVI PSAYTLRHAR GYGRSALRNW VFQVSKDGQN WTSLYTHVDD CSLNEPGSTA TWPLDPPKDE KQGWRHVRIK QMGKNASGQT HYLSLSGFEL YGTVNGVCED QLGKAAKEAE ANLRRQRRLV RSQVLKYMVP GARVIRGLDW KWRDQDGSPQ GEGTVTGELH NGWIDVTWDA GGSNSYRMGA EGKFDLKLAP GYDPDTVASP KPVSSTVSGT TQSWSSLVKN NCPDKTSAAA GSSSRKGSSS SVCSVASSSD ISLGSTKTER RSEIVMEHSI VSGADVHEPI VVLSSAENVP QAEVGSSSSA STSTLTAETG SENAERKLGP DSSVRTPGES SAISMGIVSV SSPDVSSVSD LTNKEAASQR PLSSSASNRL SVSSLLAAGA PMSSSASVPN LSSRETSSLE SFVRRVANIA RTNATNNMNL SRSSSDNNTN TLGRNVMSTA TSPLMGAQSF PNLTTPGTTS TVTMSTSSVT SSSNVATATT VLSVGQTLSN TLTTSLTSTS SESDTGQEAE YSLYDFLDSC RASTLLAELD DDEDLPEPDE EDDENEDDNQ EDQEYEEVMI LRRPSLQRRA GSRSDVTHHA VTSQLPQVPA GAGSRPIGEQ EEEEYETKGG RRRTWDDDYV LKRQFSALVP AFDPRPGRTN VQQTTDLEIP PPGTPHSELL EEVECTPSPR LALTLKVTGL GTTREVELPL TNFRSTIFYY VQKLLQLSCN GNVKSDKLRR IWEPTYTIMY REMKDSDKEK ESGKMGCWSI EHVEQYLGTD ELPKNDLITY LQKNADAAFL RHWKLTGTNK SIRKNRNCSQ LIAAYKDFCE HGTKSGLNQG AISTLQHSDI LNLTKEQPQA KAGNGQNSCG VEDVLQLLRI LYIVASDPCS RISQEEGDEQ LQFTFPPDEF TSKKITTKIL QQIEEPLALA SGALPDWCEQ LTSKCPFLIP FETRQLYFTC TAFGASRAIV WLQNRREATV ERTRTTSSVR RDDPGEFRVG RLKHERVKVP RGESLMEWAE NVMQIHADRK SVLEVEFLGE EGTGLGPTLE FYALVAAEFQ RTDLGAWLCD DNFPDDESRH VDLGGGVKPP GYYVQRSCGL FTAPFPQDSD ELERITKLFH FLGIFLAKCI QDNRLVDLPI SKPFFKLMCM GDIKSNMSKL IYESRGDRDL HCTESQSEAS TEEGHDSLSV GSFEEDSKSE FILDPPKPKP PAWFNGILNW EDFELVNPHR ARFLKEIKDL AIKRRQILSN KDLSEDEKNT KLQELVLKNP SGSGPPLSIE DLGLNFQFCP SSRIYGFTAV DLKPSGEDEM ITMDNAEEYV DLMFDFCMHT GIQKQMEAFR GNVDGFNKVF PMEKLSSFSH EEVQMILCGN QSPSWAAEDI INYTEPKLGY TRDSPGFLRF VRVLCGMSSD ERKAFLQFTT GCSTLPPGGL ANLHPRLTVV RKVDATDASY PSVNTCVHYL KLPEYSSEEI MRERLLAATM EKGFHLN // ID G3T5D1_LOXAF Unreviewed; 739 AA. AC G3T5D1; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000008636}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSLAFP00000008636}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000008636}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000008636} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000008636}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000008636} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000008636}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000008636}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000008636; -. DR Ensembl; ENSLAFT00000010308; ENSLAFP00000008636; ENSLAFG00000010302. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3T5D1; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 233 253 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 292 312 {ECO:0000256|SAM:Coils}. FT COILED 374 423 {ECO:0000256|SAM:Coils}. FT COILED 426 453 {ECO:0000256|SAM:Coils}. FT COILED 493 520 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 739 AA; 82930 MW; 9DD8D5289D2E5C50 CRC64; MSRRSQRLTR YTQGDDDGGS SSSGGSSITG SQSTLFKDSP LRTLKRKSSS MKRLSPAPQL APSSDPHTSY YSESVIKESF VGSPRAASLA RSALLDRCSI LDDQLHSDPY WSEDLRVRRR RGTGSTESSR INGLVESKRS EDFLGSSSGY SSEDDYVGYS AEMDQQNSGS RLRNAVSRAG SFLWMVVSSP GRLFGLLYWW VGTTWYRLTT AASLLDVFVL TRSRRVSSLR TFLWFLLLLL LLTCLTYGAW YFYPYGLQTF QPAVVSWWAA KGHSRQHEVW EPRDSSSHFQ AEQRILSQVH SLERRLEALA AEFSSNWQKE ALRLERLELR QGAAGEGGDG GGGGLSHEDT LVLLEGLVSR REAALKEDFR RDMAARIQEE LVALRAEHQQ DSEDLFKKIV QASQESEARL QQLKSEWQRM TQESFQENSL KELGRLEGQL AGLQQELAAL ALKQGSVEDR VDQLPQQIQA VRDDVESQFP AWIAQFLLRG GGARAGLLQQ EEIEARLQEL ESRILAHMAE TQGKSVKEAA ASLGLMLQKE GMIGVTEEQV HHIVSQALKR YSEDRIGMVD YALESGGASV ISTRCSETYE TKTALLSLFG IPLWYHSQSP RVILQPDVHP GNCWAFQGPQ GFAVVRLSAR IRPTAVTLEH VPKSLSPNST ISSAPKDFAI FGFEEDLQQE GRLLGKFTYD QDGEPIQTFY FQDPTMATYQ VVELRILTNW GHPEYTCIYR FRVHGDPAH // ID G3TFP9_LOXAF Unreviewed; 443 AA. AC G3TFP9; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000013174}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSLAFP00000013174}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000013174}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000013174} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000013174}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000013174} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000013174}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000013174}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000013174; -. DR Ensembl; ENSLAFT00000015707; ENSLAFP00000013174; ENSLAFG00000015701. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3TFP9; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 166 191 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 204 238 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 443 AA; 48619 MW; C1FAEF0BFCA68BFD CRC64; MRRSPRPSSA AAPHKHTPNF YSDNDNNSVS ATSGDSSGHR STGPGPGEPE GRRARGSSCG EPALSAGVPG GTTWAGSSRQ KPAPRSHKGQ TACGAATVPW APAELAGTSV VSEEQLDLLP TLDLRQEMPP PRVSKSFLNQ LFQVLSVLLS LLGDVLVSAS REVCSIRFLL TAVSLLSLFL AALWWGLLYL VPPLENEPKE MLTMSEYHER VRSQGQQLQQ LQAELDKLHK EMSSVRAANS ERVAKLVFQR LNEDFVRKPD YALSSVGASI DLEKTSRDYE DADTAYFWNR FSFWNYARPP TVILEPDVFP GNCWAFEGDQ GQVVIRLPGR VQLSDITLQH PPPSVAHTRG ANSAPRDFAV YGLQVDDETE VFLGKFTFDV EKSEIQTFHL QNDPPAAFPK VKIQILSNWG HPRFTCLYRV RAHGERTSEG AGDSTTGVTG GLH // ID G3TG49_LOXAF Unreviewed; 766 AA. AC G3TG49; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000013423}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSLAFP00000013423}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000013423}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000013423} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000013423}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000013423} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000013423}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000013423}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000013423; -. DR Ensembl; ENSLAFT00000015999; ENSLAFP00000013423; ENSLAFG00000015996. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3TG49; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 231 252 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 264 283 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 336 370 {ECO:0000256|SAM:Coils}. FT COILED 415 442 {ECO:0000256|SAM:Coils}. FT COILED 457 484 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 766 AA; 86364 MW; 2E8BFDF7967CE37A CRC64; MDFSRLHIYT PPQCVPENTG YTYALSSSYS SEALDFETEH RLDPVFDSPR MSRRSLRLAT PAFTAGDGPD TEGHAYARNT ASVKDRASRT VKQHRSTSKP AFNINHMSRK ATSSAVSQSS CHSLQGNMAL RPPVLDESLI REQTKVDHFW GLDDDSDLKE PLGGSKAALQ GNGDLATVAD TATVNGYTCS NCSLLSERKD VLTAHPTARG PTSRVYSRDR SQKRRKAASG VFWWLGFGWY QFVTLMSWLN VFLLTRCLRN ICKFLVLLIP LLLLLGAGLA FWGQGDFLSF LPVLNWTNVY KAQRADDPKS ILTPDASHLH LPLEGDGKEA FHWHRLSEVE QQVTALSRRC QLHEEKLREL TLLLQKLQVH ADQADSDRDG VVSLLRSLQT RADQADGDRE DLKPPELKTD FMTFHQEHEL RLSNLEDMIG KLADKSEVIQ KELEQTKLRT TSGTAEEQHL LSTVRQLELQ LEHLKSELAD WRNLKTSCEK VDTQVRETIR LMFSEDQQDS SLEWLLQKFS SQFVSKGDLQ ILLRDLELQV LKNITHHISV TKQAPTSAAV VSAMHEAGVP GITEAQARII VNNALKLYSQ DKTGMVDFAL ESGGGSILST RCSETYETKT ALISLFGIPL WYFSQSPRVV IQPDIYPGNC WAFKGSQGYL VVRLSMAIYP TTFTLEHIPK TLSPTGNISS APKDFAVYGL ESEYQEEGQL LGQFTYDQDG ESLQMFHTLK RPEKAFQIVE LRIFSNWGHL EYTCLYRFRV HGDPIK // ID G3TGX6_LOXAF Unreviewed; 979 AA. AC G3TGX6; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000013766}; GN Name=SUCO {ECO:0000313|Ensembl:ENSLAFP00000013766}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000013766}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000013766} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000013766}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000013766} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000013766}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000013766}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000013766; -. DR Ensembl; ENSLAFT00000016397; ENSLAFP00000013766; ENSLAFG00000016383. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G3TGX6; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}. FT COILED 661 681 {ECO:0000256|SAM:Coils}. FT COILED 704 731 {ECO:0000256|SAM:Coils}. FT COILED 917 937 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 979 AA; 108846 MW; 4A4D4043F73875BA CRC64; MEVEKEKNLS AGQSMHPSSN GGQQATKKVQ KNRNNYASVE CGAKILAANP EAKSTSAILI ENMDLYMLNP CSAKIWFVIE LCEPIQVKQL DIANYELFSS TPKDFLVSIS DRYPINKWIK LGTFHGRDER NVQSFPLDEQ MYAKYVKMFI KYIKVEFISH FGSEHFCPLS LIRVFGTSMV EEYEEIADSQ YQSERQELFD EDYDYPLDYN TGEDKSSKNL LGSATNAILN MVNIAANILG AKTEDLTEGN KSISENATVT AAPKMPESAP GSTPVPSPEF VTTGVHIHEI EPSTPDTPKE SPIVQLVQEE EEEASPSTVT LLGTGEQEDE SSPWFESETQ IFCSELTTIC CISSFSEYIY KWCSARVALY RQRSGSAVRK GKDYILSPQP PSLLPTASVE PPSGELDSKS VEREAETVIL DDLSSMQQSD LVNHTVDAIE LEPSYPQTLS QSLLLDITPE INTLSKIEVT ESVKHETGHT PSQIIPQESS VEIANKTEKK PESSSSIEKP PVIYETSKLS EVTDNIVKED TNSMQIITKL SETIVPPINT ATVPGSEDGE AKMTIADAPK QILTPVVDSS SLPEVKEEEQ SPEDALLAIP GSSGLQRTAT DFYAELQNST DLGYTNGNLV HGSNQKESVF MRLNNRIKAL EVNMSLSGRY LEELSQRYRK QMEEMQKAFN KTIVKLQNTS RIAEEQDQRQ TEAIHQLQAQ LANMTQLVSN LSTTVAGLKR EVSDRQSYLV MSLVLCVVLG LMLCMQRCRS PSQCDGDYIC KPPKNNPYPS PKRCFSSYDD MNLKRRTSFP LIRSKSLQLT GKEVNPNDLY IVEPLKFSPG KKKKRCKYKT EKIETIKPAD PLHPIPNGDI KGRKPFTNQR DFSNLGEVYH SSYKGPPSEG SSETSSQSEE SYFCGISACT SLCNGQSQKT KTEKRALKRR RSKVQDQGKL IKTLIQTKSG SLPSLHDIIK GNKELTVGTL GVTAVSGHI // ID G3TJN8_LOXAF Unreviewed; 262 AA. AC G3TJN8; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000014962}; DE Flags: Fragment; GN Name=SUN3 {ECO:0000313|Ensembl:ENSLAFP00000014962}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000014962}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000014962} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000014962}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000014962} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000014962}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000014962}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000014962; -. DR Ensembl; ENSLAFT00000017851; ENSLAFP00000014962; ENSLAFG00000017851. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3TJN8; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSLAFP00000014962}. SQ SEQUENCE 262 AA; 30021 MW; 2B208B246821A8DB CRC64; QARLRMPKEQ LELLKRESQT LENNFREILF LIEQIDVLKA LLREMKDGMY NHSWNEDPVE EQDKGILDEE MSNLVSYVLK KLREDQVQMA DYALKSAGAS IIEAGTSESY KNDKAKLYWH GIGFLNYEMP PDIILQPDVH PGKCWAFAGS QGHTLIKLAK KIVPTAVTME HISEKISPSG NISSAPREFS VYGISKECKG EEIFLGHFMY NKKETTVQTF GLQHEVSEYL LCVKLKILSN WGHPNYTCLY RFRVHGNPGN HT // ID G3U268_LOXAF Unreviewed; 442 AA. AC G3U268; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000021926}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSLAFP00000021926}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000021926}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000021926} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000021926}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000021926} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000021926}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000021926}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000013174; -. DR Ensembl; ENSLAFT00000035358; ENSLAFP00000021926; ENSLAFG00000015701. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 137 157 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 169 191 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 203 237 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 442 AA; 48206 MW; E14E3A92D70A98E6 CRC64; MRRSPRPSSA AAPHKHTPNF YSDNDNNSVS ATSGDSSGHR STGPGPGEPE GRRARGSSCG EPALSAGVPG GTTWAGSSRQ KPAPRSHKGQ TACGAATVRG GASELAGTSV VSEEQLDLLP TLDLRQEMPP PRVSKSFLNQ LFQVLSVLLS LLGDVLVSAS REVCSIRFLL TAVSLLSLFL AALWWGLLYL VPPLENVSGD TVLGEYHERV RSQGQQLQQL QAELDKLHKE MSSVRAANSE RVAKLVFQRL NEDFVRKPDY ALSSVGASID LEKTSRDYED ADTAYFWNRF SFWNYARPPT VILEPDVFPG NCWAFEGDQG QVVIRLPGRV QLSDITLQHP PPSVAHTRGA NSAPRDFAVY GLQVDDETEV FLGKFTFDVE KSEIQTFHLQ NDPPAAFPKV KIQILSNWGH PRFTCLYRVR AHGERTSEGA GDSTTGVTGG LH // ID G3U920_LOXAF Unreviewed; 306 AA. AC G3U920; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000024328}; DE Flags: Fragment; GN Name=SUN5 {ECO:0000313|Ensembl:ENSLAFP00000024328}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000024328}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000024328} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000024328}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000024328} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000024328}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000024328}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000016808; -. DR Ensembl; ENSLAFT00000031931; ENSLAFP00000024328; ENSLAFG00000021496. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}. FT COILED 84 104 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSLAFP00000024328}. SQ SEQUENCE 306 AA; 35072 MW; D8CE2EFF3D8F9C43 CRC64; WLTCLACFLR TRAQRVLFNT CRCKLFFQKL LEKTGILVLC MFGFWVCSMH LPSKMEVWQD DNPNGPLQSL RIYQEKVRHH TGEIQDLRGS MNQLIAKLQE VEAMSDEQRM AQKIMKMIQG DYIEKPDFAL KSIGKGDKWE RGLSTYNHNK ARSYWNWIRL WNYAQPPDVI LQAGAPNMTP GNCWAFAGDR GQVTIRLAQK IYLSNLTLQH IPKTISLSGS LDTAPKDFVI YGMEGTPKEE VFLGAFQFQP ENIIQMFPLQ NQPARPFGAV KVKISSNWGN PRFTCLYRVR IHGSVAPPGD ELEPLS // ID G3V8K3_RAT Unreviewed; 444 AA. AC G3V8K3; D3Z9U4; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Protein LOC100911109 {ECO:0000313|Ensembl:ENSRNOP00000026578}; DE SubName: Full=Sperm associated antigen 4, isoform CRA_b {ECO:0000313|EMBL:EDL85876.1}; GN Name=LOC100911109 {ECO:0000313|Ensembl:ENSRNOP00000026578, GN ECO:0000313|RGD:6495088}; GN Synonyms=Spag4 {ECO:0000313|EMBL:EDL85876.1, ECO:0000313|RGD:6495088}; GN ORFNames=rCG_37284 {ECO:0000313|EMBL:EDL85876.1}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|Ensembl:ENSRNOP00000026578, ECO:0000313|Proteomes:UP000002494}; RN [1] {ECO:0000313|Ensembl:ENSRNOP00000026578, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000026578, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [2] {ECO:0000313|EMBL:EDL85876.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDL85876.1}; RX PubMed=15632090; DOI=10.1101/gr.2889405; RA Florea L., Di Francesco V., Miller J., Turner R., Yao A., Harris M., RA Walenz B., Mobarry C., Merkulov G.V., Charlab R., Dew I., Deng Z., RA Istrail S., Li P., Sutton G.; RT "Gene and alternative splicing annotation with AIR."; RL Genome Res. 15:54-66(2005). RN [3] {ECO:0000313|EMBL:EDL85876.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=BN {ECO:0000313|EMBL:EDL85876.1}; RA Mural R.J., Li P.W., Adams M.D., Amanatides P.G., Baden-Tillson H., RA Barnstead M., Chin S.H., Dew I., Evans C.A., Ferriera S., Flanigan M., RA Fosler C., Glodek A., Gu Z., Holt R.A., Jennings D., Kraft C.L., RA Lu F., Nguyen T., Nusskern D.R., Pfannkoch C.M., Sitter C., RA Sutton G.G., Venter J.C., Wang Z., Woodage T., Zheng X.H., Zhong F.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|Ensembl:ENSRNOP00000058827} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000058827}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [5] {ECO:0000313|Ensembl:ENSRNOP00000026578} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000026578}; RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. RN [6] {ECO:0000213|PubMed:22673903} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22673903; RA Lundby A., Secher A., Lage K., Nordsborg N.B., Dmytriyev A., RA Lundby C., Olsen J.V.; RT "Quantitative maps of protein phosphorylation sites across 14 RT different rat organs and tissues."; RL Nat. Commun. 3:876-876(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABR07054396; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC118414; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CH474050; EDL85876.1; -; Genomic_DNA. DR RefSeq; XP_003749659.1; XM_003749611.3. DR UniGene; Rn.28620; -. DR STRING; 10116.ENSRNOP00000058827; -. DR Ensembl; ENSRNOT00000026578; ENSRNOP00000026578; ENSRNOG00000019566. DR Ensembl; ENSRNOT00000065052; ENSRNOP00000058827; ENSRNOG00000048056. DR GeneID; 100911109; -. DR KEGG; rno:100911109; -. DR RGD; 6495088; LOC100911109. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 34072438; -. DR Proteomes; UP000002494; Chromosome 3. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 137 157 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 169 191 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 211 238 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 444 AA; 48705 MW; B821EE71615FC726 CRC64; MRRSPRPGSA ASSHNHTPNF YSENSNSSHS ATSGDSNGRR SAGPELGEPE GRRARGSSCG EPALSSGVPG GDTWAGSSRP KLAPRSHNGQ TACGAATVRG GASEPSGSPA VLEEQLNLLP ILDLRQEMPP PPVSKSFLSL FFQVLSVFLS LVADGLVCVY REICSIRFLF TAVSLLSIFL AALWWGLLYL IPPLENEPKE MLTLSQYHHR VHSQGQQLQQ LQAELSKLHK EVTSVRAAHS ERVAKLVFQR LNEDFVRKPD YALSSVGASI DLEKTSSDYE DRNTAYFWNR LSFWNYARPP SVILEPDVFP GNCWAFEGEQ GQVVIRLPGH VQLSDITLQH PPPTVAHTGG ASSAPRDFAV FGLQADDDET EVFLGKFIFE VQKSEIQTFH LQNDPPSAFP KVKIQILSNW GHPRFTCLYR VRAHGVRISE SAEDNAMGVT GGPH // ID G3VA61_SARHA Unreviewed; 241 AA. AC G3VA61; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000000065}; DE Flags: Fragment; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSSHAP00000000065}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000000065, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000000065} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000000065} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000000065}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01048487; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000000065; -. DR Ensembl; ENSSHAT00000000067; ENSSHAP00000000065; ENSSHAG00000000061. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3VA61; -. DR OMA; WALKTMG; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}. FT COILED 13 33 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSSHAP00000000065}. SQ SEQUENCE 241 AA; 27390 MW; 9EF3C54AA70260D0 CRC64; EYHERVHSQG LQLQQLQAEL DKLHTDVSSI RAANSERVAQ LVFQRLNEDF VQKPDYALSS VGASIDLDKT SHDYEDRDTA YFWNRFSFWN YAKPPTVILE PDVFPGNCWA FQGAKGQVVI RLPGRVQLSD ITLQHPPPSV AHIGGASSAP KDFAVYGLQG DDKTEILLGK FTFDVEKSEI QTFHLKNEPP LAFPKVKIQI LSNWGHPRFT CLYRVRAHGL RSHDMQREDS RTKGEPVTIP H // ID G3VYT1_SARHA Unreviewed; 333 AA. AC G3VYT1; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000008336}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSSHAP00000008336}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000008336, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000008336} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000008336} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000008336}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01086444; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01086445; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_003762593.1; XM_003762545.2. DR STRING; 9305.ENSSHAP00000008336; -. DR Ensembl; ENSSHAT00000008404; ENSSHAP00000008336; ENSSHAG00000007223. DR GeneID; 100920624; -. DR KEGG; shr:100920624; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3VYT1; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 12 32 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 99 119 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 333 AA; 38146 MW; 4411CD8A4A49B915 CRC64; MERVREWLNQ EYFSWKIILS VMVLATFIFI GLHDSEKLRR TGLSHIPRQL YELSADYGSK LYNYQTRIRL SKSKMELLKK GSHYLENNSQ EILSLIKQIN ILKAILKDIK NQLDNYILNA NTDAFGEQDD SYITDEEMMT LVNYVLKKLR EDQVQMADYA LKSAGASIVE AGTSESYKND KAKLYWHGIG FLSYEMPPDV ILQPDVHPGK CWAFPGSKGH TIIKLARKII PTAVTMEHIS EKISPSGNTH SAPKNFSVYG LKDECKGEEI FLGQFMYNKK GTTVQTFQLQ NGVSESFPYV KLKILKNWGH PKYTCLYRFR VHGKPGNDTL GVP // ID G3W2J9_SARHA Unreviewed; 359 AA. AC G3W2J9; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000009654}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSSHAP00000009654}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000009654, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000009654} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000009654} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000009654}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01040125; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01040126; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01040127; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000009654; -. DR Ensembl; ENSSHAT00000009739; ENSSHAP00000009654; ENSSHAG00000008355. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3W2J9; -. DR OMA; GTIDFEH; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 88 106 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 137 157 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 359 AA; 41071 MW; 454A3038AC7D6D27 CRC64; MNESPEDSYL ITRTSEQSTS EEDSPQIVTA IIPSSCRNLL SSLIQIICFI SNLRNTVHKL HLFPKYIGRT QTGEATLKNL LHKLVEKLSV VFFCVFAFWC IIIYLPTKLD TDREESGFDS PLSLYIHSSK RLYQEKVRKH AEEIQALHNL MKQISAKIQE VKVMSNEDLV AQNIMKKIQG DYIEKPDFAL KSIGGTIDFE HTSATYSCDK ARSYWSWLRL WNYAHPPDVI LEPNVTPGNC WAFRGDRGQV VIRLARKIFL TNITIQHIPK TISLSGNLDT APKDFVVYGI SDQSREETFL GAFMFQPENA IQMFPLQNTL SRPFNCIKLK ILTNWGNPHF TCIYRVRAHG TVTPSASDY // ID G3W2K0_SARHA Unreviewed; 353 AA. AC G3W2K0; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000009655}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSSHAP00000009655}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000009655, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000009655} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000009655} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000009655}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01040125; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01040126; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01040127; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000009654; -. DR Ensembl; ENSSHAT00000009740; ENSSHAP00000009655; ENSSHAG00000008355. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 88 107 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 131 151 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 353 AA; 40324 MW; 45DBF0219B98AF74 CRC64; MNESPEDSYL ITRTSEQSTS EEDSPQIVTA IIPSSCRNLL SSLIQIICFI SNLRNTVHKL HLFPKYIGRT QTGEATLKNL LHKLVEKLSV VFFCVFAFWC IIIYLPTKLD TVPRNSMKGT KSSKRLYQEK VRKHAEEIQA LHNLMKQISA KIQEVKVMSN EDLVAQNIMK KIQGDYIEKP DFALKSIGGT IDFEHTSATY SCDKARSYWS WLRLWNYAHP PDVILEPNVT PGNCWAFRGD RGQVVIRLAR KIFLTNITIQ HIPKTISLSG NLDTAPKDFV VYGISDQSRE ETFLGAFMFQ PENAIQMFPL QNTLSRPFNC IKLKILTNWG NPHFTCIYRV RAHGTVTPSA SDY // ID G3WIF6_SARHA Unreviewed; 926 AA. AC G3WIF6; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000015211}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSSHAP00000015211}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015211, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015211} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015211} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000015211}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01075928; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01075929; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01075930; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01075931; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000015211; -. DR Ensembl; ENSSHAT00000015337; ENSSHAP00000015211; ENSSHAG00000012971. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G3WIF6; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 402 423 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 435 453 Helical. {ECO:0000256|SAM:Phobius}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSSHAP00000015211}. SQ SEQUENCE 926 AA; 106021 MW; 7B6083D0412442D4 CRC64; FEILNMDFSR LHTYTPPQSV PENTGYTYGL SSSYSSDALD FETEHKLDPV FDFPRLSRRS LHSVPTTYTN DDGQVENTHS YTRNASHKDK ISKTSKHHRN TNKQSLALNH TTRKATSNSS SLLSQSTFDS HASDTSLRSS VLDESLIRKQ TKVVHFWGLD DDGDLKEKLG GTKTVIQGNG DLATGETDTT LNNGYICSDC SMLSERKDVL TAYSTSHVPS SRIYFRDGSQ KRGASIHMNR ILRLAKHTAA SFSSLLVQLF QVVLMKLGYE SENYKLKNYE SKDCESKSYK TKSHESKAHS NYCGCVNVKE FLREDGHLSV NGESLCDDCK GKKHLETYTT THLQSSRSKR VARTIWHTFS YTGYFLMQTL QRIGATGWFV SKKVLSFLWL AIVSPGKAAS GVFWWLGTGW YQFVTLISWL NVFLLTRCLP KICKLLLLLI PLLLLSGIGL YLWNMESFLS LLPIFNWTRI HKTQRIDESR YFFKPDSSHN NQPTEGFNNG FSYLNYFLVV SHTKERTVDC VFKGSNIYNN NLNNQVFKEQ TEKDLFIYFF SSSVRDLLDQ NRMDFLSFQQ ENEFRILKLE DLLGKLFEKD KLIQEELDQT KSRIISGIDE RQHLISKVKH LELELGHLKS ELLTWQDLKT SCDKIEAVHE KVDTQIRETI RLMFSGDHQD GSLEWLLQWL SSKFVSKGDL QILLRDLERQ ILKNITHYVS ERKQIPTPET LLNADSVRIS GITELQARVI VNNALKLYSQ DKTGMVDFAL ESGGGSILST RCSETYETKT ALISVFGIPL WYHSQSPRIV IQPDIYPGNC WAFKGSQGYL VVRLSMMIYP SAFTMEHIPK TLSPTGNITS APKDFSVYGL DNEYQEEGML LGQFVYDQEG ESLQIFQAMK SPGKAFQIVE LRIFSNWGHP EYTCLYRFRV HGELTK // ID G3WIF7_SARHA Unreviewed; 851 AA. AC G3WIF7; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000015212}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSSHAP00000015212}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015212, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015212} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015212} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000015212}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01075928; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01075929; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01075930; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01075931; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_012398246.1; XM_012542792.1. DR RefSeq; XP_012398247.1; XM_012542793.1. DR RefSeq; XP_012398248.1; XM_012542794.1. DR Ensembl; ENSSHAT00000015338; ENSSHAP00000015212; ENSSHAG00000012971. DR GeneID; 100927321; -. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 394 415 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 427 445 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 851 AA; 97112 MW; E17F6CBC0A6C80E6 CRC64; MDFSRLHTYT PPQSVPENTG YTYGLSSSYS SDALDFETEH KLDPVFDFPR LSRRSLHSVP TTYTNDDGQV ENTHSYTRNA SHKDKISKTS KHHRNTNKQS LALNHTTRKA TSNSSSLLSQ STFDSHASDT SLRSSVLDES LIRKQTKVVH FWGLDDDGDL KGGTKTVIQG NGDLATGETD TTLNNGYICS DCSMLSERKD VLTAYSTSHV PSSRIYFRDG SQKRGASIHM NRILRLAKHT AASFSSLLVQ LFQVVLMKLG YESENYKLKN YESKDCESKS YKTKSHESKA HSNYCGCVNV KEFLREDGHL SVNGESLCDD CKGKKHLETY TTTHLQSSRS KRVARTIWHT FSYTGYFLMQ TLQRIGATGW FVSKKVLSFL WLAIVSPGKA ASGVFWWLGT GWYQFVTLIS WLNVFLLTRC LPKICKLLLL LIPLLLLSGI GLYLWNMESF LSLLPIFNWT RIHKTQRIDE SRYFFKPDSS HNNQPTEMDF LSFQQENEFR ILKLEDLLGK LFEKDKLIQE ELDQTKSRII SGIDERQHLI SKVKHLELEL GHLKSELLTW QDLKTSCDKI EAVHEKVDTQ IRETIRLMFS GDHQDGSLEW LLQWLSSKFV SKGDLQILLR DLERQILKNI THYVSERKQI PTPETLLNAD SVRISGITEL QARVIVNNAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALISV FGIPLWYHSQ SPRIVIQPDI YPGNCWAFKG SQGYLVVRLS MMIYPSAFTM EHIPKTLSPT GNITSAPKDF SVYGLDNEYQ EEGMLLGQFV YDQEGESLQI FQAMKSPGKA FQIVELRIFS NWGHPEYTCL YRFRVHGELT K // ID G3WIP7_SARHA Unreviewed; 2624 AA. AC G3WIP7; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000015302}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSSHAP00000015302}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015302, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015302} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015302} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000015302}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01010813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010814; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010815; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010816; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010817; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000015302; -. DR Ensembl; ENSSHAT00000015429; ENSSHAP00000015302; ENSSHAG00000013045. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; G3WIP7; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1254 1274 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2624 AA; 291031 MW; E3659340BE1C873F CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGVIAAF EVNFMDDVGQ TLLNWASAFG TQEMVEFLCE RGADVNRGQR SSSLHYAACF GRPQVAKTLL RHGANPDLRD EDGKTPLDKA RERGHSEVVA ILQSPGDWMC PVNKGDDKKK KDANKDEEEC NEPKGDPEMA PIYLKRLLPV FAQTFQQTML PSIRKASLAL IRKMIHFCSE ALLKEVCDSD AGHNLPTILV EITATVLDQE DDDDGHLLAL QIIRDLVDKG GDLFLDQLAR LGVISKVSTL AGPSSDDENE EESKPEKEDE PQEDAKELQQ GKPYHWRDWS IIRGRDCLYI WSDAAALELS NGSNGWFRFI LDGKLATMYS SGSPEGGSDS SESRSEFLEK LQRARSQVKP STSSQPILSV PGPTKLTVGN WSLTCLKEGE IAIHNSDGQQ ATILKEDLPG FVFESNRGTK HSFTAETSLG SEFVTGWTGK RGRKLKSKLE KTKQKSKKMI RDLYDDHFKA VESMPRGVVV TLRNIATQLE SSWELHTNRQ CIESENTWRD LMKTALENLI VLLKDENTIS PYEMCSSGLV QALLTVLNNV SLFLNIRNSM DLELVERINV FKTAFSENED DERELHSRPA VALIRKLIAV LESIERLPLH LYDTPGSTYN LQILTRRLRF RLERASGETS LIDRTGRMLK MEPLATVESL EQYLLKMVAK QWYDFDRSSF VFVRKLREGQ NFVFRHQHDF DENGIIYWIG TNAKTAYEWV NPAAYGLVVV TSSEGRNLPY GRLEDILSRD SSALNCHSND DKNAWFAIDL GLWVIPSAYT LRHARGYGRS ALRNWVFQVS KDGQNWTTLY THVDDCSLNE PGSTATWPLD PPKDEKQGWR HVRIKQMGKN ASGQTHYLSL SGFELYGTVN GVCEDQLGKA AKEAEANLRR QRRLVRSQVL KYMVPGARVI RGIDWKWRDQ DGSPQGEGTV TGELHNGWID VTWDAGGSNS YRMGAEGKFD LKLAPGYDPD TAASPKPVSS TVSGTTQSWS SLVKNNCPDK TSAAAGSSSR KGSSSSVCSV ASSSDISLGS TKMERRSESV MEQSIVSGTD VHEPIVVLSS AENMPQAEVG SSSSASTSTL TADTGSENAE RKLGPDSSVR TAGESSAISM GIVSVSSPDV SSVSELTNKE AASQRPLSSS ASNRLSVSSL LAAGAPMSSS ASVPNLSSRE TSSLESFVRR VANIARTNAT NNMNLSRSSS DNNTNTLGRN VMSTATSPLM GAQSFPNLTT TGTTSTVTMS TSSVTSSSNV ATATTVLSVG QSLSNTLTTS LTSTSSESDT GQEAEYSLYD FLDSCRASTL LAELDDDEDL PEPDEEDDEN EDDNQEDQEY EEVMVQESGA PRKHAKLKYI FSVHLLSTLF LPACVVGGGG DLFGDREEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQFSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSVEHVE QYLGTDELPK NDLITYLQKN ADSAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGSK SGLSQGAIST FQNCDILSLA KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRSSQ EEGDEQLQFN FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSTVRRDDP GEFRVGRLKH ERVKVPRGDS LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTEL GTWLCDDDFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILGNKSLS EDEKNTKLQD LMLKNPSGSG PPLSIEDLGL NFQFCPSSRV YGFTAVDLKP RGEDEMITMD NAEEYVDLMF DFCMQTGIQK QMEAFRGNDK DGFNKVFPME KLSSFSHEEV QMILCGNQSP SWAAEDIINY TEPKLGYTRD SPGFLRFVRV LCGMSSDERK AFLQFTTGCS TLPPGGLANL HPRLTVVRKV DATDASYPSV NTCVHYLKLP EYSSEEIMRE RLLAATMEKG FHLN // ID G3WIP8_SARHA Unreviewed; 2611 AA. AC G3WIP8; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000015303}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSSHAP00000015303}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015303, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015303} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015303} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000015303}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01010813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010814; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010815; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010816; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010817; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000015302; -. DR Ensembl; ENSSHAT00000015430; ENSSHAP00000015303; ENSSHAG00000013045. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2611 AA; 289530 MW; 79320497242C0964 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDA NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDL FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPSTS SQPILSVPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKSKKMIRDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL EVKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERASGET SLIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFVFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DSSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTTL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGIDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTAASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKMERRSES VMEQSIVSGT DVHEPIVVLS SAENMPQAEV GSSSSASTST LTADTGSENA ERKLGPDSSV RTAGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TTGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMVQESG APRKHAKLKY IFSVHLLSTL FLPACVVGGG GDLFGDREEE EYETKGGRRR TWDDDYVLKR QFSALVPAFD PRPGRTNVQQ TTDLEIPPPG TPHSELLEEV ECTPSPRLAL TLKVTGLGTT REVELPLTNF RSTIFYYVQK LLQFSCNGNV KSDKLRRIWE PTYTIMYREM KDSDKEKENG KMGCWSVEHV EQYLGTDELP KNDLITYLQK NADSAFLRHW KLTGTNKSIR KNRNCSQLIA AYKDFCEHGS KSGLSQGAIS TFQNCDILSL AKEQPQAKAG NGQNSCGVED VLQLLRILYI VASDPYSRSS QEEGDEQLQF NFPPDEFTSK KITTKILQQI EEPLALASGA LPDWCEQLTS KCPFLIPFET RQLYFTCTAF GASRAIVWLQ NRREATVERT RTTSTVRRDD PGEFRVGRLK HERVKVPRGD SLMEWAENVM QIHADRKSVL EVEFLGEEGT GLGPTLEFYA LVAAEFQRTE LGTWLCDDDF PDDESRHVDL GGGLKPPGYY VQRSCGLFTA PFPQDSDELE RITKLFHFLG IFLAKCIQDN RLVDLPISKP FFKLMCMGDI KSNMSKLIYE SRGDRDLHCT ESQSEASTEE GHDSLSVGSF EEDSKSEFIL DPPKPKPPAW FNGILTWEDF ELVNPHRARF LKEIKDLAIK RRQILGNKSL SEDEKNTKLQ DLMLKNPSGS GPPLSIEDLG LNFQFCPSSR VYGFTAVDLK PRGEDEMITM DNAEEYVDLM FDFCMQTGIQ KQMEAFRDGF NKVFPMEKLS SFSHEEVQMI LCGNQSPSWA AEDIINYTEP KLGYTRDSPG FLRFVRVLCG MSSDERKAFL QFTTGCSTLP PGGLANLHPR LTVVRKVDAT DASYPSVNTC VHYLKLPEYS SEEIMRERLL AATMEKGFHL N // ID G3WIP9_SARHA Unreviewed; 2522 AA. AC G3WIP9; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000015304}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSSHAP00000015304}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015304, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015304} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000015304}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01010813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010814; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010815; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010816; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01010817; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_012402407.1; XM_012546953.1. DR STRING; 9305.ENSSHAP00000015302; -. DR Ensembl; ENSSHAT00000015431; ENSSHAP00000015304; ENSSHAG00000013045. DR GeneID; 100917544; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2522 AA; 280148 MW; 298DB74FBC8FB897 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTAADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDA NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDAGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDL FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPSTS SQPILSVPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL EVKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERASGET SLIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFVFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DSSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTTL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGIDWKWRD QDGSPQGEGT VTGELHNGTT QSWSSLVKNN CPDKTSAAAG SSSRKGSSSS VCSVASSSDI SLGSTKMERR SESVMEQSIV SGTDVHEPIV VLSSAENMPQ AEVGSSSSAS TSTLTADTGS ENAERKLGPD SSVRTAGESS AISMGIVSVS SPDVSSVSEL TNKEAASQRP LSSSASNRLS VSSLLAAGAP MSSSASVPNL SSRETSSLES FVRRVANIAR TNATNNMNLS RSSSDNNTNT LGRNVMSTAT SPLMGAQSFP NLTTTGTTST VTMSTSSVTS SSNVATATTV LSVGQSLSNT LTTSLTSTSS ESDTGQEAEY SLYDFLDSCR ASTLLAELDD DEDLPEPDEE DDENEDDNQE DQEYEEVMEE EEYETKGGRR RTWDDDYVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPPP GTPHSELLEE VECTPSPRLA LTLKVTGLGT TREVELPLTN FRSTIFYYVQ KLLQFSCNGN VKSDKLRRIW EPTYTIMYRE MKDSDKEKEN GKMGCWSVEH VEQYLGTDEL PKNDLITYLQ KNADSAFLRH WKLTGTNKSI RKNRNCSQLI AAYKDFCEHG SKSGLSQGAI STFQNCDILS LAKEQPQAKA GNGQNSCGVE DVLQLLRILY IVASDPYSRS SQEEGDEQLQ FNFPPDEFTS KKITTKILQQ IEEPLALASG ALPDWCEQLT SKCPFLIPFE TRQLYFTCTA FGASRAIVWL QNRREATVER TRTTSTVRRD DPGEFRVGRL KHERVKVPRG DSLMEWAENV MQIHADRKSV LEVEFLGEEG TGLGPTLEFY ALVAAEFQRT ELGTWLCDDD FPDDESRHVD LGGGLKPPGY YVQRSCGLFT APFPQDSDEL ERITKLFHFL GIFLAKCIQD NRLVDLPISK PFFKLMCMGD IKSNMSKLIY ESRGDRDLHC TESQSEASTE EGHDSLSVGS FEEDSKSEFI LDPPKPKPPA WFNGILTWED FELVNPHRAR FLKEIKDLAI KRRQILGNKS LSEDEKNTKL QDLMLKNPSG SGPPLSIEDL GLNFQFCPSS RVYGFTAVDL KPRGEDEMIT MDNAEEYVDL MFDFCMQTGI QKQMEAFRDG FNKVFPMEKL SSFSHEEVQM ILCGNQSPSW AAEDIINYTE PKLGYTRDSP GFLRFVRVLC GMSSDERKAF LQFTTGCSTL PPGGLANLHP RLTVVRKVDA TDASYPSVNT CVHYLKLPEY SSEEIMRERL LAATMEKGFH LN // ID G3X1W4_SARHA Unreviewed; 1195 AA. AC G3X1W4; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000021669}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSSHAP00000021669}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021669, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021669} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021669} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000021669}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01155810; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155811; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155812; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000021670; -. DR Ensembl; ENSSHAT00000021843; ENSSHAP00000021669; ENSSHAG00000018351. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}. FT COILED 927 947 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSSHAP00000021669}. SQ SEQUENCE 1195 AA; 132534 MW; 6E927A424FCF9F99 CRC64; PSWRVCCKES SSPSSYYSQN DNCVLENEDE HVQKEDTQSR ILSSPVVETL PSIDINGDSS SIAANIENVE NISTSSTSEI TPVSKPNEIE NSSADIPLAT LTEIEQSETD CTIGGSLFSD PHVEKHGTLG FHIHSLVGQH IENASSSQDK GITKSEFESV STSEQDADHQ KSALNASENL KEQVADYIKA GDIDPTSVVS PKDPGDIPTF DEWKKKVMEV EKEKSQSMHP SSNGGQHSTK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQFDIANYEL FSSTPKDFLV SISDRYPTSK WIKLGTFHGR DERTVQSFPL DEQMYAKYVK VELVSHFGSE HFCPLSLIRV FGTSMVEEYE EIADSQYQSE RQELFDEDYD YPLDYGTGED KSSKNLLGSA TNAILNMVNI AANILGAKTE DLAEIGNKSV SENAPATTAS QMSDSEPSPI PSPEFVTAEG HLPDIEPPIP DFPKEGPIVQ LVQEEEEEPS PSTVTLLGND EQEEESPAWS ELETQAYCSE LPPACCVSSF SEYLLRWCSV RVALSRRRSR TMGSREPRSP VPAQTPLPLS PEPVETLLPQ PPSEELDSKG MEKDTETAVA HNLSGAFHEE LVNHTRDAIE LEPSHPPAVS QSVLLDATPE IKSSSKAEIP DPIKNEVGQT VSQLFPQESI IEVYTETEKK SESIVATEKH AVIHETSTVG EVKDSSLRDD LSSIPMILKP SESVLPPEHT PSVADDEDEE AKVTTTTDTY KPLTPPVGEP SPVSDMRDEE QAAEDVLLAI PVHGGLQRTA PDFYAELQNS TDLGYANGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQSYM ARHLHPEREI WRLFSPFFVC LFFGGKIRGD KKDQRQTEAI QLLQAQLTNM TQLVSNLSTT VADLKREVSD RQSYLVISLV LCVILGLMLC MQRCRNTSQF DGDYISKLPK NNHYPSPKRC FSSYDDLNLK RRTSFPLMRS KSLQLAGKEV DPDDLYIVEP LKFSPEKKKK RCKYKTEKIE TVKPADTSHP IANGDIKAKK PFTNQRDFSN MGEVYHSSYK GPPSEGSSET SSQSEESYFC GISACTSLCN GQTQKTKTEK RALKRRRSRV QDQGKLIKTL IQTKSGSMPS LHDFIKGNKE ITVGTFGVTA VSGHI // ID G3X1W5_SARHA Unreviewed; 1262 AA. AC G3X1W5; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000021670}; GN Name=SUCO {ECO:0000313|Ensembl:ENSSHAP00000021670}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021670, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021670} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021670} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000021670}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01155810; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155811; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155812; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000021670; -. DR Ensembl; ENSSHAT00000021844; ENSSHAP00000021670; ENSSHAG00000018351. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; G3X1W5; -. DR OMA; SSPWFES; -. DR TreeFam; TF105817; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 1262 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003459433. FT COILED 994 1014 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1262 AA; 140203 MW; 7DE40C5E1C3BF9BB CRC64; MLSPPRNVLH MLLVLWIDGL TEFPSWRVCC KESSSPSSYY SQNDNCVLEN EDEHVQKEEE TDKSINTELF GNIDSTMPSA PEHDTLVDDC STDEQDTQSR ILSSPVVETL PSIDINGDSS SIAANIENVE NISTSSTSEI TPVSKPNEIE NSSADIPLAT LTEIEQSETD CTIGGSLFSD PHVEKHGTLG FHIHSLVGQH IENASSSQDK GITKSEFESV STSEQDADHQ KSALNASENL KEQVADYIKA GDIDPTSVVS PKDPGDIPTF DEWKKKVMEV EKEKSQSMHP SSNGGQHSTK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQFDIANYEL FSSTPKDFLV SISDRYPTSK WIKLGTFHGR DERTVQSFPL DEQMYAKYVK MFIKYIKVEL VSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQSERQE LFDEDYDYPL DYGTGEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLA EIGNKSVSEN APATTASQMS DSEPSPIPSP EFVTAEGHLP DIEPPIPDFP KEGPIVQLVQ EEEEEPSPST VTLLGNDEQE EESPAWSELE TQAYCSELPP ACCVSSFSEY LLRWCSVRVA LSRRRSRTMG SREPRSPVPA QTPLPLSPEP VETLLPQPPS EELDSKGMEK DTETAVAHNL SGAFHEELVN HTRDAIELEP SHPPAVSQSV LLDATPEIKS SSKAEIPDPI KNEVGQTVSQ LFPQESIIEV YTETEKKSES IVATEKHAVI HETSTVGEVK DSSLRDDLSS IPMILKPSES VLPPEHTPSV ADDEDEEAKV TTTTDTYKPL TPPVGEPSPV SDMRDEEQAA EDVLLAIPVH GGLQRTAPDF YAELQNSTDL GYANGNLVHG SNQKESVFMR LNNRIKALEV NMSLSGRYLE ELSQSYMARH LHPEREIWRL FSPFFVCLFF GGKIRGDKKD QRQTEAIQLL QAQLTNMTQL VSNLSTTVAD LKREVSDRQS YLVISLVLCV ILGLMLCMQR CRNTSQFDGD YISKLPKNNH YPSPKRCFSS YDDLNLKRRT SFPLMRSKSL QLAGKEVDPD DLYIVEPLKF SPEKKKKRCK YKTEKIETVK PADTSHPIAN GDIKAKKPFT NQRDFSNMGE VYHSSYKGPP SEGSSETSSQ SEESYFCGIS ACTSLCNGQT QKTKTEKRAL KRRRSRVQDQ GKLIKTLIQT KSGSMPSLHD FIKGNKEITV GTFGVTAVSG HI // ID G3X1W6_SARHA Unreviewed; 1107 AA. AC G3X1W6; DT 16-NOV-2011, integrated into UniProtKB/TrEMBL. DT 16-NOV-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000021671}; GN Name=SUCO {ECO:0000313|Ensembl:ENSSHAP00000021671}; OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus. OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021671, ECO:0000313|Proteomes:UP000007648}; RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021671} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21709235; DOI=10.1073/pnas.1102838108; RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., RA Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q., RA Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., RA Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., RA Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y., RA Kreiss A., Woods G.M., Jones M.E., Schuster S.C.; RT "Genetic diversity and population structure of the endangered RT marsupial Sarcophilus harrisii (Tasmanian devil)."; RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011). RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021671} RP IDENTIFICATION. RG Ensembl; RL Submitted (SEP-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSSHAP00000021671}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AEFK01155810; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155811; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155812; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AEFK01155813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9305.ENSSHAP00000021670; -. DR Ensembl; ENSSHAT00000021845; ENSSHAP00000021671; ENSSHAG00000018351. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000007648; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007648}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007648}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 1107 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003459664. FT TRANSMEM 1020 1038 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 994 1014 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1107 AA; 123142 MW; 6527806A194D41AA CRC64; MLSPPRNVLH MLLVLWIDGL TEFPSWRVCC KESSSPSSYY SQNDNCVLEN EDEHVQKEEE TDKSINTELF GNIDSTMPSA PEHDTLVDDC STDEQDTQSR ILSSPVVETL PSIDINGDSS SIAANIENVE NISTSSTSEI TPVSKPNEIE NSSADIPLAT LTEIEQSETD CTIGGSLFSD PHVEKHGTLG FHIHSLVGQH IENASSSQDK GITKSEFESV STSEQDADHQ KSALNASENL KEQVADYIKA GDIDPTSVVS PKDPGDIPTF DEWKKKVMEV EKEKSQSMHP SSNGGQHSTK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQFDIANYEL FSSTPKDFLV SISDRYPTSK WIKLGTFHGR DERTVQSFPL DEQMYAKYVK MFIKYIKVEL VSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQSERQE LFDEDYDYPL DYGTGEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLA EIGNKSVSEN APATTASQMS DSEPSPIPSP EFVTAEGHLP DIEPPIPDFP KEGPIVQLVQ EEEEEPSPST VTLLGNDEQE EESPAWSELE TQAYCSELPP ACCVSSFSEY LLRWCSVRVA LSRRRSRTMG SREPRSPVPA QTPLPLSPEP VETLLPQPPS EELDSKGMEK DTETAVAHNL SGAFHEELVN HTRDAIELEP SHPPAVSQSV LLDATPEIKS SSKAEIPDPI KNEVGQTVSQ LFPQESIIEV YTETEKKSES IVATEKHAVI HETSTVGEVK DSSLRDDLSS IPMILKPSES VLPPEHTPSV ADDEDEEAKV TTTTDTYKPL TPPVGEPSPV SDMRDEEQAA EDVLLAIPVH GGLQRTAPDF YAELQNSTDL GYANGNLVHG SNQKESVFMR LNNRIKALEV NMSLSGRYLE ELSQSYMARH LHPEREIWRL FSPFFVCLFF GGKIRGDKKD QRQTEAIQLL QAQLTNMTQL VSNLSTTVAD LKREVSDRQS YLVISLVLCV ILGLMLCMQR CRNTSQFDGD YISKLPKNNH YPSPKRCFSS YDDLNLKRRT SFPLMRSKSL QLAGKEGRLC VTFHMYF // ID G3XNT2_ASPNA Unreviewed; 621 AA. AC G3XNT2; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHA27983.1}; GN ORFNames=ASPNIDRAFT_184562 {ECO:0000313|EMBL:EHA27983.1}; OS Aspergillus niger (strain ATCC 1015 / CBS 113.46 / FGSC A1144 / LSHB OS Ac4 / NCTC 3858a / NRRL 328 / USDA 3528.7). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=380704 {ECO:0000313|EMBL:EHA27983.1, ECO:0000313|Proteomes:UP000009038}; RN [1] {ECO:0000313|EMBL:EHA27983.1, ECO:0000313|Proteomes:UP000009038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 1015 / CBS 113.46 / FGSC A1144 / LSHB Ac4 / NCTC 3858a / RC NRRL 328 / USDA 3528.7 {ECO:0000313|Proteomes:UP000009038}; RX PubMed=21543515; DOI=10.1101/gr.112169.110; RA Andersen M.R., Salazar M.P., Schaap P.J., van de Vondervoort P.J.I., RA Culley D., Thykaer J., Frisvad J.C., Nielsen K.F., Albang R., RA Albermann K., Berka R.M., Braus G.H., Braus-Stromeyer S.A., RA Corrochano L.M., Dai Z., van Dijck P.W.M., Hofmann G., Lasure L.L., RA Magnuson J.K., Menke H., Meijer M., Meijer S.L., Nielsen J.B., RA Nielsen M.L., van Ooyen A.J.J., Pel H.J., Poulsen L., Samson R.A., RA Stam H., Tsang A., van den Brink J.M., Atkins A., Aerts A., RA Shapiro H., Pangilinan J., Salamov A., Lou Y., Lindquist E., Lucas S., RA Grimwood J., Grigoriev I.V., Kubicek C.P., Martinez D., RA van Peij N.N.M.E., Roubos J.A., Nielsen J., Baker S.E.; RT "Comparative genomics of citric-acid-producing Aspergillus niger ATCC RT 1015 versus enzyme-producing CBS 513.88."; RL Genome Res. 21:885-897(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHA27983.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACJE01000002; EHA27983.1; -; Genomic_DNA. DR EnsemblFungi; EHA27983; EHA27983; ASPNIDRAFT_184562. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000009038; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009038}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009038}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 303 322 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 621 AA; 68698 MW; 5019D09A7E27C060 CRC64; MPARRGATRR AGSTRSDIGS ASTYFQSKLG PEARTQALPN LPTKQSFAYG SAETPILPRE LKIQPHMDLT EMADAIDKGI EDAKDRQMKE KETTQDKSRR QKSPSITRSP VRRSRREPTP DELQLLDNLR EATKSPTPVR GNYSNNDQST ATPTPPIPHT LSTASSPAQS LPVPRYPHVP AENLYPSPMG RFGPQLHDGP PLGSSPLPDD SSLYSFTVER AINSDELTRT LSDGKNIKAP PRRFSGLAFA NEPIHEEEEP DSRLLKTKSR SPSLQPSYED FQIEPSPEPE PQSEPDVSTV ARILAGIALA AATVYLVAFG GIPSLSRPPQ YIPMDENNML AVSSLTDQMS RIGAQVSSLA KEMRTVKWDV NEVQSEVRSS PTPIMPPSRG STDLGPPTEQ KTNFLSIGLG VIVIPGLTSP TVGHKLSAWQ WAYVNLWRGS HYRPASPPLA ALVPWEDYGD CWCSTPRDGM SQIGIDLGQK IVPEEVAVEH MPKTATLKPE NAPREMELWA QYVLVQKGTS RPARTQAERF SIHKPIMDAL RSAWPTEDPT AYSDDPLLGP TYYRVGKFTY DIHGSHHVQR FELDAVIDSP EVRVDRVVFR ATSNWGGNHT CIYRLKLFGH V // ID G3Y4J9_ASPNA Unreviewed; 1320 AA. AC G3Y4J9; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 14-OCT-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHA22376.1}; GN ORFNames=ASPNIDRAFT_128861 {ECO:0000313|EMBL:EHA22376.1}; OS Aspergillus niger (strain ATCC 1015 / CBS 113.46 / FGSC A1144 / LSHB OS Ac4 / NCTC 3858a / NRRL 328 / USDA 3528.7). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=380704 {ECO:0000313|EMBL:EHA22376.1, ECO:0000313|Proteomes:UP000009038}; RN [1] {ECO:0000313|EMBL:EHA22376.1, ECO:0000313|Proteomes:UP000009038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 1015 / CBS 113.46 / FGSC A1144 / LSHB Ac4 / NCTC 3858a / RC NRRL 328 / USDA 3528.7 {ECO:0000313|Proteomes:UP000009038}; RX PubMed=21543515; DOI=10.1101/gr.112169.110; RA Andersen M.R., Salazar M.P., Schaap P.J., van de Vondervoort P.J.I., RA Culley D., Thykaer J., Frisvad J.C., Nielsen K.F., Albang R., RA Albermann K., Berka R.M., Braus G.H., Braus-Stromeyer S.A., RA Corrochano L.M., Dai Z., van Dijck P.W.M., Hofmann G., Lasure L.L., RA Magnuson J.K., Menke H., Meijer M., Meijer S.L., Nielsen J.B., RA Nielsen M.L., van Ooyen A.J.J., Pel H.J., Poulsen L., Samson R.A., RA Stam H., Tsang A., van den Brink J.M., Atkins A., Aerts A., RA Shapiro H., Pangilinan J., Salamov A., Lou Y., Lindquist E., Lucas S., RA Grimwood J., Grigoriev I.V., Kubicek C.P., Martinez D., RA van Peij N.N.M.E., Roubos J.A., Nielsen J., Baker S.E.; RT "Comparative genomics of citric-acid-producing Aspergillus niger ATCC RT 1015 versus enzyme-producing CBS 513.88."; RL Genome Res. 21:885-897(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHA22376.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ACJE01000012; EHA22376.1; -; Genomic_DNA. DR EnsemblFungi; EHA22376; EHA22376; ASPNIDRAFT_128861. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009038; Unassembled WGS sequence. DR GO; GO:0003995; F:acyl-CoA dehydrogenase activity; IEA:InterPro. DR GO; GO:0050660; F:flavin adenine dinucleotide binding; IEA:InterPro. DR Gene3D; 1.10.540.10; -; 1. DR InterPro; IPR006091; Acyl-CoA_Oxase/DH_cen-dom. DR InterPro; IPR009075; AcylCo_DH/oxidase_C. DR InterPro; IPR013786; AcylCoA_DH/ox_N. DR InterPro; IPR009100; AcylCoA_DH/oxidase_NM_dom. DR InterPro; IPR007727; Spo12. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00441; Acyl-CoA_dh_1; 1. DR Pfam; PF02770; Acyl-CoA_dh_M; 1. DR Pfam; PF02771; Acyl-CoA_dh_N; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR Pfam; PF05032; Spo12; 1. DR SUPFAM; SSF47203; SSF47203; 1. DR SUPFAM; SSF56645; SSF56645; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009038}; KW Reference proteome {ECO:0000313|Proteomes:UP000009038}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 1320 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003460232. SQ SEQUENCE 1320 AA; 144036 MW; 4EBE905F14424C05 CRC64; MTSWTASQWI PWMTLISTWI DGTTADPSQT ICPAPRWQVA EAEFIQWPQC PETRWEAEPA TPIPAEQQQP LLKAPEETLS AVSVSMASSE SSARPDHELD TESPLDNANF LSFEDWKKQN LAKVGQSAEN VGGRGAAAAA AGKEGRRRPT GINNALDSLG EDVEIELDFG GFGADTPEAA KPTSWGARVS TGVTGGEAGS AGDVDSLAHG VPPAGGVPRS KDAGTTCKER FNYASFDCAA TVLKTNPECT GSSSVLIENK DSYMLNECRA NNKFLILELC DDILVDTVVL ANYEFFSSIF HTFRVSVSDR YPAKLDQWRE LGVYEARNTR EVQAFAVENP LIWARYVKIE FLTHYGNEFF CPLSLIRVHG TTMLEEYKHD GEVSRTDDVV ADEELEPAPV AAEIETIPTV DAAAAAGPIE QKVDEQTPET CPNPGPVVDE AVMMQLWGVP WTCSIHDSPA AGDEGTQASL NRPSATDATP PKGDDAAPLG NEAPVKEAGE QKMTVSPNVD SAPSSATTAG PETTSHGEAD SRSTGFTKEE QSVAAETTRS TATQPPSANP TTQESFFKSV NKRLQMLESN SSLSLLYIEE QSRILRDAFN KVEKRQLAKT STFLEQLNVT VLHELKQFRE QYDNVWKSVA LEFEHQRIQY HQEVHSLSAQ LGVLADELVF QKRVAVIQSI MILFCFGLVL FSRGAVSSYI ELPSMQNMVS RSYSLRSSSP PFGSPSVSPT SSGRRAGGHR RNLSEDSQED GPISPTLAYS PPTPVSDVMS SSEEAENQRG NSLALPEVAP PVRSRSSPPD LKGGEESIEE SSSSGDSPVS HGRNAAINSM EYHRQVLQGK LENGDKNQAS YVSPSDDIMS PCSKKLSDLK GKRFKNSLSC QPPPASPSSL SPSSATGPSV EQFVEKECIP SEAIFRAQLG TGSQRWSTYP AIMETLKQKA REQGLWNMFL PKNHFAQGAG FSNLEYGLMA EYLGKSTIAS EATNNAAPDT GNMEVLAKYG NEQQKKQWLE PLLEGKIRSA FLMTEPEVAS SDATNIQLDI KREGDEWVLN GSKWWSSGAG DPRCAIYLVM GKTDPTNSDP YKQQSVILVP AHNTPGITVH RMLTVYGYDD APHGHGHITF KNVRVPVSNI VLGEGRGFEI IQGRLGPGRI HHAMRTIGAA EKALEWMIAR INDERKKPFG QSLSSHGVIL EWVAKSRIEI DAARLIVLNA AIKIDQGDAK SALKEIAQAK VMVPSMACGV IDRAVQAYGA MGVCQDTPLA YMWAWVRTLR IADGPDEVHL LQMGRRENKS RKEEVRRKLK WQAEETEKLL GVTAGGRSRL // ID G4NC08_MAGO7 Unreviewed; 975 AA. AC G4NC08; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 14-OCT-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHA49011.1}; GN ORFNames=MGG_00469 {ECO:0000313|EMBL:EHA49011.1}; OS Magnaporthe oryzae (strain 70-15 / ATCC MYA-4617 / FGSC 8958) (Rice OS blast fungus) (Pyricularia oryzae). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; Magnaporthaceae; OC Magnaporthe. OX NCBI_TaxID=242507 {ECO:0000313|EMBL:EHA49011.1, ECO:0000313|Proteomes:UP000009058}; RN [1] {ECO:0000313|EMBL:EHA49011.1, ECO:0000313|Proteomes:UP000009058} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=70-15 / ATCC MYA-4617 / FGSC 8958 RC {ECO:0000313|Proteomes:UP000009058}; RX PubMed=15846337; DOI=10.1038/nature03449; RA Dean R.A., Talbot N.J., Ebbole D.J., Farman M.L., Mitchell T.K., RA Orbach M.J., Thon M.R., Kulkarni R., Xu J.-R., Pan H., Read N.D., RA Lee Y.-H., Carbone I., Brown D., Oh Y.Y., Donofrio N., Jeong J.S., RA Soanes D.M., Djonovic S., Kolomiets E., Rehmeyer C., Li W., RA Harding M., Kim S., Lebrun M.-H., Bohnert H., Coughlan S., Butler J., RA Calvo S.E., Ma L.-J., Nicol R., Purcell S., Nusbaum C., Galagan J.E., RA Birren B.W.; RT "The genome sequence of the rice blast fungus Magnaporthe grisea."; RL Nature 434:980-986(2005). RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=70-15; RG The Broad Institute Genome Sequencing Platform; RA Ma L.-J., Dead R., Young S.K., Zeng Q., Gargeya S., Fitzgerald M., RA Haas B., Abouelleil A., Alvarado L., Arachchi H.M., Berlin A., RA Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E., Gearin G., RA Gellesch M., Goldberg J., Griggs A., Gujja S., Heiman D., Howarth C., RA Larson L., Lui A., MacDonald P.J.P., Mehta T., Montmayeur A., RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., RA Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Yandava C., RA Wortman J., Nusbaum C., Birren B.; RT "The Genome Sequence of Magnaporthe oryzae 70-15."; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001235; EHA49011.1; -; Genomic_DNA. DR RefSeq; XP_003718595.1; XM_003718547.1. DR EnsemblFungi; MGG_00469T0; MGG_00469T0; MGG_00469. DR GeneID; 2674226; -. DR KEGG; mgr:MGG_00469; -. DR InParanoid; G4NC08; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009058; Chromosome 5. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009058}; KW Reference proteome {ECO:0000313|Proteomes:UP000009058}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 975 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003466337. FT COILED 130 150 {ECO:0000256|SAM:Coils}. FT COILED 648 668 {ECO:0000256|SAM:Coils}. FT COILED 703 730 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 975 AA; 105198 MW; B8D5693D6C741D95 CRC64; MRLKATSCNR FCISLVWGLA LLGRHGSAAQ PGRDAQAAAA ASAIGSCEAR TINYITHTLP QQCLRTSWSS ASNDAAAAPA GPIVTSTTGE PTAAPETEGQ NNDQKTDSET GGADDGQDQE LATSSFMSFE EWKELQLRKA EQEADDLKAR NNDDGSGQQQ QQGGEGGRDE GETNLDFDAL SEKVSEITAS GAGSTGEATG KQLEQQKQQQ DSKGEDLAIY DGLDYTRSKD AGKTCKERFS YASFDAGATV LKTGPRAKNA KAILVENKDS YMLLECAQPN KFVIIELSDD VLVDTVVIAN FEFFSSMIRT FRASVSDRYP VKLEKWKVIG TFEARNQRDI QAFLVEHPQI WAKYIRIEFL NHYGSEFYCP ISLVRVHGTR MMDSWKEVEG GRDDDDEAID QGMPTISQQP SEPQPQPAVE EPSPPEPVPA DNATSAPPVV TEMGLTPWEP IFREFSSFEM CEMPEPTATD SGLAGQSSVV SGPETQPMHD GRPHDSVDGN RSTAQKPVSS LTSAIFKETA APRGNVTMPD PAASVYNIEN LVPPPLASTE SPSGAKDPAS TSSTSTQASS SKHVSNSSKH KTLVSKPSSA SAKPAMPKPS STVPSSARNR TGTNTNSAAA ASPTVQESFF KTVAKRLQLL ESNTSLSMQY IEDQSRFLQD ALARMERKQI SRVDTFLDTL NRTVLAELRS VRSQYDQIWQ STVIALESQR DQSQREIVAL SDRLSVLAEE VVFQKRMAIL QSILLLACLV LIIFSRAFGG AASTVTFSGT PRHSRWAFTM PMSPPPSAGP PTGRRYGPGP PSPSPSPSRL GSIPPRSPPE DDDRLAAIDT RHRYADKTLP LTPTSEYDAG GREVTPVIHV VDETGEPSYF DDTASFSSPQ RLGSEASDAT PDLGYGSDTH VEPADPVVIS SEPADEENVS RAEQAKYPPG GEENDGFEAD AADKEVEGSG PPLTRTTLPD FGDSRKPLPA LPESP // ID G4TBV2_PIRID Unreviewed; 919 AA. AC G4TBV2; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCA68778.1}; GN ORFNames=PIIN_02640 {ECO:0000313|EMBL:CCA68778.1}; OS Piriformospora indica (strain DSM 11827). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Sebacinales; Sebacinales group B; Piriformospora. OX NCBI_TaxID=1109443 {ECO:0000313|EMBL:CCA68778.1, ECO:0000313|Proteomes:UP000007148}; RN [1] {ECO:0000313|EMBL:CCA68778.1, ECO:0000313|Proteomes:UP000007148} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=DSM 11827 {ECO:0000313|EMBL:CCA68778.1, RC ECO:0000313|Proteomes:UP000007148}; RX PubMed=22022265; DOI=10.1371/journal.ppat.1002290; RA Zuccaro A., Lahrmann U., Guldener U., Langen G., Pfiffi S., RA Biedenkopf D., Wong P., Samans B., Grimm C., Basiewicz M., Murat C., RA Martin F., Kogel K.H.; RT "Endophytic Life Strategies Decoded by Genome and Transcriptome RT Analyses of the Mutualistic Root Symbiont Piriformospora indica."; RL PLoS Pathog. 7:e1002290-e1002290(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:CCA68778.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAFZ01000040; CCA68778.1; -; Genomic_DNA. DR EnsemblFungi; CCA68778; CCA68778; PIIN_02640. DR InParanoid; G4TBV2; -. DR OMA; DNKAYRP; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000007148; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007148}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007148}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 443 460 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 596 616 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 919 AA; 100250 MW; 5D28529216F1479B CRC64; MNRTATGDSS PAKRRSQQPT TSTSRGYDQS SQQHYQVSSI TQHTVEKPKS KSNLTGDNLR DTSVNVASAF SQAVQDFAMS NGAVTTNGYP RTNGVAERGL GAPPPSRARK PVSRGSNRVL AVQEESDDDF EVNRLRGKSP LVEAASNFVR ALSPGGSLYL RQRAGGAGDA DPSGYQSLTG AFTGIKPQSS QGSSRLVPTQ SQTSHGNVSS DYNYSREESM VNRMEPPPPG KPMSSASASA AAKKKLGPRH SSTIALDKQA YKPPIDEEED DEDDWSEDEK GQRRKRSKHG PAQKKLDHLP TVGAQAKHRR VRKRKGKDGE EESESETVPR DSAVRSLRGS MPPPEPYDDQ MEQSNEHFAD EYTISDAQAH PSQSAFSIGG LLGKMVNMAF HLFGATVALS VNTLTSASIL LLRIIASVFD ICLIQPASFV MDRAQKIFGS IDWSSIGKGV LGLVIAWLFF NSLLGPNANT STGRAWIPGW GQPSPLPSSI PTGDAPAALL EIARRLQDME NKVIDMQYAQ RRAFDRLDSQ ARITDESASK LDSLGSAISK QNLARIESEE KLRSSSAAAI TSLRTELSGL MNQLGHIDTS GADEKLQFFE KRLAVAEANV KDAVEVSKQA LNTANTKAVS TGGTRSGGSI WELFGTGEGK SSLTIKSTDG RDVTNLISAL VDNAVNMRSK DDIAKPDYAS YFAGGRVIPQ LTSQTFRIPA KSYWGSWGFG MFGQQTVEGR PPVTAIHPDI HVGNCWPFKG QQGQLGVVLA RSIIVTDITI DHAPKEVAFD VRSAPKNMEV WGLVEGAENI KKVTEYHRRR EQRYRDLVAA ANREGRKPPP PEDPYPANLP PDGNYIRLAQ FKYDVNAPSH IQTFSVPQDI QDLGVDVGIV VLMVRSNWGE KNWTCLYRFR VSGHDLDRRP YPLEEIDGE // ID G4TG85_PIRID Unreviewed; 1029 AA. AC G4TG85; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCA70331.1}; GN ORFNames=PIIN_04270 {ECO:0000313|EMBL:CCA70331.1}; OS Piriformospora indica (strain DSM 11827). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Sebacinales; Sebacinales group B; Piriformospora. OX NCBI_TaxID=1109443 {ECO:0000313|EMBL:CCA70331.1, ECO:0000313|Proteomes:UP000007148}; RN [1] {ECO:0000313|EMBL:CCA70331.1, ECO:0000313|Proteomes:UP000007148} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=DSM 11827 {ECO:0000313|EMBL:CCA70331.1, RC ECO:0000313|Proteomes:UP000007148}; RX PubMed=22022265; DOI=10.1371/journal.ppat.1002290; RA Zuccaro A., Lahrmann U., Guldener U., Langen G., Pfiffi S., RA Biedenkopf D., Wong P., Samans B., Grimm C., Basiewicz M., Murat C., RA Martin F., Kogel K.H.; RT "Endophytic Life Strategies Decoded by Genome and Transcriptome RT Analyses of the Mutualistic Root Symbiont Piriformospora indica."; RL PLoS Pathog. 7:e1002290-e1002290(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:CCA70331.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAFZ01000079; CCA70331.1; -; Genomic_DNA. DR EnsemblFungi; CCA70331; CCA70331; PIIN_04270. DR InParanoid; G4TG85; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007148; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007148}; KW Reference proteome {ECO:0000313|Proteomes:UP000007148}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 1029 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003468728. SQ SEQUENCE 1029 AA; 112769 MW; F4011A84B8D7D6B6 CRC64; MPPLLSITAI LFVLCVFSHT VFSEDVFEPA LNHSSICSLP PSPLLTFNPV ALNPTCSLSS RPADPAPEFL SFEEWKAKQL AVQESANGKD KDSHAPTTKS NTGNKQDNKN QEEEVASTSN GHKDADVARD VVASVEANDT EHATPSPSAP KYHIPLIDRF NYASNECNAR IHSSHKSAKS SSSILSRKKD RYMLSPCQLN KEKHFVVVEL CEDIRIDTVQ LANFEFFSGV FKDIRVSAAE TYTSDGKGWT VVGEYTAKNI RGIQSFHPQK ELLRFYRYLR VEFLSYYGKE YFCPVSLLRV YGLTQMEEWK SDLWKAEWEA SQAASIDAVT KEPEDTVIIL GAGSLSASAS SGYHIPSTDS APSTTSESSQ TSQESKDPPQ RVESASSENK TAGSRAITES LHEIPSTSHP IPATGGTLQS QPTESRETSE SQETSLRQTQ SSDGHHVVEE VPTSSDEKVS STPPPTTSTP ATSAENGYNE PTTSVVTRTV STTIILSAAT PSSSTAIPNG ESVYRMIMNR ISSLEANQTL YARYVEEQGR SVNIRLELIE EDIGRLGAVM TSQQQRIKKM FERHRIDTER AHNQLAAQVE HLAHEVLMEK RLGILQLVLL LTVLVFMALT RGSRGEPIRL RKAGLVWHKN LRNSADWVTG WRGSASRLPS PDGEHVDSQP NDAIEVAFNL EDSPMRRLQD PPRRPRRSSS QRSARYDDVF QTPPRKSRTV IRSRQNSTHS RNHTPLGAFR RSSNNNMRAT DSATEMPSAS KRAPLGVIDR NGQEDPKQPY HKETWRASGR SVSLSGANPV GLGLGAGGST GNLPDSRNHG SMRRLAKTAH LHEVKNATQR SRGNTVDELR PPLVVSVLSP MTEASTPGFT PNPQTLSHED KSGPTRITGF ESSELRGVGG PPAARSEGVL SDLFSIPTSS SLVSPQAIPF PSPGNRGISE GEQDSEDDDV WVDDEQPGDG DPDEREIVQR SPRTKRSRFS FGSPRKPFVF SSSPPAGTER GSGMMGGGGR GRFRFSKRSD VSMGSIPRG // ID G4UKC8_NEUT9 Unreviewed; 1020 AA. AC G4UKC8; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGZ73477.1}; GN ORFNames=NEUTE2DRAFT_108096 {ECO:0000313|EMBL:EGZ73477.1}; OS Neurospora tetrasperma (strain FGSC 2509 / P0656). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=510952 {ECO:0000313|EMBL:EGZ73477.1, ECO:0000313|Proteomes:UP000008513}; RN [1] {ECO:0000313|EMBL:EGZ73477.1, ECO:0000313|Proteomes:UP000008513} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=strain FGSC 2509 / P0656 {ECO:0000313|Proteomes:UP000008513}; RX PubMed=21750257; DOI=10.1534/genetics.111.130690; RA Ellison C.E., Stajich J.E., Jacobson D.J., Natvig D.O., Lapidus A., RA Foster B., Aerts A., Riley R., Lindquist E.A., Grigoriev I.V., RA Taylor J.W.; RT "Massive changes in genome architecture accompany the transition to RT self-fertility in the filamentous fungus Neurospora tetrasperma."; RL Genetics 189:55-69(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL891217; EGZ73477.1; -; Genomic_DNA. DR EnsemblFungi; EGZ73477; EGZ73477; NEUTE2DRAFT_108096. DR OMA; EPPRIAR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000008513; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008513}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008513}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 496 519 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 111 135 {ECO:0000256|SAM:Coils}. FT COILED 164 320 {ECO:0000256|SAM:Coils}. FT COILED 363 383 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1020 AA; 115457 MW; C811BADE3138DF68 CRC64; MPPRRITRRS VVSPSPALSD TGTPKPGKRG TLIPVEQVRN QTPTRFSLSY GSSLVAMPDR NKTAAGTDLE TAFAEIHETV RTDNIKAEAR RRELDARRGS TTPGPRRPDP IEEETEEEEE EDEEVQEEKE EDDDDNQGGY YNDEEEEEPE PDPPRRATKP AQTLNKLQDQ IEKAKLLEKQ RAEERAAEER EKAEKDAAER ERKRVEKDKK DKEEREKREK AEREKKEQAA KAQQEAKAKA AREAQERAER EAKKRARDEE DQEQAELERA ERNARLKRER SEDARRQAEQ KHAAEAARKK EEQRQAREAS EAEMASLEEA KRQAMRPPPP PSKQLLSTPP TSRTRELVVP DTGNSYVEES DVYTDSEKMR EVLEEEVVRM AQQRRLARYT PEPPEPPRIA RRPASTLSNS FQHAPHQVDQ HQDLFDTEAK SMSDKQYPSF GKVSKPTAAR PNQTSRPRAE QSNTTNGETP PPPYTTAPPT FMQRLLKLIR RSTWGVWKLF TFLVPVLLIG LIVLTASSYG SPDSNTSIRW YGWKHWRSNV GQFIPSHPQL TDDQFNDLKD FILEQSSSTE SAVKNIQSLL PRMVHVKRGP NGDLIIQDDF WHALLDKMLK DSSVLTLDGT GDISEEHWDA LRPRLIKAGL FEKGPSDEHI LQIAEGTVSK SWERWVTKNG EKVAQVVKKH LPGDKGDGVT RDAAISRDEF VGLLKKRIAE HKEEIDGQLD SVKKGLETLI DTTVKAAISN SEGSLSKSEI TTLVRNIVKK EIPRAQLEAA AKDGIMRNYH DYVETQVNHF GLGNEAGIVL SESSPVYRLD SQALPGNKHL SKLLGKPKPI SSKDQVTLEA EYMLALSAWN DVGQCWCAGI TASRGAELAV EMANHVIPQA IVVEHVHPNA TNDPGSMPKD IEIWGYYPDA DDSKRLLAWM DELYPGEREA DMKRVDADNK KSLSLINRKY VKIGELEYDY AKTSGSHGMF VHKLSEELLD LDAATYKVLV RAKTNHGALD HTCIYRLKLF GEELEFEGEE // ID G4UR13_NEUT9 Unreviewed; 1098 AA. AC G4UR13; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGZ71913.1}; GN ORFNames=NEUTE2DRAFT_110981 {ECO:0000313|EMBL:EGZ71913.1}; OS Neurospora tetrasperma (strain FGSC 2509 / P0656). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=510952 {ECO:0000313|EMBL:EGZ71913.1, ECO:0000313|Proteomes:UP000008513}; RN [1] {ECO:0000313|EMBL:EGZ71913.1, ECO:0000313|Proteomes:UP000008513} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=strain FGSC 2509 / P0656 {ECO:0000313|Proteomes:UP000008513}; RX PubMed=21750257; DOI=10.1534/genetics.111.130690; RA Ellison C.E., Stajich J.E., Jacobson D.J., Natvig D.O., Lapidus A., RA Foster B., Aerts A., Riley R., Lindquist E.A., Grigoriev I.V., RA Taylor J.W.; RT "Massive changes in genome architecture accompany the transition to RT self-fertility in the filamentous fungus Neurospora tetrasperma."; RL Genetics 189:55-69(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; GL891236; EGZ71913.1; -; Genomic_DNA. DR EnsemblFungi; EGZ71913; EGZ71913; NEUTE2DRAFT_110981. DR OMA; LLECHAK; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000008513; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008513}; KW Reference proteome {ECO:0000313|Proteomes:UP000008513}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 1098 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003469897. FT COILED 665 685 {ECO:0000256|SAM:Coils}. FT COILED 720 747 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1098 AA; 119228 MW; ED6494339DD943F3 CRC64; MRTPPTLFLG LLGLHAVAAA LPEPASVCES RTVNYITHTL PQQCLRTAWT TPTAVTSAIA ADTTSSEVPS NETAAPAQAK ETQQQHPDQP KPSAEHTQEQ TKEEQEDEDL AASTFMSFEE WKEMMLRKSG DPANTKGGQK QPAQQRTGGE HDQNGPNSDT DSHRPGDDGE NPLNFDALSE KVSELTSSPS GDPSTDYGSD KARTDDQVVH EDGKTQYYRS KDAGKTCKER FSYSSFDAGA IVKKTSPGAK NAKAILVENK DSYMLLECHA KSKFVIVQLS DDILVDTVVL ANFEFFSSMI RQFKVSVSDR YPVKLDKWVE LGTFEARNSR DIQAFSVEHP QIYTKYIRIE FLSHYGNEYY CPVSLLRVHG TRMLDTWKEP DDRHDDEQET IEAPPVQEQL PQTPEPEQPS PQVGQPSVAS EPAPSTVTEL EEEAHQETEP VQAVELGFTP WEPVFYRDFS FEICDLRSRT TGQSTATSPE ADNKQGRNSD TAKEQASTGS AVHETLVPKA SSTASKPQEI AKAQPASSAA SHTPVPPQVS GTITGSPSNK APLSRSNTAS NETAPSVSPA AKPSGSSNST AGTTSRSDSK DYGNNASANA GTGGSPLNNS SQNNKNNQPR KPASGAGHGG SPTSSAPPLP TIQESFFKTV HKRLTHLESN TSLSLQYIEQ QSRFLQDVLS KLERRQLTRV DTFLDTLNKT VLTELRNVRQ QYDQIWQSTV IALETQREQT EREVVALSGR LNVLADEVVF QKRMAILQSV LLLSCLILVI FNRTGGGGGG GGVNGGGGIA LNSNRGTGGR PGSRGGGGGG GGWFDSPIQA VQRRSMKPGS GWISNMSMSM GMSSPFPFST TVSTSGVQQQ VTAVATAEAR SGSGEDADSV GTSTGVDIAA AQQRNQQQLH PNDNHNLGQR QHQHMLQTQQ HSYAYPRNND KALPLTPTSE YDSREGTPLV HTSPLRQTST TIDEVLAAED ADDDSQLYTQ SSFGPESECV PDQEESSRSS SSEFESGGLT QERTLEIYQE STEPNRNGVT NVPVRSNSAE ESSERIEEDN INLMPVDSIE YHQQQTLRPR ARPSRTHLGS ETVKPLPAVP ETSKFIIT // ID G4V5B4_SCHMA Unreviewed; 1568 AA. AC G4V5B4; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CCD74599.1}; GN ORFNames=Smp_092950 {ECO:0000313|EMBL:CCD74599.1}; OS Schistosoma mansoni (Blood fluke). OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; OC Schistosomatoidea; Schistosomatidae; Schistosoma. OX NCBI_TaxID=6183 {ECO:0000313|Proteomes:UP000008854}; RN [1] {ECO:0000313|Proteomes:UP000008854} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Puerto Rican {ECO:0000313|Proteomes:UP000008854}; RX PubMed=22253936; DOI=10.1371/journal.pntd.0001455; RA Protasio A.V., Tsai I.J., Babbage A., Nichol S., Hunt M., Aslett M.A., RA De Silva N., Velarde G.S., Anderson T.J., Clark R.C., Davidson C., RA Dillon G.P., Holroyd N.E., LoVerde P.T., Lloyd C., McQuillan J., RA Oliveira G., Otto T.D., Parker-Manuel S.J., Quail M.A., Wilson R.A., RA Zerlotini A., Dunne D.W., Berriman M.; RT "A systematically improved high quality genome and transcriptome of RT the human blood fluke Schistosoma mansoni."; RL PLoS Negl. Trop. Dis. 6:E1455-E1455(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601624; CCD74599.1; -; Genomic_DNA. DR STRING; 6183.Smp_092950__mRNA; -. DR EnsemblMetazoa; Smp_092950.1; Smp_092950.1:pep; Smp_092950. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G4V5B4; -. DR Proteomes; UP000008854; Chromosome 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008854}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008854}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1158 1184 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 889 909 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1568 AA; 178121 MW; 06BF871A9077111E CRC64; MPPRTNQESA GLDHKNVRES ILKELPSLDS VAAEGINTHE IPTFNEFHAM VSERSQSRKG QDCEVVENVG VANPDYDINI EQKIKASQIP QGSEKVLQTH IDVNPSVKSS RNQIIQDVPE KSVFTHHSSF SDNIKKDSAS EQHQSLQTNS VVTSNSETNS KSSSHVTSQP VITTTTSATT TITNNDAVHT PGKIDLTLRR NVASVACGAK MLGFSRAIKN PEAVLNENND EYMNVPCSEE KWLVLEVCQP VQLRTIELAN YELFSSRLKS FRVYANDRYP AKSWELIGTF TARDVKGIQS FSVVSGKMIK YVKDKNQHLE SHENDIITKD NAKNERKDDQ TSLNTTSSND GMYDTTGITE DNNHHIVNIS TYDPEIKLTN SDLTSEDYPN NQNDDHDHKH IVDKLSNIDS TNSDSSVDSS TMYHTDDTAS SDHHHKLLGD RIKKYSERTT TGDSLKNHND NYKNKFCEKI DPVKALHKPF CPAGSILYRS INHINDHSTT TNCHTSKVTN SDSPTPHKSH FYPTVNIHAK VPAIMKLPPP IDKIEQNNKS LQNMNTVNLT SKLHNKLFNQ VFNPMNLFTR LANAIKHTIL GIFLQFFPST EDTNNPYLIL NDIVPTLENY HTINYLFNPI GLYNSLCNHS MIYLTMNQIK GLWYLRKCLR LLDVQYSSYS LKSLNVIHPM KIEECSRAIH QLNLTMISHS NDLQDKQSYY YHWPLYELDK SIGHLFIAYQ MMQHNQTLLR SSSSSSYCIE CEKSFSYSSL YDHEKLFIDN YHHSADIVIQ NRNSKSSNSL TSSTSSTSSS MKSLKINRPH SHHDEALVVP AALGGSHRST AYMRLNNRVR IIERNVSVSM RYLEELSQSY RRQMERLSRS FNLTYAWLKV TAHSAEERDR QQQHRISQLE LQLNDLTARI KSRLLNSLPA SSSSSGQPDL TLTSSPLSST TPSSSSSESI ATTTNTEGTI SSSSNSKSSN LVTSLTSPPP LPDFESSLDW NPWLKSQHDD WYMIVDGDMV VTNNNEDVDD TEFESDDSYQ TNDDVYLSRL DGTTGTLSKN YKEKFKQTTT GTTNSKSSDN NIVIPDSSSV SNNPNEYNAK SLFDPYLSTR NYKHQHYHME SSSSSSSSSL FSEWKQFILD LLWLEFSWPV WIYNFGVFIH NFCQMNTMIL NIFSLVLLHL ILASIVHFLI YWIWLRPKNL MLSKFDTELL LSSLYQVLYC RKNHPNFNSC IVYFNSMNST TPTHHLAHLL PSININTTTT TNSNYDNDNP HTTDNVTEHH DDEYDRKRSF DKASNLILTN TMITATNDDD NNNLHTTDNL TEHHDDEYDK KSFGKVSNII LTNTMITTTT TNGNNSSNNN NTNNITVPKV LTCYQMNELS HIYKPTIEMT YDHLIIKTDL IDNDCSIKEI HGNPIVTNVN DQCMNNNNND DDDVHSCCLI KEIQVNKIED NGISSSQGKF DPQLKHVPSL LQLPTIYSSS DNSQFNKDEL KCGENDCSQS SICQLNEFLP QECNKPSCRP SLNQNNNDLE PHLSTSTHLV MNSNLTPPTL IKKISKSMNN NHKRKHKRKR ELDNNGFM // ID G4VTU7_SCHMA Unreviewed; 237 AA. AC G4VTU7; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Sad1/unc-84-like protein {ECO:0000313|EMBL:CCD82101.1}; DE Flags: Fragment; GN ORFNames=Smp_210590 {ECO:0000313|EMBL:CCD82101.1}; OS Schistosoma mansoni (Blood fluke). OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; OC Schistosomatoidea; Schistosomatidae; Schistosoma. OX NCBI_TaxID=6183 {ECO:0000313|EMBL:CCD82101.1, ECO:0000313|Proteomes:UP000008854}; RN [1] {ECO:0000313|Proteomes:UP000008854} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Puerto Rican {ECO:0000313|Proteomes:UP000008854}; RX PubMed=22253936; DOI=10.1371/journal.pntd.0001455; RA Protasio A.V., Tsai I.J., Babbage A., Nichol S., Hunt M., Aslett M.A., RA De Silva N., Velarde G.S., Anderson T.J., Clark R.C., Davidson C., RA Dillon G.P., Holroyd N.E., LoVerde P.T., Lloyd C., McQuillan J., RA Oliveira G., Otto T.D., Parker-Manuel S.J., Quail M.A., Wilson R.A., RA Zerlotini A., Dunne D.W., Berriman M.; RT "A systematically improved high quality genome and transcriptome of RT the human blood fluke Schistosoma mansoni."; RL PLoS Negl. Trop. Dis. 6:E1455-E1455(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE601631; CCD82101.1; -; Genomic_DNA. DR STRING; 6183.Smp_179730__mRNA; -. DR EnsemblMetazoa; Smp_210590.1; Smp_210590.1:pep; Smp_210590. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; KOG2967; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G4VTU7; -. DR Proteomes; UP000008854; Chromosome W. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008854}; KW Reference proteome {ECO:0000313|Proteomes:UP000008854}. FT NON_TER 1 1 {ECO:0000313|EMBL:CCD82101.1}. SQ SEQUENCE 237 AA; 26510 MW; 83335F80B0012E82 CRC64; TIINELRDSD SVLFTYFRQT VSTLAKESVD KLIYNRHSEL VPHSLENKAT LAKMIDEALH LFAADRTGLT DYALESSGGS IVGTRCTKTY TEGASLFSIF GLPLARLSNS PRTILQPGNN PGDCWPFHGS KGQAIIRLSS PIIISSVTLE HLPRELAPNG RLDSAPRDFL VKALQTEYDD GVVLGEFTYD VNSRPIQNFP IKASFITYHV RHETYRHVSR ISISFNRYLK KILGILK // ID G4YZ67_PHYSP Unreviewed; 658 AA. AC G4YZ67; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGZ23925.1}; GN ORFNames=PHYSODRAFT_481289 {ECO:0000313|EMBL:EGZ23925.1}; OS Phytophthora sojae (strain P6497) (Soybean stem and root rot agent) OS (Phytophthora megasperma f. sp. glycines). OC Eukaryota; Stramenopiles; Oomycetes; Peronosporales; Phytophthora. OX NCBI_TaxID=1094619 {ECO:0000313|Proteomes:UP000002640}; RN [1] {ECO:0000313|EMBL:EGZ23925.1, ECO:0000313|Proteomes:UP000002640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=P6497 {ECO:0000313|EMBL:EGZ23925.1}; RX PubMed=16946064; DOI=10.1126/science.1128796; RA Tyler B.M., Tripathy S., Zhang X., Dehal P., Jiang R.H.Y., Aerts A., RA Arredondo F.D., Baxter L., Bensasson D., Beynon J.L., Chapman J., RA Damasceno C.M.B., Dorrance A.E., Dou D., Dickerman A.W., Dubchak I.L., RA Garbelotto M., Gijzen M., Gordon S.G., Govers F., Grunwald N.J., RA Huang W., Ivors K.L., Jones R.W., Kamoun S., Krampis K., Lamour K.H., RA Lee M.-K., McDonald W.H., Medina M., Meijer H.J.G., Nordberg E.K., RA Maclean D.J., Ospina-Giraldo M.D., Morris P.F., Phuntumart V., RA Putnam N.H., Rash S., Rose J.K.C., Sakihama Y., Salamov A.A., RA Savidor A., Scheuring C.F., Smith B.M., Sobral B.W.S., Terry A., RA Torto-Alalibo T.A., Win J., Xu Z., Zhang H., Grigoriev I.V., RA Rokhsar D.S., Boore J.L.; RT "Phytophthora genome sequences uncover evolutionary origins and RT mechanisms of pathogenesis."; RL Science 313:1261-1266(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH159152; EGZ23925.1; -; Genomic_DNA. DR RefSeq; XP_009519213.1; XM_009520918.1. DR GeneID; 20655350; -. DR KEGG; psoj:PHYSODRAFT_481289; -. DR InParanoid; G4YZ67; -. DR KO; K19347; -. DR Proteomes; UP000002640; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 175 197 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 250 284 {ECO:0000256|SAM:Coils}. FT COILED 338 358 {ECO:0000256|SAM:Coils}. FT COILED 388 408 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 658 AA; 73712 MW; A7B47028A9948D9D CRC64; MADGNYSLRS RRSRRSASTS SEDSEDLEPF PRRSTRHYGD YTPEPVQRTL ELRSGDLEDD DEEDEDSDFE ELDDYRGTVY RSTTYRPPQV DAGDVNEPES VHEEEEEEDV DEHEHETSGP DVRRSELKRR AAGAAAYFKQ PQNAGGLWQK ITSSKVVKTT LKLLRRLWRF VLRNSFMAVN VLWLLAPLCC FVIAITVPQY LTTAIQYVDV LSSKVIGVRG TADAGLEKGA MRSLVQEIVD VKLVGMNEEI GVLRQTVQTQ EREIEALKLL HDTLRHSHDE AQQKFSLAES DSAITVHIEK VVAKHTDELW EKFMDTTARL QQDVRIATKQ QSVLSSVVKE QEEKMDSVEN IVKKTVEATA DAADDHARER DMQREFIAWR DSFERDLKSE MKSKVQDIED RMSKVLQEEK QALSSSADAL RGLDATDPGI LRVIEVAVQA VEIKKTGRVD HAALANGASV IHSERDLLYQ ESSSPVQLLT QLFGLYNVGD DGRFTSPSFR HAPAPFLGQL LSSGEIPWWL SRHNGRPETA LSETMEMGSC WGISGSSGRL SVKFAQQIVA DAITIDHIPA QIASDFSSAP NEFRVLGISG HPLRETVEFV PFGNFSYASN GPASQTFKLT SPLSQRSAID GITLEVLSNH GNPEYTCLYR FRVHGQPA // ID G4Z3M1_PHYSP Unreviewed; 583 AA. AC G4Z3M1; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EGZ20090.1}; DE Flags: Fragment; GN ORFNames=PHYSODRAFT_460966 {ECO:0000313|EMBL:EGZ20090.1}; OS Phytophthora sojae (strain P6497) (Soybean stem and root rot agent) OS (Phytophthora megasperma f. sp. glycines). OC Eukaryota; Stramenopiles; Oomycetes; Peronosporales; Phytophthora. OX NCBI_TaxID=1094619 {ECO:0000313|Proteomes:UP000002640}; RN [1] {ECO:0000313|EMBL:EGZ20090.1, ECO:0000313|Proteomes:UP000002640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=P6497 {ECO:0000313|EMBL:EGZ20090.1}; RX PubMed=16946064; DOI=10.1126/science.1128796; RA Tyler B.M., Tripathy S., Zhang X., Dehal P., Jiang R.H.Y., Aerts A., RA Arredondo F.D., Baxter L., Bensasson D., Beynon J.L., Chapman J., RA Damasceno C.M.B., Dorrance A.E., Dou D., Dickerman A.W., Dubchak I.L., RA Garbelotto M., Gijzen M., Gordon S.G., Govers F., Grunwald N.J., RA Huang W., Ivors K.L., Jones R.W., Kamoun S., Krampis K., Lamour K.H., RA Lee M.-K., McDonald W.H., Medina M., Meijer H.J.G., Nordberg E.K., RA Maclean D.J., Ospina-Giraldo M.D., Morris P.F., Phuntumart V., RA Putnam N.H., Rash S., Rose J.K.C., Sakihama Y., Salamov A.A., RA Savidor A., Scheuring C.F., Smith B.M., Sobral B.W.S., Terry A., RA Torto-Alalibo T.A., Win J., Xu Z., Zhang H., Grigoriev I.V., RA Rokhsar D.S., Boore J.L.; RT "Phytophthora genome sequences uncover evolutionary origins and RT mechanisms of pathogenesis."; RL Science 313:1261-1266(2006). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH159153; EGZ20090.1; -; Genomic_DNA. DR RefSeq; XP_009522807.1; XM_009524512.1. DR GeneID; 20653244; -. DR KEGG; psoj:PHYSODRAFT_460966; -. DR InParanoid; G4Z3M1; -. DR Proteomes; UP000002640; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 464 486 Helical. FT COILED 425 445 {ECO:0000256|SAM:Coils}. FT COILED 576 583 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EGZ20090.1}. FT NON_TER 583 583 {ECO:0000313|EMBL:EGZ20090.1}. SQ SEQUENCE 583 AA; 63847 MW; 8F3450CB6286657C CRC64; LALSSVRGIV TPDAPSDPTP SASADAPQRI AEPDEPKADT ELADEDVDPL LDVPSGLFEV VDADSVDNRK RQNYASLDAG ATILDAAPDT KSPTNLLVPD KDRYMLTPCS NPRKWVVISL SEDVHADAIA IANYEKFSSP VKDFIVLGSV NYPTDTWLVL GNFTAAHSNG EQIFQLDAQQ HVRYIKFRFL SHYGSEYYCT LSQLRVFGRT FTQVISQLEK SIDAEVEALD AQAAIPAPQL SALPDSAEIS VPRIPDPTEL TSQCLMEKNN TVVAVFYDEP QRIEHYRSHG MCCLVDYTPE KIEAEVAANL ITNEHAASTT ADPIDADVVD GASLSSGSSL HNGASANTSA SVNGSSANAT APSSAATPAS LLPAAHNAAA TSTQGLGRLE SIFVRITKKI QALEVNQSVM GRQLEEFHTH QWAAIKMLQA NQESLNEQLK EIRTMIIDLN EILIVREVIT TMKAGILCAI VLSGFIILFY LLRLLFRCVS KCKERADLRE WFWRMENEES NADEAVKNIP SVDMVAGALR VNRKAQFGSS WDDSAIERKT LVSDMVGDGP QKFRRHKAKR SSQPSTSLKR SRK // ID G5AYT8_HETGA Unreviewed; 733 AA. AC G5AYT8; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Unc-84-like protein B {ECO:0000313|EMBL:EHB02208.1}; GN ORFNames=GW7_02585 {ECO:0000313|EMBL:EHB02208.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB02208.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB02208.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH167581; EHB02208.1; -; Genomic_DNA. DR RefSeq; XP_004845652.1; XM_004845595.1. DR GeneID; 101703115; -. DR KEGG; hgl:101703115; -. DR CTD; 25777; -. DR InParanoid; G5AYT8; -. DR KO; K19347; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 223 244 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 384 411 {ECO:0000256|SAM:Coils}. FT COILED 420 454 {ECO:0000256|SAM:Coils}. FT COILED 494 514 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 733 AA; 82472 MW; F81E0ECBBFAB2DDF CRC64; MSRRSQRLTR YSQGDDDGGS SSSGGSSVAG SQSTLFKDSP LRTLKKKSSN MKRMSPAPQL GPSSDTHTSY YSESVIQESY IGSPRASSLA RSALLDDHLR SEPYWDEDLR VRRRRGTGGT ESSKANGLVE SKATEDFLGS SSGYSSEDDF AGYSDMDQHS SGSSLGSVVS RAGSFVWTVI TFPGRLFRLL YWWVGTTWYR LTTAASLLDV FVLTRRFSML RTFLWFLLLL LLLTGLTYGA WYFYPFGLQT SHPAVVSWWA ARDSRRQPEV WDSRDTTPHF QGEQHILSRV HSLERRLEAL AAEFSSSWQK QSIRLERLEL WQGSTGHRGG GGLNHEDTLA LLEGLVSRRE AALKEDFLRD TATRIQEELA TVRAEHHHDS EDLFKKIVQA SQESEARLQQ LKSEWQRMTQ ENSVLWENIR KNSVEELGRM EAQLTGLRQE LAALSLKQSE VEDEVGLLPQ KIQAVREDVE SQFPAWVGHF LLHGGGTRAG LLQREEVQAQ LQELESKILT HVTEMQGKST QEAAALLGQT LQKEGMVGVT EEQVHQIVKQ ALQRYSEDRI GMVDYALESG GASVISTRCS EPYETKTALL SLFGIPLWYH SQSPRVILQP DVHPGNCWAF QGPQGFAVVR LSARIRPTAV TLEHVPKALS PNSTISSAPK DFSIFGFDED LQQEGTLLGT FTYNQDGEPI QTFHFQTPKM AEYQVVELRI LTNWGHPEYT CIYRFRVHGE PAH // ID G5B3U6_HETGA Unreviewed; 349 AA. AC G5B3U6; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 7. DE SubName: Full=Sad1/unc-84 domain-containing protein 1 {ECO:0000313|EMBL:EHB03957.1}; GN ORFNames=GW7_10272 {ECO:0000313|EMBL:EHB03957.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB03957.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB03957.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH168365; EHB03957.1; -; Genomic_DNA. DR InParanoid; G5B3U6; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}. FT COILED 109 129 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 349 AA; 39791 MW; 30EECFDBFB36511E CRC64; MSGNGKLRQG AGSFLGGSDD PSSSSSSTSI PVLLPEHRNP GANGLTRSWK VILSIMLTLT FLLVGFRNHQ WLKETQFPQK YRQLYAVVAE YGTRLYDYQA RMRMPRGQLE LLKKESQALE NNFREILLLI EQMDLLRALL RDMKDGADHD SGSRPGDVTQ DQNGAEEMFT LVNYVLKKLR EDQVQMADYA LKSAESYTNN KTKLYWHGIG LLNHEMPPDI ILQPDVHPGK CWAFPGSQGH ILIRLARKII PMSVTMEHIS EKVSPSRNTS SAPKEFSVHG LMKRCEGQEI FLGQFVYNKT ETTIQTFDLQ HEISESLLCV RLKILSNWGH PNYTCLYRFR VHGIPRDHT // ID G5B848_HETGA Unreviewed; 428 AA. AC G5B848; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 14-OCT-2015, entry version 6. DE SubName: Full=Sperm-associated antigen 4 protein {ECO:0000313|EMBL:EHB05459.1}; GN ORFNames=GW7_21273 {ECO:0000313|EMBL:EHB05459.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB05459.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB05459.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH168905; EHB05459.1; -; Genomic_DNA. DR InParanoid; G5B848; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 156 178 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 198 225 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 428 AA; 47340 MW; 0B78A9A265493EC9 CRC64; MQRSPHPRTS TCPTSTERTA IAQTASLRGT AAGTGLLGLG PGSLRAEGPG ARAAVSPPCA QDCPEIPHGL EASPGACASK PQRADRTRRG NQPAASPPVS EEQRSLLATL DLRREMPAPR ATKSFLSLLF QVLKVLLSLV RDVLLGVCRE VSSVRFLFAA SLLSVFLAAL SWGFVHLLPP LENEPKEMLT PSEYHERVRS HGQQLQQLQA ELNKLRKEVA RVRAAHSERV AKLVFQRLNE DFVQKPDYAL SSVGASIDLE KTSHDYEDTN MAYFWNRFSF WNYARPPTVI LEPDVFPGNC WAFEGDQGQV VIRLAGHVQL SDITLQHPPP SVAHMGDASS TPRDFAVFGL WVDNETEVFL GKFTFDVKKS PLQTFHLQND PPSAFPKVKI QILSNWGHPR FTCLYRVRAH GLQISEKGED SATVFNPH // ID G5BPW1_HETGA Unreviewed; 323 AA. AC G5BPW1; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 9. DE SubName: Full=Sperm-associated antigen 4-like protein {ECO:0000313|EMBL:EHB11319.1}; GN ORFNames=GW7_12317 {ECO:0000313|EMBL:EHB11319.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB11319.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB11319.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH171305; EHB11319.1; -; Genomic_DNA. DR InParanoid; G5BPW1; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 34 51 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 88 115 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 323 AA; 37017 MW; A7DAB87664E864FC CRC64; MPRSSRNPRD PCILSEDMAY NTRPRRCKLL CQKLVEKTVI LVLCAFGFWL FSMHIPSKME IWQDNSINSP LQSLRLYQEK VRHHTGEIQD LRGSVNQLIA KLKEMEAMSD EQKMAQKIMK MIQGDYIEKP DFALKSIGKW PPRPPQPTSS ILRGATIDFE HTSATYNHDK ARSYWNWIRL WNYAQPPDVI LEPNMTPGNC WAFVGDRGQV TIRLAQKVYL SNLTLQHIPK TISLSGSLDT APKDFIIYGM ESSPGEEVFL GAFQFQPENT IQMFPLQNHQ PRAFGAVKVK ISSNWGNPRF TCLYRVRVHG SVAPPTQNPA LKE // ID G5BUE9_HETGA Unreviewed; 871 AA. AC G5BUE9; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Unc-84-like protein A {ECO:0000313|EMBL:EHB12910.1}; GN ORFNames=GW7_14513 {ECO:0000313|EMBL:EHB12910.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB12910.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB12910.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH171957; EHB12910.1; -; Genomic_DNA. DR InParanoid; G5BUE9; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 317 339 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 345 366 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 378 397 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 459 479 {ECO:0000256|SAM:Coils}. FT COILED 528 555 {ECO:0000256|SAM:Coils}. FT COILED 569 589 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 871 AA; 98118 MW; 4947BBA9C5E8A9E1 CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFETEH RLDPVFDSPR MSRRSLRLVT AAYSSGDSQA MDAHSCASST ASFKDRVTRT VKQRRSASKQ ASFSINHLSG KATSSSVSQG GSGSLQGTVS LQPPVLDESL IQEQTKVDHF WGLDDDGDLK GGNKAAAQGN GDLAADTAAR NGYTCRDCSM LSERTDVLTA HPATHGPSSR IYSRDRNLKR GVSFYMDRTL WLARYTTSSF ASFLAQLFQV VLMKLNYESD NYKLKNYESK DCESESSHRS YCGRKTVTEL PREDGRPSVH GESLCYFLMQ MLRRARAAGW FVAETVWSVL WLATVAPGGK VASGALWWFG IGWYQFATLI SWLNVFLLTR CLRNTCKFLI LLIPLLLLLG AGLSLWGQSD FFSLLPVLNW TDMRTAERVD DPKDTFRPGS SHLQVDGQAS WWLWENDMRQ QVASFSTQCH NHEEKLRELT VLLQKLQLRV DQMDEGKEGL LLWMKDVVGQ HLQEMSAAGF RGTKTDFMTY HHENEVRLSN MEDILRKLTE KSEAIQKELE HMKLRTTSGA EEQPLLLRME RLEQELGLLR SQLSGWQQLK AGCEKVDAQV KETVRLMFSE DQQGGSLEWL LQKLSSHYVS RDDLQVLLRD LELQVLKNIT HHLVVTGQKL TSETVVSAVS GAGISGITEA QARVIVNNAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALLSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MKIHPTMFTV EHIPKTLSPM GNISSAPKDF AVYGLENEYQ EEGQPLGQFT YDQEGESLQM FQALERPDKA FQIVELRILS NWGHPEYTCL YRFRVHGQPA Q // ID G5C6K8_HETGA Unreviewed; 379 AA. AC G5C6K8; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 07-JAN-2015, entry version 6. DE SubName: Full=Sperm-associated antigen 4 protein {ECO:0000313|EMBL:EHB17169.1}; GN ORFNames=GW7_20907 {ECO:0000313|EMBL:EHB17169.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB17169.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB17169.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH173563; EHB17169.1; -; Genomic_DNA. DR InParanoid; G5C6K8; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}. SQ SEQUENCE 379 AA; 42138 MW; 8B4811D4CAFD073D CRC64; MRRSPRPGLA ASPHEHLPNF YRENSNSSDS VPSGHCSGHR SPGPGPGEPE GRRAGPGWYG KGNLRPGLGE GGRGNLGLGC ARPQWSPWAP AEPAASPPVS EDQRSLLDLR REMPAPRTTK RFLSLLFQVL KVLLSLVRDV LLGVCREVCS VRFLFAASLL SVFLAAFSSG LVDLLPPLEN RVHAAHSERL AKLVFQRLNE DFVQKPDYAL SSVGASIDLE KTSRDYEDKN TSYFWNQFSF WNYARPPTVI LEPDVFPGNC WAFEGDQGQV VIRLAGRIQL SDITLEHPPP RVARTRDASS APRDFAVFGL RVDDETEVFL GKFTFDVKKS ALQTFHLQND PPSAFPKVKI QILSNWGHPR FTCLYRVRAH GLQISRGPH // ID G5C7M6_HETGA Unreviewed; 1189 AA. AC G5C7M6; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 14-OCT-2015, entry version 9. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHB17537.1}; GN ORFNames=GW7_06262 {ECO:0000313|EMBL:EHB17537.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB17537.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB17537.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH173694; EHB17537.1; -; Genomic_DNA. DR InParanoid; G5C7M6; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}. FT COILED 871 891 {ECO:0000256|SAM:Coils}. FT COILED 921 941 {ECO:0000256|SAM:Coils}. FT COILED 1127 1147 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1189 AA; 132318 MW; CBED6C201D0B16E1 CRC64; MPKRAVSRFL DERKGPINAE LPGKPSSDLP NPPEENKLKD DHIVDVRNAE SEELRQSVIE TLPTVDLHED SSSVVMSNEN VENTSSLSTS EITPVAKLDE IEKSSTIPIA KSSETEQSET DCDVGDASVQ QPSFVSPPES LVGQHIENVS SSHGKGKITK SEFESKVSAN EQGDDSPKSA LNASDNLKNE SSDFIKPGET DPTPPANPKD PEDIPTFDEW KKKVMEVEKE KSQSLHPSSN GGPHAAKKVQ KNRNNYASVE CGAKILAANP EAKSTSAILI ENMDLYMLNP CSTKIWFVIE LCEPIQVKQL DIANYELFSS TPKDFLVSIS DRYPTNKWIK LGTFHGRDER NVQSFPLDEQ MYAKYVKVEL VSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQSERQE LFDEDYDYPL DYNTVEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLT EGNKSLSENA TATTAPTMPE STPVSTPVPS PEDVTTEVRA TEPSTPDTPK DSPIVQLVQE EEEEASPSTV TLLGSGEQED ESSPWFELET QIFCSELTTI CCISSFSEYI YQWCSVRIAF YRQRSRAAVS KGKDCRVSAQ PSILLPAQSV DVSVLQPPSG ELDSKNMERE SETVILDDLS NAHHGDLINH TVEVIELEPS LFQTLSQSLL LDITPEINSL SKIEGSESVK HETGHTPSQV ITQESSVEFD NETEKKFESF SSTEKLSMIY ETNQVNEVTD NTVKEDVTSI EIITKLSETV VPPINTAIVS DSDSGEAKMN IAHTPKHIVT PAVDSSLPEV KEDEQSPEDA LLRGLQRTAT DFYAELQNST DLGYANGNLI HGSNQKESVF MRLNNRIKAL EVNMSLSGRY LEELSQRYRK QMEEMQKAFN KTIVKLQNTS RIAEEQDQRQ TEAIQLLQAQ LTNMTQLVSN LSTTVAELKQ EVSDRQSYLV IALVLCVVLG LMLCMQRCRN TSQFDGDYIS KLRKSNQYLS PKRCFSSYDD MNLKRRTSFP LIRSKSLQLT DKEVDPNDLY IVEPLKFSPE KKKKRCKYKT EKIETIKPAD PLHPIANGDI KGRKPFTNQR DFSNMGEVYH SSYKGPPSEG SSETSSQSEE SYFCGISACT SLCNGQSQKT KTEKRALKRR RSKVQDQGKL LKTLIQTKSG SLPSLHDIIK GNKEITVGTF GVTAVSGHI // ID G5CAM0_HETGA Unreviewed; 2609 AA. AC G5CAM0; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:EHB18581.1}; GN ORFNames=GW7_02432 {ECO:0000313|EMBL:EHB18581.1}; OS Heterocephalus glaber (Naked mole rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Bathyergidae; Heterocephalus. OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB18581.1, ECO:0000313|Proteomes:UP000006813}; RN [1] {ECO:0000313|EMBL:EHB18581.1, ECO:0000313|Proteomes:UP000006813} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21993625; DOI=10.1038/nature10533; RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L., RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X., RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A., RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., RA Bronson R.T., Buffenstein R., Wang B., Han C., Li Q., Chen L., RA Zhao W., Sunyaev S.R., Park T.J., Zhang G., Wang J., Gladyshev V.N.; RT "Genome sequencing reveals insights into physiology and longevity of RT the naked mole rat."; RL Nature 479:223-227(2011). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH174175; EHB18581.1; -; Genomic_DNA. DR CTD; 25831; -. DR InParanoid; G5CAM0; -. DR Proteomes; UP000006813; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006813}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EHB18581.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000006813}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2609 AA; 289186 MW; 9AD949998154A098 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPSRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP SKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE GENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DLKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVVPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGLELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN AATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQNSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GTWLCDDNFP DDESRHVDLG GGVKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKSLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKALQF TTGCSTLPPG GLANLHPRLT VVRKVDATDA SYPSVNTCVH YLKLPEYSSE EIMRERLLAA TMEKGFHLN // ID G5E798_LOXAF Unreviewed; 342 AA. AC G5E798; DT 14-DEC-2011, integrated into UniProtKB/TrEMBL. DT 14-DEC-2011, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLAFP00000016808}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSLAFP00000016808}; OS Loxodonta africana (African elephant). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Afrotheria; Proboscidea; Elephantidae; Loxodonta. OX NCBI_TaxID=9785 {ECO:0000313|Ensembl:ENSLAFP00000016808}; RN [1] {ECO:0000313|Ensembl:ENSLAFP00000016808} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000016808}; RA Di Palma F., Heiman D., Young S., Johnson J., Lander E.S., RA Lindblad-Toh K.; RT "The Genome Sequence of Loxodonta africana (African elephant)."; RL Submitted (JUN-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSLAFP00000016808} RP IDENTIFICATION. RC STRAIN=Isolate ISIS603380 {ECO:0000313|Ensembl:ENSLAFP00000016808}; RG Ensembl; RL Submitted (OCT-2011) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLAFP00000016808}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 9785.ENSLAFP00000016808; -. DR Ensembl; ENSLAFT00000022967; ENSLAFP00000016808; ENSLAFG00000021496. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; G5E798; -. DR OMA; GNPRFTC; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007646; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007646}; KW Reference proteome {ECO:0000313|Proteomes:UP000007646}. FT COILED 127 147 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 342 AA; 39089 MW; B6B8DA6CC73435E4 CRC64; MPRSSRSPGD PGVLPEDLAH GARDARPRSY LETITEEPLP NTTWLTCLAC FLRTRAQRVL FNTCRCKLFF QKLLEKTGIL VLCMFGFWVC SMHLPSKMEV WQDDNPNGPL QSLRIYQEKV RHHTGEIQDL RGSMNQLIAK LQEVEAMSDE QRMAQKIMKM IQGDYIEKPD FALKSIEGGF HFCQSHNNAT YNHNKARSYW NWIRLWNYAQ PPDVILQPNM TPGNCWAFAG DRGQVTIRLA QKIYLSNLTL QHIPKTISLS GSLDTAPKDF VIYGMEGTPK EEVFLGAFQF QPENIIQMFP LQNQPARPFG AVKVKISSNW GNPRFTCLYR VRIHGSVAPP GD // ID G6CNG5_DANPL Unreviewed; 306 AA. AC G6CNG5; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHJ77238.1}; GN ORFNames=KGM_02790 {ECO:0000313|EMBL:EHJ77238.1}; OS Danaus plexippus (Monarch butterfly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Papilionoidea; Nymphalidae; Danainae; Danaini; Danaina; Danaus; OC Danaus. OX NCBI_TaxID=13037 {ECO:0000313|EMBL:EHJ77238.1, ECO:0000313|Proteomes:UP000007151}; RN [1] {ECO:0000313|EMBL:EHJ77238.1, ECO:0000313|Proteomes:UP000007151} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=F-2 {ECO:0000313|EMBL:EHJ77238.1}; RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052; RA Zhan S., Merlin C., Boore J.L., Reppert S.M.; RT "The monarch butterfly genome yields insights into long-distance RT migration."; RL Cell 147:1171-1185(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHJ77238.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGBW01002187; EHJ77238.1; -; Genomic_DNA. DR STRING; 13037.EHJ77238; -. DR EnsemblMetazoa; EHJ77238; EHJ77238; KGM_02790. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G6CNG5; -. DR OMA; DFGTFEY; -. DR Proteomes; UP000007151; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007151}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007151}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 18 37 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 53 80 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 306 AA; 34739 MW; EAF81403B7282612 CRC64; MDYELEHDLC FRNAFRSFVC VVLSMLLGLQ VYTYFWASQD SFDGDFSDIK YVVMQLTRGL TDVNRKHEKL QNEMERISAA LPAVAAAAGR ARDALEPRRS PRQLFDVHDY DRQIVDFALE TAGARVIDTG DTLEHFIHES PVGWVLHSIS ALVCRDCLGA NAIIKPGTLP GECWAFKGSK GEATIRLLGT VRITGLSLEH IPAHISPTKE ISSAPRLFQL EGLEFRGDPY PYDFGTFEYE KDGKPIQYFE VLHQPSKGYN LVRLKIFSNW GHPVYTCVYR VRIHGDLAPG QQQHNSNEEE MRIETE // ID G6DJS1_DANPL Unreviewed; 376 AA. AC G6DJS1; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHJ66274.1}; GN ORFNames=KGM_13169 {ECO:0000313|EMBL:EHJ66274.1}; OS Danaus plexippus (Monarch butterfly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Papilionoidea; Nymphalidae; Danainae; Danaini; Danaina; Danaus; OC Danaus. OX NCBI_TaxID=13037 {ECO:0000313|EMBL:EHJ66274.1, ECO:0000313|Proteomes:UP000007151}; RN [1] {ECO:0000313|EMBL:EHJ66274.1, ECO:0000313|Proteomes:UP000007151} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=F-2 {ECO:0000313|EMBL:EHJ66274.1}; RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052; RA Zhan S., Merlin C., Boore J.L., Reppert S.M.; RT "The monarch butterfly genome yields insights into long-distance RT migration."; RL Cell 147:1171-1185(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHJ66274.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGBW01009754; EHJ66274.1; -; Genomic_DNA. DR STRING; 13037.EHJ66274; -. DR EnsemblMetazoa; EHJ66274; EHJ66274; KGM_13169. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; G6DJS1; -. DR OMA; WVHTSPR; -. DR Proteomes; UP000007151; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007151}; KW Reference proteome {ECO:0000313|Proteomes:UP000007151}. SQ SEQUENCE 376 AA; 43379 MW; F2ECFEFDB84151D6 CRC64; MYKENVPVDA YKPPEISQDS NINERVAALE SWALKVDNRL NYFDKKISVV YNLEARIEEY SVKHLQRNLI RILSDDVNSD AVAEKLKRHF DRNYVSNEQI SLISQEIHER LLNSWQTEMN EDKIRQIVQD YLYDVEKKQM EIIVTKIREY VGEVESRGVM RSSQMDLEEV KNMVMGMLQV YDADRTGKVD YALESAGGQI LSTKCTELYQ IKTKQYSILG IPVWWVHTSP RHALTPGAMP AECWAFQGFP GYLVIRTYAV IEVTGFTLEH MSRLLAVEGK IESAPKNFSV YGLHNEMDAE PHLFGDYMYD ANGTSIQHFP VKYPKTTNIG GVQYPVAYDI IELRIESNHG NPTYTCVYRF RVHGNPLTDI RKADNI // ID G6DKZ4_DANPL Unreviewed; 2449 AA. AC G6DKZ4; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Putative hect E3 ubiquitin ligase {ECO:0000313|EMBL:EHJ65831.1}; GN ORFNames=KGM_08715 {ECO:0000313|EMBL:EHJ65831.1}; OS Danaus plexippus (Monarch butterfly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Papilionoidea; Nymphalidae; Danainae; Danaini; Danaina; Danaus; OC Danaus. OX NCBI_TaxID=13037 {ECO:0000313|EMBL:EHJ65831.1, ECO:0000313|Proteomes:UP000007151}; RN [1] {ECO:0000313|EMBL:EHJ65831.1, ECO:0000313|Proteomes:UP000007151} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=F-2 {ECO:0000313|EMBL:EHJ65831.1}; RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052; RA Zhan S., Merlin C., Boore J.L., Reppert S.M.; RT "The monarch butterfly genome yields insights into long-distance RT migration."; RL Cell 147:1171-1185(2011). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHJ65831.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGBW01010246; EHJ65831.1; -; Genomic_DNA. DR STRING; 13037.EHJ65831; -. DR EnsemblMetazoa; EHJ65831; EHJ65831; KGM_08715. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; G6DKZ4; -. DR OMA; NRQCIEG; -. DR Proteomes; UP000007151; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 1. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000007151}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:EHJ65831.1}; KW Reference proteome {ECO:0000313|Proteomes:UP000007151}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2449 AA; 266797 MW; 88B8AB54E926B5FB CRC64; MFVVNLYAVE VGVGRKSRAL LKLLSLQRHG PGNFVSYQII LQRYGSLSVS YNIKMAEVDP ETLLEWLLTG QGDERDMQLI ALEQLCMLLL MSDNVDRCFE SCPPRTFLPA LCKIFLDECA PDNVLEVTAR AITYYLDVSA ECTRRIVAIE GAVKAICSRL LTVDPNNRTS KDLAEQCIKV LELVCTREAG AVWEGGGLPS VLHFITHHGT SVHKDTLHSA MAVVSRVCGK MEPGDARVGD AVSSLSTLLR HSDARVSDAA LRCFASLADR FARAHADPAP LAQHGLIEEL VRRLGTTENS DDKCLMPSVS TTVSLLSTLC RGSEQITHDL VRLDLCSAIE TAVQADERWC LECMRLVDLL LVLLCEGRHA IQTSNRMGSA SSSGAGGEGS SNATATGSRG DKSHRQLIDC IRGKDTDALI TSVTSGSVDV NFTDDVGQTL LNWAAAFGTR EMVEFLCEKG ADVNRGQRSS SLHYAACFGR PAIAKVLLRY GANADLRDED GKTPLDKARE RHDQGHREVA AILQCPGEWL VVSNQDSPAS SNDEDFPETG DKEMAAIYLE RLIPVFCARY IGAGGAGVRR ACLSLVRKMV HYAPAARLRA LSTPRAASLL TRLLAHVLDT QGESPRRSSR VLRSRYIRPP ADDDDGHLTV LGIAEELMVK AADIYLEQFA RLGVFSKVEA LAAAPANEMN ADGETVITTG VSEDATSLSS GCAYWWGEWS LCRGRDALYC WSDAAALELS TGSNGWFRFL LDGKLATMYS SGSPEHQTDN TENRGEFIDK LQRARASVKN CVPQSILSKP GPTKLVLGNW VISCKKEKEL HIHNTDGQQQ TTILREDLPG FIFESNRGTK HSFTAETFLG PELASGWADR RPVVSNGQSH SRSRLSAKSE AQKAQVSERA RALYTRHLAS AAGRQPRAPV ARLRALLSKL QTLATNPNGD WQQELKSSLE QLTELLCGEE LLSAYELQSS GLAPALLQVL SPQPNDKPGQ LSDREGVVRG WAWSGNPEGS CGAALAGRLV AVLESVERLP VLAPAPDAPP HQPPTLHHLT KRIRLRVERA NEETSEESAA NNNAGRSLKV EALTTIRQLE RFLAKSVARQ WYDMDRSTFH FVQKIKSEAP MTFTYDHDFD ENGVLYFIGS NGGTCEWVNP GAHGLVSVWS SDGKQLPYGR AEDALSRSPE PLNVHTNDDR RAFIAVDLGL HLVPSAYTLR HARGYGRSAL RNWLFQMSMD GVSWTTLVAH CDEQALQEPG STATWRVRVD AHYRYLRIQQ NGKNASGQSH YLSLSGLEIY GKVVSVVDTP PRQVGSTSTS SSCSGARARR WSRGARGLCA GARVMRGVDW KWRDQDGPHP SVGTVTSDLH NGWVDVRWDH GGRNSYRMGA EGKFDLKVVG GGATGACGGE GARASRKSHS TPSLPDATGT EQQVSVASTE QASSADNISS EVGGNMARPR GNAADLSAIN TSTHHINTDL ATIVESLTLG AESNNCMSDL GNTSFTNMEM GPTSITDITK PYPAKEPLPD STSQEMGSLR CDGEAMRNSA NALLSSELLA LPASLLHTLR NNANRLHIQC EENEGGEAQF GEHKKDAQAA AGAMSASEPD LTQQGAARLL ESLGVGRGAG AGRGAATQRS SRSNHSALLF PSLVRLALSS NFPGGLLSAA QSYPSLAPNA QNALTLSLTS TSSESEQVSL EDFLESCRAP ALLTELEDDD EGDDALDSDK ENEPTYQDVV SRNLLSLMEE EALEAVRGSG GGGSGGGSAG AGGARSRKPW DDDFVLKRQF SALIPAFDPR PGRTNLNQTV DLDIPLNDDS DTESTEATTS TERNGNVPES NGNVIATKLP ALRLVLSAGG LSLPLEKPSW TARQILMKIY SFGNLVIFFY RLTYKEIEGV ETFSSSCDSD EDEPGDPDTS SVYSEGGSSE GMVTWCVRAL RRLRTVAPGL PPASFLSTKL TNKLHHQLQD PLTLAAAATP RWCQQLNDWC PFLFPLETRQ MFFACTAFGT SRTIVWLQAQ RDRALDRQRT GNTVSPRRAE LEATEFRMGR LRHERVRIPR HPNLLRSAMQ VMRVHAARKS VLEVEFAGEE GTGLGPTLEF YALVAAELQR ADLSMWLSDS PAPTLDAELA PLPLTLPDEK PPGYYVTRAG GLFPAPLPQD SPICDKVCKY FWFLGVFLAK VLQDGRLVDL PLSEPFLRIM CGEELTNADL EEIDPIRHRF LASVLAAAEQ YEALKQESSL SESEVQERAA ALTVDGATFE ELSLTMTHVG GRDAYPLCDG GEHVEVGPSN ARLYAEASAR YMVRDGVANQ TEAFRRGFGG VFPPRRLRAF TPPELRLLLC GERGPAWTRE HLLQYTEPKL GYTRDSPGFL RLVDVLVEMS IRERKAFLQF ATGCSSLPPG GLANLHPRLT VVRKVDAGDG SYPSVNTCVH YLKLPEYSCK EVLRERLLAA TNERGFHLN // ID G6DR16_DANPL Unreviewed; 818 AA. AC G6DR16; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHJ64063.1}; GN ORFNames=KGM_19724 {ECO:0000313|EMBL:EHJ64063.1}; OS Danaus plexippus (Monarch butterfly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Papilionoidea; Nymphalidae; Danainae; Danaini; Danaina; Danaus; OC Danaus. OX NCBI_TaxID=13037 {ECO:0000313|EMBL:EHJ64063.1, ECO:0000313|Proteomes:UP000007151}; RN [1] {ECO:0000313|EMBL:EHJ64063.1, ECO:0000313|Proteomes:UP000007151} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=F-2 {ECO:0000313|EMBL:EHJ64063.1}; RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052; RA Zhan S., Merlin C., Boore J.L., Reppert S.M.; RT "The monarch butterfly genome yields insights into long-distance RT migration."; RL Cell 147:1171-1185(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHJ64063.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGBW01012449; EHJ64063.1; -; Genomic_DNA. DR STRING; 13037.EHJ64063; -. DR EnsemblMetazoa; EHJ64063; EHJ64063; KGM_19724. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G6DR16; -. DR OMA; DAVMSIM; -. DR Proteomes; UP000007151; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007151}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007151}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 560 583 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 493 513 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 818 AA; 92990 MW; 61822B661C37DCE9 CRC64; MLNTCNSRIW FVVELCEAVQ AQKLEIANFE LFSSTPKDIA VYFSDRFPTR EWASVGQFTA EEMRDVQSFD LYPHLFGKFI KVEMLSHHGS EHYCPISLFK VYGTSEFEVL EKESSQHSAH VDDDEDDEII DVPDIPAAET EPSKNLFGSA RDAVMSIMKK AAQALVKTEV PKNISSERND TSTDNMYKKC CSPSHIIVCD NCSETLYNDV YELLSCSTDR LTSLLRQVFL KDTLKCTSVC QMYGLDFKST KTIEFHEERV AYINALFPPK YLAALCNILA IKEKKVVLNT SFETETNVTS NITVEESAQN VNSNINTEQD LTPIRSNETD DHKDNDATHV EDKNVAPEQA PEYVTEATID DSKIEICKDD IQPTLDEKQT TPEPQEDALA EEKQGDALSE NETKDSSNGK DATEKSKETA NNADEISDQV IMDSDSFISD LDQIAVDPTP AGNTAAVSHN QAQATLQKES VFLRLSNRVK TLERNMSLSG QYLEELSRRY KKQVEEMQRS FEKTMLQVTE ERRKSNEREQ KYLEQMTVLQ EQLSQVTLAI TILMEEKDGW FGNITFVKFI IYQAVIFALA FYYMTKRKQE TVVAPVPKKI KKKQDRFRRK SVEGVSGHST PSVKKRRPSE EAFQIARLSC DDNECDEAPG EWQVAKKNRR RKTSIVHRNL ELEVKTTHNQ EGLIQLQENT ITLDEAEFLA PVSEPKEFTE NEVKEKELPK TNGSFFNNLK NKTMKTRRLS SPAFLRTFNR QSVRSTPSPD VRNIEPIFNG KISKKAASES PTGSLWSEST DISQNGHSEN GNGNKKKKSL KNILRKVF // ID G7E548_MIXOS Unreviewed; 651 AA. AC G7E548; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:GAA97958.1}; GN Name=Mo04638 {ECO:0000313|EMBL:GAA97958.1}; GN ORFNames=E5Q_04638 {ECO:0000313|EMBL:GAA97958.1}; OS Mixia osmundae (strain CBS 9802 / IAM 14324 / JCM 22182 / KY 12970). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Mixiomycetes; Mixiales; Mixiaceae; Mixia. OX NCBI_TaxID=764103 {ECO:0000313|EMBL:GAA97958.1, ECO:0000313|Proteomes:UP000009131}; RN [1] {ECO:0000313|Proteomes:UP000009131} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 9802 / IAM 14324 / JCM 22182 / KY 12970 RC {ECO:0000313|Proteomes:UP000009131}; RX PubMed=21478649; DOI=10.2323/jgam.57.63; RA Nishida H., Nagatsuka Y., Sugiyama J.; RT "Draft genome sequencing of the enigmatic basidiomycete Mixia RT osmundae."; RL J. Gen. Appl. Microbiol. 57:63-67(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:GAA97958.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABT02000148; GAA97958.1; -; Genomic_DNA. DR EnsemblFungi; GAA97958; GAA97958; E5Q_04638. DR InParanoid; G7E548; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000009131; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009131}; KW Reference proteome {ECO:0000313|Proteomes:UP000009131}. SQ SEQUENCE 651 AA; 71491 MW; A0661DFC161DF750 CRC64; MTEHEPANYI RVETVDLASL VHFVRDGREQ DCNLALSFIS LHSHTGPGTH PDIVPTQDTA TSKGLANSGM TAPEIRITAV SPAPSESLHH IPKRSAIGTS AAGALDSASQ IRLSGQDDHI RMTTCSPSLS INHPRGKVTA ATRRLDEDLS AYQAEHLDSK ARQTQRNLER SPGSPRFEAT SGDNLDYSLT ESNASESDDA EAYPVPGRYG IIAAQALKRA YASLDDPMVT AALIAIAFAM LSMLYSDKLI QLKNPAQTEA RDLHLLSSRV ASLTDAHDLL SAQIKQTYDA FADELLEMND TLSRVQSQSE TLSKMMVTIV EKNGQLKQSF EHLSEQVQER CASDERGALS GVATQQVRDM IKEAMKEQMG ESLAASGEVQ TAASIDVSAV FETVKRSLVE ELQADIYKNS NGNGSLSHDD GMRTVLQKAV QREVAIAQDG VAMSDVALAS AGARVIKKWT GKSFIKTTNP IKKWYATRHP FRPPEYAMDI DRTAGHCWAF NGNRTTLGIQ FGGPVYITHI SLDHAPRSVW PSVAIAPKDF EVWGILNDDT DRPYWNLVRQ SEEQRLAVLK YGDDRGKMVY LTNGTYTPAD DYFGATQTWP IVASAKELNL AYEMILLKIT STHGAENGCL YRFRVHGGLK YREGFNRTSI V // ID G7E7C3_MIXOS Unreviewed; 869 AA. AC G7E7C3; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:GAA98733.1}; GN Name=Mo05421 {ECO:0000313|EMBL:GAA98733.1}; GN ORFNames=E5Q_05421 {ECO:0000313|EMBL:GAA98733.1}; OS Mixia osmundae (strain CBS 9802 / IAM 14324 / JCM 22182 / KY 12970). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Mixiomycetes; Mixiales; Mixiaceae; Mixia. OX NCBI_TaxID=764103 {ECO:0000313|EMBL:GAA98733.1, ECO:0000313|Proteomes:UP000009131}; RN [1] {ECO:0000313|Proteomes:UP000009131} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 9802 / IAM 14324 / JCM 22182 / KY 12970 RC {ECO:0000313|Proteomes:UP000009131}; RX PubMed=21478649; DOI=10.2323/jgam.57.63; RA Nishida H., Nagatsuka Y., Sugiyama J.; RT "Draft genome sequencing of the enigmatic basidiomycete Mixia RT osmundae."; RL J. Gen. Appl. Microbiol. 57:63-67(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:GAA98733.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABT02000163; GAA98733.1; -; Genomic_DNA. DR EnsemblFungi; GAA98733; GAA98733; E5Q_05421. DR InParanoid; G7E7C3; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000009131; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009131}; KW Reference proteome {ECO:0000313|Proteomes:UP000009131}. FT COILED 506 526 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 869 AA; 93330 MW; 3B9ADCBFF73C1E0B CRC64; MFSSSTPVGG RRRSTAVFNA QLSSTVSSLE PAAGDHQPVG PRSRSTVSER ARSRGRSAHR EILSASLVSS QESHNRDPEA PSTSAAGRHS SEASAHPVAP ASLEYSLVAD SAASGSGTAP QPPITEMLSQ AESKRKRSSA GSPVGDEQRN PNRLTDSANA DDHDLASDDE ATSQLDVGRK GIRMNSYGQL RDTVASPAQP LHSETRVHGT PTRGGSQSRS QASPRRARNA ASHYPALSPA STIQLSPAVA LTTARRHVQV TPLSTPDSLL SRHGGDIEIG VPSPIKPTAE PSNPESLRRN DSLSMSGDDE LSAYEDEFHL RNVRSNRPGG NALGAAAPHA GLASASAAED AQSIVAALEQ SQGDPEHNAG STGPVVTSGD RPGHTTTKLA VPRLPAKKQQ QHHSPSSGGL RASESDDAHA QPVPGRYGIK LAQLLKRAYA SLNGVVGAAA FIAIAFAVLS MLYPDKLIQL KNPAQTEARD FHLLSARVAS LKDAHDLLKA HVKQTHEAFE DELRAMNDTL SRVQSQSEIL SKMMGTTVEE NGQLKQSFEH LNEQVQERCT SDERGALSGV ATQQVSDMIK QAMKEQMGGS PGASGEMQTA ASMDVSAVFE TVKSSLVEEL QADIYKNSNG NGSLSHDDGM RTVLQNAVQR EVAIAQDGVA MSDVALASAG ARVIKKWTGK SFIKTTNPIK KWYATRHPFR PPEYAMDIDR TAGHCWAFKG NRTTLGIQFG GPVFITHISL DHAPRSVWPS VAIAPKDFEV WGILNDDTDR PYWNLVRQSE EQRLAVLKYG DDRGKMVYLT NGTYTPADDY FGATQTWPIV ASAKELNLAY EMILLKITST HGAENGCLYR FRVHGGLKYR EGFNRTSIV // ID G7E831_MIXOS Unreviewed; 932 AA. AC G7E831; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:GAA98991.1}; GN Name=Mo05680 {ECO:0000313|EMBL:GAA98991.1}; GN ORFNames=E5Q_05680 {ECO:0000313|EMBL:GAA98991.1}, GN L969DRAFT_51228 {ECO:0000313|EMBL:KEI38591.1}; OS Mixia osmundae (strain CBS 9802 / IAM 14324 / JCM 22182 / KY 12970). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Mixiomycetes; Mixiales; Mixiaceae; Mixia. OX NCBI_TaxID=764103 {ECO:0000313|EMBL:GAA98991.1, ECO:0000313|Proteomes:UP000009131}; RN [1] {ECO:0000313|Proteomes:UP000009131} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 9802 / IAM 14324 / JCM 22182 / KY 12970 RC {ECO:0000313|Proteomes:UP000009131}; RX PubMed=21478649; DOI=10.2323/jgam.57.63; RA Nishida H., Nagatsuka Y., Sugiyama J.; RT "Draft genome sequencing of the enigmatic basidiomycete Mixia RT osmundae."; RL J. Gen. Appl. Microbiol. 57:63-67(2011). RN [2] {ECO:0000313|EMBL:GAA98991.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=IAM 14324 {ECO:0000313|EMBL:GAA98991.1}; RX PubMed=22724063; DOI=10.1098/rsob.120043; RA Nishida H., Kondo S., Matsumoto T., Suzuki Y., Yoshikawa H., RA Taylor T.D., Sugiyama J.; RT "Characteristics of nucleosomes and linker DNA regions on the genome RT of the basidiomycete Mixia osmundae revealed by mono- and dinucleosome RT mapping."; RL Open Biol. 2:120043-120043(2012). RN [3] {ECO:0000313|EMBL:KEI38591.1, ECO:0000313|Proteomes:UP000027399} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 9802 / IAM 14324 / JCM 22182 / KY 12970 RC {ECO:0000313|Proteomes:UP000027399}, and RC IAM 14324 {ECO:0000313|EMBL:KEI38591.1}; RX PubMed=24372469; DOI=10.1111/nph.12653; RA Toome M., Ohm R.A., Riley R.W., James T.Y., Lazarus K.L., RA Henrissat B., Albu S., Boyd A., Chow J., Clum A., Heller G., RA Lipzen A., Nolan M., Sandor L., Zvenigorodsky N., Grigoriev I.V., RA Spatafora J.W., Aime M.C.; RT "Genome sequencing provides insight into the reproductive biology, RT nutritional mode and ploidy of the fern pathogen Mixia osmundae."; RL New Phytol. 202:554-564(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABT02000165; GAA98991.1; -; Genomic_DNA. DR EMBL; KL411548; KEI38591.1; -; Genomic_DNA. DR EnsemblFungi; GAA98991; GAA98991; E5Q_05680. DR EnsemblFungi; KEI38591; KEI38591; L969DRAFT_51228. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009131; Unassembled WGS sequence. DR Proteomes; UP000027399; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009131}; KW Reference proteome {ECO:0000313|Proteomes:UP000009131}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 16 {ECO:0000256|SAM:SignalP}. FT CHAIN 17 932 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5005349734. FT COILED 624 644 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 932 AA; 101085 MW; C42D8084C0C14176 CRC64; MLWLLWTMCL VATCQAAHLS TCRPPTSSTL EHARSHRDVC LLTDYCKESG SLAATSESVL NADATTSTSV TAPSDETASM SLNAASEAPS ASAEPASITS RPPEDKATPA ADEITRSKTD DTVSPTPTAV PPFVPFDEWR AKNEPKPRTV KTTASSLGAS AVIQESRTVS SSRSEATSDS LTSAEQATGT AQAIITPSAI VPIDRTSSEP GAKSEEYAMA IASDLVHPLP DLPTTDPLSP LRKLASRTNY ASFDCAAAVH RSSKQTKGAS SILREAKDRY MLTPCKTPNK FVIIELCEAI EIDTLVLANY EFFSSMFKLF SVKVTDRLAV STGDTDSEQW INLGTFRARN VRGLQIFKPH KLKGFYRYLR IDFVTHYGTE HFCPVSLLRV YGLTEIASWR EEEMRMQAAA NALDASAEDI EYAVPTNEEL WNGPARVAPR TVDVNIVATG SSAADQATHT SDNVSRTSVE HPSAIPTSVL LASEVRQSAS DKMASQRDDS TPSSASVTKQ PVSNLPPANQ TRSEDVANAD GAQGEARNVT RAAAVSSSQP AHTQPGESIY GTIMKRLSAL EHNASLSMRY IEEQGKMLRE AFARVETRFE YYDAARAKQD AMLRKIMLDV DLHRAKVEED RIALASQLNH LSQEVIFERR VGLAQLVVLM AIVVFVGLTR GSPNVPLLQM LPEHPTRHWR GRSQLFADIH DHKGEHARER TSSDRKHLAQ GLHRVASTGR NSRKAHLNAP TSRKTYSVSG NINPRLTPVL ATGDLPGPRR RKRDEERRSE SPMLSGPTGR RRPASIHEST AKSGNGLEMT THRPLPIRMP PSPEPSSDAN TWQSASGTDD ELSASASADE SSDAGSATAE AFTPYLLETR SSLDDSDEDN GASDGTRLMT DANGLGFKLH NASSATLVSP ANLVSQSNPP SSPRDQMEET LQ // ID G7JKT4_MEDTR Unreviewed; 526 AA. AC G7JKT4; A0A0C3WUD2; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 01-OCT-2014, sequence version 2. DT 11-NOV-2015, entry version 21. DE SubName: Full=Galactose-binding protein {ECO:0000313|EMBL:AES87600.2}; DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:AES87600}; GN OrderedLocusNames=MTR_4g030960 {ECO:0000313|EMBL:AES87600.2}; OS Medicago truncatula (Barrel medic) (Medicago tribuloides). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Trifolieae; Medicago. OX NCBI_TaxID=3880 {ECO:0000313|EMBL:AES87600.2, ECO:0000313|Proteomes:UP000002051}; RN [1] {ECO:0000313|EMBL:AES87600.2, ECO:0000313|EnsemblPlants:AES87600, ECO:0000313|Proteomes:UP000002051} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=A17 {ECO:0000313|EMBL:AES87600.2}, and RC cv. Jemalong A17 {ECO:0000313|EnsemblPlants:AES87600, RC ECO:0000313|Proteomes:UP000002051}; RX PubMed=22089132; DOI=10.1038/nature10625; RA Young N.D., Debelle F., Oldroyd G.E.D., Geurts R., Cannon S.B., RA Udvardi M.K., Benedito V.A., Mayer K.F.X., Gouzy J., Schoof H., RA Van de Peer Y., Proost S., Cook D.R., Meyers B.C., Spannagl M., RA Cheung F., De Mita S., Krishnakumar V., Gundlach H., Zhou S., RA Mudge J., Bharti A.K., Murray J.D., Naoumkina M.A., Rosen B., RA Silverstein K.A.T., Tang H., Rombauts S., Zhao P.X., Zhou P., RA Barbe V., Bardou P., Bechner M., Bellec A., Berger A., Berges H., RA Bidwell S., Bisseling T., Choisne N., Couloux A., Denny R., RA Deshpande S., Dai X., Doyle J.J., Dudez A.-M., Farmer A.D., RA Fouteau S., Franken C., Gibelin C., Gish J., Goldstein S., RA Gonzalez A.J., Green P.J., Hallab A., Hartog M., Hua A., RA Humphray S.J., Jeong D.-H., Jing Y., Jocker A., Kenton S.M., RA Kim D.-J., Klee K., Lai H., Lang C., Lin S., Macmil S.L., RA Magdelenat G., Matthews L., McCorrison J., Monaghan E.L., Mun J.-H., RA Najar F.Z., Nicholson C., Noirot C., O'Bleness M., Paule C.R., RA Poulain J., Prion F., Qin B., Qu C., Retzel E.F., Riddle C., RA Sallet E., Samain S., Samson N., Sanders I., Saurat O., Scarpelli C., RA Schiex T., Segurens B., Severin A.J., Sherrier D.J., Shi R., Sims S., RA Singer S.R., Sinharoy S., Sterck L., Viollet A., Wang B.-B., Wang K., RA Wang M., Wang X., Warfsmann J., Weissenbach J., White D.D., RA White J.D., Wiley G.B., Wincker P., Xing Y., Yang L., Yao Z., Ying F., RA Zhai J., Zhou L., Zuber A., Denarie J., Dixon R.A., May G.D., RA Schwartz D.C., Rogers J., Quetier F., Town C.D., Roe B.A.; RT "The Medicago genome provides insight into the evolution of rhizobial RT symbioses."; RL Nature 480:520-524(2011). RN [2] {ECO:0000313|EMBL:AES87600.2} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=A17; RX PubMed=24767513; DOI=10.1186/1471-2164-15-312; RA Tang H., Krishnakumar V., Bidwell S., Rosen B., Chan A., Zhou S., RA Gentzbittel L., Childs K.L., Yandell M., Gundlach H., Mayer K.F., RA Schwartz D.C., Town C.D.; RT "An improved genome release (version Mt4.0) for the model legume RT Medicago truncatula."; RL BMC Genomics 15:312-312(2014). RN [3] {ECO:0000313|EnsemblPlants:AES87600} RP IDENTIFICATION. RC STRAIN=cv. Jemalong A17 {ECO:0000313|EnsemblPlants:AES87600}; RG EnsemblPlants; RL Submitted (APR-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001220; AES87600.2; -; Genomic_DNA. DR RefSeq; XP_003605403.2; XM_003605355.2. DR UniGene; Mtr.21714; -. DR EnsemblPlants; AES87600; AES87600; MTR_4g030960. DR GeneID; 11425714; -. DR Proteomes; UP000002051; Chromosome 4. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002051}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002051}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 34 56 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 526 AA; 58789 MW; 738872CB0344B839 CRC64; MHRSRKALLE TRASLLHNHP IDISSSSSGK SFNFYELSLV FVLWGLLILF SLWISYTDGS EELSVGLSKW NEVNHGFCEI SDTADKYFIK EIDACFPSEA LIYSKAGDAE ANGLVNESHN GRESGAYAVP ADINKENTDS ANREDHVVEN SEYAVKHEND VKKSDILSRA VPLGLNEFKS RAISSKVKSG TGQSRSVIHR LEPGGAEYNY ASASKGAKVL GSNKEGKGAS NILSRDKDKY LRNPCSVVGK FVIMELSEET LVDTIEIANF EHHSSNLKDF EIHGSLNFPT NVWDLLGNFT ASNVRHAQRF VLKEPKWVRY LKLNLQSHYG SEFYCTLSVV EVFGVDAVER MLEDLINTQD NLLASGEGNA DKTILPHPDP AVIEHVHKKP LEGINSVPAS DISSSKHETA NIKVPDPVEE IRQQVGRMPG DTVLKILMQK VRTLDVNLFV LERYMEDLNS RYVNIFKDYS KDTGEKDIVL QKIKEDIKNL IDHQDVSAKD ASDLISWKSQ VSSQLNHLIQ DNAVLR // ID G7L6W7_MEDTR Unreviewed; 462 AA. AC G7L6W7; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Sad1/UNC-like protein {ECO:0000313|EMBL:AET02575.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:AET02575}; GN OrderedLocusNames=MTR_8g043510 {ECO:0000313|EMBL:AET02575.1}; OS Medicago truncatula (Barrel medic) (Medicago tribuloides). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Trifolieae; Medicago. OX NCBI_TaxID=3880 {ECO:0000313|EMBL:AET02575.1, ECO:0000313|Proteomes:UP000002051}; RN [1] {ECO:0000313|EMBL:AET02575.1, ECO:0000313|EnsemblPlants:AET02575, ECO:0000313|Proteomes:UP000002051} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=A17 {ECO:0000313|EMBL:AET02575.1}, and RC cv. Jemalong A17 {ECO:0000313|EnsemblPlants:AET02575, RC ECO:0000313|Proteomes:UP000002051}; RX PubMed=22089132; DOI=10.1038/nature10625; RA Young N.D., Debelle F., Oldroyd G.E.D., Geurts R., Cannon S.B., RA Udvardi M.K., Benedito V.A., Mayer K.F.X., Gouzy J., Schoof H., RA Van de Peer Y., Proost S., Cook D.R., Meyers B.C., Spannagl M., RA Cheung F., De Mita S., Krishnakumar V., Gundlach H., Zhou S., RA Mudge J., Bharti A.K., Murray J.D., Naoumkina M.A., Rosen B., RA Silverstein K.A.T., Tang H., Rombauts S., Zhao P.X., Zhou P., RA Barbe V., Bardou P., Bechner M., Bellec A., Berger A., Berges H., RA Bidwell S., Bisseling T., Choisne N., Couloux A., Denny R., RA Deshpande S., Dai X., Doyle J.J., Dudez A.-M., Farmer A.D., RA Fouteau S., Franken C., Gibelin C., Gish J., Goldstein S., RA Gonzalez A.J., Green P.J., Hallab A., Hartog M., Hua A., RA Humphray S.J., Jeong D.-H., Jing Y., Jocker A., Kenton S.M., RA Kim D.-J., Klee K., Lai H., Lang C., Lin S., Macmil S.L., RA Magdelenat G., Matthews L., McCorrison J., Monaghan E.L., Mun J.-H., RA Najar F.Z., Nicholson C., Noirot C., O'Bleness M., Paule C.R., RA Poulain J., Prion F., Qin B., Qu C., Retzel E.F., Riddle C., RA Sallet E., Samain S., Samson N., Sanders I., Saurat O., Scarpelli C., RA Schiex T., Segurens B., Severin A.J., Sherrier D.J., Shi R., Sims S., RA Singer S.R., Sinharoy S., Sterck L., Viollet A., Wang B.-B., Wang K., RA Wang M., Wang X., Warfsmann J., Weissenbach J., White D.D., RA White J.D., Wiley G.B., Wincker P., Xing Y., Yang L., Yao Z., Ying F., RA Zhai J., Zhou L., Zuber A., Denarie J., Dixon R.A., May G.D., RA Schwartz D.C., Rogers J., Quetier F., Town C.D., Roe B.A.; RT "The Medicago genome provides insight into the evolution of rhizobial RT symbioses."; RL Nature 480:520-524(2011). RN [2] {ECO:0000313|EMBL:AET02575.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=A17; RX PubMed=24767513; DOI=10.1186/1471-2164-15-312; RA Tang H., Krishnakumar V., Bidwell S., Rosen B., Chan A., Zhou S., RA Gentzbittel L., Childs K.L., Yandell M., Gundlach H., Mayer K.F., RA Schwartz D.C., Town C.D.; RT "An improved genome release (version Mt4.0) for the model legume RT Medicago truncatula."; RL BMC Genomics 15:312-312(2014). RN [3] {ECO:0000313|EnsemblPlants:AET02575} RP IDENTIFICATION. RC STRAIN=cv. Jemalong A17 {ECO:0000313|EnsemblPlants:AET02575}; RG EnsemblPlants; RL Submitted (APR-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001224; AET02575.1; -; Genomic_DNA. DR RefSeq; XP_003628099.1; XM_003628051.2. DR UniGene; Mtr.4094; -. DR EnsemblPlants; AET02575; AET02575; MTR_8g043510. DR GeneID; 11410381; -. DR KEGG; mtr:MTR_8g043510; -. DR KO; K19347; -. DR OMA; MEIARHS; -. DR Proteomes; UP000002051; Chromosome 8. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002051}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002051}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 110 131 Helical. FT COILED 201 225 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 462 AA; 50447 MW; 0AA6F7B700F3C84A CRC64; MSASTVSITA ANPGTRRRPV ISTDKKTASN LELLANDVAA SPNTAGDGKN PSTAGNGRDL SHHSIRSDAI LSKDLAPATK RVAGGDSTRR VRKNGGKSEK QKWVTVARIF AKNFGLLVMV VGLVQLIRWF AVKSGDGVVV GGGFGGFSEY EDRISEMEGL LKKTAKMMQV QVDVVDKKIG NEVGGLKKEM DAKIEQKGAF LENEIKKLAN KGDKLERYLE ELKVEDLLTK EEFEKFVEGL KNVKGNGYEG GGLDEIREFA RGVVESEIEK HAADGLGRVD YALANGGAWV VRHSEAYDVQ RGNWFLLNAR NGVHHNADKM LKPSFGEPGQ CFPLKGSSGF VQIRLRAEIV PEAVTLEHVA KSVAYDRSSA PKDCRISGWL QGSNPNSVID TEKMFLLTEF TYDLEKSNAQ TFNVLNSAGY GVIDTIRFDF TSNHGSPSHT CIYRLRVHGY ESDSVSVMAI DS // ID G7MLB6_MACMU Unreviewed; 357 AA. AC G7MLB6; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 7. DE SubName: Full=Sad1/unc-84 domain-containing protein 1; GN ORFNames=EGK_13703; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CR-5; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001255; EHH17320.1; -; Genomic_DNA. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 48 69 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 357 AA; 40382 MW; F094D44CCF33BD52 CRC64; MSGKAKARRA AMFFRGCSED ASGSTSGSTL LSEDENPDTN GVTRSWKIIL STMFTLTFLL VGLLSHQWLK ETEVPQKSRQ LYAIIAEYGS RLYKYQARLR MPKEQLELLK KESQTLENNF HKILLLIEQI DVLKALLRDM KDGTDNNHSW NTHGDPVEDP DHTEEMSNLV NYVLKKLRED QVQMADYALK SAGASIIEAG TSESYKNNKA KLYWHGISFL NHEMPPDIIL QPDVYPGNCW AFPGSQGHTL IKLATKIIPT AVTMEHISEK VSPSGNISSA PKEFSVYGIT KKCEGEEIFL GQFIYNKTGT TVQTFELQHA VSEYLLCVKL NIFSNWGHPK YTCLYRFRVH GTPGKHI // ID G7MNE2_MACMU Unreviewed; 810 AA. AC G7MNE2; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein; GN ORFNames=EGK_13393; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CR-5; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001255; EHH17087.1; -; Genomic_DNA. DR STRING; 9544.ENSMMUP00000022451; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 284 307 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 314 333 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 408 428 {ECO:0000256|SAM:Coils}. FT COILED 453 487 {ECO:0000256|SAM:Coils}. FT COILED 500 520 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 810 AA; 90024 MW; AE1EDAE34E12E593 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADGGASSA VSLKNRAART AKQRRSTNKS AFSINHVSRQ VTSFGVSHSG TDSLQDAVTR QPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DLGAAAATAH NGFSCSNCSM LSERKDMLTA HPAAPGPVSR VYSRDRNQKR DDCKGKRHLD AYTAGTLRHI WACAGYFLLQ TLRRIGAAGR AVSRMAWSAL WLAVVAPGKA ASGVFWWLGI GWYQFVTLIS WLNVFLLTRC LRNICKLLVL LVPLLLLLAG LSLRGQGDFF SFLPVLNWAS THRTQRVDDP QDVFKPATSR LNQPLQGDNE AFPWHWMSGM EQQVTSLSGQ CHHHGENLRE LTTLLQKLQA RVDQMDNGAA GPSTSVRDAV GQPLKETDFM AFHQEHEVRI SHLEDILGKL REKSEAIQKE LEQTKQKTVS AVGEQLLPTV EHLQLELDQL KSELSSWRHM KTGCETVDAL QERVDVQVRE TVKLLFSEDQ QGGSLEQLLQ RFSSQCVSRG DLHTMLRDLE LQILRNVTHH ISVTKRLPAS EVVVSAVSEA GASGITEAQA RAIVNNALKL YSQDKTGMVD FALESGGGSI LSTRCSETYE TKTALMSLFG IPLWYFSQSP RVVIQPDIYP GNCWAFKGSQ GYLVVRLSMM IHPAAFTLEH IPKTLSPTGN ISSAPKDFAV YGLENEYQEE GQLLGQFTYD QDGESLQMFQ ALKTPDDRVF QIVELRIFSN WGHPEYTCLY RFRVHGEPVK // ID G7N4Y8_MACMU Unreviewed; 399 AA. AC G7N4Y8; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 14-OCT-2015, entry version 7. DE SubName: Full=Putative uncharacterized protein; GN ORFNames=EGK_02459; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CR-5; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001262; EHH19742.1; -; Genomic_DNA. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 91 114 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 120 145 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 158 192 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 399 AA; 44186 MW; 94AEA61AFB1AFBF1 CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSVSIT SEDSNGLRSA GPGPGDPEGR GARGPSCEPT GSPAVSEEPL DLLPTLDLRQ EMPPSRVFKS FLSLLFQVLS VLLSLAGDVL VSMYREVCSI RFLFTAVSLL SLFLAAIWLG LLYLVSPLEN EPKEMLTLSE YHERVRSQGQ QLQQLQAELD KLHKEVSTVR AANSERVAKL VFQRLNEDFV RKPDYALSSV GASIDLQKTS HDYADRNTAY FWNRFSFWNY ARPPTVILEP HVFPGNCWAF EGDQGQVVIQ LPGRVQLSDI TLQHPPPSVE HTGGANSAPR DFAVFVSADE GLQVDDETEV FLGKFTFDVE KSEIQTFHLQ NDPPAAFPKV KIQILSNWGH PRFTCLYRVR AHGVRTSEGA EGSATGGPH // ID G7N531_MACMU Unreviewed; 376 AA. AC G7N531; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 9. DE SubName: Full=Sperm-associated antigen 4-like protein; GN ORFNames=EGK_02512; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CR-5; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001262; EHH19785.1; -; Genomic_DNA. DR STRING; 9544.ENSMMUP00000012321; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 376 AA; 42285 MW; AC64B03E6A025483 CRC64; MPRSSRSPGD PGAPLEDVAH NPRPRRIAQR GRNTSRMVED TSSNMNDNFL LPVRINAQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DDSINGPLQS LRLYQEKVRH HSGEIQDLRG SMNLLIAKLQ EMEAMSDEQK VAQKIMKMIH GDYIEKPDFA LKSTGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNVTPGN CWAFEGNRGQ VTIQLAQKVY LSNLTLQHIP KTISPSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPES IIQMFPLQNQ PARAFGAVKV KISSNWGNPA FTCLYRVRVH GSVAPPGEQA SPEPLP // ID G7P1S8_MACFA Unreviewed; 357 AA. AC G7P1S8; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Sad1/unc-84 domain-containing protein 1 {ECO:0000313|EMBL:EHH52136.1}; GN ORFNames=EGM_12525 {ECO:0000313|EMBL:EHH52136.1}; OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9541 {ECO:0000313|Proteomes:UP000009130}; RN [1] {ECO:0000313|EMBL:EHH52136.1, ECO:0000313|Proteomes:UP000009130} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH52136.1}; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001278; EHH52136.1; -; Genomic_DNA. DR RefSeq; XP_005549683.1; XM_005549626.1. DR UniGene; Mfa.6592; -. DR GeneID; 102136181; -. DR CTD; 256979; -. DR Proteomes; UP000009130; Chromosome 3. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009130}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009130}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 48 69 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 357 AA; 40382 MW; F094D44CCF33BD52 CRC64; MSGKAKARRA AMFFRGCSED ASGSTSGSTL LSEDENPDTN GVTRSWKIIL STMFTLTFLL VGLLSHQWLK ETEVPQKSRQ LYAIIAEYGS RLYKYQARLR MPKEQLELLK KESQTLENNF HKILLLIEQI DVLKALLRDM KDGTDNNHSW NTHGDPVEDP DHTEEMSNLV NYVLKKLRED QVQMADYALK SAGASIIEAG TSESYKNNKA KLYWHGISFL NHEMPPDIIL QPDVYPGNCW AFPGSQGHTL IKLATKIIPT AVTMEHISEK VSPSGNISSA PKEFSVYGIT KKCEGEEIFL GQFIYNKTGT TVQTFELQHA VSEYLLCVKL NIFSNWGHPK YTCLYRFRVH GTPGKHI // ID G7P249_MACFA Unreviewed; 810 AA. AC G7P249; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EHH51957.1}; GN ORFNames=EGM_12301 {ECO:0000313|EMBL:EHH51957.1}; OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9541 {ECO:0000313|Proteomes:UP000009130}; RN [1] {ECO:0000313|EMBL:EHH51957.1, ECO:0000313|Proteomes:UP000009130} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH51957.1}; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001278; EHH51957.1; -; Genomic_DNA. DR Proteomes; UP000009130; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009130}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009130}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 284 307 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 314 333 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 408 428 {ECO:0000256|SAM:Coils}. FT COILED 453 487 {ECO:0000256|SAM:Coils}. FT COILED 500 520 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 810 AA; 89962 MW; 94AEB2E3BD331202 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADGGASSA VSLKNRAART AKQRRSTNKS AFSINHVSRQ VTSFGVSHSG TDSLQDAVTR QPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DLGAAAATAH NGFSCSNCSM LSERKDVLTA HPAAPGPVSR VYSRDRNQKR DDCKGKRHLD AYTAGTLRHI WACAGYFLLQ TLRRIGAAGR AVSRTAWSAL WLAVVAPGKA ASGVFWWLGI GWYQFVTLIS WLNVFLLTRC LRNICKLLVL LVPLLLLLAG LSLRGQGDFF SFLPVLNWAS THRTQRVDDP QDVFKPATSR LNQPLQGDNE AFPWHWMSGM EQQVTSLSGQ CHHHGENLRE LTTLLQKLQA RVDQMDNGAA GPSTSVRDAV GQPLKETDFM AFHQEHEVRI SHLEDILGKL REKSEAIQKE LEQTKQKTVS AVGEQLLPTV EHLQLELDQL KSELSSWRHM KTGCETVDAL QERVDVQVRE TVKLLFSEDQ QGGSLEQLLQ RFSSQCVSRG DLHTMLRDLE LQILRNVTHH ISVTKRLPAS EVVVSAVSEA GASGITEAQA RAIVNNALKL YSQDKTGMVD FALESGGGSI LSTRCSETYE TKTALMSLFG IPLWYFSQSP RVVIQPDIYP GNCWAFKGSQ GYLVVRLSMM IHPAAFTLEH IPKTLSPTGN ISSAPKDFAV YGLENEYQEE GQLLGQFTYD QDGESLQMFQ ALKTPDDRVF QIVELRIFSN WGHPEYTCLY RFRVHGEPVK // ID G7PGI1_MACFA Unreviewed; 338 AA. AC G7PGI1; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 14-OCT-2015, entry version 6. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EHH65387.1}; DE Flags: Fragment; GN ORFNames=EGM_02136 {ECO:0000313|EMBL:EHH65387.1}; OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9541 {ECO:0000313|Proteomes:UP000009130}; RN [1] {ECO:0000313|EMBL:EHH65387.1, ECO:0000313|Proteomes:UP000009130} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH65387.1}; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001285; EHH65387.1; -; Genomic_DNA. DR Proteomes; UP000009130; Chromosome 10. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009130}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009130}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 35 55 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 67 89 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 102 136 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EHH65387.1}. FT NON_TER 338 338 {ECO:0000313|EMBL:EHH65387.1}. SQ SEQUENCE 338 AA; 37909 MW; 1D67B5B62D0CFEA3 CRC64; AEPTGSPAVS EEPLDLLPTL DLRQEMPPSR VFKSFLSLLF QVLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLAA IWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SAPRDFAVFG LQVDDETEVF LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSATGGPH // ID G7PGM2_MACFA Unreviewed; 376 AA. AC G7PGM2; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 14-OCT-2015, entry version 6. DE SubName: Full=Sperm-associated antigen 4-like protein {ECO:0000313|EMBL:EHH65428.1}; GN ORFNames=EGM_02186 {ECO:0000313|EMBL:EHH65428.1}; OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9541 {ECO:0000313|Proteomes:UP000009130}; RN [1] {ECO:0000313|EMBL:EHH65428.1, ECO:0000313|Proteomes:UP000009130} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH65428.1}; RX PubMed=22002653; DOI=10.1038/nbt.1992; RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., RA Li Q., Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., RA Huang Z., Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., RA Huang Y., Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., RA Li B., Liu X., Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., RA Li Y., Wang W., Katze M.G., Su B., Nielsen R., Yang H., Wang J., RA Wang X., Wang J.; RT "Genome sequencing and comparison of two nonhuman primate animal RT models, the cynomolgus and Chinese rhesus macaques."; RL Nat. Biotechnol. 29:1019-1023(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM001285; EHH65428.1; -; Genomic_DNA. DR Proteomes; UP000009130; Chromosome 10. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009130}; KW Reference proteome {ECO:0000313|Proteomes:UP000009130}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 376 AA; 42285 MW; AC64B03E6A025483 CRC64; MPRSSRSPGD PGAPLEDVAH NPRPRRIAQR GRNTSRMVED TSSNMNDNFL LPVRINAQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DDSINGPLQS LRLYQEKVRH HSGEIQDLRG SMNLLIAKLQ EMEAMSDEQK VAQKIMKMIH GDYIEKPDFA LKSTGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNVTPGN CWAFEGNRGQ VTIQLAQKVY LSNLTLQHIP KTISPSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPES IIQMFPLQNQ PARAFGAVKV KISSNWGNPA FTCLYRVRVH GSVAPPGEQA SPEPLP // ID G7XSF7_ASPKW Unreviewed; 833 AA. AC G7XSF7; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 14-OCT-2015, entry version 9. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:GAA89842.1}; GN ORFNames=AKAW_07956 {ECO:0000313|EMBL:GAA89842.1}; OS Aspergillus kawachii (strain NBRC 4308) (White koji mold) (Aspergillus OS awamori var. kawachi). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=1033177 {ECO:0000313|EMBL:GAA89842.1, ECO:0000313|Proteomes:UP000006812}; RN [1] {ECO:0000313|Proteomes:UP000006812} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NBRC 4308 {ECO:0000313|Proteomes:UP000006812}; RX PubMed=22045919; DOI=10.1128/EC.05224-11; RA Futagami T., Mori K., Yamashita A., Wada S., Kajiwara Y., RA Takashita H., Omori T., Takegawa K., Tashiro K., Kuhara S., Goto M.; RT "Genome sequence of the white koji mold Aspergillus kawachii IFO 4308, RT used for brewing the Japanese distilled spirit shochu."; RL Eukaryot. Cell 10:1586-1587(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DF126470; GAA89842.1; -; Genomic_DNA. DR InParanoid; G7XSF7; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006812; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006812}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006812}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 833 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003505635. FT TRANSMEM 678 699 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 833 AA; 89977 MW; 0FA21038297D1EDD CRC64; MTTSWTATQW IPWMTLISTW IDGTTADPSQ TICPAPSWQV AEAEFIQWPQ CPETRWEADP ATSLSAAQQQ QPLLKAPEET LSVVSVSMAA SSESSARPDH ELDTESPLDN ANFLSFEDWK KQNLAKVGQS AENVGARGAA AAAAGKEGRR RPTGINNALD SLGEDVEIEL DFGGFGADTP EAAKPTSWGA RVSTGGTTEE GGSVGDVESL AQGVPPAGGV SRSKDAGTTC KERFNYASFD CAATVLKTNP ECTGSSSVLI ENKDSYMLNE CRANNKFLIL ELCDDILVDT VVLANYEFFS SIFHTFRVSV SDRYPAKLDQ WRELGIYEAR NTREVQAFAV ENPLIWARYV KIEFLTHYGN EFFCPLSLIR VHGTTMLEEY KHDGEVSRTD DVVADEEPEP APAAAEIETI PTVDVAPAAG TAEQKVEEQT PETCPNPGPV VDETVMTQLL GVLETCSIHD SPAAGAEGTQ TSLNRPPATD AAPPKGDDTA SVGNEAPAKE AGEQKVTVSP NVDSAPSSAT TAGPETTSQG EADSRSTGFT KEEQSVAAET TRSTATQPPS ANPTTQESFF KSVNKRLQML ESNSSLSLLY IEEQSRILRD AFNKVEKRQL AKTSTFLEQL NVTVLHELKQ FREQYDNVWK SVALEFEHQR IQYHKEVHSL SAQLGVLADE LVFQKRVAVI QSIMILFCFG LVLFSRGAVS SYIELPSMQN MVSRSYSLRS SSPPFGSPSV SPTSSGRRAG GHRRNLSEDS QEDGPISPTL AYSPPTPVSD MMSSSEEAEN HRGNSLALPE VAPPVRSRSS PPDLKGGEES IEESSSSGES PVSHGRNAAV AEA // ID G7XWZ5_ASPKW Unreviewed; 708 AA. AC G7XWZ5; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:GAA91454.1}; GN ORFNames=AKAW_09568 {ECO:0000313|EMBL:GAA91454.1}; OS Aspergillus kawachii (strain NBRC 4308) (White koji mold) (Aspergillus OS awamori var. kawachi). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=1033177 {ECO:0000313|EMBL:GAA91454.1, ECO:0000313|Proteomes:UP000006812}; RN [1] {ECO:0000313|Proteomes:UP000006812} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NBRC 4308 {ECO:0000313|Proteomes:UP000006812}; RX PubMed=22045919; DOI=10.1128/EC.05224-11; RA Futagami T., Mori K., Yamashita A., Wada S., Kajiwara Y., RA Takashita H., Omori T., Takegawa K., Tashiro K., Kuhara S., Goto M.; RT "Genome sequence of the white koji mold Aspergillus kawachii IFO 4308, RT used for brewing the Japanese distilled spirit shochu."; RL Eukaryot. Cell 10:1586-1587(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DF126480; GAA91454.1; -; Genomic_DNA. DR InParanoid; G7XWZ5; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000006812; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006812}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006812}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 394 413 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 708 AA; 78029 MW; C2859D5078881BCF CRC64; MPTRRGATRR AGSTPRSDIG SASTYFQSKL GPEARTQALP NLPTKQSFAY GSAETPILPR ELKIQPHMDL TEMADAIDKG IEDAKDRQLK EKETTQDKSR RQKSPSISRS PVRRSRREPT PDELQLLDNL REATKSPTPI RGNYSNNDQS TATPTPPIPH TLSTASSPTQ PLPVPRYPHV PADNLYPSPM GRFGPQLHDG PPLGSSPLPD NSSLYSFTVE RAINSDELTR TLSDGKNIKA PPRRFSGLAF NEPIHEEEEP DSRLLKTKSR SPSLQPSLEE FTIEPSPEPE PPLEPESVHE PSPEPTPTPE PEPMPELEHM PEAMPEPEVI REKSPAAQFT APTKTLIPDT YARNPSREPS VDGNQQTITQ RGQSWSWVGS LSAQLPSAGT VARILAGIAL AAVAVYLVAF GGIPSLSRPT QYIPMDESNM LAVSSLTDQM SRIGAQVSSL AKDMRTVKWD VNAVQSEVRS SPTPVMPPSR GTDFGPPTEQ KTNFLSIGLG VLVIPGLTSP TVGHKLNPLQ WAYVKLWRGS YYRPASPPLA ALAPWEDYGD CWCSTPRDGM SQIGIDLGQK IVPEEVAIEH MPKTATLKPE NAPREMELWA QYVLVQKGAS RHARTHASIH KPIMNALRSA WPTEDPTAYS DDPLLGPSYY RVGKFTYDIH GSHHVQLFQL DAVIDSPELR VDRVVFRATS NWGGNHTCIY RLKLFGHV // ID G7YBZ6_CLOSI Unreviewed; 1032 AA. AC G7YBZ6; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 14-OCT-2015, entry version 7. DE SubName: Full=SUN domain-containing protein 2 {ECO:0000313|EMBL:GAA50480.1}; GN ORFNames=CLF_104596 {ECO:0000313|EMBL:GAA50480.1}; OS Clonorchis sinensis (Chinese liver fluke). OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis. OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA50480.1, ECO:0000313|Proteomes:UP000008909}; RN [1] {ECO:0000313|Proteomes:UP000008909} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., RA Lv X., Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., RA Hu X., Xu J., Yu X.; RT "The draft genome of the carcinogenic human liver fluke Clonorchis RT sinensis."; RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Henan; RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W., RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L., RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., RA Xu J., Wu Z., Yu X.; RT "The genome and transcriptome sequence of Clonorchis sinensis provide RT insights into the carcinogenic liver fluke."; RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DF143050; GAA50480.1; -; Genomic_DNA. DR InParanoid; G7YBZ6; -. DR Proteomes; UP000008909; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008909}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008909}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 302 322 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 395 418 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 529 556 {ECO:0000256|SAM:Coils}. FT COILED 664 684 {ECO:0000256|SAM:Coils}. FT COILED 703 723 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1032 AA; 115569 MW; 95F0BC916CB658BB CRC64; MVTKRLQANC QARRTDQQPP DQQPSSRAGQ HIATLGSRTL NDSDLEMDSV VTDNRLSAVI PGLGGNREEE LEDDELSSRA SVASSRSSTR VSRVRCIRVQ GEWCVHSECI ASTPGNRRMI LVAHQRNTNN TADVNHIIDG TETETPTGRH RHKQDQQQLV SRRSTRASLR PSITTTKTIT SQNRIESNHH TDRSNQLASA SSTYVYTFVW SSGTSSAGNK AQKWLARHIF GLESSAPVST SNHFVPSDTE ESDTIATGRS LARGKYVHTP SATRSRRLYH TDSSSTTWFF RPLVQGAEKL KGVTFSSIAF VLAGIFIVYD AFSSCVRNLS SAVWTFWFHP SSPLPHRSRD KLTVHPRLTQ FDNHVTPEIS SAEESDFFSR TFRASYTWFT RAGTICVRVL GCLCFLIPLL LLLAFLFAPV AVNDDEPPPV WPSFLSDADC KKALLESRPS DATVWQLARW RFRCLYYLYF LTPEFPSNTT TTSESSSVWQ KFKTWLWPST PVVPPPGILP TDLPSYVDGK LLAQLEAFRD FVNDRLDGLT NTIRRTEERV SEMEQRSDTQ FNDLNIHINN LKQHFNEHTT ALDSWHVQLQ ALQALAGRLD SSEPSKVTLN EYDRLFNAVI NAANQTIVKE LNLLRIELDE KSSSMWNRHN SSFTQLSLLI SRLREELNDR LLQTEHRLGE LRSQLNSGIH AQVVNHTSLV IQMEEIRLTL GNLSERLAQV RSANEGLDGL FMKLQEATRD CSERQLHQVE DCKQAAIQQA EIAVNAFSER FTNQISVLVK ESLLHWLNDV SVEEALDSKL SELVKQSTRE ALDRTIRETA ISGYAPSVDT SVAQTDELKS RVFVQKLIDA ALERFAADRV GMADFALESA GGSIVGTRCT RTYTERAALF TIFGIPLARL SNSARTILQP SNNPGDCWAF HGSTGQAVIR LSAPIIITSV TLEHLPRVLS PNQRVDSAPK DFVIKGLSSE TDEGVVIGTF VYDINGPAIQ TFPIEGQSSS WHLIELGILS NHGHPLYTCV YRLRVHGRTP DP // ID G8BCX7_CANPC Unreviewed; 572 AA. AC G8BCX7; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CCE43122.1}; GN ORFNames=CPAR2_207650 {ECO:0000313|EMBL:CCE43122.1}; OS Candida parapsilosis (strain CDC 317 / ATCC MYA-4646) (Yeast) (Monilia OS parapsilosis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Candida. OX NCBI_TaxID=578454 {ECO:0000313|Proteomes:UP000005221}; RN [1] {ECO:0000313|Proteomes:UP000005221} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CDC 317 / ATCC MYA-4646 {ECO:0000313|Proteomes:UP000005221}; RX PubMed=19465905; DOI=10.1038/nature08064; RA Butler G., Rasmussen M.D., Lin M.F., Santos M.A.S., Sakthikumar S., RA Munro C.A., Rheinbay E., Grabherr M., Forche A., Reedy J.L., RA Agrafioti I., Arnaud M.B., Bates S., Brown A.J.P., Brunke S., RA Costanzo M.C., Fitzpatrick D.A., de Groot P.W.J., Harris D., RA Hoyer L.L., Hube B., Klis F.M., Kodira C., Lennard N., Logue M.E., RA Martin R., Neiman A.M., Nikolaou E., Quail M.A., Quinn J., RA Santos M.C., Schmitzberger F.F., Sherlock G., Shah P., RA Silverstein K.A.T., Skrzypek M.S., Soll D., Staggs R., Stansfield I., RA Stumpf M.P.H., Sudbery P.E., Srikantha T., Zeng Q., Berman J., RA Berriman M., Heitman J., Gow N.A.R., Lorenz M.C., Birren B.W., RA Kellis M., Cuomo C.A.; RT "Evolution of pathogenicity and sexual reproduction in eight Candida RT genomes."; RL Nature 459:657-662(2009). RN [2] {ECO:0000313|Proteomes:UP000005221} RP GENOME REANNOTATION. RC STRAIN=CDC 317 / ATCC MYA-4646 {ECO:0000313|Proteomes:UP000005221}; RX PubMed=22192698; DOI=10.1186/1471-2164-12-628; RA Guida A., Lindstaedt C., Maguire S.L., Ding C., Higgins D.G., RA Corton N.J., Berriman M., Butler G.; RT "Using RNA-seq to determine the transcriptional landscape and the RT hypoxic response of the pathogenic yeast Candida parapsilosis."; RL BMC Genomics 12:628-628(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE605206; CCE43122.1; -; Genomic_DNA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005221; Chromosome 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005221}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005221}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 572 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003508253. FT TRANSMEM 490 507 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 300 327 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 572 AA; 65149 MW; 5A7E1E6B9F075182 CRC64; MLSTWLSIAL ICLYGCVCQA RKDDEKDPFT NTSSQEYSPI NDSLPVFLST EIPSFANAYD GRQDLYLQSP VIQPHSSNES HADSVIDDCH FMSFEEWKKQ KIETNTSLSN TSRNNTEPLK PSANVSTTNA TAFSVVAVTE QEGTVYKNKF NFASADCAAT IVKTNSQAKG APAILKENKD SYLLNECSVK NKFIVVELCQ DILVSQVVLG NYEFFSSMYK DIRVSVSDRF PTQNWRELGQ FTAQNIRDIQ TFNIDNPLIW ARYLKLEILS HYGNEFYCPI SVIRVHGKTM IDEFKEDEEV SSLQNQKDAT IKELDSKDNE LESLINDTFN ECSVVLPHLL LNEFLKDFNT THNNHCLPSD DTNSSSSITS TTATITTTQE SIYKNIIKRL TLLESNATLS LLYIEEQSKL LSIAFQNLEK RQTANFNNLL RSVNSTLLNQ LSIFKESYHE MYSQYSELFH LQDHKYKHFI SESNVRIKNI SSDLTFQKRL SFFNSVIIIC LLVYVILTRE VNVEVQTRSS GRRDKSLFGT RQQGRSSSLD GSRKNRLSIS DPILAPTKSM HDDPHPKHRK ST // ID G8BNR0_TETPH Unreviewed; 718 AA. AC G8BNR0; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCE61538.1}; GN Name=TPHA0A04650 {ECO:0000313|EMBL:CCE61538.1}; GN OrderedLocusNames=TPHA_0A04650 {ECO:0000313|EMBL:CCE61538.1}; OS Tetrapisispora phaffii (strain ATCC 24235 / CBS 4417 / NBRC 1672 / OS NRRL Y-8282 / UCD 70-5) (Yeast) (Fabospora phaffii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Tetrapisispora. OX NCBI_TaxID=1071381 {ECO:0000313|EMBL:CCE61538.1, ECO:0000313|Proteomes:UP000005666}; RN [1] {ECO:0000313|EMBL:CCE61538.1, ECO:0000313|Proteomes:UP000005666} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 24235 / CBS 4417 / NBRC 1672 / NRRL Y-8282 / UCD 70-5 RC {ECO:0000313|Proteomes:UP000005666}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE612856; CCE61538.1; -; Genomic_DNA. DR RefSeq; XP_003683972.1; XM_003683924.1. DR STRING; 1071381.XP_003683972.1; -. DR EnsemblFungi; CCE61538; CCE61538; TPHA_0A04650. DR GeneID; 11532775; -. DR KEGG; tpf:TPHA_0A04650; -. DR eggNOG; ENOG410IE9E; Eukaryota. DR eggNOG; ENOG4111CR2; LUCA. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000005666; Chromosome 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005666}; KW Reference proteome {ECO:0000313|Proteomes:UP000005666}. FT COILED 255 275 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 718 AA; 83137 MW; 268A6DCCF96FA776 CRC64; MSSDKKVSDD RKSRIDEDRY LPLNSSLHEH KDLLIEKMNR SNFGRDYEGH GEFSREGRSV VDRGMIMHDS EVDDNNDTDY DNFKKKILSN GSLNSNYISV DDDDWIDDLG SYSETREGLS NDEANESFIE DGDDDDYDYD YDYDDDEDYE ILTNSNGILD NHRDSKGGSK GLFRTWALVT IVFVVFSTLL SKVVLPTSIS SASNIPSGNV QRQINHLYNM VNTQNDKIQT DLDKTIKIVI TQFEKKIKSI LPKNILDFQS QLELLNTKVN KMNENQRTEK IINNQMNTEF SMKNLTIIQD LLTNQLNNTL PDKIPVIINN STSMLMIPEI HNYLKDIISG IITTLESNST NILQGNMTTN LGMQQEGFLP DLNGYIKEIL KDELQYIDKD YFVQELNRKL QLNKHEIFEE FTEKLSDLKI SSNSHYHYND MTSDKYSDIL LRKMINRIYN ANQHQWEDDL DFATFAQGTR LLNHLTSKTW KKGTQNTPLE LLSNTINNSV YWQCDSTKDC RWAIRFSEPI YLFRLSYLHG RLKNNVHMMN SAPKKISIYV KLANGNDLIK TFKKVAKTYK QGQSLNEDSS YIKIGQYDYD LTDPKVKQDF LLPSWYIKLR PLVHSMVFEI NENYGNKDFT SLKKFLIKAV TKQDLEITTN NEFPYKLGNV PEYNADNYVI ASSDTGSSHH LMNQQLRNVQ DDRNDGFKNL ADNENSKIPS FGQDELDI // ID G8BQ41_TETPH Unreviewed; 725 AA. AC G8BQ41; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCE62122.1}; GN Name=TPHA0B04530 {ECO:0000313|EMBL:CCE62122.1}; GN OrderedLocusNames=TPHA_0B04530 {ECO:0000313|EMBL:CCE62122.1}; OS Tetrapisispora phaffii (strain ATCC 24235 / CBS 4417 / NBRC 1672 / OS NRRL Y-8282 / UCD 70-5) (Yeast) (Fabospora phaffii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Tetrapisispora. OX NCBI_TaxID=1071381 {ECO:0000313|EMBL:CCE62122.1, ECO:0000313|Proteomes:UP000005666}; RN [1] {ECO:0000313|EMBL:CCE62122.1, ECO:0000313|Proteomes:UP000005666} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 24235 / CBS 4417 / NBRC 1672 / NRRL Y-8282 / UCD 70-5 RC {ECO:0000313|Proteomes:UP000005666}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE612857; CCE62122.1; -; Genomic_DNA. DR RefSeq; XP_003684556.1; XM_003684508.1. DR STRING; 1071381.XP_003684556.1; -. DR EnsemblFungi; CCE62122; CCE62122; TPHA_0B04530. DR GeneID; 11534831; -. DR KEGG; tpf:TPHA_0B04530; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005666; Chromosome 2. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005666}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005666}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 725 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003508468. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 725 AA; 83621 MW; 118CCD163A2DFB46 CRC64; MAYFSYTLAI FLVIFNEFFY VQGSNASASL IENNVNSSCC IEKNHRQEIF KIEAEILSCV ASLTKSAGIE ASTKYALSST VFMPVLSTVA VDIALELTDD KSEKEASNIT NTIDLKNNTF KPFNEWKQQK LYNTLSDSKI RAQRTRSPVN QDLEEQDLIG GEMEIDLGFF TEKEIDNELP EVKVYKNKFN YASLDCAATI METNSDASGA NSILIENKDT YLLNPCSVAS KYVIIELCQD ILVEQIAMAN FEFFSSTFKD VRFSVSDRYP ITKDEWKVIG NFKAQNSRNI QNFMIENPKI WARYLKIETI SFFDNEYYCP ISVVRVHGKT MMDEYKMSNI DKHDREGIEY KYSEAVEDDE IEMVCDPINE ISGHNFTNVM FMNNRLNFTE SALQPLTFDD YLKEVNRTFC PPRPYKNSSA TSSLSSSGST EDSIFKNIMK RLTSLESNTN LTFSYIEEQS RLLSESFEAS ERSHVKKLTF IIDAFNITVQ RNIQSLSEFA GQLKEQSLRI LEEQKLNNDF FSTQNARKME RMEKEIAYQR RVIYFILFVL SALVLATILN KEFYFDDYNQ PDNWMESKPK PTENKSYSKN DFMMKIDAQK KEQFTLSPYS SGSAYSDIEY LQNSFDNGNE SDSKLNTNSN FINEEISNLK FTKSETFEKS NSSDFNNDTE NHLTFSKSDA FENSFDRKYN NQHGNTLTLS KSDTYGDNFR SDDQGSINSD QEWEY // ID G8JMC9_ERECY Unreviewed; 796 AA. AC G8JMC9; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AET37338.1}; GN OrderedLocusNames=Ecym_1081 {ECO:0000313|EMBL:AET37338.1}; OS Eremothecium cymbalariae (strain CBS 270.75 / DBVPG 7215 / KCTC 17166 OS / NRRL Y-17582) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Eremothecium. OX NCBI_TaxID=931890 {ECO:0000313|EMBL:AET37338.1, ECO:0000313|Proteomes:UP000006790}; RN [1] {ECO:0000313|Proteomes:UP000006790} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 270.75 / DBVPG 7215 / KCTC 17166 / NRRL Y-17582 RC {ECO:0000313|Proteomes:UP000006790}; RX DOI=10.1534/g3.111.001032; RA Wendland J., Walther A.; RT "Genome evolution in the Eremothecium clade of the Saccharomyces RT complex revealed by comparative genomics."; RL G3 (Bethesda) 1:539-548(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP002497; AET37338.1; -; Genomic_DNA. DR RefSeq; XP_003644155.1; XM_003644107.1. DR STRING; 931890.XP_003644155.1; -. DR EnsemblFungi; AET37338; AET37338; Ecym_1081. DR GeneID; 11469559; -. DR KEGG; erc:Ecym_1081; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; G8JMC9; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006790; Chromosome 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006790}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006790}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 796 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003510852. FT TRANSMEM 636 653 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 796 AA; 89880 MW; 6AD757C94D00B0F5 CRC64; MKVYNITNGV YLLLVLTLSL ALVQVERNGE IFMSTFSESK SSDIVFTTSS GHLHMPSDLV VTEDVILSER SLDPVQLRSC STVSLRQGTP VISVKGSNNI MENTSIPKLY NTRTNLNLLS TDMSFYVPKL SSVETAGIQT GDALLVSNLS RSEISSGNVV LPSTSVILNE SQLTSLKVES PRKPEGHVSD DPEVHNETEF LPFAEWKKLK LDEKQAESHT ESQLKTRTMI DYNKVETLGD DMEVDLGIFT SGDDDEPEGK LYQQKFNYAS LDCAASIVKT NSEAHGASSI LYENKDKYLL NPCSASIKFV VIELCQDILV ENIEIANYEF FSSTFKKLKF SVSDRFPVPK NGWKVLGEFI AENSRDLQTF SIPNPMIWAK YLRVDILSHY GDEFYCPISV VRAHGKTMMD EFKMTQKDNE DEDLVVLENE QELTVNSSTM DELLLCNVST YHEFFDRYNN PLIFPSSDNI TLELFWDDIS SQCLAALPAL KFEEFVKDFN NETANPSKQT KAIDFTPNMP SLSIEESIFK NIMKRISSLE ANATLSVLYI EEQSRLLSKS FSSLEKTHAK KLDSLVSAFN ETMMGNLEKL SSFARQLRES SVKILEEQKL ANDQFTTSTS QRLDMMEKDA TFQKRMMYLI LFAFSAMLVY VLLTREAYID DYMEDDGWYL DSPPLKKAKD KLMRKAALAV STPTIFKNIV DDVEQGKFER TRSLSSSTTS DDSQYLIDND DDDDSLYVGR GRSYSKLLAA DENEVDIDEI MSISSENSDL SNGMDRLLDV DTTALSSRES MDKENK // ID G8JXC5_ERECY Unreviewed; 619 AA. AC G8JXC5; DT 25-JAN-2012, integrated into UniProtKB/TrEMBL. DT 25-JAN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AET41499.1}; GN OrderedLocusNames=Ecym_8214 {ECO:0000313|EMBL:AET41499.1}; OS Eremothecium cymbalariae (strain CBS 270.75 / DBVPG 7215 / KCTC 17166 OS / NRRL Y-17582) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Eremothecium. OX NCBI_TaxID=931890 {ECO:0000313|EMBL:AET41499.1, ECO:0000313|Proteomes:UP000006790}; RN [1] {ECO:0000313|Proteomes:UP000006790} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CBS 270.75 / DBVPG 7215 / KCTC 17166 / NRRL Y-17582 RC {ECO:0000313|Proteomes:UP000006790}; RX DOI=10.1534/g3.111.001032; RA Wendland J., Walther A.; RT "Genome evolution in the Eremothecium clade of the Saccharomyces RT complex revealed by comparative genomics."; RL G3 (Bethesda) 1:539-548(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP002504; AET41499.1; -; Genomic_DNA. DR RefSeq; XP_003648316.1; XM_003648268.1. DR STRING; 931890.XP_003648316.1; -. DR EnsemblFungi; AET41499; AET41499; Ecym_8214. DR GeneID; 11471514; -. DR KEGG; erc:Ecym_8214; -. DR eggNOG; ENOG410IE9E; Eukaryota. DR eggNOG; ENOG4111CR2; LUCA. DR InParanoid; G8JXC5; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000006790; Chromosome 8. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006790}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006790}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 146 164 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 227 247 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 619 AA; 71785 MW; 5B326654FD7439E3 CRC64; MSGDSRQRLY NQSIHNAYNA LLSQKRSSGS GDFHHRSSAV VSNNNSEHEV GDMSEFSDGD QLKDEAMIQE DDSDERLGDG EDEEYSRFKK SLVNGHNDDD YRWLDDDEDT DYTDEADQSF IQDDEGDTYI YEESHYEGER IGRERWLKWL LGGLLCGLLV WYMYGGVSPG ANPDLYKRVN QLQTQLNHLS HDTEAQRKTF KSDLDSNIKM IIQQFEKNIK RILPRDVSKL ESNIAHLEAE MQQINQMLLM ENVTQWQRDL VTKLNEKLPD KIPIVMENDT NMLLIPELHE YLSILISQIV QQSVTSLPQF KFNMNHYIKE VLNNNFQFVD KQYFLNHLQE SLLSTRDEIK QELESRLSHL TSPELVPQQV SSVLLKKLVH KIYNSNLHQW ESDLNIATFA QGTKLLNHLC SKTIHGPVGP MDLLQDCNSC TSTYWKCNSE GCSWAIRLEE PMYLTKIGYL HGKFSHNLQI MTAAPKKISV FVKLYEGARG IPSNVQRWSK NNHFVSLGHW EYDIFDNKIR QDFELPLWFI QGKFLIRSIG FEINSNHGNP EYTALRKFVV NAVTPKDLKL MDQFPADWKI QVPDYSVMID DQERIRASRI AQLHNSNEVP SFGDDEVDT // ID G8Y7H8_PICSO Unreviewed; 695 AA. AC G8Y7H8; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Piso0_004105 protein {ECO:0000313|EMBL:CCE84558.1}; GN Name=Piso0_004105 {ECO:0000313|EMBL:CCE84558.1}; GN ORFNames=GNLVRS01_PISO0K09558g {ECO:0000313|EMBL:CCE83527.1}, GN GNLVRS01_PISO0L09559g {ECO:0000313|EMBL:CCE84558.1}; OS Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / OS NBRC 10061 / NRRL Y-12695) (Hybrid yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Millerozyma. OX NCBI_TaxID=559304 {ECO:0000313|EMBL:CCE84558.1, ECO:0000313|Proteomes:UP000005222}; RN [1] {ECO:0000313|EMBL:CCE84558.1} RP NUCLEOTIDE SEQUENCE. RA Genoscope - CEA; RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000005222} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL RC Y-12695 {ECO:0000313|Proteomes:UP000005222}; RX DOI=10.1534/g3.111.001032; RA Leh Louis V., Despons L., Friedrich A., Martin T., Durrens P., RA Casaregola S., Neuveglise C., Fairhead C., Marck C., Cruz J.A., RA Straub M.-L., Kugler V., Sacerdot C., Uzunov Z., Thierry A., Weiss S., RA Bleykasten C., De Montigny J., Jacques N., Jung P., Lemaire M., RA Mallet S., Morel G., Richard G.-F., Sarkar A., Savel G., RA Schacherer J., Seret M.-L., Talla E., Samson G., Jubin C., Poulain J., RA Vacherie B., Barbe V., Pelletier E., Sherman D.J., Westhof E., RA Weissenbach J., Baret P.V., Wincker P., Gaillardin C., Dujon B., RA Souciet J.-L.; RT "Pichia sorbitophila, an interspecies yeast hybrid reveals early steps RT of genome resolution following polyploidization."; RL G3 (Bethesda) 2:299-311(2012). RN [3] {ECO:0000313|EMBL:CCE84558.1} RP NUCLEOTIDE SEQUENCE. RX DOI=10.1534/g3.111.000745; RA Leh Louis V., Despons L., Friedrich A., Martin T., Durrens P., RA Casaregola S., Neuveglise C., Fairhead C., Marck C., Cruz J.A., RA Straub M.L., Kugler V., Sacerdot C., Uzunov Z., Thierry A., Weiss S., RA Bleykasten C., De Montigny J., Jacques N., Jung P., Lemaire M., RA Mallet S., Morel G., Richard G.F., Sarkar A., Savel G., Schacherer J., RA Seret M.L., Talla E., Samson G., Jubin C., Poulain J., Vacherie B., RA Barbe V., Pelletier E., Sherman D.J., Westhof E., Weissenbach J., RA Baret P.V., Wincker P., Gaillardin C., Dujon B., Souciet J.L.; RT "Pichia sorbitophila, an interspecies yeast hybrid reveals early steps RT of genome resolution following polyploidization."; RL Genetics 0:0-0(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FO082049; CCE83527.1; -; Genomic_DNA. DR EMBL; FO082048; CCE84558.1; -; Genomic_DNA. DR RefSeq; XP_004196877.1; XM_004196829.1. DR RefSeq; XP_004197908.1; XM_004197860.1. DR EnsemblFungi; CCE83527; CCE83527; GNLVRS01_PISO0K09558g. DR EnsemblFungi; CCE84558; CCE84558; GNLVRS01_PISO0L09559g. DR GeneID; 14519907; -. DR GeneID; 14520970; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005222; Chromosome K. DR Proteomes; UP000005222; Chromosome L. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005222}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005222}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 FT CHAIN 24 695 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003518995. FT TRANSMEM 621 638 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 695 AA; 77371 MW; 98A58810E69E49C8 CRC64; MFSMRRLSYS WIIVLSLTSG IISDQTNNSD NGASSVTNIC SAENDNNVFL SAKCVMAPFA MTSLNQAVSN NPNPSTSLSS STSSCFRGST DFSSNDESYT NKPTSISDQF SINSLTDSGV IDSTSTDSSN MQSVSSVFTG TSHNGPDSLT TNNALTTQSE AKGDVVSGDP STDSEVSEAS VSPQNSGFSG SNISHVPHNN GSNDSSDCCH FLSFEEWKKK KVEKSPYKNE SRLLLQEEST LSKSNINNIG ENRSSEVVEH PEEDQGTIYK NRFNFASVDC AATIVKTDSN AKGASAILAE NKNSYLLNRC SSPQKFVVIE LCQDILIDTV VMGNLEFFSS NFRKVRFSVS DRFPVSSPTG WKVLGEFEAE NVRDVQSFKI NDSLMWAKYL KLEMLSHYGD EFYCPISIVR VHGKTMMEEF KMTEEQESLA NGIHTRQDVD DAYELSNLSN FSSLAMEGYE CKISLPYIGI NEFLEGMNGT NDVCDASFSF PAEHETNSDA IQKSNTKTSQ ESIYKNIMKR LSLLESNATL SLLYIEEQSK LLSEAFSNLE RRQTSSFEAL VDTYNSSLSN NLLHYKNFFL EAQNEITKFL TIQDYKHQKS LKDANEQVTS LSGQLTFQRR LVILNTVIIL CILVYVALTR DIYIEDSASS TKKSHSTGNY YFSVGSLKKR KYDVHYQEKK LRGIFPSSRN KYVQS // ID G8ZQM7_TORDC Unreviewed; 514 AA. AC G8ZQM7; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCE91514.1}; GN Name=TDEL0C06250 {ECO:0000313|EMBL:CCE91514.1}; GN ORFNames=TDEL_0C06250 {ECO:0000313|EMBL:CCE91514.1}; OS Torulaspora delbrueckii (strain ATCC 10662 / CBS 1146 / NBRC 0425 / OS NCYC 2629 / NRRL Y-866) (Yeast) (Candida colliculosa). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Torulaspora. OX NCBI_TaxID=1076872 {ECO:0000313|Proteomes:UP000005627}; RN [1] {ECO:0000313|EMBL:CCE91514.1, ECO:0000313|Proteomes:UP000005627} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10662 / CBS 1146 / NBRC 0425 / NCYC 2629 / NRRL Y-866 RC {ECO:0000313|Proteomes:UP000005627}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE616744; CCE91514.1; -; Genomic_DNA. DR RefSeq; XP_003680725.1; XM_003680677.1. DR EnsemblFungi; CCE91514; CCE91514; TDEL_0C06250. DR GeneID; 11501932; -. DR KEGG; tdl:TDEL_0C06250; -. DR InParanoid; G8ZQM7; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005627; Chromosome 3. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005627}; KW Reference proteome {ECO:0000313|Proteomes:UP000005627}. FT COILED 363 383 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 514 AA; 58920 MW; 240857EE741F84D0 CRC64; MQVIEPSRCA SVNEEEKNEL NNTFISFDEW RVAKLSEEIP DKRIRSKESI ETTEGETIGE DLEVEIGFFA SMEDSNVSDH EEMDGEAHKH RFNFASLDCA ATIVKTNPEA SGASSILNEN KDKYLLNPCS VPNKFVITEL CQDILVEEVA IANYEFFSST FNKLRFSVSD RYPVAKNGWT VLGEFNAENS RDLQVFSIQN PQIWARYLRI EILSHHGNEY YCPISLLRVH GKTMMDEFKM DHSKVTVAKT ESQDIPQEEI PRDVTSEGVE DTDQCDMWPS IDEGNITSPP RNDFMQRCKC RLKPLKFEEF LRDLNETFCP APSHQNIPTS SVSAVSTEES IFKNIMKRLT SLEANTSLSV LYMEEQSKLL SNSFDNLERA QANKFDNLVS MFNDTLMDNL NVLRVFANQL KDQSIRIIEE QKLHNDQFTT QYVLRTEQLE KELKVQRNLV YTIIFVSLLI LFYQLHSREP ELDEFIKKEQ TFTSFESMED SDPSTCASFP VSPISISGSS VASD // ID G8ZVR4_TORDC Unreviewed; 656 AA. AC G8ZVR4; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCE92708.1}; GN Name=TDEL0E04650 {ECO:0000313|EMBL:CCE92708.1}; GN ORFNames=TDEL_0E04650 {ECO:0000313|EMBL:CCE92708.1}; OS Torulaspora delbrueckii (strain ATCC 10662 / CBS 1146 / NBRC 0425 / OS NCYC 2629 / NRRL Y-866) (Yeast) (Candida colliculosa). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Torulaspora. OX NCBI_TaxID=1076872 {ECO:0000313|Proteomes:UP000005627}; RN [1] {ECO:0000313|EMBL:CCE92708.1, ECO:0000313|Proteomes:UP000005627} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10662 / CBS 1146 / NBRC 0425 / NCYC 2629 / NRRL Y-866 RC {ECO:0000313|Proteomes:UP000005627}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE616746; CCE92708.1; -; Genomic_DNA. DR RefSeq; XP_003681919.1; XM_003681871.1. DR EnsemblFungi; CCE92708; CCE92708; TDEL_0E04650. DR GeneID; 11500829; -. DR KEGG; tdl:TDEL_0E04650; -. DR InParanoid; G8ZVR4; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000005627; Chromosome 5. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005627}; KW Reference proteome {ECO:0000313|Proteomes:UP000005627}. FT COILED 221 248 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 656 AA; 75364 MW; B803FA18C362ADA9 CRC64; MSGFKETDSI HNKSLKSAYA DLLLEKVGNG AKSVVYDGDD ESCEEGELGM IEENEDELDS DYYGKFKRSI LKEGRDDESD KVSEMSADDD RWLDDNSTLD EDYTDEADRS FYGEEKEEYP ADETFHYEEE PQPGLTSKKW FMIGLSFIMV LFLGPMINGL LSSGGTSLTQ AGMPAIQKQI NHIYSELNTR EQRSKSDFDK TIKVVISQFE KKIKELLPSN VMKLQNQLES LSNKVNNLSF SLSQWENKQT PIFSIDNVTE WQSRLVEELN AQLPQQIPVV IDNDTSMLVL PELGKYMAQI ASSLVQQSEE PRVDHFLKYD LNSYVKEILS NEFQYVDRSF FLNELNRRMQ LNNHEIWQEM NDRVEQLKHE NSSPTGSVPQ QYSNILLKKL VNQIYNANQH QWEEDLDFAT FAQGTKLLNH LTSRTWAQGN AIGPVELLQD AKYSSTTYWQ CAGDRDCTWA IRFKDPVYIT RLSYLHGRFN KNLHMMNSAP KMISVYVKLA KSDKNIEVQK LIKLAKSFKQ GQPLSKDNQH IRIGQYDYSL TDNKVRQALP LPAWFIQLKP LVRSIVFEVN ENYGNKRFTS LRKFIINAVT PEDLQIMETN TFPFISNEPP EYATPVSDSK SLQIDEQLLR RQSSADDVAH TLTGAIPSFG QDEVDS // ID G9KRW6_MUSPF Unreviewed; 610 AA. AC G9KRW6; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Unc-84-like protein A {ECO:0000313|EMBL:AES07645.1}; DE Flags: Fragment; OS Mustela putorius furo (European domestic ferret) (Mustela furo). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Mustelidae; OC Mustelinae; Mustela. OX NCBI_TaxID=9669 {ECO:0000313|EMBL:AES07645.1}; RN [1] {ECO:0000313|EMBL:AES07645.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Lungs {ECO:0000313|EMBL:AES07645.1}; RX PubMed=23236062; DOI=10.1128/JVI.02476-12; RA Leon A.J., Banner D., Xu L., Ran L., Peng Z., Yi K., Chen C., Xu F., RA Huang J., Zhao Z., Lin Z., Huang S.H., Fang Y., Kelvin A.A., RA Ross T.M., Farooqui A., Kelvin D.J.; RT "Sequencing, annotation, and characterization of the influenza ferret RT infectome."; RL J. Virol. 87:1957-1966(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JP019047; AES07645.1; -; mRNA. DR STRING; 9669.ENSMPUP00000017775; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 49 69 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 75 96 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 108 127 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 193 220 {ECO:0000256|SAM:Coils}. FT COILED 255 289 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:AES07645.1}. FT NON_TER 610 610 {ECO:0000313|EMBL:AES07645.1}. SQ SEQUENCE 610 AA; 68686 MW; 954A1C68777F4AB2 CRC64; DYKRKELLET HTAVRSQSSS PKRVAGAIWH IFSYAGHLLV QTLRRIGGWG WSVSKTLLSV LWLAVVAPGK AASGIFWWLG IGWYQFVTLI SWLNVFLLTR CLRNICKFLI LLIPLLLLLG AGLSLYGQGD LLSGLPVFNW TRAYGAWWVG SPESTFTPDA SHLHRPLEEG DQAYHWHRMS EVEREMTLLS GQCRNHDEKL RELAAVLQHL QARVDQMDGD SEETLSLVQR VVGQHLKEIG ADRLSGSQSD AVSVRQEQEL RLSNLEDLLG KLTEKSEAIQ KELEQTKLRT ASGAEEEQRL LSVVTHLELE LGRLKSELSS WQHLKSSCEE VHSIHGKVDA QVRETIRLLF SDGEQGRSPD WLLQALSSRF VSKDDLQALL RDLELQILKN ITHYISVTKQ VPDSETVVSA AKEAGVSGIT EAQARVIVNN ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALI SLFGIPLWYF SQSPRVVIQP DIHPGNCWAF RGSQGYLVVR LSMKIRPTTF TLEHIPKTLS PTGNITSAPK DFAVYGLENE YQEEGQLLGQ FVYDQEGESL QMFHVLKRPD GAFQIVELRI LSNWGHPEYT CLYRFRVHGE // ID G9KRW7_MUSPF Unreviewed; 741 AA. AC G9KRW7; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Unc-84-like protein B {ECO:0000313|EMBL:AES07646.1}; DE Flags: Fragment; OS Mustela putorius furo (European domestic ferret) (Mustela furo). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Mustelidae; OC Mustelinae; Mustela. OX NCBI_TaxID=9669 {ECO:0000313|EMBL:AES07646.1}; RN [1] {ECO:0000313|EMBL:AES07646.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Lungs {ECO:0000313|EMBL:AES07646.1}; RX PubMed=23236062; DOI=10.1128/JVI.02476-12; RA Leon A.J., Banner D., Xu L., Ran L., Peng Z., Yi K., Chen C., Xu F., RA Huang J., Zhao Z., Lin Z., Huang S.H., Fang Y., Kelvin A.A., RA Ross T.M., Farooqui A., Kelvin D.J.; RT "Sequencing, annotation, and characterization of the influenza ferret RT infectome."; RL J. Virol. 87:1957-1966(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JP019048; AES07646.1; -; mRNA. DR STRING; 9669.ENSMPUP00000013084; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 239 260 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 298 318 {ECO:0000256|SAM:Coils}. FT COILED 399 426 {ECO:0000256|SAM:Coils}. FT COILED 429 456 {ECO:0000256|SAM:Coils}. FT COILED 496 530 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:AES07646.1}. FT NON_TER 741 741 {ECO:0000313|EMBL:AES07646.1}. SQ SEQUENCE 741 AA; 82953 MW; C76649A27E53193D CRC64; ARSSRGSRVS SGGGSPAYLA MSRRSQRLSR YSQGDDDGGS SSGGSSVMGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDDHTSYY SESVVRESYF GSPRAASLAR SSILDDHLHS EPYWGEDLRV RRRRGTESSK LNGLAENRSS EDFLGSSSGY SSEDDFAGYS ETDHHSSGSR LRNAVSWAAS CLWTLVTSPG RLFGLLYWWV GTTWYRLTTA ASLLDVFVLT RRFSSVKTFL WFLLLLLLTT GLTYGAWYFY PYGLQTFHPA LVSWWASKSS GRQQDVWEPR DSSHFQAEQR ILSRVHSLER RLEALAAEFS SNWQKEAMRL ERLELRQGAA GGGGHVGLSQ EDTLELLEGL VSRREAALKE DFRRDTAAQI QEELVTLRAE HHRDSEDLLK KIVQASQESE ARLQQLKSEW QRMTQEAFRE NSAKELGRLE GQLAALRQEL AALSLKQSSV ADQVGLLPQQ LQAVRDDVES QFPAWVSQYL LRGGGTRAGL LQREEMQAQL QELERKILAH VAEMQGKSAR EAAASLGLTL QKEGVIGVTE EQVQRIVNQA LKRYSEDRIG MVDYALESGG ASVISTRCSE TYETKTALLS LFGIPLWYHS QSPRVILQPD VHPGNCWAFQ GPQGFAVVRL SARIRPTAVT LEHVPKSLSP NSTISSAPKD FSIFGFDEDL QHEGTLLGQF TYDQDGEPIQ TFYFQDPKMA TYQVVELRIS TNWGHPEYTC IYRFRVHGEP A // ID G9L068_MUSPF Unreviewed; 315 AA. AC G9L068; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AES10547.1}; DE Flags: Fragment; OS Mustela putorius furo (European domestic ferret) (Mustela furo). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Carnivora; Caniformia; Mustelidae; OC Mustelinae; Mustela. OX NCBI_TaxID=9669 {ECO:0000313|EMBL:AES10547.1}; RN [1] {ECO:0000313|EMBL:AES10547.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Lungs {ECO:0000313|EMBL:AES10547.1}; RX PubMed=23236062; DOI=10.1128/JVI.02476-12; RA Leon A.J., Banner D., Xu L., Ran L., Peng Z., Yi K., Chen C., Xu F., RA Huang J., Zhao Z., Lin Z., Huang S.H., Fang Y., Kelvin A.A., RA Ross T.M., Farooqui A., Kelvin D.J.; RT "Sequencing, annotation, and characterization of the influenza ferret RT infectome."; RL J. Virol. 87:1957-1966(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JP021949; AES10547.1; -; mRNA. DR STRING; 9669.ENSMPUP00000015533; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; FT NON_TER 1 1 {ECO:0000313|EMBL:AES10547.1}. FT NON_TER 315 315 {ECO:0000313|EMBL:AES10547.1}. SQ SEQUENCE 315 AA; 35014 MW; 4DDF8EB74DA01F41 CRC64; SFPLDEQMYA KYVKMFIKYI KVELVSHFGS EHFCPLSLIR VFGTSMVEEY EEIADSQYQS ERQELYDEDY DYPLDYNTGE DKSSKNLLGS ATNAILNMVN IAANILGAKT EDLTEGNKSI SENATATAAS KMPDLAPVST PVPSPEYVTT EGHIQDTELT SPDTPKESPI VQLVQEEEEE AGPSTVTLLG SGEQEDESSP WFESETQIFC SELTTICCIS SFTEYIYKWC SVRVALYRQR SRTTVSKEKD HLVLPQPPLP LPAESVDVSV LQPPSGDLDS KRKEKEAETI VLGDLSSMHQ GDLINQSADA IELEP // ID G9MX95_HYPVG Unreviewed; 582 AA. AC G9MX95; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHK21027.1}; DE Flags: Fragment; GN ORFNames=TRIVIDRAFT_171029 {ECO:0000313|EMBL:EHK21027.1}; OS Hypocrea virens (strain Gv29-8 / FGSC 10586) (Gliocladium virens) OS (Trichoderma virens). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Hypocreaceae; OC Trichoderma. OX NCBI_TaxID=413071 {ECO:0000313|EMBL:EHK21027.1, ECO:0000313|Proteomes:UP000007115}; RN [1] {ECO:0000313|EMBL:EHK21027.1, ECO:0000313|Proteomes:UP000007115} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Gv29-8 {ECO:0000313|EMBL:EHK21027.1, RC ECO:0000313|Proteomes:UP000007115}; RX PubMed=21501500; DOI=10.1186/gb-2011-12-4-r40; RA Kubicek C.P., Herrera-Estrella A., Seidl-Seiboth V., Martinez D.A., RA Druzhinina I.S., Thon M., Zeilinger S., Casas-Flores S., Horwitz B.A., RA Mukherjee P.K., Mukherjee M., Kredics L., Alcaraz L.D., Aerts A., RA Antal Z., Atanasova L., Cervantes-Badillo M.G., Challacombe J., RA Chertkov O., McCluskey K., Coulpier F., Deshpande N., von Doehren H., RA Ebbole D.J., Esquivel-Naranjo E.U., Fekete E., Flipphi M., Glaser F., RA Gomez-Rodriguez E.Y., Gruber S., Han C., Henrissat B., Hermosa R., RA Hernandez-Onate M., Karaffa L., Kosti I., Le Crom S., Lindquist E., RA Lucas S., Luebeck M., Luebeck P.S., Margeot A., Metz B., Misra M., RA Nevalainen H., Omann M., Packer N., Perrone G., Uresti-Rivera E.E., RA Salamov A., Schmoll M., Seiboth B., Shapiro H., Sukno S., RA Tamayo-Ramos J.A., Tisch D., Wiest A., Wilkinson H.H., Zhang M., RA Coutinho P.M., Kenerley C.M., Monte E., Baker S.E., Grigoriev I.V.; RT "Comparative genome sequence analysis underscores mycoparasitism as RT the ancestral life style of Trichoderma."; RL Genome Biol. 12:R40.1-R40.15(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHK21027.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABDF02000076; EHK21027.1; -; Genomic_DNA. DR EnsemblFungi; EHK21027; EHK21027; TRIVIDRAFT_171029. DR InParanoid; G9MX95; -. DR OMA; YVIVELS; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007115; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007115}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007115}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 544 561 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 510 537 {ECO:0000256|SAM:Coils}. FT NON_TER 582 582 {ECO:0000313|EMBL:EHK21027.1}. SQ SEQUENCE 582 AA; 64076 MW; 69A6780B9750941E CRC64; MSFEDWKEMM LRETGQDPQD LHLRNSSGRQ KGDRSSPDME EGLGEEGEIS LTFDDYGDGD GTRPATYTDS ANSNKGDDVD EALVSKEGKA PIHRSKDAGQ TCKERFSYSS FDAGATILKA GPQAKNAKAI LVENKDSYML LECAAQNKYV IVELSDDILI DTIVIANFEF FSSMVRHFRV SVSDRYPVKM EKWREIGIFE AANSRDIQAF LVQNPQIWAK YIRLEFLTHY GNEYYCPVSL LRVHGSRMLD SWKDSETGRE DEAHEEEEVE EEILSIPTAT VMPSPAAVET APETRKDEET ILAPTSTSLL PLDKGPFYHL TATCAASPTL AADGKSGDLG FATASEGVKR QIKDGTLESN YASADSPRSN ATQARDPPTT PPLKQTDSSK PAAENAQIPA ESVDAAGSGS GTGSGTGKPR STSSASAATP TVQGSFFNSI TKRLQQVESN LTLSLKYVED QSRLMQDALQ KTEQKQVSKL TRFLGDLNHT VLAEMRNVRE QYEQIWQSTV LALESQREQS ERDIVALSTR LNLLADEVVF QKRMAIVQAI LLLSCLFLVI FSRGVPIPYL AALQEQAAGI AY // ID G9NEF0_HYPAI Unreviewed; 776 AA. AC G9NEF0; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHK51056.1}; GN ORFNames=TRIATDRAFT_210501 {ECO:0000313|EMBL:EHK51056.1}; OS Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) (Trichoderma OS atroviride). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Hypocreaceae; OC Trichoderma. OX NCBI_TaxID=452589 {ECO:0000313|EMBL:EHK51056.1, ECO:0000313|Proteomes:UP000005426}; RN [1] {ECO:0000313|EMBL:EHK51056.1, ECO:0000313|Proteomes:UP000005426} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 20476 / IMI 206040 {ECO:0000313|Proteomes:UP000005426}; RX PubMed=21501500; DOI=10.1186/gb-2011-12-4-r40; RA Kubicek C.P., Herrera-Estrella A., Seidl-Seiboth V., Martinez D.A., RA Druzhinina I.S., Thon M., Zeilinger S., Casas-Flores S., Horwitz B.A., RA Mukherjee P.K., Mukherjee M., Kredics L., Alcaraz L.D., Aerts A., RA Antal Z., Atanasova L., Cervantes-Badillo M.G., Challacombe J., RA Chertkov O., McCluskey K., Coulpier F., Deshpande N., von Doehren H., RA Ebbole D.J., Esquivel-Naranjo E.U., Fekete E., Flipphi M., Glaser F., RA Gomez-Rodriguez E.Y., Gruber S., Han C., Henrissat B., Hermosa R., RA Hernandez-Onate M., Karaffa L., Kosti I., Le Crom S., Lindquist E., RA Lucas S., Luebeck M., Luebeck P.S., Margeot A., Metz B., Misra M., RA Nevalainen H., Omann M., Packer N., Perrone G., Uresti-Rivera E.E., RA Salamov A., Schmoll M., Seiboth B., Shapiro H., Sukno S., RA Tamayo-Ramos J.A., Tisch D., Wiest A., Wilkinson H.H., Zhang M., RA Coutinho P.M., Kenerley C.M., Monte E., Baker S.E., Grigoriev I.V.; RT "Comparative genome sequence analysis underscores mycoparasitism as RT the ancestral life style of Trichoderma."; RL Genome Biol. 12:R40.1-R40.15(2011). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHK51056.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABDG02000011; EHK51056.1; -; Genomic_DNA. DR EnsemblFungi; EHK51056; EHK51056; TRIATDRAFT_210501. DR OMA; YVIVELS; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005426; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005426}; KW Reference proteome {ECO:0000313|Proteomes:UP000005426}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 776 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003524428. FT COILED 573 600 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 776 AA; 85331 MW; CA31B5C43625D699 CRC64; MQLLSCAWHG GFVLALLSSG LAWATDQGRP AEKAASPTTS TPGQCEVRTI NYITHTLPSL CFKSSWTDSL PTPTKLPATG DADNTTQHVT RPPVEEQAPP SATEQTDTFA NNSAGDTFAT PFMSFDDWKK MMLEQTGQDP QNLHLRNSSG RQKADRSSPD LDDVGLGEEG EISLVFDDYG EGEGPKAGPP TELANANSGD DADDTFTSKD GKTPIHRSKD AGKTCKERFS YSSFDAGATI LKAGPQAKNA KAILVENKDS YMLLECAAQN KYVIVELSDD ILIDTIVIAN FEFFSSMVRH FRVSVSDRYP VKMDKWRTLG IFEAANSRDI QAFLVENPEN PQIWGKYVRL EFLTHYGNEY YCPVSLLRVH GSTMLDSWRD SETSREDDIH EEDEEDLPVS TVATQPAAAG PRPKRPTKDG VVEVDSASQD SSKSDATQAR DASSAPPVKQ TDPSKPTTDN NAQTPVESSD AAGSSSSSGT TKSRTSSASS ATPTVQGSFY NSITKRLQQV ESNLTLSLQY VEDQSRIMQD ALRRTEQQQV TKLTRFLGDL NHTVLAEMRN VREQYEQIWQ STVLALESQR EQSERDIVAL STRLNLLADE VVFQKRMAIV QAILLLSCLF LVIFSRGVPI PYLAALQEQA GGIAFPSSPP YPGQGPHDIY KSDPGIPPDA QTLPDNVPVV SVSSFEAAPN VPSKRKPHSL SETTEPHEPI CEDGEEEYLR RHPLPSPPAE DKYQYVHYLD QEPRFHSLHH SSPALLNTPR KPLPSLPEHL ASSQDS // ID H0EIX3_GLAL7 Unreviewed; 629 AA. AC H0EIX3; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative Spindle pole body-associated protein sad1 {ECO:0000313|EMBL:EHL01604.1}; GN ORFNames=M7I_2492 {ECO:0000313|EMBL:EHL01604.1}; OS Glarea lozoyensis (strain ATCC 74030 / MF5533). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Leotiomycetes; OC Helotiales; Helotiaceae; Glarea. OX NCBI_TaxID=1104152 {ECO:0000313|EMBL:EHL01604.1, ECO:0000313|Proteomes:UP000005446}; RN [1] {ECO:0000313|EMBL:EHL01604.1, ECO:0000313|Proteomes:UP000005446} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 74030 / MF5533 {ECO:0000313|Proteomes:UP000005446}; RX PubMed=22302591; DOI=10.1128/EC.05302-11; RA Youssar L., Gruening B.A., Erxleben A., Guenther S., Huettel W.; RT "Genome sequence of the fungus Glarea lozoyensis: the first genome RT sequence of a species from the Helotiaceae family."; RL Eukaryot. Cell 11:250-250(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHL01604.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGUE01000049; EHL01604.1; -; Genomic_DNA. DR EnsemblFungi; EHL01604; EHL01604; M7I_2492. DR InParanoid; H0EIX3; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000005446; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR GO; GO:0003677; F:DNA binding; IEA:InterPro. DR InterPro; IPR017956; AT_hook_DNA-bd_motif. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00384; AT_hook; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005446}; KW Reference proteome {ECO:0000313|Proteomes:UP000005446}. FT COILED 338 372 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 629 AA; 70557 MW; B067C1C5EB9E634B CRC64; MSVRRTTPYG KDPLFLATTR AGRHTGQVIE SVLGDLLEPV REDATNASQA ARAARNSPAV RSTTGRPRGR PSKQPAQSSV DPDRTFGEES NLFGAAGFDT SSVNQSSDQE DYDDLANERK PLSNVYKPLR GPLQQPQPPR AVVNQQTPKP IATQHRPVTP KTPILQKKTS PANNNLNVPR PVKPSWEWRS TFDSVHQYLK DWLLQNLDYK YSYLKRKTEI DEKTISRIEK SLPDFLVVRK NDRGKMEIPL EFWGALRDLI LSDGDLLPSS APPTVVSTGE GVSMKEFEKK ADQLWQKYIK ENQAKITSWS STEFEEKFPH LFKKHILASK SEIVDMIRHS WKDNNEAVKQ ELSTLSKKLD KARDQIIQLQ DTPSSMSSDK LKAMISNHIN NILPIAQLEA LISANVKGNV NWGLSRVNHW SHGTGAVINI LMTSPNYAFP SMNQWTHQKL FRWFIGNPVP TPNPPEAALT KWDEHGECWC SPAKHDNGFG PTIGVISGSD IFPDQVVVEH ISPSASLEPG AAPREMELYA YIENFDTYDA VSGLSKEMFG EPDSKLKYHY VKVAEWTYNA ELGSNSAQAF NVAIDLKQFS APTNKLVVRA KNNWGGDSVE YTCLYRIRVH GEIAVAPKD // ID H0GNI2_SACCK Unreviewed; 587 AA. AC H0GNI2; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Slp1p {ECO:0000313|EMBL:EHN04626.1}; GN ORFNames=VIN7_4575 {ECO:0000313|EMBL:EHN04626.1}; OS Saccharomyces cerevisiae x Saccharomyces kudriavzevii (strain VIN7) OS (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=1095631 {ECO:0000313|EMBL:EHN04626.1, ECO:0000313|Proteomes:UP000009009}; RN [1] {ECO:0000313|EMBL:EHN04626.1, ECO:0000313|Proteomes:UP000009009} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=VIN7 {ECO:0000313|EMBL:EHN04626.1}; RX PubMed=22136070; DOI=10.1111/j.1567-1364.2011.00773.x; RA Borneman A.R., Desany B.A., Riches D., Affourtit J.P., Forgan A.H., RA Pretorius I.S., Egholm M., Chambers P.J.; RT "The genome sequence of the wine yeast VIN7 reveals an allotriploid RT hybrid genome with Saccharomyces cerevisiae and Saccharomyces RT kudriavzevii origins."; RL FEMS Yeast Res. 12:88-96(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHN04626.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGVY01000047; EHN04626.1; -; Genomic_DNA. DR EnsemblFungi; EHN04626; EHN04626; VIN7_4575. DR PhylomeDB; H0GNI2; -. DR Proteomes; UP000009009; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009009}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 542 559 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 587 AA; 67350 MW; E2A4A18FE6E55D40 CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFRQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEAGNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKKKISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID H0GWW4_SACCK Unreviewed; 680 AA. AC H0GWW4; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=Mps3p {ECO:0000313|EMBL:EHN01701.1}; GN ORFNames=VIN7_8020 {ECO:0000313|EMBL:EHN01701.1}; OS Saccharomyces cerevisiae x Saccharomyces kudriavzevii (strain VIN7) OS (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=1095631 {ECO:0000313|EMBL:EHN01701.1, ECO:0000313|Proteomes:UP000009009}; RN [1] {ECO:0000313|EMBL:EHN01701.1, ECO:0000313|Proteomes:UP000009009} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=VIN7 {ECO:0000313|EMBL:EHN01701.1}; RX PubMed=22136070; DOI=10.1111/j.1567-1364.2011.00773.x; RA Borneman A.R., Desany B.A., Riches D., Affourtit J.P., Forgan A.H., RA Pretorius I.S., Egholm M., Chambers P.J.; RT "The genome sequence of the wine yeast VIN7 reveals an allotriploid RT hybrid genome with Saccharomyces cerevisiae and Saccharomyces RT kudriavzevii origins."; RL FEMS Yeast Res. 12:88-96(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHN01701.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGVY01000270; EHN01701.1; -; Genomic_DNA. DR EnsemblFungi; EHN01701; EHN01701; VIN7_8020. DR PhylomeDB; H0GWW4; -. DR Proteomes; UP000009009; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009009}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 154 172 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 680 AA; 78466 MW; 45252F1CAA4C5C24 CRC64; MNHTNEYRRE EVGAANEQFL YNKTVKSVYA DALKDKMNRE QTIGPHDIRK RSFADGSETD KCDVGKENDS TYEVFKKNLG VPIDHHDYED DDYDDDDDLY SEDDGYSDED YTDEADKSFI EESDSNDYDL EGDSEFEENW EGPDGSRGLQ WGKYIFYGVF FLVLYFLGYF LITSMKNNGD EGPKLGATSS SGKSFANLQK QVNHLYSELS NRDEKQSSEL DKTIKVIISQ FEKNIKKLLP SNLVNFENDI NSLTKQVQTI STSMSHLQRQ NYGFTVENVT QWQDQLIKQL DSHLPQEIPV VIDNSSSLLI IPELHNYLSV LISDVIESPG IITTGGDKNP WEYDLNHYVK EILSNELQYI DKDYFIREMN TRLQSNKQEI WQEIANKLES QQQQHVQQDY SKAPQQYSSI LMKRLINQIY NSNQHQWEDD LDFATYVQGT KLLNHLTSPT WKQGNGVQPI ELLTDSKQSS STYWQCENEP GCSWAIRFKT PLYLTKISYI HGRFTNNLHI MNSAPKVISL YVKLSQIKET KGLRSLAGQY GFGQPHKRDQ NYINIAKFEY RLTDTRIRQQ ISLPPWFIQL KPLIRSIVFQ VDENYGNQKF TSLRKFIING VTPQDLQIIE NNEFPVLLGD VPEYGVVQNN NKGKRKALPS TPPYASPSMI SSRFHPASSV PSFGQDELDQ // ID H0H182_SACCK Unreviewed; 588 AA. AC H0H182; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Slp1p {ECO:0000313|EMBL:EHN00174.1}; GN ORFNames=VIN7_9994 {ECO:0000313|EMBL:EHN00174.1}; OS Saccharomyces cerevisiae x Saccharomyces kudriavzevii (strain VIN7) OS (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=1095631 {ECO:0000313|EMBL:EHN00174.1, ECO:0000313|Proteomes:UP000009009}; RN [1] {ECO:0000313|EMBL:EHN00174.1, ECO:0000313|Proteomes:UP000009009} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=VIN7 {ECO:0000313|EMBL:EHN00174.1}; RX PubMed=22136070; DOI=10.1111/j.1567-1364.2011.00773.x; RA Borneman A.R., Desany B.A., Riches D., Affourtit J.P., Forgan A.H., RA Pretorius I.S., Egholm M., Chambers P.J.; RT "The genome sequence of the wine yeast VIN7 reveals an allotriploid RT hybrid genome with Saccharomyces cerevisiae and Saccharomyces RT kudriavzevii origins."; RL FEMS Yeast Res. 12:88-96(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EHN00174.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGVY01000365; EHN00174.1; -; Genomic_DNA. DR EnsemblFungi; EHN00174; EHN00174; VIN7_9994. DR PhylomeDB; H0H182; -. DR Proteomes; UP000009009; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009009}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 22 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 543 560 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 334 354 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 588 AA; 67043 MW; FCD418471E5191DA CRC64; MTNRLLIYGL ILWVSIIGSL AFDRNKSAQN DKVGQQDSTL STTEPATNIQ DEFSSLLSTG SLSSRDLKQT SKNEIRQAGI QGTSMEKEKG ALTQPVNSVN PGDSSNSFLS FDEWKKMKSK EHSSSSPERH IARVREPVDP SCYKEKECIG EELEIDLGFL TAKDEWNEGG DEKQENIEEK ESIGNVYKKQ FNYASLDCAA TIVKSNSEAI GATSILIESK DKYLLNPCSA PQQFVVIELC EDILVEEIDI ANYEFFSSTF KKFRVSVSDR IPVVKNDWTI LGEFEAENSR ELQRFQIHNP QIWASYLKIE ILSHYDDEFY CPVSLIRAYG KTMMDEFKLD QLKAQEDKEQ LIAEKNIDNL AGSNIQEECN NIETHLEAIN LNAMSNIAGA LSCTSKLIPL KFDEFFKDVN ASFCPPKQAI SSSSSVVPVI PEESIFKNIM KRLSQLETNS SLTVSYIEEQ SKLLSRSFEQ LEMVHEAKFG HLVTVFNNTM MNNLDLLDNF ANKLKDQSLR ILEEQKLEND KFTNRHLLHL ERLEKEVSFQ RRIVYASFFA FVGLTSYLLI TRELYFEDFE ESKNDCIEKP NIVQQAIR // ID H0UVN4_CAVPO Unreviewed; 435 AA. AC H0UVN4; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000001077}; GN Name=LOC100734565 {ECO:0000313|Ensembl:ENSCPOP00000001077}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000001077}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000001077} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000001077}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000001077} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000001077}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000001077}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02012512; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000001077; -. DR Ensembl; ENSCPOT00000001205; ENSCPOP00000001077; ENSCPOG00000001193. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0UVN4; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 170 192 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 212 239 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 435 AA; 48391 MW; E3048F5871C5CEA7 CRC64; MRRSARRGSD TTPQEHRLSL YSENSDSSDS VPSRDSSGHW FPGQGPGEPE GRRDRVSSCR ELALRPRLPR GSPRAGSSRQ EAAPGNHNLE TACGAATVRG GASVEPMAAS PAVSEEQRSL LAILDLRRET PALRTTKSFL SLLFQVPRVL LLLVRDALLG VCREVCSVRF LFAASLLSVF LAALSWCLLH LLPPLENEPK AMLSPSEYQE RVRSHGQQLQ QLQAELNKLR QEVARVRAAH SERVAKLVFQ RLNEDFVQKP DYALSSIAPG ASIDLEKTSQ DYEDTNTSYF WNHFSFWSYA RPPTVILEPD VFPGNCWAFE GDQGQVVIRL AGHVQLSDIT LQHPPPRVAH TGDASSAPRD FAVFGLQVDD KTEVFLGRFT FDVKKSAIQT FHLQNDSPSA FPKVKIQILS NWGHPRFTCL YRVRAHGLRI SGGPH // ID H0UY87_CAVPO Unreviewed; 908 AA. AC H0UY87; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000002090}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSCPOP00000002090}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000002090}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000002090} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000002090}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000002090} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000002090}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000002090}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02032235; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000002090; -. DR Ensembl; ENSCPOT00000002333; ENSCPOP00000002090; ENSCPOG00000002301. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0UY87; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 356 377 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 383 404 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 416 435 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 500 520 {ECO:0000256|SAM:Coils}. FT COILED 599 619 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 908 AA; 102215 MW; A254B089FCF15D5F CRC64; YTPPQCVPEN TGYTYALSSS YSSDALDFEI EHRLDPVFDS PRMSRRSLRL VTAAYSSGDS QAMDTHSCAS STASLRDRAS KTVKQRRSTS KQASFSVNHL SGKATSSSIS QGSSGNLQGA VPLRPPVLDE TLIREQTKVD HFWGLDDDGD LKGVNKAATQ GNGDLAADTA AHNGYTCRDC SMLSERTDML TAHPAAHGPA SKIYSRDRNL KRGVSFYLDR TLWLARYTSS SFSSFLVQLF QVVLMKLYCE SNNYKLKNFF PSSFSPPNNA FKVFNMKAHR SYCGRMTVTE LPREDPDRPS VHGELLCDDC KGKKHLETHT AARWQPSKLH RVVGAMGHLC TRAGYFLMQT LGRVRAAGWL VSKTVWSVLW LAIVAPGKVA SGVFSWLGIG WYQFVTLISW LNVFLLTRCL RNICKFLILL IPLLLLVGVG LSLWGQSDFF SFLPVLNWTD VRTVERVEDP KDTFRPGSSQ LQVLKDKEEA SWWLQENDGR QQVTSFSAQC HNHEEKLGEL TVLLQKLQAR VDQIDDGREG LSLQVKDVVG QHLQETDIMT FHHEHEVRLS KLEDIFRKLT EKSEAIHKEL EQTKLRTISG AEEQLLPRVE RLEEELSLLR SQLSDWQQLK ASCEQVGAAQ SLVDAQVKET PRLMFTEDQQ GSSLEWLIQR LSSHYVSRDD LQTLLRDLEL QVLKNITHHL VVTGQKPTSE TVVSAVSEAG ISGITEEQAR VIVNNALKLY SQDKTGMVDF ALESGGGSIL STRCSETYET KTALLSLFGI PLWYFSQSPR VVIQPDIYPG NCWAFKGSQG YLVVRLSMKI HLTMFTMEHI PKTLSPMGNI SSAPKDFAVY GLENEYQEEG QPLGQFTYDQ EGESLQMFPA LEIPDRAFQI VELRILSNWG HLEYTCLYRF RVHGQPAH // ID H0V6Q8_CAVPO Unreviewed; 727 AA. AC H0V6Q8; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000005384}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSCPOP00000005384}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000005384}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000005384} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000005384}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000005384} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000005384}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000005384}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02031258; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000005384; -. DR Ensembl; ENSCPOT00000006028; ENSCPOP00000005384; ENSCPOG00000005965. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0V6Q8; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 174 192 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 223 244 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 414 448 {ECO:0000256|SAM:Coils}. FT COILED 488 508 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 727 AA; 81175 MW; F7D260D1717935FD CRC64; MSRRSQRLTR YSQVDDDGGS SSSGGSSVAG SQSTLFKDSP LRTLKRKSSN MKHLSPVPQL GPSSDPHTSY YSESMVRESY IGSPQAVSLA RSALLDDRLH SEPYWSEDLH VRRRRGTGGT ESSKANGLAE SKATEDFLGS SSGYSSEDDF AGYADIEQHS SGSRLGSVVS RAGSFVWTVV TFPGRLFGLL YWWLGTSWYR LTTAASLLDV FVLTRRFSSL KTFFWFLLLL LLLTGLTYGA WYFYPFGLQT LHPAVVSWWA AKDSRKQPEV WSSGDSALHL QAEQRVLSQV HILERRLEAL ATEFSSKWQR ESIRLDRLEL QQGTAGQGGG SGLSYEDTLA LLEGLVSRRE AALKEDFRRD TAARIQEELA TVRAEHHQDS EDLFKKIVQA SQESEAHIQQ FKSEWQRTMQ EAIRENSAQE LGRLETQLAG LQQDLAALTL KQSTVEDEVG LLPQKIQAVR EEVESQFPAW VGHFLLQGGG TRAGLLQREE VQAQLQELES KILAHVTKMQ GRSAQEAAAL LGQTLQKEGV VGVTEEQVHR IVKQALQRYS EDRIGMVDYA LESGGASVIS TRCSETYETK TALLSLFGIP LWYHSQSPRV ILQPDVHPGN CWAFQGPQGF AVVRLSARIR PTAVTLEHVP KALSPNSTIS SAPKDFSIFG FDEDLQQEGS LLGTFTYDQD GEPIQTFYFQ TPKMATYQVV ELRILTNWGH PEYTCIYRFR VHGEPAP // ID H0VC21_CAVPO Unreviewed; 1251 AA. AC H0VC21; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000007468}; GN Name=SUCO {ECO:0000313|Ensembl:ENSCPOP00000007468}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000007468}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000007468} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000007468}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000007468} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000007468}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000007468}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02017800; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02017801; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02017802; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02017803; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000007468; -. DR Ensembl; ENSCPOT00000008383; ENSCPOP00000007468; ENSCPOG00000008308. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H0VC21; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 1251 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003541765. FT COILED 931 951 {ECO:0000256|SAM:Coils}. FT COILED 981 1001 {ECO:0000256|SAM:Coils}. FT COILED 1189 1209 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1251 AA; 139470 MW; B8A1D3EFFDF3DB99 CRC64; ISKLLNVIIL NCVKLIFILR LPSWHVCCKE SSSASAKSYY SQDDNCALEN EDVQFQKKNE RKVPVNAELS GKPSSDLPIP PEENKLKDET IVDVQQNIES QKLRPPVTET LPAVDLHEDS SSVVMSNENV ENTSSLSTSE ITPTEKLDEI EKSGTVPITK PSETEHSETN CDVGEALDPD APVQQPSFVS PPESLVGQHI ENVSSSHGKG KITKSEFESK ASASEQGGDN PKSALNASDN VKNESSDLVK PGETDATPAV NPKDPEDIPT FDEWKKKVME VDKNKCQSLH PSSNGGPHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LISHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYQSERQ ELFDEDYDYP LDYNTVEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSLSEN ATATTAPKMP ESTPVSTPVP SPEYVTAEIR DTELSTPDTP KESPIVQLVQ EEEEEASPST VTLLGSDEQE DESSPWFESE TQIFCSELST TCCISSFSEY VYQWCSVRIA SFRQRSRTAV SEGKDDHVST RPSVLLPAES VDVSILQPPS RELDSKNVER ETETVTLDDL SRAHQGDLAN HTVELIELEP SQTPTLSQSF LLDITPEISL LPKTEGTESV KHETQHTPSQ VITQESSVDL DNETEKKSES FSSTEKLTMI YETNKVDEAT DTTVKEDVIS MQIITKVSET IVSPINTATV SDSEEGDAKM TITDTAKQIL TPLDSSLTEV KEEEQSPEDA LLRGLQRTAT DFYAELQNST DLGYTNGNLV HGSNQKESVF MRLNNRIKAL EVNMSLSGRY LEELSQRYRK QMEEMQKAFN KTIVKLQNTS RIAEEQDQRQ TEAIQLLQAQ LTNMTQLVSN LSTTVAELKR EVSDRQTYLV ISLVLCVILG LMLCMQRCRS TSQFDGDYIS KLPKSNQYPS PKSIRCFSSY DDTNLKRRIS FPLIRSKSLQ FTGNEVDPND LYIVEPLKFS PEKKKKRCKY KTEKIETIKP ADPLHPIANG DIKGRKPFTN QRDFSNMGEV YHSSYKGPPS EGSSETSSQS EESYFCGISA CTSLCNGQSQ KTKTEKRALK RRRSKVQDQG KLIKTLIQTK SGSLPSLHDI IRGNKEITVG TFGVTAVSGH I // ID H0VF96_CAVPO Unreviewed; 343 AA. AC H0VF96; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000008732}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSCPOP00000008732}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000008732}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000008732} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000008732}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000008732} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000008732}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000008732}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02012484; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02012485; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02012486; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000008732; -. DR Ensembl; ENSCPOT00000009815; ENSCPOP00000008732; ENSCPOG00000009728. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0VF96; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}. FT COILED 127 147 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 343 AA; 39215 MW; 6F7BB6C3EFBA3A86 CRC64; MPRSSRNPVG PCVLSEDMAY NTRPRRNILN MYICRGSPPA PGQPAWFNCL ACFLRTQAQQ VLFNTCRCKL LCQKLMEKTG VLLLCALEWW MYRFLLPSTL FQDDSINSPL QSLRLYQEKV RHHSGEIQDL RGSVNLLIAK LKEMEAMSDE EKLTQKIMKM IQGDYIEKPD FALKSIGATI DFEHTSATYN HDKARSYWNW IRLWNYAQPP DVILEPNMTP GNCWAFEGDR GQVTIRLAQK VYLSNITLQH IPKTISLSGS LDTAPKDFVV YGMESSPGEE VFLGAFQFQP ENIIQMFPLQ NPQQRAFGAV KVKISSNWGN PRFTCLYRVR VHGSVAPPLK TLT // ID H0VP11_CAVPO Unreviewed; 2605 AA. AC H0VP11; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000012213}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSCPOP00000012213}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000012213}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000012213} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000012213}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000012213} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000012213}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000012213}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02047570; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02047571; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02047572; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02047573; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02047574; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAKN02047575; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000012213; -. DR Ensembl; ENSCPOT00000013701; ENSCPOP00000012213; ENSCPOG00000013566. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H0VP11; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 5. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1240 1260 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2605 AA; 289431 MW; D8D1A7B726277681 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVSLLI LKVIENSEDH SSLGASKMVM IFSYINLPFV FLHLTKPHAA LKVSAQLLGK CISVCVFSFD LLIIVSRIEL RTRALRCFAS LADRFTRRCD PAPLAKHGLT ELLSRMAAAG GTVSGPSSAC KPRSTTGAPT TADSKLSNQV STIVSLLSTL CRGSPVVTHD LLRSELPDSI ESALQGDERC VLDTMRLVDL LLVLLFEGRK ALPKSSAGST GRIPGLRRLD SSGERSHRQL IDCIRSKDTD ALIDAIDTGA FEVNFMDDVG QTLLNWASAF GTQEMVRGEF LCERGADVNK RSKGRRHYIM LDGFGRPQVA KDTLLRHGAN PDLRDEDGKT PLDKARERGH SEVVAILQSP GDWMCPVNKG DDKKKKDTNK DEEECNEPKG DPEMAPIYLK RLLPVFAQTF QQTMLPSIRK ASLALIRKMI HFCSEALLKE VCDSDVGHNL PTILVEITAT VLDQEDDDDG HLLALQIIRD LVDKGGDIFL DQLARLGVIS KVSTLAGPSS DDENEEESKP EKEDEPQEDA KELQQGKPYH WRDWSIIRGR DCLYIWSDAA ALELSNGSNG WFRFILDGKL ATMYSSGSPE GGSDSSESRS EFLEKLQRAR GQVKPSTSSQ PILSTPGPTK LTVGNWSLTC LKEGEIAIHN SDGQQATILK EDLPGFVFES NRGTKHSFTA ETSLGSEFVT GWTSEPEDKL YPKYYTNTYK VRTMARDLYD DHFKAVESMP RGVVVTLRNI ATQLESSWEL HTNRQCIEGE NTWRDLMKTA LENLIVLLKD ENTISPYEMC SSGLVQALLT VLNNVSLFKK TDQLVERINV FKTAFSENED DESRPAVALI RKLIAVLESI ERLPLHLYDT PGSTYNLQIL TRRLRFRLER APGETALIDR TGRMLKMEPL ATVESLEQYL LKMVAKQWYD FDRSSFVFVR KLREGQNFIF RHQHDFDENG IIYWIGTNAK TAYEWVNPAA YGLVVVTSSE GRNLPYGRLE DILSRDNSAL NCHSNDDKNA WFAIDLGLWV IPSAYTLRHA RGYGRSALRN WVFQVSKDGQ NWTSLYTHVD DCSLNEPGST ATWPLDPPKD EKQGWRHVRI KQMGKNASGQ THYLSLSGFE LYGTVNGVCE DQLGKAAKEA EANLRRQRRL VRSQVLKYMV PGARVIRGLD WKWRDQDGSP QGEGTVTGEL HNGWIDVTWD AGGSNSYRMG AEGKFDLKLA PGYDPDTVAS PKPVSSTVSG TTQSWSSLVK NNCPDKTSAA AGSSSRKGSS SSVCSVASSS DISLGSTKTE RRSEIVMEHS IVSGADVHEP IVVLSSAENV PQTEIGSSSS ASTSTLTAET GSENAERKLG PDSSVRTPGE SSAISMGIVS VSSPDVSSVS ELTNKEAASQ RPLSSSASNR LSVSSLLAAG APMSSSASVP NLSSRETSSL ESFVRRVANI ARTNATNNMN LSRSSSDNNT NTLGRNVMST ATSPLMGAQS FPNLTTPGTT STVTMSTSSV TSSSNAATAT TVLSVGQSLS NTLTTSLTST SSESDTGQEA EYSLYDFLDS CRASTLLAEL DDDEDLPEPD EEDDENEDDN QEDQEYEEVM ILRRPSLQRR AGSRSDVTHH AVTSQLPQVP AGAGSRPIGE QEEEEYETKG GRRRTWDDDY VLKRQFSALV PAFDPRPGRT NVQQTTDLEI PPPGTPHSEL LEEVECTPSP RLALTLKVTG LGTTREVELP LTNFRSTIFY YVQKLLQLSC NGNVKSDKLR RIWEPTYTII YREMKDSDKE KENGKTGCWS IEHVEQYLGT DELPKNDLIT YLQKNADAAF LRHWKLTGTN KSIRKNRNCS QLIAAYKDFC EHGTKSGLNQ GAISTLQNSD ILNLTKEQPQ AKAGNGQSSC GVEDVLQLLR ILYIVASDPY SRISQEDGDE QPQFTFPPDE FTSKKITTKI LQQIEEPLAL ASGALPDWCE QLTSKCPFLI PFETRQLYFT CTAFGASRAI VWLQNRREAT VERTRTTSSV RRDDPGEFRV GRLKHERVKV PRGESLMEWA ENVMQIHADR KSVLEVEFLG EEGTGLGPTL EFYALVAAEF QRTDLGAWLC DDNFPDDESR HVDLGGGVKP PGYYVQRSCG LFTAPFPQDS DELERITKLF HFLGIFLAKC IQDNRLVDLP ISKPFFKLMC MGDIKSNMSK LIYESRGDRD LHCTESQSEA STEEGHDSLS VGSFEEDSKS EFILDPPKPK PPAWFNGILT WEDFELVNPH RARFLKEIKD LAIKRRQILS NKGLSEDEKN TKLQELVLKN PSGSGPPLSI EDLGLNFQFC PSSRIYGFTA VDLKPSGEDE MITMDNAEEY VDLMFDFCMH TGIQKQMEAF RDGFNKVFPM EKLSSFSHEE VQMILCGNQS PSWAAEDIIN YTEPKLGYTR DSPGFLRFVR VLCGMSSDER KAFLQFTTGC STLPPGGLAN LHPRLTVVRK VDATDASYPS VNTCVHYLKL PEYSSEEIMR ERLLAATMEK GFHLN // ID H0VQK0_CAVPO Unreviewed; 353 AA. AC H0VQK0; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000012826}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSCPOP00000012826}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000012826}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000012826} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000012826}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000012826} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000012826}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000012826}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02045318; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000012826; -. DR Ensembl; ENSCPOT00000014378; ENSCPOP00000012826; ENSCPOG00000014236. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0VQK0; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}. SQ SEQUENCE 353 AA; 40542 MW; 227C4887671BC767 CRC64; VSRKGKLRQG ARPFPSCSEA PSSSTSMPLL SPAYRNPDAN GLTQSLWKIT LSIIFLSTFL LIGHRNHQWL KETAMPQKYE QLYAMFAEYG TRLYHYQARF RKPREQLELL KKESQTLENN FREILVLTEQ IATLKALLRD MQDGTFAVTR DQDNAEVPDE EMFQLVHYVL KRLREDQVQM ADYALKSAGA SIVESETSES YINNKTKLYW HGIGFLNHEM PPDIILQPDV HPGKCWAFPG SQGHILIRLA RKIIPMSVTM EHISEKVSPS GNISSAPKEF SVHGLMKRCE GEEVFLGRFI YNKTEATVQT FDLQHEISES LLCVRLKILS NWGHPQYTCL YRFRVHGIPS DHT // ID H0VXU8_CAVPO Unreviewed; 443 AA. AC H0VXU8; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCPOP00000015528}; GN Name=Spag4 {ECO:0000313|Ensembl:ENSCPOP00000015528}; OS Cavia porcellus (Guinea pig). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000015528}; RN [1] {ECO:0000313|Ensembl:ENSCPOP00000015528} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000015528}; RX PubMed=21993624; DOI=10.1038/nature10530; RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., RA Washietl S., Kheradpour P., Ernst J., Jordan G., Mauceli E., RA Ward L.D., Lowe C.B., Holloway A.K., Clamp M., Gnerre S., Alfoldi J., RA Beal K., Chang J., Clawson H., Cuff J., Di Palma F., Fitzgerald S., RA Flicek P., Guttman M., Hubisz M.J., Jaffe D.B., Jungreis I., RA Kent W.J., Kostka D., Lara M., Martins A.L., Massingham T., Moltke I., RA Raney B.J., Rasmussen M.D., Robinson J., Stark A., Vilella A.J., RA Wen J., Xie X., Zody M.C., Baldwin J., Bloom T., Chin C.W., Heiman D., RA Nicol R., Nusbaum C., Young S., Wilkinson J., Worley K.C., Kovar C.L., RA Muzny D.M., Gibbs R.A., Cree A., Dihn H.H., Fowler G., Jhangiani S., RA Joshi V., Lee S., Lewis L.R., Nazareth L.V., Okwuonu G., RA Santibanez J., Warren W.C., Mardis E.R., Weinstock G.M., Wilson R.K., RA Delehaunty K., Dooling D., Fronik C., Fulton L., Fulton B., Graves T., RA Minx P., Sodergren E., Birney E., Margulies E.H., Herrero J., RA Green E.D., Haussler D., Siepel A., Goldman N., Pollard K.S., RA Pedersen J.S., Lander E.S., Kellis M.; RT "A high-resolution map of human evolutionary constraint using 29 RT mammals."; RL Nature 478:476-482(2011). RN [2] {ECO:0000313|Ensembl:ENSCPOP00000015528} RP IDENTIFICATION. RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000015528}; RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCPOP00000015528}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAKN02039387; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 10141.ENSCPOP00000015528; -. DR Ensembl; ENSCPOT00000025681; ENSCPOP00000015528; ENSCPOG00000020244. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0VXU8; -. DR OMA; GANSAPK; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005447; Unassembled WGS sequence. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005447}; KW Reference proteome {ECO:0000313|Proteomes:UP000005447}. FT COILED 210 237 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 443 AA; 49117 MW; 492616F010643622 CRC64; MRRSPRPGSV ASPQEHRLSF YSENSDSSDS VPSRDSSGHR FPGQGPGEPE GRRDRGSSCG EPALRRRLPR GSSWAGSSRQ EPSPGNHNLE TACGAATVRG GASEPAASPA VSEEQRSLLA ILDLRREMPA LRATKSFLSL LFQVPRVLLL LVRDALLGVC REVCSVHFLS AASLLSVFLA ALSWSLLHLL PPLENEPKAM LSPSEHQERL RSHGQQLQQL QAELNKLRKE VARVRKAHSE RVAKLVFQRL NEDFVQKPDY ALSSVAPGAT IDLEKTSHDY EDTNMAYFWN RFSFWNYARP PTVILEPDVF PGNCWAFQGH QGQVVIRLAG RVQLSDITLQ HPPPSVAHTE DASSAPRDFA VFGLQVDDKT EVFLGRFTFD VKKFAIQTFH LQNDSPSAFP KVKIQILSNW GHPRFTCLYR VRAHGLRISE GEGSATGATR GPH // ID H0WGK7_OTOGA Unreviewed; 760 AA. AC H0WGK7; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOGAP00000000463}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSOGAP00000000463}; OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. OX NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000000463, ECO:0000313|Proteomes:UP000005225}; RN [1] {ECO:0000313|Ensembl:ENSOGAP00000000463} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B., RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N., RA Walker B.J., Sharpe T., Hall G.; RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby)."; RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOGAP00000000463} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOGAP00000000463}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAQR03016211; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 30611.ENSOGAP00000000463; -. DR Ensembl; ENSOGAT00000000516; ENSOGAP00000000463; ENSOGAG00000000515. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H0WGK7; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005225; Unassembled WGS sequence. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 206 226 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 257 278 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 316 336 {ECO:0000256|SAM:Coils}. FT COILED 406 444 {ECO:0000256|SAM:Coils}. FT COILED 447 474 {ECO:0000256|SAM:Coils}. FT COILED 521 548 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 760 AA; 85448 MW; 30FE9CE03E292A2A CRC64; LISPPPRVTW GQPQASLVFL FSLPKPSGKG SISHLIMSRR SQRLTRYSQG DDDGNSSSGG SSVTGSHNSL FKDSPLRTLK RKSSNMKRLS PAPQLGPSSD AHTSYYSESM VQESCIGNPR AALLARNALL HDLHSDSYWS EDLRVRRRRG TGGTESSKVN GLTEGKVSED FLGSSSGYSS EDDFMGYSDT DQQNPGSRLR SAVSRLGSFF WMVVTFPGRL FGLLYWWIGT TWYRLTTAAS LLDVFVLTRR FSSLKTFLWF LLLLLFLTCL TYGAWYFYPY GLQTFHPAVV SWWAAKDSRT QQEGWDSRDS PHFQAEQRVL SRVHSLERRL EALAAEFSSN WQKEALRLER LELRQGAAGQ GGGGDLSHED TLALLEGLVS RREAALKDDL RRDTTARIQE ELAALRAEHQ QDSEDLFKKI VQASQESEAR IHQLKSEWQR MTQESFQEHF VKELGRLEDQ LAGLRQELAA LTLKQSSVAD EVGLLPQQIQ AVRDDVESQF PAWINQFLLR GGGTRTGFLQ KEEMQAQLQE LENRILAHMA EMQGKSATEA AASLGLTLQK EGVIGVTEEQ VHRIVKQALQ RYSEDRIGMV DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRLTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGKFTY DQDGEPIQTF YFQPPKMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID H0X4Z0_OTOGA Unreviewed; 913 AA. AC H0X4Z0; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOGAP00000010307}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSOGAP00000010307}; OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. OX NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000010307, ECO:0000313|Proteomes:UP000005225}; RN [1] {ECO:0000313|Ensembl:ENSOGAP00000010307} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B., RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N., RA Walker B.J., Sharpe T., Hall G.; RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby)."; RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOGAP00000010307} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOGAP00000010307}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAQR03120015; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03120016; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03120017; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03120018; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 30611.ENSOGAP00000010307; -. DR Ensembl; ENSOGAT00000011513; ENSOGAP00000010307; ENSOGAG00000011509. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H0X4Z0; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005225; Unassembled WGS sequence. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 335 352 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 364 381 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 387 408 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 420 438 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 504 531 {ECO:0000256|SAM:Coils}. FT COILED 556 590 {ECO:0000256|SAM:Coils}. FT COILED 604 624 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 913 AA; 102907 MW; 53B0F2D401DAD94A CRC64; EVVIMDFSRL HMYTPPQCVP ENSGYTYALS SSYSSDALDF ETEHKLDPVF DSPRMSRRSL RLVTTAYPPG DSQAVGTHSC ISSTGSLKDG VARTAKQRRS ATKMAFSINH VSRQVTSSGA SQGGSNNLQP PVLDESLICE QTKVDHFWGL DDDGDLKGGN KAAIQGNGDM AAGATAYNGY TCSDCTMLSQ RRDALTAHSA AQGLMTRIYS RERDPKRGSS FYMDRILWLA KYTSSSFSSF LVQLFQVVLM KLNYESENYK LKTDESKDCE SKSYKSKSHE VKAHPSYCGR MNVREFLRED GHLNVNGGSL CGDCKGKKHL ETHTATHPQC PRPPGLAGAL GHVCACAGYF LMQTLRRIGA AGWFMSRMVW SVLWLAIVAP GKAASGLFWW LGVGWYQFVT LISWLNVFLL TRCLRNFCKF LILIIPLLLL LAGLSLWGQG DFLSFLPILN WIDIQRTQRV DDPRNILKPE TSHLNQPLQG DDEAFQWRWT RDMEQQVASL SGQCRSHDER LQELTAVLQK LQAQVDQVDD GRAGLSVLVR DAVGQHLRET DFMTFHQEHD LRISNLEDIL KKLTEKSEAI RKELEQTKLK AIREADEQHL LSTVRHLELE LDHLKSELSS WQHVKTSCEK IDVIHEKVDA QVRETVKLIF SEDQQDGSLE WLLQKFSSQF VSKDDLQILL QDLELQILKN ITHHISVTKQ MPTSETVVSA VHKAGVSGIT EAQARVIVNN ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALI SLFGIPLWYF SQSPRVVIQP DMYPGNCWAF KGSQGYLVVR LSMVIHPTAV TLEHIPKTLS PTGNITSAPK DFAVYGLENE YQEEGWLLGQ FTYDHEGEPL QMFHVLERPD RAFQVVELRI FSNWGHPEYT CLYRFRVHGQ PVK // ID H0X5H6_OTOGA Unreviewed; 1399 AA. AC H0X5H6; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOGAP00000010537}; GN Name=SUCO {ECO:0000313|Ensembl:ENSOGAP00000010537}; OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. OX NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000010537, ECO:0000313|Proteomes:UP000005225}; RN [1] {ECO:0000313|Ensembl:ENSOGAP00000010537} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B., RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N., RA Walker B.J., Sharpe T., Hall G.; RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby)."; RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOGAP00000010537} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOGAP00000010537}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAQR03038748; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038749; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038750; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038751; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038752; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038753; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038754; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038755; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038756; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03038757; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 30611.ENSOGAP00000010537; -. DR Ensembl; ENSOGAT00000011779; ENSOGAP00000010537; ENSOGAG00000011770. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; H0X5H6; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000005225; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005225}; KW Reference proteome {ECO:0000313|Proteomes:UP000005225}. FT COILED 1081 1101 {ECO:0000256|SAM:Coils}. FT COILED 1131 1151 {ECO:0000256|SAM:Coils}. FT COILED 1337 1357 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1399 AA; 156006 MW; 9DA2D305F157FB6B CRC64; RGVLARPFPF TNQHLAQWAS PLTPGKCLLN FITTAFCLPS SVLPRTVLKK GKLNVTIPKL LGLLRSETTD FSVKIVDSMR NRERERRVLE GDYSSRGALR GAARRTEGRG ASTRRPLQQR QSPESCEAPL PAQLSAPRRA RTEREPPSAT ALRTLAPILA LLLRLLHLGL GSGGCREDVP PSGRGKKEEK MKKYRRALAL VSCLSLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFQKKN KSNFRNSMPI CKIYRRTAKH SSFFDLAYVL EITNSIFNLY YVKFIQIVFL VEIEKSGTIS VAKPSETEQS ETDCDVGEAL DANAPIEQPS FVGPPESLVG QHIENSSSSH GKGKITKSEF ESKVGASEQG GDDSKSALNS SDNLKNESSD YTKPGEIDPS SVASPKDPED IPTFDEWKKK VMEVEKEKSQ SMHPSSNGGS HATKKVQKNR NNYASVECGA KILAANPEAK STSAILIENM DLYMLNPCST KIWFVIELCE PIQVKQFDIA NYELFSSTPK DFLVSISDRY PTNKWIKLGT FHGRDERNVQ SFPLDEQMYA KYVKVELVSH FGSEHFCPLS LIRVFGTSMV EEYEEIADSQ YHSERQELFD EDYDYPLDYN TGEDKSSKNL LGSATNAILN MVNIAANILG AKTEDLTEGN KSISENATAT PAPKMPESTP VSTPVPPPEY VTAEKHIHDM EPSTLDTAKE SPIVQLVQEE EEEASQSTVT LLGSGEQEDE SSPWFESETQ IFCSELSTVC CVSSFSEYIF KWCSVRIALY WQRSRTAWSK GKNYLVSAQP PLLLPAESVE VSVLQPSKGE LDSKDVVREA ESVLLGDLSS MHQGDLMNHT VDAIELEPSH PQTLSQSLLL DITPEINSLS DIEVSESVKY TAGHIPSQII PQEHSIEVDN ETEKKSESFS SIEKPSVIYE TNKINEVIDN TIKEDMNSMH ITTKLSETIV PPLNTATVPD NEDGEAKMNI ADTPKQILTP VVDSSSLPEV KEEEQSPEDA LLKGLQRTAT DFYAELQNST DLGYANGNLV HGSNQKESVF MRLNNRIKAL EVNMSLSGRY LEELSQRYRK QMEEMQKAFN KTIVKLQNTS RIAEEQDQRQ TEAIQLLQAQ LTNMTQLVSN LSTTVAELKQ EVSDRQSYLV ISLVLCVVLG LMLCMQRCRN TSQFDGDYIS KLPKSNQYPS PKRCFSSYDD MNLKRRTSFP LIRSKSLQLT GKEVDPNDLY IVEPLKFSPE KKKKRCKYKI EKIETIKSAD PLHPIANGDI KGKKPFTNQR DFSNMGEVYH SSYKGPPSEG SSETSSQSEE SYFCGISACT SLCNGQSQKT KTEKRALKRR RSKVQDQGKL LKTLIQTKSG SLPSLHDIIR GNKEITVGTF GVTAVSGHT // ID H0X7I2_OTOGA Unreviewed; 360 AA. AC H0X7I2; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOGAP00000011370}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSOGAP00000011370}; OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. OX NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000011370, ECO:0000313|Proteomes:UP000005225}; RN [1] {ECO:0000313|Ensembl:ENSOGAP00000011370} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B., RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N., RA Walker B.J., Sharpe T., Hall G.; RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby)."; RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOGAP00000011370} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOGAP00000011370}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAQR03138868; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03138869; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AAQR03138870; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_003797796.1; XM_003797748.1. DR STRING; 30611.ENSOGAP00000011370; -. DR Ensembl; ENSOGAT00000012691; ENSOGAP00000011370; ENSOGAG00000012687. DR GeneID; 100948130; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H0X7I2; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005225; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005225}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005225}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 48 69 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 360 AA; 40801 MW; 3C784F86B826682B CRC64; MSGRTKTRRA ARFFRGCSDE ASSSSSANIL QSEPENPCTN RLTRPRKIIL CVILGLTFLF IGLRSLQWLK EIEFPQKSRQ VYSVIGEFAS RLYNYQARLR MPKEQLELLK KESQTLENNF REILFLIEQI DVLKALLRDI KDGVYNQSWG AHGDPPGSQN TTETLDEEMS NLVNYVLKKL REDQVQMADY ALKSAGASII EAGTSESYKN NKAKLYWHGI GFLNYEMPPD IILQPDVYPG KCWAFPGSQG HTLIKLARKI VPTSVTMEHI SEKVSPSGNI SSAPKQFSVY GILKKCEGDE IFLGQFIYNK TGTTVQTFKL QHAVSESVLC VKLKILSNWG HSEYTCLYRF RVHGTPSVHT // ID H0XDS1_OTOGA Unreviewed; 369 AA. AC H0XDS1; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSOGAP00000014031}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSOGAP00000014031}; OS Otolemur garnettii (Small-eared galago) (Garnett's greater bushbaby). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. OX NCBI_TaxID=30611 {ECO:0000313|Ensembl:ENSOGAP00000014031, ECO:0000313|Proteomes:UP000005225}; RN [1] {ECO:0000313|Ensembl:ENSOGAP00000014031} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Broad Institute Genome Sequencing Platform; RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K., Jaffe D.B., RA Gnerre S., MacCallum I., Przybylski D., Ribeiro F.J., Burton J.N., RA Walker B.J., Sharpe T., Hall G.; RT "Version 3 of the genome sequence of Otolemur garnettii (Bushbaby)."; RL Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSOGAP00000014031} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSOGAP00000014031}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAQR03054150; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 30611.ENSOGAP00000014031; -. DR Ensembl; ENSOGAT00000015666; ENSOGAP00000014031; ENSOGAG00000015660. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H0XDS1; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005225; Unassembled WGS sequence. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005225}; KW Reference proteome {ECO:0000313|Proteomes:UP000005225}. FT COILED 153 173 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 369 AA; 42137 MW; F5E534CC5E375ED2 CRC64; MPRPLRNPGD LNSEDVVHGA RARRWVTQRG RNTCRTEDPC PNTNDTLPLP VSINAPALNL IQCMLGCMSW LTYAACFLRT QLQQVLSNTC KCKMFCQKLM EKTGVLVLCA FGFWMFSIHL PSKMEVWQDD SINSPMQSLR MYQEKVRHHA GEIQDLRGSM NLLIAKLQEM EAMSDEQKMA QKIMKMIQGD YIERPDFALK SIGASIDFEH TSATYNHDKA RSYWNWIRLW NYAQPPDVIL EPNVTPGNCW AFEGDRGQVT IRLAQKVYLS NLSLQHIPKT ISLSGSLDTA PKDFVIYGME KSPKEEVFLG AFQFQPENII QTFPLQNQPA RAFGSVKVKV SSNWGNPRFT CLYRVRVHGS VMPPKEQPN // ID H0Y6N5_HUMAN Unreviewed; 634 AA. AC H0Y6N5; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 27. DE SubName: Full=SUN domain-containing protein 1 {ECO:0000313|Ensembl:ENSP00000406653}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSP00000406653}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000406653, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000406653, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [2] {ECO:0000213|PubMed:20068231} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=20068231; DOI=10.1126/scisignal.2000475; RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., RA Mann M.; RT "Quantitative phosphoproteomics reveals widespread full RT phosphorylation site occupancy during mitosis."; RL Sci. Signal. 3:RA3-RA3(2010). RN [3] {ECO:0000313|Ensembl:ENSP00000406653} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000406653}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC073957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC099731; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; KF458356; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; H0Y6N5; -. DR Ensembl; ENST00000433212; ENSP00000406653; ENSG00000164828. DR UCSC; uc003sji.3; human. DR HGNC; HGNC:18587; SUN1. DR GeneTree; ENSGT00390000011587; -. DR ChiTaRS; SUN1; human. DR NextBio; 35519950; -. DR Proteomes; UP000005640; Chromosome 7. DR ExpressionAtlas; H0Y6N5; baseline and differential. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|MaxQB:H0Y6N5, KW ECO:0000213|PeptideAtlas:H0Y6N5}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 108 131 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 138 157 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 232 252 {ECO:0000256|SAM:Coils}. FT COILED 277 311 {ECO:0000256|SAM:Coils}. FT COILED 324 344 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000406653}. SQ SEQUENCE 634 AA; 71044 MW; 3ED87AAD5F5EFF93 CRC64; XSKAHASYYG RMNVREVLRE DGHLSVNGEA LCDDCKGKRH LDAHTAAHSQ SPRLPGRAGT LWHIWACAGY FLLQILRRIG AVGQAVSRTA WSALWLAVVA PGKAASGVFW WLGIGWYQFV TLISWLNVFL LTRCLRNICK FLVLLIPLFL LLAGLSLRGQ GNFFSFLPVL NWASMHRTQR VDDPQDVFKP TTSRLKQPLQ GDSEAFPWHW MSGVEQQVAS LSGQCHHHGE NLRELTTLLQ KLQARVDQME GGAAGPSASV RDAVGQPPRE TDFMAFHQEH EVRMSHLEDI LGKLREKSEA IQKELEQTKQ KTISAVGEQL LPTVEHLQLE LDQLKSELSS WRHVKTGCET VDAVQERVDV QVREMVKLLF SEDQQGGSLE QLLQRFSSQF VSKGDLQTML RDLQLQILRN VTHHVSVTKQ LPTSEAVVSA VSEAGASGIT EAQARAIVNS ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALM SLFGIPLWYF SQSPRVVIQP DIYPGNCWAF KGSQGYLVVR LSMMIHPAAF TLEHIPKTLS PTGNISSAPK DFAVYGLENE YQEEGQLLGQ FTYDQDGESL QMFQALKRPD DTAFQIVELR IFSNWGHPEY TCLYRFRVHG EPVK // ID H0Y742_HUMAN Unreviewed; 710 AA. AC H0Y742; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=SUN domain-containing protein 1 {ECO:0000313|Ensembl:ENSP00000409909}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSP00000409909}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000409909, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000409909, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [2] {ECO:0000313|Ensembl:ENSP00000409909} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000409909}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC073957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC099731; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; KF458356; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; H0Y742; -. DR STRING; 9606.ENSP00000384015; -. DR PaxDb; H0Y742; -. DR Ensembl; ENST00000429178; ENSP00000409909; ENSG00000164828. DR UCSC; uc003sjg.3; human. DR HGNC; HGNC:18587; SUN1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR ChiTaRS; SUN1; human. DR NextBio; 35520047; -. DR Proteomes; UP000005640; Chromosome 7. DR Bgee; H0Y742; -. DR ExpressionAtlas; H0Y742; baseline and differential. DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IDA:HPA. DR GO; GO:0031965; C:nuclear membrane; IDA:HPA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|MaxQB:H0Y742, KW ECO:0000213|PeptideAtlas:H0Y742}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 184 207 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 214 233 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 308 328 {ECO:0000256|SAM:Coils}. FT COILED 353 387 {ECO:0000256|SAM:Coils}. FT COILED 400 420 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000409909}. SQ SEQUENCE 710 AA; 79591 MW; 5E4E6D3C56E4CDA3 CRC64; GDVGAAAATA HNGFSCSNCS MLSERKDVLT AHPAAPGPVS RVYSRDRNQK CGASFYVNRI LWLARYTASS FSSFLVQLFQ VVLMKLSYES ENYKLKTHES KDCESESYKS KSHESKAHAS YYGRMNVREV LREDGHLSVN GEALCYFLLQ ILRRIGAVGQ AVSRTAWSAL WLAVVAPGKA ASGVFWWLGI GWYQFVTLIS WLNVFLLTRC LRNICKFLVL LIPLFLLLAG LSLRGQGNFF SFLPVLNWAS MHRTQRVDDP QDVFKPTTSR LKQPLQGDSE AFPWHWMSGV EQQVASLSGQ CHHHGENLRE LTTLLQKLQA RVDQMEGGAA GPSASVRDAV GQPPRETDFM AFHQEHEVRM SHLEDILGKL REKSEAIQKE LEQTKQKTIS AVGEQLLPTV EHLQLELDQL KSELSSWRHV KTGCETVDAV QERVDVQVRE MVKLLFSEDQ QGGSLEQLLQ RFSSQFVSKG DLQTMLRDLQ LQILRNVTHH VSVTKQLPTS EAVVSAVSEA GASGITEAQA RAIVNSALKL YSQDKTGMVD FALESGGGSI LSTRCSETYE TKTALMSLFG IPLWYFSQSP RVVIQPDIYP GNCWAFKGSQ GYLVVRLSMM IHPAAFTLEH IPKTLSPTGN ISSAPKDFAV YGLENEYQEE GQLLGQFTYD QDGESLQMFQ ALKRPDDTAF QIVELRIFSN WGHPEYTCLY RFRVHGEPVK // ID H0YJP0_HUMAN Unreviewed; 1465 AA. AC H0YJP0; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 29-OCT-2014, sequence version 2. DT 11-NOV-2015, entry version 30. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|Ensembl:ENSP00000451860}; DE Flags: Fragment; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSP00000451860}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000451860, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000451860, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12508121; DOI=10.1038/nature01348; RA Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., RA Sun H., Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., RA Bruels T., Jaillon O., Friedlander L., Samson G., Brottier P., RA Cure S., Segurens B., Aniere F., Samain S., Crespeau H., Abbasi N., RA Aiach N., Boscus D., Dickhoff R., Dors M., Dubois I., Friedman C., RA Gouyvenoux M., James R., Madan A., Mairey-Estrada B., Mangenot S., RA Martins N., Menard M., Oztas S., Ratcliffe A., Shaffer T., Trask B., RA Vacherie B., Bellemere C., Belser C., Besnard-Gonnet M., RA Bartol-Mavel D., Boutard M., Briez-Silla S., Combette S., RA Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., Muselet D., RA Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., Trybou A., RA Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., RA Verdier J., Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., RA Matsuda F., Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., RA Quetier F., Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14."; RL Nature 421:601-607(2003). RN [2] {ECO:0000213|PubMed:18669648} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=18669648; DOI=10.1073/pnas.0805139105; RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., RA Elledge S.J., Gygi S.P.; RT "A quantitative atlas of mitotic phosphorylation."; RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008). RN [3] {ECO:0000213|PubMed:19413330} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=19413330; DOI=10.1021/ac9004309; RA Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., RA Mohammed S.; RT "Lys-N and trypsin cover complementary parts of the phosphoproteome in RT a refined SCX-based approach."; RL Anal. Chem. 81:4493-4501(2009). RN [4] {ECO:0000213|PubMed:20068231} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=20068231; DOI=10.1126/scisignal.2000475; RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., RA Mann M.; RT "Quantitative phosphoproteomics reveals widespread full RT phosphorylation site occupancy during mitosis."; RL Sci. Signal. 3:RA3-RA3(2010). RN [5] {ECO:0000213|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Burckstummer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [6] {ECO:0000213|PubMed:21406692} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21406692; DOI=10.1126/scisignal.2001570; RA Rigbolt K.T., Prokhorova T.A., Akimov V., Henningsen J., RA Johansen P.T., Kratchmarova I., Kassem M., Mann M., Olsen J.V., RA Blagoev B.; RT "System-wide temporal characterization of the proteome and RT phosphoproteome of human embryonic stem cell differentiation."; RL Sci. Signal. 4:RS3-RS3(2011). RN [7] {ECO:0000313|Ensembl:ENSP00000451860} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. RN [8] {ECO:0000213|PubMed:24275569} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000451860}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL121808; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL136418; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; KC877516; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9606.ENSP00000382269; -. DR PaxDb; H0YJP0; -. DR PRIDE; H0YJP0; -. DR Ensembl; ENST00000553957; ENSP00000451860; ENSG00000092148. DR HGNC; HGNC:20157; HECTD1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR ChiTaRS; HECTD1; human. DR NextBio; 35523893; -. DR Proteomes; UP000005640; Chromosome 14. DR Bgee; H0YJP0; -. DR ExpressionAtlas; H0YJP0; baseline and differential. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|MaxQB:H0YJP0, KW ECO:0000213|PeptideAtlas:H0YJP0}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT COILED 719 739 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000451860}. FT NON_TER 1465 1465 {ECO:0000313|Ensembl:ENSP00000451860}. SQ SEQUENCE 1465 AA; 162188 MW; DA0625BA80BBB089 CRC64; VFAQTFQQTM LPSIRKASLA LIRKMIHFCS EALLKEVCDS DVGHNLPTIL VEITATVLDQ EDDDDGHLLA LQIIRDLVDK GGDIFLDQLA RLGVISKVST LAGPSSDDEN EEESKPEKED EPQEDAKELQ QGKPYHWRDW SIIRGRDCLY IWSDAAALEL SNGSNGWFRF ILDGKLATMY SSGSPEGGSD SSESRSEFLE KLQRARGQVK PSTSSQPILS APGPTKLTVG NWSLTCLKEG EIAIHNSDGQ QATILKEDLP GFVFESNRGT KHSFTAETSL GSEFVTGWTG KRGRKLKSKL EKTKQKVRTM ARDLYDDHFK AVESMPRGVV VTLRNIATQL ESSWELHTNR QCIESENTWR DLMKTALENL IVLLKDENTI SPYEMCSSGL VQALLTVLNN SMDLDMKQDC SQLVERINVF KTAFSENEDD ESRPAVALIR KLIAVLESIE RLPLHLYDTP GSTYNLQILT RRLRFRLERA PGETALIDRT GRMLKMEPLA TVESLEQYLL KMVAKQWYDF DRSSFVFVRK LREGQNFIFR HQHDFDENGI IYWIGTNAKT AYEWVNPAAY GLVVVTSSEG RNLPYGRLED ILSRDNSALN CHSNDDKNAW FAIDLGLWVI PSAYTLRHAR GYGRSALRNW VFQVSKDGQN WTSLYTHVDD CSLNEPGSTA TWPLDPPKDE KQGWRHVRIK QMGKNASGQT HYLSLSGFEL YGTVNGVCED QLGKAAKEAE ANLRRQRRLV RSQVLKYMVP GARVIRGLDW KWRDQDGSPQ GEGTVTGELH NGTTQSWSSL VKNNCPDKTS AAAGSSSRKG SSSSVCSVAS SSDISLGSTK TERRSEIVME HSIVSGADVH EPIVVLSSAE NVPQTEVGSS SSASTSTLTA ETGSENAERK LGPDSSVRTP GESSAISMGI VSVSSPDVSS VSELTNKEAA SQRPLSSSAS NRLSVSSLLA AGAPMSSSAS VPNLSSRETS SLESFVRRVA NIARTNATNN MNLSRSSSDN NTNTLGRNVM STATSPLMGA QSFPNLTTPG TTSTVTMSTS SVTSSSNVAT ATTVLSVGQS LSNTLTTSLT STSSESDTGQ EAEYSLYDFL DSCRASTLLA ELDDDEDLPE PDEEDDENED DNQEDQEYEE VMILRRPSLQ RRAGSRSDVT HHAVTSQLPQ VPAGAGSRPI GEQEEEEYET KGGRRRTWDD DYVLKRQFSA LVPAFDPRPG RTNVQQTTDL EIPPPGTPHS ELLEEVECTP SPRLALTLKV TGLGTTREVE LPLTNFRSTI FYYVQKLLQL SCNGNVKSDK LRRIWEPTYT IMYREMKDSD KEKENGKMGC WSIEHVEQYL GTDELPKNDL ITYLQKNADA AFLRHWKLTG TNKSIRKNRN CSQLIAAYKD FCEHGTKSGL NQGAISTLQS SDILNLTKEQ PQAKAGNGQN SCGVEDVLQL LRILYIVASD PYSRISQEDG DEQPQFTFPP DEFTS // ID H0YJV8_HUMAN Unreviewed; 185 AA. AC H0YJV8; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|Ensembl:ENSP00000452233}; DE Flags: Fragment; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSP00000452233}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000452233, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000452233, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12508121; DOI=10.1038/nature01348; RA Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., RA Sun H., Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., RA Bruels T., Jaillon O., Friedlander L., Samson G., Brottier P., RA Cure S., Segurens B., Aniere F., Samain S., Crespeau H., Abbasi N., RA Aiach N., Boscus D., Dickhoff R., Dors M., Dubois I., Friedman C., RA Gouyvenoux M., James R., Madan A., Mairey-Estrada B., Mangenot S., RA Martins N., Menard M., Oztas S., Ratcliffe A., Shaffer T., Trask B., RA Vacherie B., Bellemere C., Belser C., Besnard-Gonnet M., RA Bartol-Mavel D., Boutard M., Briez-Silla S., Combette S., RA Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., Muselet D., RA Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., Trybou A., RA Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., RA Verdier J., Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., RA Matsuda F., Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., RA Quetier F., Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14."; RL Nature 421:601-607(2003). RN [2] {ECO:0000213|PubMed:21269460} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Burckstummer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [3] {ECO:0000313|Ensembl:ENSP00000452233} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000452233}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL121808; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL136418; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; KC877516; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; H0YJV8; -. DR STRING; 9606.ENSP00000382269; -. DR PaxDb; H0YJV8; -. DR Ensembl; ENST00000557369; ENSP00000452233; ENSG00000092148. DR HGNC; HGNC:20157; HECTD1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR ChiTaRS; HECTD1; human. DR NextBio; 35523960; -. DR Proteomes; UP000005640; Chromosome 14. DR ExpressionAtlas; H0YJV8; baseline and differential. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR InterPro; IPR000421; FA58C. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS50022; FA58C_3; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|MaxQB:H0YJV8, KW ECO:0000213|PeptideAtlas:H0YJV8}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT COILED 76 96 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000452233}. FT NON_TER 185 185 {ECO:0000313|Ensembl:ENSP00000452233}. SQ SEQUENCE 185 AA; 20050 MW; 996C3B7FA55515A3 CRC64; XSKDGQNWTS LYTHVDDCSL NEPGSTATWP LDPPKDEKQG WRHVRIKQMG KNASGQTHYL SLSGFELYGT VNGVCEDQLG KAAKEAEANL RRQRRLVRSQ VLKYMVPGAR VIRGLDWKWR DQDGSPQGEG TVTGELHNAS PLMGAQSFPN LTTPGTTSTV TMSTSSVTSS SNVATATTVL SVGQS // ID H0ZDQ1_TAEGU Unreviewed; 564 AA. AC H0ZDQ1; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTGUP00000008713}; DE Flags: Fragment; GN Name=SUN1 {ECO:0000313|Ensembl:ENSTGUP00000008713}; OS Taeniopygia guttata (Zebra finch) (Poephila guttata). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; OC Estrildidae; Estrildinae; Taeniopygia. OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000008713, ECO:0000313|Proteomes:UP000007754}; RN [1] {ECO:0000313|Ensembl:ENSTGUP00000008713, ECO:0000313|Proteomes:UP000007754} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20360741; DOI=10.1038/nature08819; RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W., RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A., RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P., RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., RA London S.E., Li Y., Lin Y.C., George J., Sweedler J., Southey B., RA Gunaratne P., Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., RA Itoh Y., Whitney O., Pfenning A.R., Howard J., Volker M., RA Skinner B.M., Griffin D.K., Ye L., McLaren W.M., Flicek P., RA Quesada V., Velasco G., Lopez-Otin C., Puente X.S., Olender T., RA Lancet D., Smit A.F., Hubley R., Konkel M.K., Walker J.A., RA Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z., Eichler E.E., RA Stapley J., Slate J., Ekblom R., Birkhead T., Burke T., Burt D., RA Scharff C., Adam I., Richard H., Sultan M., Soldatov A., Lehrach H., RA Edwards S.V., Yang S.P., Li X., Graves T., Fulton L., Nelson J., RA Chinwalla A., Hou S., Mardis E.R., Wilson R.K.; RT "The genome of a songbird."; RL Nature 464:757-762(2010). RN [2] {ECO:0000313|Ensembl:ENSTGUP00000008713} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTGUP00000008713}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABQF01038279; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59729.ENSTGUP00000008713; -. DR Ensembl; ENSTGUT00000008806; ENSTGUP00000008713; ENSTGUG00000008442. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0ZDQ1; -. DR OMA; CEEITTH; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007754; Chromosome 14. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007754}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007754}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 45 61 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 73 98 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 144 164 {ECO:0000256|SAM:Coils}. FT COILED 208 242 {ECO:0000256|SAM:Coils}. FT COILED 248 268 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSTGUP00000008713}. SQ SEQUENCE 564 AA; 63609 MW; 4FEEECD006DD3719 CRC64; GYFMLHALRT AGATGWLVSQ KVSLLWLAIL SPGRAASGMF RLLRTGWYQL VTLMSLLKVF LVRRCLPKNY RWLLFLIPLL FLLGIMSLGL LNGLISLLPP LNWTGIGRIQ RTDDSVHVPE PEDDSFHSVQ PPKDTTNIFD FGRISELEKQ MAFMSDRCHH QNKEYNEVMS LLQNLHNQVA KMKSGGKGPG SSCGHLINSR QTDFLALHKK HELRIQTLED LLRELSAESK DIQKEFDLAK SKSVRTLLHL LMSKIKELEL ELASMKSKLL SGEGVKTSCE KMDVILEKVD AQVKESVKLM LFGTQEEDLP ESLLQWLASN FVSKSDLQTL LRDLELQILK NITLHMSVTN QKVTSEVVTN AVTNAGISGI TEAQAQIIVN NALKLYSQDK TGMVDFALES GGGSILSTRC SETYETKTAL ISLFGIPLWY FSQSPRVVIQ PDMYPGNCWA FKGSQGYLVV RLSMKIYPTA FTLEHIPKTL SPTGNITSAP RNFAVYGLDD EYQEEGKLLG EYVYDQDGEP LQMFPVMEEN EDAFQIVELR IFSNWGHAEY TCLYRFRVHG KPAE // ID H0ZJ94_TAEGU Unreviewed; 712 AA. AC H0ZJ94; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTGUP00000010662}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSTGUP00000010662}; OS Taeniopygia guttata (Zebra finch) (Poephila guttata). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; OC Estrildidae; Estrildinae; Taeniopygia. OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000010662, ECO:0000313|Proteomes:UP000007754}; RN [1] {ECO:0000313|Ensembl:ENSTGUP00000010662, ECO:0000313|Proteomes:UP000007754} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20360741; DOI=10.1038/nature08819; RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W., RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A., RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P., RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., RA London S.E., Li Y., Lin Y.C., George J., Sweedler J., Southey B., RA Gunaratne P., Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., RA Itoh Y., Whitney O., Pfenning A.R., Howard J., Volker M., RA Skinner B.M., Griffin D.K., Ye L., McLaren W.M., Flicek P., RA Quesada V., Velasco G., Lopez-Otin C., Puente X.S., Olender T., RA Lancet D., Smit A.F., Hubley R., Konkel M.K., Walker J.A., RA Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z., Eichler E.E., RA Stapley J., Slate J., Ekblom R., Birkhead T., Burke T., Burt D., RA Scharff C., Adam I., Richard H., Sultan M., Soldatov A., Lehrach H., RA Edwards S.V., Yang S.P., Li X., Graves T., Fulton L., Nelson J., RA Chinwalla A., Hou S., Mardis E.R., Wilson R.K.; RT "The genome of a songbird."; RL Nature 464:757-762(2010). RN [2] {ECO:0000313|Ensembl:ENSTGUP00000010662} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTGUP00000010662}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABQF01015966; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_012424680.1; XM_012569226.1. DR STRING; 59729.ENSTGUP00000010662; -. DR Ensembl; ENSTGUT00000010774; ENSTGUP00000010662; ENSTGUG00000010300. DR GeneID; 100223417; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H0ZJ94; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007754; Chromosome 1A. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007754}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007754}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 218 238 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 277 304 {ECO:0000256|SAM:Coils}. FT COILED 367 387 {ECO:0000256|SAM:Coils}. FT COILED 397 424 {ECO:0000256|SAM:Coils}. FT COILED 466 493 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 712 AA; 78784 MW; F9EC47787A878A11 CRC64; MSRRSQRLVT TRYYPGDEDA TTSSSSTSLL GGTQLPFKET TGRTIRRKSS STKRLSPTPS TQTSYYSESM MSESYLGGSR GLPALGRSLL DDDLDSSTYW GGELSTRRRR GTGDTESSKI NGVLESKTYD TYTSSSGYSS EDDYAGHYYS GQSSSGSGLR TAASRVGSFL WQVFTSPVQL LRWLFSGLAG AWRRLTGTAS HLDNVPFSRR YPRLKRSLLL LLLLLLLAAA AYGAWYFYPY GLSTLSLPSF PWWGAGKLSS SSDLPGAGDL TTLDQGEHRL LARFQSLEKR FEALEAELSR WELRRGAAAV TAGGEPPPGD ILALLEGLVS RRDAGLKEHL HTDMANHLQG ELDVLRAQVQ RDLDGRLGKM AQASQEMEAR LLELNSEWQS SVQEKLRGTF QQEVGKLEQE VAALRRELAS LKSDQEVMGK HVKGILEQLK TVRADVEAQF PAWIGRFLAQ SQQDGAAAFI LQREDLQAEL QALERKILAK VQEDRRLSAR DAQAGIGVAL RQGGTAGVTE EQVHLIVGQA LKRYSEDRVG MVDYALESAG ASVINTRCSE TYETRTALLS LFGIPLWYHS QSPRVILQPD VNPGNCWAFR GSQGFAVIRL SGIIRPTAVT LEHIPKALSP QGTIPSAPKD FAVYGLKEER EEEGLLLGQF TYNHDGDPIQ TFYLEGDSMG TYQLVELRVL SNWGHPEYTC IYRFRVHGEP AH // ID H0ZNP8_TAEGU Unreviewed; 1578 AA. AC H0ZNP8; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTGUP00000012225}; DE Flags: Fragment; OS Taeniopygia guttata (Zebra finch) (Poephila guttata). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; OC Estrildidae; Estrildinae; Taeniopygia. OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000012225, ECO:0000313|Proteomes:UP000007754}; RN [1] {ECO:0000313|Ensembl:ENSTGUP00000012225, ECO:0000313|Proteomes:UP000007754} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20360741; DOI=10.1038/nature08819; RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W., RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A., RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P., RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., RA London S.E., Li Y., Lin Y.C., George J., Sweedler J., Southey B., RA Gunaratne P., Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., RA Itoh Y., Whitney O., Pfenning A.R., Howard J., Volker M., RA Skinner B.M., Griffin D.K., Ye L., McLaren W.M., Flicek P., RA Quesada V., Velasco G., Lopez-Otin C., Puente X.S., Olender T., RA Lancet D., Smit A.F., Hubley R., Konkel M.K., Walker J.A., RA Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z., Eichler E.E., RA Stapley J., Slate J., Ekblom R., Birkhead T., Burke T., Burt D., RA Scharff C., Adam I., Richard H., Sultan M., Soldatov A., Lehrach H., RA Edwards S.V., Yang S.P., Li X., Graves T., Fulton L., Nelson J., RA Chinwalla A., Hou S., Mardis E.R., Wilson R.K.; RT "The genome of a songbird."; RL Nature 464:757-762(2010). RN [2] {ECO:0000313|Ensembl:ENSTGUP00000012225} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTGUP00000012225}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABQF01002856; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABQF01002857; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59729.ENSTGUP00000012225; -. DR Ensembl; ENSTGUT00000012359; ENSTGUP00000012225; ENSTGUG00000011864. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H0ZNP8; -. DR OMA; YTSTHES; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000007754; Chromosome 5. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007754}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007754}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 206 226 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSTGUP00000012225}. SQ SEQUENCE 1578 AA; 175339 MW; 509BB9E1FA03D839 CRC64; AKQWYDFDRA SFVFVRKLRE GQTFVFRHQH DFDENGIIYW IGTNAKTAYE WVNPAAYGLV VVTSSEGRNL PYGRLEDILS RDSSALNCHT NDDKNAWFAI DLGLWVIPSA YTLRHARGYG RSALRNWVFQ VSKDGQNWTT LYTHVDDCSL NEPGSTATWP LDPPKDEKQG WRHVRIKQMG KNASGQTHYL SLSGFELYGT VNGVCEDQLG KAAKEAEANL RRQRRLVRSQ VLKYMVPGAR VIRGIDWKWR DQDGSPQGEG TVTGELHNGW IDVTWDAGGS NSYRMGAEGK FDLKLAPGYD PDSAASPKPV SSTVSGTTQS WSSLVKNNCP DKTTAAAGSS SRKGSSSSVC SVASSSDISL GSTKMERRSE SVLEQNIVSG TDIHEPIVVL SSADSVPQAD IGSSSSASTS TLTADMGNEN AERKLGPDNS IRTPGESSAI SMGIVSVSSP DVSSVSELTN KEAASQRPLS SSASNRLSVS SLLAAGAPMS SSASVPNLSS RETSSLESFV RRVANIARTN ATNNMNLSRS SSDNNTNTLG RNVVSTATSP LMGAQSFPNL TTTGTTSTVT MSTSSVTSSS NVATATTVLS VGQSLSNTLT TSLTSTSSES DTGQEAEYSL YDFLDSCRAS TLLAELDDDE DLPEPDEEDD ENEDDNQEDQ EYEEVMVRYN SFSFLHMVKT TRKCVTAVFM NSQFFNQIIL FSGIKTKQLG FLGEEEEYET KGGRRRTWDD DYVLKRQFSA LVPAFDPRPG RTNVQQTTDL EIPPPGTPHS ELLEEVECMP SPRLALTLKV SGLGTTREVE LPLTNFRSTI FYYVQKLLQL SCNGSVKSDK LRRIWEPTYT IMYREMKDSD KEKESGKMGC WSVEHVEQYL GTDELPKNDL ITYLQKNADS AFLRHWKLTG TNKSIRKNRN CSQLIAAYKD FCEHGSKSGL SQGAISTLQN SDILSLAKEQ PQAKAGSGQN SCGVEDVLQL LRILYIVASD PYTTRISQEE GDEHPQFNFP PDEFTSKKIT TKILQQIEEP LALASGALPD WCEQLTSKCP FLIPFETRQL YFTCTAFGAS RAIVWLQNRR EATVERTRTT STVRRDDPGE FRVGRLKHER VKVPRGESLM EWAENVMQIH ADRKSVLEVE FLGEEGTGLG PTLEFYALVA AEFQRTDLGA WLCDDDFPDD ESRQVDIGGG LKPPGYYVQR SCGLFTAPFP QDSDELERIT KLFHFLGIFL AKCIQDNRLV DLPISKPFFK LMCMGDIKSN MSKLIYESRG DRDLHCTESQ SEASTEEGHD SLSVGSLEED SKSEFILDPP KPKPPAWFNG ILTWEDFELV NPHRARFLKE IKDLAIKRRQ ILSNKNLSED EKNTKLQELM LKNPSGSGPP LSIEDLGLNF QFCPSSKVYG FTAVDLKPGG EDETVTMDNA EEYVDLMFDF CMHTGIQKQM EAFRDGFNRV FPMEKLSSFS HEEVQMILCG NQSPSWAAED IINYTEPKLG YTRDSPGFLR FVRVLCGMSS DERKAFLQFT TGCSTLPPGG LANLHPRLTV VRKVDATDAS YPSVNTCVHY LKLPEYSSEE IMRERLLAAT MEKGFHLN // ID H1A3S9_TAEGU Unreviewed; 1148 AA. AC H1A3S9; DT 22-FEB-2012, integrated into UniProtKB/TrEMBL. DT 22-FEB-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTGUP00000017542}; GN Name=SUCO {ECO:0000313|Ensembl:ENSTGUP00000017542}; OS Taeniopygia guttata (Zebra finch) (Poephila guttata). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; OC Estrildidae; Estrildinae; Taeniopygia. OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000017542, ECO:0000313|Proteomes:UP000007754}; RN [1] {ECO:0000313|Proteomes:UP000007754} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=20360741; DOI=10.1038/nature08819; RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W., RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A., RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P., RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., RA London S.E., Li Y., Lin Y.C., George J., Sweedler J., Southey B., RA Gunaratne P., Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., RA Itoh Y., Whitney O., Pfenning A.R., Howard J., Volker M., RA Skinner B.M., Griffin D.K., Ye L., McLaren W.M., Flicek P., RA Quesada V., Velasco G., Lopez-Otin C., Puente X.S., Olender T., RA Lancet D., Smit A.F., Hubley R., Konkel M.K., Walker J.A., RA Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z., Eichler E.E., RA Stapley J., Slate J., Ekblom R., Birkhead T., Burke T., Burt D., RA Scharff C., Adam I., Richard H., Sultan M., Soldatov A., Lehrach H., RA Edwards S.V., Yang S.P., Li X., Graves T., Fulton L., Nelson J., RA Chinwalla A., Hou S., Mardis E.R., Wilson R.K.; RT "The genome of a songbird."; RL Nature 464:757-762(2010). RN [2] {ECO:0000313|Ensembl:ENSTGUP00000017542} RP IDENTIFICATION. RG Ensembl; RL Submitted (JAN-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTGUP00000017542}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABQF01040846; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABQF01040847; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABQF01040848; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABQF01040849; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 59729.ENSTGUP00000017542; -. DR Ensembl; ENSTGUT00000017943; ENSTGUP00000017542; ENSTGUG00000017265. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H1A3S9; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000007754; Unassembled WGS sequence. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007754}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007754}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 906 924 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 830 850 {ECO:0000256|SAM:Coils}. FT COILED 880 900 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1148 AA; 126543 MW; C6599122C91CAF8B CRC64; ETSETSQPEA VSPPSADVNE ASSSIVPSTE NTSSSPTSEI PPVSQPDAIE NSRADIPVVS SSEAEQSEPD CDIGGTLEAD PQSEPSSFVS PPESLAGQHI ENISSSHGKG KKTKSEFESK VSAAEKGADE QKSALNASEN LKREKDFKKT GEIDPTSVIT PKDPGDIPTF DEWKKKVMEV EKEKSQSMHP SAVGGQHSTK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILMENMDLYM LNPCSTKIWF VVELCEPVQV KQFDIANHEL FSSTPKDFLV SISDRYPTNK WIKLGTFHAR DERNVQSFPL DEQMYAKYVK MFIKYIKVEL ISHFGSDIFS KVKLKRRVFG TSMVEEYEEI ADSQYQSDFM SCSSLYIDYL LDYNTGEEKS SKNLLGSATN AILSMVNIAA NMLGAKTEES PETEAGNKSV SENVTATPAT STAAPRLPEP TPVPSPELVT TDIPQIEKEQ LKVDLTKESP IVQLVQEYEE DTSQSTVTLL SSDEQEEEAA AWFELETEKY CCDMAAVCCI STFSEYLLKW CSVAAAMHRQ HSKTGGEERP PHPPQPALTE SAQTAAEEPL PEQLDSKAEK PPGSAVAVDF SAVHEEVSNE TTESIELEPS HPQTVSQSLL LEVTSEAKPA PTTDMVLEPP KEDSGQVAPR VTLQVDVPEL STDMEKAESS VVEESPEPSV ATEVKEMSTR ETFATPVIPK PTETVVQPES TVDMVASDAV EGKESSAEVQ KPAVPPVELP VPAETKEEEQ AAEEALLALP VSGPQRTATD FYAELQNSTE LGYANGNLVH GSNQKESVFM RLNNRIKALE VNMSLSSRYL EELSQRYRKQ MEEMQKAFNK TIIKLQNTSR IAEEQDQRQT EAIQLLQAQL TNMTQLASNL SATVAELKRE VSDRQTYLVI SLVLCLILGL VLFVQRCRSP SQFCEDYLSK IPKSNHYPSP KRCFSSYDDM NLKRRTSLPL VRSQSFQLAG KEVDPDDLYI VEPKFSPEKK KKRCKYKSEK PETIKPTTEP LHPIANGEIK GRKPFTNQRD FSNIGEVYHS SYKGPPSEGS SETSSQSDES YFCGISACTS LCNGQTQKTK TEKRAVKRRR SKVSDQGKLI KTLIQTKSGS MPSLHDIIKG NKDITVGTLG VTTVSGHI // ID H1VRH8_COLHI Unreviewed; 411 AA. AC H1VRH8; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 16-SEP-2015, entry version 15. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:CCF42834.1}; GN ORFNames=CH063_02935 {ECO:0000313|EMBL:CCF42834.1}; OS Colletotrichum higginsianum (strain IMI 349063) (Crucifer anthracnose OS fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Glomerellales; Glomerellaceae; OC Colletotrichum. OX NCBI_TaxID=759273 {ECO:0000313|Proteomes:UP000007174}; RN [1] {ECO:0000313|Proteomes:UP000007174} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=IMI 349063 {ECO:0000313|Proteomes:UP000007174}; RX PubMed=22885923; DOI=10.1038/ng.2372; RA O'Connell R.J., Thon M.R., Hacquard S., Amyotte S.G., Kleemann J., RA Torres M.F., Damm U., Buiate E.A., Epstein L., Alkan N., RA Altmueller J., Alvarado-Balderrama L., Bauser C.A., Becker C., RA Birren B.W., Chen Z., Choi J., Crouch J.A., Duvick J.P., Farman M.A., RA Gan P., Heiman D., Henrissat B., Howard R.J., Kabbage M., Koch C., RA Kracher B., Kubo Y., Law A.D., Lebrun M.-H., Lee Y.-H., Miyara I., RA Moore N., Neumann U., Nordstroem K., Panaccione D.G., Panstruga R., RA Place M., Proctor R.H., Prusky D., Rech G., Reinhardt R., RA Rollins J.A., Rounsley S., Schardl C.L., Schwartz D.C., Shenoy N., RA Shirasu K., Sikhakolli U.R., Stueber K., Sukno S.A., Sweigard J.A., RA Takano Y., Takahara H., Trail F., van der Does H.C., Voll L.M., RA Will I., Young S., Zeng Q., Zhang J., Zhou S., Dickman M.B., RA Schulze-Lefert P., Ver Loren van Themaat E., Ma L.-J., RA Vaillancourt L.J.; RT "Lifestyle transitions in plant pathogenic Colletotrichum fungi RT deciphered by genome and transcriptome analyses."; RL Nat. Genet. 44:1060-1065(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:CCF42834.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CACQ02005678; CCF42834.1; -; Genomic_DNA. DR EnsemblFungi; CCF42834; CCF42834; CH063_02935. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007174; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007174}; KW Reference proteome {ECO:0000313|Proteomes:UP000007174}. SQ SEQUENCE 411 AA; 45145 MW; 6C2E81ECE38783B0 CRC64; MAGLNITEAR MQEKLAKKVL KTNQGAKNAK AILLENKDSY MLLECSAENK FVIVELSDDI LVDTVVLANF EFFSSMIRHF RVSVSDRYPV KVDKWKDLGT FEAKNSRDIQ PFLVQNPLIW AKYVRIEFLT HYGNEFYCPV SLLRVHGTRM LESWKDQETA AEDEEVVDEP LAELTGPASG NAVGESIDGQ ENATMAAVEE TAQDISSDVS ADPQMPSHIF RPSEGGNPSC LRPISSVYPS STNPTGTESQ RIHQDSDHVA GQNKSTSVSA PDPRYAERHP TPSSAPALSA GAHASSANTE AKTGGPGDNT SSTKASPQAT PPVRNKNNST AAAASPTVQE SFFKAVSKRL QYLESNVSLS LKYIEDQSRS LQETQLLAER KQLSRIDVFL DSLNHTVLSE LRAVRQQSGW R // ID H1VYW3_COLHI Unreviewed; 440 AA. AC H1VYW3; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCF45425.1}; GN ORFNames=CH063_14516 {ECO:0000313|EMBL:CCF45425.1}; OS Colletotrichum higginsianum (strain IMI 349063) (Crucifer anthracnose OS fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Glomerellales; Glomerellaceae; OC Colletotrichum. OX NCBI_TaxID=759273 {ECO:0000313|Proteomes:UP000007174}; RN [1] {ECO:0000313|Proteomes:UP000007174} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=IMI 349063 {ECO:0000313|Proteomes:UP000007174}; RX PubMed=22885923; DOI=10.1038/ng.2372; RA O'Connell R.J., Thon M.R., Hacquard S., Amyotte S.G., Kleemann J., RA Torres M.F., Damm U., Buiate E.A., Epstein L., Alkan N., RA Altmueller J., Alvarado-Balderrama L., Bauser C.A., Becker C., RA Birren B.W., Chen Z., Choi J., Crouch J.A., Duvick J.P., Farman M.A., RA Gan P., Heiman D., Henrissat B., Howard R.J., Kabbage M., Koch C., RA Kracher B., Kubo Y., Law A.D., Lebrun M.-H., Lee Y.-H., Miyara I., RA Moore N., Neumann U., Nordstroem K., Panaccione D.G., Panstruga R., RA Place M., Proctor R.H., Prusky D., Rech G., Reinhardt R., RA Rollins J.A., Rounsley S., Schardl C.L., Schwartz D.C., Shenoy N., RA Shirasu K., Sikhakolli U.R., Stueber K., Sukno S.A., Sweigard J.A., RA Takano Y., Takahara H., Trail F., van der Does H.C., Voll L.M., RA Will I., Young S., Zeng Q., Zhang J., Zhou S., Dickman M.B., RA Schulze-Lefert P., Ver Loren van Themaat E., Ma L.-J., RA Vaillancourt L.J.; RT "Lifestyle transitions in plant pathogenic Colletotrichum fungi RT deciphered by genome and transcriptome analyses."; RL Nat. Genet. 44:1060-1065(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:CCF45425.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CACQ02007744; CCF45425.1; -; Genomic_DNA. DR EnsemblFungi; CCF45425; CCF45425; CH063_14516. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000007174; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007174}; KW Reference proteome {ECO:0000313|Proteomes:UP000007174}. FT COILED 147 178 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 440 AA; 49727 MW; 0C150D78AE0DB5F3 CRC64; MQLDKHGRPV VDEEFWHALR DLMKGDTEIL TMDRGHGGYH HVSDEHWRAV RARLLKDPVY QSAASPAPPG PSTAEIENIA QTRFDKSWEK WLKTNDKKVA KILEPAFSTS IPDKISKDLD HKIEKFVKDQ FKNKSTNDVV VTRTEFIRHL QDEFVKHRNE VKAEAQELQR KLEYYINDAV KSASEQAPPA GVSRSEMVQV VDGMVRQAIA NAGLEALAQG KIGATWDREL RHQVNFLGRH TGVVVDASAT TPDWDPPVHG KFRSASWYKS TRRDPKSVPP SATLWNWEDE GDCWCGKVTA HKSGVEVGVR IGYLLSHEII PQHVVVEHIL PGATLDPDAR PREIDVLAYI TEINTRNRIK DFAAAHFPRS KIILSDGWVR IGHFTYESRD ALNGVYVHKL SSELLSLGAV TDQVVLQVVS NYGADHTCLY RVRLYGEKQS // ID H2ATL5_KAZAF Unreviewed; 566 AA. AC H2ATL5; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 14-OCT-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCF57715.1}; GN Name=KAFR0D00680 {ECO:0000313|EMBL:CCF57715.1}; GN ORFNames=KAFR_0D00680 {ECO:0000313|EMBL:CCF57715.1}; OS Kazachstania africana (strain ATCC 22294 / BCRC 22015 / CBS 2517 / OS CECT 1963 / NBRC 1671 / NRRL Y-8276) (Yeast) (Kluyveromyces OS africanus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Kazachstania. OX NCBI_TaxID=1071382 {ECO:0000313|EMBL:CCF57715.1, ECO:0000313|Proteomes:UP000005220}; RN [1] {ECO:0000313|EMBL:CCF57715.1, ECO:0000313|Proteomes:UP000005220} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 22294 / BCRC 22015 / CBS 2517 / CECT 1963 / NBRC 1671 / RC NRRL Y-8276 {ECO:0000313|Proteomes:UP000005220}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE650824; CCF57715.1; -; Genomic_DNA. DR RefSeq; XP_003956850.1; XM_003956801.1. DR EnsemblFungi; CCF57715; CCF57715; KAFR_0D00680. DR GeneID; 13885673; -. DR KEGG; kaf:KAFR_0D00680; -. DR InParanoid; H2ATL5; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005220; Chromosome 4. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005220}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005220}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 566 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003558881. FT TRANSMEM 519 536 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 566 AA; 64639 MW; 6B964175E74D8D32 CRC64; MKCQIVAILL YCIAYVTRGA AEEGHKEDTK LSDNNSQQDN AGTFVSFEDW KRSKEEDISN SNRRNREPVD PSCYGDGECI GEEMEFEVNF FTGNDDNLPS HKNGGDRENY EEEPEGKLYK HKFNYASLDC AATIVKTNSE ASGATSILIE NKDKYLLNPC SAANKFVVIE LCEDILIEEI IIANFEYFSS TFKHIRVSAS DRFPVNKNKW TSLGEFDCEN VRNLQRFSIP NPQIWARYLR LEILSHFDDE FYCPISLVRV HGKTMMDEFK LSEIANNDID SDLQLVKNLT EVDPSLKETR VGEAADGKQA DPEEETKENV LDKCSTWPYV DLDNKTLIPH ISSFFNVCES QFEPLKFEEF LKEINLDIMN ITNTLAVANS PVSSSLQSNS NSNSKPSKEA AQLPLSPEES IFKNIIKRIA TLEHNATLTI LYMEEQSKLL SSSFENLEAK YAVRFDNLIG MFNDTIMNNI ELLKVFAKQL KDQSLGILEE TKNNNDAFIK SSDLRIISLE NEIRYQRKLV LFTLGILAVY HIYLLFSKIL YIEVDEDEVE KEEVKEQKPK DPVGKS // ID H2B0H5_KAZAF Unreviewed; 602 AA. AC H2B0H5; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 14-OCT-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CCF60125.1}; GN Name=KAFR0J00570 {ECO:0000313|EMBL:CCF60125.1}; GN ORFNames=KAFR_0J00570 {ECO:0000313|EMBL:CCF60125.1}; OS Kazachstania africana (strain ATCC 22294 / BCRC 22015 / CBS 2517 / OS CECT 1963 / NBRC 1671 / NRRL Y-8276) (Yeast) (Kluyveromyces OS africanus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Kazachstania. OX NCBI_TaxID=1071382 {ECO:0000313|EMBL:CCF60125.1, ECO:0000313|Proteomes:UP000005220}; RN [1] {ECO:0000313|EMBL:CCF60125.1, ECO:0000313|Proteomes:UP000005220} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 22294 / BCRC 22015 / CBS 2517 / CECT 1963 / NBRC 1671 / RC NRRL Y-8276 {ECO:0000313|Proteomes:UP000005220}; RX PubMed=22123960; DOI=10.1073/pnas.1112808108; RA Gordon J.L., Armisen D., Proux-Wera E., OhEigeartaigh S.S., RA Byrne K.P., Wolfe K.H.; RT "Evolutionary erosion of yeast sex chromosomes by mating-type RT switching accidents."; RL Proc. Natl. Acad. Sci. U.S.A. 108:20024-20029(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE650830; CCF60125.1; -; Genomic_DNA. DR RefSeq; XP_003959260.1; XM_003959211.1. DR EnsemblFungi; CCF60125; CCF60125; KAFR_0J00570. DR GeneID; 13883775; -. DR KEGG; kaf:KAFR_0J00570; -. DR InParanoid; H2B0H5; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000005220; Chromosome 10. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005220}; KW Reference proteome {ECO:0000313|Proteomes:UP000005220}. FT COILED 193 220 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 602 AA; 71055 MW; AF06950361344E5A CRC64; MDAGSAERSQ NSIRDKYRDL LSQRLNESVK RTEDDDSDDE SYEPNNSEGY EFESEDDEDY SMVEDPDKSF IQDNGDYEDT GLYSTEEDDA YYYDDDDEEE EEEEYRQPWF KSWGSWLLIF LFFSLLFKQW GWNTETGSLQ SVGSMKSIQR QINHLYNELN SRDLKQKNEL DNKLKIVISQ FEKNIKKLLP SNVLNYQSEI DKLNKKVNSL SQQEVTLENV TEWQKELINQ LKENLPNEIP VMVNNSSSVL VIPELHTYLS EIISNLVQNN VPPKMLEYDL NNYIREILTN EFQYVEKNYF INELEHKLQR NKLEIWNEIQ NNLETWKNDQ NSYKESSSPQ YSTIFLKRLI NEIYNVNQHQ WDDDLDFATW SQGTKLIKHL TSPVYSKGNR LPSTELLSDS KYISSSTYWQ CPNDKCSLAL RFSKPLHLTK ISYLHGRFKN NLHMMSSAPK LISMYIKLAN QNGSKKLIHI AKKFDMGQVY RKDSTYIRVA TYRYNLANDK IKQDFALPKW FIEFKPLIKS VLFQVEENHG NKDYVSLRKF IINGVLEEDL QILRNDQLEL RFNRNTPEYS AQSEEDLLTH QQPKLITNSH IPSFGQDELD IA // ID H2KRG1_CLOSI Unreviewed; 1335 AA. AC H2KRG1; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 14-OCT-2015, entry version 9. DE SubName: Full=Protein osteopotentia homolog {ECO:0000313|EMBL:GAA28979.2}; GN ORFNames=CLF_106254 {ECO:0000313|EMBL:GAA28979.2}; OS Clonorchis sinensis (Chinese liver fluke). OC Eukaryota; Metazoa; Platyhelminthes; Trematoda; Digenea; OC Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis. OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA28979.2, ECO:0000313|Proteomes:UP000008909}; RN [1] {ECO:0000313|Proteomes:UP000008909} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., RA Lv X., Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., RA Hu X., Xu J., Yu X.; RT "The draft genome of the carcinogenic human liver fluke Clonorchis RT sinensis."; RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE. RC STRAIN=Henan; RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W., RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L., RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., RA Xu J., Wu Z., Yu X.; RT "The genome and transcriptome sequence of Clonorchis sinensis provide RT insights into the carcinogenic liver fluke."; RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DF143161; GAA28979.2; -; Genomic_DNA. DR InParanoid; H2KRG1; -. DR Proteomes; UP000008909; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008909}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008909}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1053 1078 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 870 897 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1335 AA; 146297 MW; 454690E73213BD09 CRC64; MLSEGSGAAA SNGGGTEQKT NKGTEKPSRT SHSTTSQSRQ ARGDKPHPGV HSSSESTDER RSKSQATRGN NLARGESQMS VEDEHVGSLA PDDLAEKSSI GQDSSFNVHD GDGVHMPSGQ SRVSSSTDPE IHRAESARSN SVDGVKSSVS NDVPSEPKST GLRSAVPVTL RRNVAAVACG AKLLDSSKAI KNAESILNGN NDEYMNVPCS AEKWFVIEVC EPVQLRVVEM ANYELFSSRV KSFRVLVSDR YPAKTWDTIG VFTAQDVKGL QTFDVSSDKL IKYVKFEMLE HYGSEHYCPI SMIRLFGLVS DDLDDDDDEL PMHQVNQQLP TGGVPSSTLF DSVPAVDPKN VPYGDADLSR SHQGTDAVEL PTVLNPPESM KIPQADPLRQ AGSTDLPPSS QRVIDDTHPG EDTGHGPQQG LPNRAAQLRT KPQVDSLKDL KLSGELHGGH ADSGKDRAVR KLDHSEKCRV DTNGVDGAPM DRVAVTDATT TMSTPVQSIG TTPMHPVHGD RSAGSGVPSR QSSADFVRKT DDHSSAVTET DGNFFAKIKN FFRNVFSTSF FRVTVPETTH VDYNLSAIQL CPHHLAAIDS LLSYGLLNAS LDSSTTLFLS GRSRRGITAL STCLFQLEQL VQPVTVVLGH SPVTVTHRIL SIPRKVGATA FQSCSEASQL LNITFFNGYH PTDQLQATVR PNVEYNILQL LVAYQYRYQF KQHSTVVSSA LVSHSPLAPC GSWWTPNQLS LPENSVCPKL PDFLLTVDRY SEQDLVQRPP QVNLSLSAEI SSHSTHPRQP ELSMSETIVV PAGLSGSRKD TVMMRLSNRV RLLERNVSVS MRYLEELSQS YRHQMERLSR SFNLTNAWLK ATAQGAQERD QLQQARIDQL ELRLEELLTR IRNGSLASAP NASVGLLDDA ASLRNLQNKQ NNTLAIEMFT PPLPNFDPPP DWNPSPSVYY GQWTQLDEDW GDEDAFVETE EETPTHASHA DEPSTKSGLP HKSSSISCVF PLRDHAGRQC IDVHTEEPDS WWQYFRTSWN RCTRLFLSSS RWLSHVVSTH TTLFGMISMA CVHFMFALTS HVLIYLVWLR PHGRKLSMPP SGSFTLIQSR SGEWLLTQNP THKTGLISSG RSQKPTVPGG TVAISCRGSK DVVDSIRQSP AERKIGVALV KPSSSIPRCI PSAQNRKEAT SPGNIEYHQV TPTDAIAKIS QVSSPAFPTT SSTRQTNGGL KVNGSMYYEK FVPSTHSATH TLPRSVGSHF SKVQSSNTDF PVYMPRAPDP SAKFVQAQHK SIGASPSISC PSLITQHSTT TTTEGQGSSN SSTLTSRNAR RRRKRLERAK MNQLC // ID H2L8X7_ORYLA Unreviewed; 1051 AA. AC H2L8X7; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSORLP00000002309}; GN Name=LOC101169776 {ECO:0000313|Ensembl:ENSORLP00000002309}; OS Oryzias latipes (Japanese rice fish) (Japanese killifish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; OC Oryziinae; Oryzias. OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000002309, ECO:0000313|Proteomes:UP000001038}; RN [1] {ECO:0000313|Ensembl:ENSORLP00000002309, ECO:0000313|Proteomes:UP000001038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000002309, RC ECO:0000313|Proteomes:UP000001038}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSORLP00000002309} RP IDENTIFICATION. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000002309}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSORLP00000002309}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BAAF04076824; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BAAF04076825; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8090.ENSORLP00000002309; -. DR Ensembl; ENSORLT00000002310; ENSORLP00000002309; ENSORLG00000001852. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H2L8X7; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001038; Chromosome 4. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001038}; KW Reference proteome {ECO:0000313|Proteomes:UP000001038}. FT COILED 357 379 {ECO:0000256|SAM:Coils}. FT COILED 726 746 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1051 AA; 116150 MW; 5D9AC0B4F55D2BC0 CRC64; NTAKEPDPSV PSKEDIPTFD EWKKQVLEVE KEKSQTLHTS TSGSSHPVKK AQKNFKNNYA SVECGAKILS ANSDAKSTSA ILMENMDLYM LNPCNNKIWF VIELCEPIQV KQLDIANFEL FSSTPKDFLV SISDRYPTNK WVKLGTFHAR DERIVQSFPL DEQLYAKFVK VELLSHFGSE HFCPLSLIRV FGTSMVEEYE EIADSQYPSE RPEYLDEDYD YPPGYQPSED KISKNLLCSA TNAIIDMVSN IAANVLGGKP RPEGGAELEV DHEPSFTEAK TRPLPKEKFR CPAAKHSQPH TINKNKISED SAAQDISHTS SDSSSLPPSS ESDEDRQIVT LVEEEEEDEP KQSTVTLMKE DAEEEETDEE KKRQEADWKE WESQLYCPLS FFSSLSLSCS ATLPELLLRW CSARVAKERL RSLKRRGDRP QTHTHPDVDT HPTPSITASV PTPPLESLPL TEKAPEPEES DRIYAEGGAV GETPSTDDHP RTITPEAHST EPVILLEPSR TSSIPPHTFT DIQGSLTRPT PEKTLLPPQR DGSRVSVATP PLEAALILET QQTSNATPSP SVSCAPSFSE TTVSTVSEQP TQTPPTAPRP EDLQTPPTDL PATPPPADGH SETLRQSNKA AQVLIEGDPL PPSGEPHRGE EAVEEELLST SGSSNTQRAA ADFYAELQNA GDYSGGAVSG NGGAVHGSSQ KESVFMRLNN RIKALEMNMS LSSRYLEELS QRYRKQMEEM QRAFNKTITK LQNTSRIAAE QDQKQTDSIQ LLQTQLENIT KLMLNLTATV TQLQMEVSDR QSYLVVSLVL CLLLGLLLCL QCCRSSTPNS PTNADALPKS NNYPSPKRCF SSYDDMNLKR RVTCPLVRSK SFQLPSSEGP DDLYIVEPLR FSPEKKRKRS KTKSLDKVET LKGFDPSAAS ANGGPKCNGF HQCQASLLPP CLQNEIPTET SSCSSSTHSE ESYTSRLPPP SPTFPSAGLC NGHALPFPPS QPSAKSRQEK RSMKRRKSRQ TDLHFPHMPG GGALPALQQF MKGNKEMSVG TIRVSTLSGH V // ID H2LX55_ORYLA Unreviewed; 984 AA. AC H2LX55; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSORLP00000010708}; OS Oryzias latipes (Japanese rice fish) (Japanese killifish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; OC Oryziinae; Oryzias. OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000010708, ECO:0000313|Proteomes:UP000001038}; RN [1] {ECO:0000313|Ensembl:ENSORLP00000010708, ECO:0000313|Proteomes:UP000001038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000010708, RC ECO:0000313|Proteomes:UP000001038}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSORLP00000010708} RP IDENTIFICATION. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000010708}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSORLP00000010708}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BAAF04067649; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8090.ENSORLP00000010708; -. DR Ensembl; ENSORLT00000010709; ENSORLP00000010708; ENSORLG00000008524. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2LX55; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001038; Chromosome 8. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001038}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001038}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 353 380 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 400 422 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 434 451 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 568 599 {ECO:0000256|SAM:Coils}. FT COILED 649 669 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 984 AA; 109387 MW; B6AE7088BB352867 CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEKEH QIAPVYESPR MSRRSLRLQS GATHYGNEGL ADHCHNQNSS YNSTSKEMRL SKTLRSRKQQ QQSLSSSMSL SLSQVATPRQ TLSFSAVSTP IDLTSVYSRS HTRKQTVSTI STTTTVDLCR GQSSAQKSNV NGKASASKSH MALSNGYICN NCSLHAQKTD ACMSNSSCQA QEVSTEALSS TSSPFTSVLM SVSNTSHSSF SGSINVNDQA TDDLSHLNLN GSQCKLQHTH THLTHTLWVL TVCFSKSVFK SRDYSRSTLG FTSTSKRASL IPILSDISIL TSLTGNDGTG KHYSAANTVL LTERSRSKGV VGALWSVLLF TGYFVVKASS ALASGFYSGA KNLLSCLWLL MTTPVKASKG LLWFLGQGWY QLVSLMSLLN IFFLTQCVPR LWKLLWFLLP FLLLLVLWLW GPSAATLFAY LPAINLTEWN LASAFSFSNF KKIPASVPVT EPPMKQAPVA PVSQATTLNV DLKRLEDVER QLALLWERVK QEDEQQDQRH QDIFSLYSKL KEQLHMQTNR ESLGVWVSSL LEERLSSLRE KLEQENTQRT QNEEQQEQRQ ESQAVRLAEL ELLLKALSGK TEKQTAFLKK SVISTAAVSV IAGVQQEDHD ALLVEVQRLE AELVKIRQEL QTVVGCKGKC GQLDTLQETV SSQVRRELQA LFFGAGGSED QQGEVPESLI SWLSQRYVGT PDLQAALASL ELSILKNVSQ QLELSRAQTL TEAESKTQNM VQTITGTVQH TSSSEGLTEE QVKLIVQNAL KLFSQDRTGQ VDYALESGDS GGSILSTRCS ETYETKTALM SLFGLPLWYF SQSPRVVIQP DVYPGNCWAF KGSQGYLVIR LSLRILPTSF CLEHIPKSLS PTGNITSAPR NFTVFGLDDE YQDKGKLLGH YTYEENGEAL QIFPVKEEND KTFQIIEVRV LSNWGHPEYT CLYRFRVHGE PRPE // ID H2M1W7_ORYLA Unreviewed; 901 AA. AC H2M1W7; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSORLP00000012402}; GN Name=SUCO (2 of 2) {ECO:0000313|Ensembl:ENSORLP00000012402}; OS Oryzias latipes (Japanese rice fish) (Japanese killifish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; OC Oryziinae; Oryzias. OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000012402}; RN [1] {ECO:0000313|Ensembl:ENSORLP00000012402} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000012402}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSORLP00000012402} RP IDENTIFICATION. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000012402}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSORLP00000012402}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BAAF04061043; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BAAF04061044; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8090.ENSORLP00000012402; -. DR Ensembl; ENSORLT00000012403; ENSORLP00000012402; ENSORLG00000009886. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR OMA; KIWFIIE; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001038; Chromosome 17. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001038}; KW Reference proteome {ECO:0000313|Proteomes:UP000001038}. FT COILED 625 645 {ECO:0000256|SAM:Coils}. FT COILED 668 688 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 901 AA; 98443 MW; 30BA60A50726C7F7 CRC64; MEVEKEKTLS TDTFINGGSH VPKKVQKHFN NYASVECGAK ILGANPEAKS TSAILKENMD LYMLNPCSTK IWFIIELCEP IQVRQLDIAN FELFSSTPKD FLVSISDRYP TNKWLKLGTF HARDERTVQS FPLDEQLYAK YVKMFTKYIK VELVSHFGSE HFCPLSLIRV FGTSMVEEYE IADPSERADD QDDELVDPTG FFPGEPKSSE NLIGSAKDAI MNMVNNIANN VLGGSAEIKD SEDMQILPGT DVPPAEPPSS ETPLPEEAAN TESEVPQEEK RLVILLEKEE EEPMRSTVIL LEKEDKPYEE QRDAPEHHQV DPSCCSPAPS LRDYVQRQSS AKSRKCQTGD REPESPPIHP STPLAPSSEP EPPHTDDEAA SELTSQPVEE SISEVLDPSL TLNLPALIVS ESSCAQPGTT VQTPPLSASA EEPQRSQDAS VEEEHLQTAE SPSSSVHVQP TISFPAVDTS VTPTVDKSNI DATSTEMTVP VPTQDLFSVL PPTYSAQSEQ PAEAPSAPDG APGPLEGSGP IPEPVIEAEP SGGQAADADA EDVPASSNAN GQSPHPNSAA FPQGSASDIY AEPFNGTEQN GNPMHGSSQK ESVFMRLNNR IKALEVNMSL SGRYLEQLSQ RYRKQMEEMQ RAFNKTIIKL QNTARIAEEQ DLRQTESIQL LQSQLENMTQ LLLNLSVRVS QLHDEVSDRQ TYLLLSLMLC LCLGLLLCVN RSRFTGSPAI EPEPPAPKSY TYCCAERPLS SCEEAGLKRS ASYPLIHTDS LQLATSAADP EVLHAEETQS LCPTNRKRRR RKLKLLDKGE VLKASIKAAP ELCNGAACKG APVSTNPTAL TKRLLLPTFR DSPSEGSSEG SSQSDEPSFC GIAAACPRVC DGLPPSKTRA EKRALGRRPP K // ID H2M4G7_ORYLA Unreviewed; 125 AA. AC H2M4G7; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSORLP00000013335}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSORLP00000013335}; OS Oryzias latipes (Japanese rice fish) (Japanese killifish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; OC Oryziinae; Oryzias. OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000013335, ECO:0000313|Proteomes:UP000001038}; RN [1] {ECO:0000313|Ensembl:ENSORLP00000013335, ECO:0000313|Proteomes:UP000001038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000013335, RC ECO:0000313|Proteomes:UP000001038}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSORLP00000013335} RP IDENTIFICATION. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000013335}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSORLP00000013335}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BAAF04093962; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8090.ENSORLP00000013335; -. DR Ensembl; ENSORLT00000013336; ENSORLP00000013335; ENSORLG00000010634. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2M4G7; -. DR OMA; IRITHVT; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000001038; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001038}; KW Reference proteome {ECO:0000313|Proteomes:UP000001038}. SQ SEQUENCE 125 AA; 14230 MW; CEA945BEC44FAA08 CRC64; LLPGKCWAFH GVQGTLVISL SHPIRISHVT LDHLPRYNAP TGRIDSAPKD FQVYLDCIFL SQGMKNDTEE GMLLGAFTYD ENGESSQTFE LPNPSDVVYR IVELRVLSNW GHMEYTCLYR FRVHG // ID H2M4P6_ORYLA Unreviewed; 2580 AA. AC H2M4P6; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSORLP00000013415}; OS Oryzias latipes (Japanese rice fish) (Japanese killifish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; OC Oryziinae; Oryzias. OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000013415, ECO:0000313|Proteomes:UP000001038}; RN [1] {ECO:0000313|Ensembl:ENSORLP00000013415, ECO:0000313|Proteomes:UP000001038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000013415, RC ECO:0000313|Proteomes:UP000001038}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSORLP00000013415} RP IDENTIFICATION. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000013415}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSORLP00000013415}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BAAF04094490; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BAAF04094491; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8090.ENSORLP00000013415; -. DR Ensembl; ENSORLT00000013416; ENSORLP00000013415; ENSORLG00000010702. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H2M4P6; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001038; Chromosome 22. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 2. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001038}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001038}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 819 839 {ECO:0000256|SAM:Coils}. FT COILED 1247 1267 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2580 AA; 284719 MW; 63B4E39DF03D3B40 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE ASGLNCVLSF IRDSGHLVHK DTLHSAMAVV SRLCSKMEPQ DSSLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGSVSGS SSACKPGRPS TGATPSASDS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ALPNSMESAL GGDERCVLDT MRLVDLLLVL LFEGRKALPK STAGSAGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKRKKDA SKEEEEGSEP KGDPEMAPFY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MVHYSSEVLL REVCDSETGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDV FLDQLARLGV INKVSTLAGP TSDDENEDEV KPEKEEEVQE DAKEIQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPATA SQPILSMVGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVKSMAREL YDDHFKAVES MPRGVVVTLR NIATQLESAW ELHINRQCLE GENTWRDLMK TALENLIVVL KDENTISPYE MCSSGLVQAL FTVLSNVSSV ELDLKHDCKP LMERINVFKA AFSENEDNES RPAVALVRKL IAVLESIERL PLHLYDTPGS SYNLQILTRR LRFRLERAPG ETALIDRTGR MLKMEPLATV ESLEQYLLKM VAKQWYDFER SSFVFVRKLR EGQIFTFRHQ HDFDENGIIY WVGTNAKTAY EWVNPAAYGL VVVTSSEGRN LPYGRLEDIL SRDSSALNCH TNDDKNAWFA VDLGLWVIPS AYTLRHARGY GRSALRNWVF QVSKDGQNWS TLYNHIDDGS LNEPGSTATW PLDPSKEEKQ GWRHIRIKQM GKNASGQTHY LSLSGLELYG TVTAVCEDQL GKAVKEAEAN LRRQRRLFRS QVMKYIVPGA RVIRGIDWKW RDQDGNPAGE GTVTGEAHNG WIDVTWDAGG SNSYRMGAEG KFDLKLAPGY DPELAATAPS PKPVSSAVSG PASSTQPSWS SLVKNNCPDK GGASSSSRKG SSSSVCSVAS SSDISLSSSI GLPGAGSLQL DRKAEGLLLD QAAGVEGVTG GGTGSDGHQQ EPIVVLSSVT ESGSASSSGT LTADVSTAGD EHHSPSAMAT AVDPATAISM GLVSVSSPDV SSVSEPSSKD THSQRPLCSA VNTRLSVSSL LAASAPMSSS ASVPNLSSRE ASLMESFVRR APNMSRTNAT NNMNLSRSSS DNNTNTLGRN VMSTATSPLM GAQSFPNLTT TGTTSTVTMS TSIVTCGNNV ATATTGLSVG QLLSNTLTTS LTSTSSESDT GQEAEFSLYD FLDSCRANTL LAELDDEEDL PEPDDDDDEN EDDNQEDQEY EEVLALLAPR LFTPKPTVCQ LRLWAAFQEE EEYETKGGRR RTWDDDFVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPPP GTPRSEVREE VECAPSPHLA LTLKVAGLGT AREVEIPLSN YKLTIFYYVQ RLLQLSCNGA VKTDKLRRIW EPTYTIMYRE LKDSDKEKES GKMVSLCGGW GWSRHFRMFN DLFEHVVGSR SGGLSPSSGS ANQSSEILGW AKEAVLAKAG CSQNACGVED VLQLLRILYV IGRDSTAAVR GPYVDELQFN SPPEEFSSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATMERSR PSTTVRRDDP GEFRVGRLKH ERVKVPRGES MMEWAESVMQ IHANRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTSL GIWLCDDDFP DDESRQVDLG GGLKPPGFYV QRSCGLFPAP FPQDSEELER ITRLFHFLGI FLAKCIQDNR LVDLPVSQPF FKLLCMGDIK SNMSKLLYQS RGSPQGHLSE RAPLLLLTEV QSEASTEESQ ETYSVGSFDE DSKSDFIMDP PKPKPPAWFH GIMSWDDFHL VNPHSRASFL KEVKELAVRR RQILTSKSLS EDEKNTRLQD LMLRNPLGSG PPLSVEDLGL NFQFCPSSKI HGFFAVDLKP NGDDEMVSLE NAEEYVELMF DFCMHAGIQK QMEAFREGFN RVFPMEKLSS FSQKEVQMIL CGNQSPCWTA DDIISYTEPK LGYTRDSPGF LRFIRVLCGM STDERKAFLQ FTTGCSTLPP GGLANLHPRL TIVRKVDATD SSYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID H2MDT0_ORYLA Unreviewed; 157 AA. AC H2MDT0; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSORLP00000016731}; GN Name=SUN1 (2 of 2) {ECO:0000313|Ensembl:ENSORLP00000016731}; OS Oryzias latipes (Japanese rice fish) (Japanese killifish). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; OC Oryziinae; Oryzias. OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000016731}; RN [1] {ECO:0000313|Ensembl:ENSORLP00000016731} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000016731}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSORLP00000016731} RP IDENTIFICATION. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000016731}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSORLP00000016731}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BAAF04058646; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 8090.ENSORLP00000016731; -. DR Ensembl; ENSORLT00000016732; ENSORLP00000016731; ENSORLG00000013348. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; CFGFRGS; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001038; Chromosome 19. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001038}; KW Reference proteome {ECO:0000313|Proteomes:UP000001038}. SQ SEQUENCE 157 AA; 17812 MW; 935481E4D72CABC8 CRC64; ETYETRAALL SMFGVPLWYF SQSPRTVIQP DVHPGNCWAF RGSTGFLVIR LSMSILPTAF TLEHIPKALT PGGMLLSAPR HFSVYGLTAE NQEHGRLLGT FTYREDGEAL QTFLVTVSPL EENKEFFQII EVQVLSNWGH PDYTCMYRFR VHGTPTK // ID H2N4P3_PONAB Unreviewed; 1254 AA. AC H2N4P3; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000000582}; GN Name=SUCO {ECO:0000313|Ensembl:ENSPPYP00000000582}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000000582, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000000582, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000000582} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000000582}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGA01089568; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01089569; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_003775543.1; XM_003775495.2. DR STRING; 9601.ENSPPYP00000000582; -. DR Ensembl; ENSPPYT00000000607; ENSPPYP00000000582; ENSPPYG00000000504. DR GeneID; 100461791; -. DR KEGG; pon:100461791; -. DR CTD; 51430; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H2N4P3; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001595; Chromosome 1. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1254 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003566248. FT COILED 936 956 {ECO:0000256|SAM:Coils}. FT COILED 986 1006 {ECO:0000256|SAM:Coils}. FT COILED 1192 1212 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1254 AA; 139317 MW; 69419B67633BD087 CRC64; MKKHRRALAL VSCLFLCSVV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFQKKD EREGPINAES LGKSGSNLPI SPKEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQGGGD PKSALNASDN LKNESSDYTK PGDIDPTSVA SPKDPEDIPT FDEWKKKVME VEKEKSQSMH ASSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPEYVTTEVH THDMEPSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVK VALYRQRSRT ALSKGKDYLV SAQPPLLLPA ESVDVSVLQP LSGELENKNM EREAETVVLG DLSSSVHQDD LVNHTVDAVE LEPSHSQTLS QSLVLDITPE INLLPKIEVS ESVEYEAGHT PSQVIPQESS VEIDNEAEQK SESFSSIEKP SITYETNKVN ELMDNIIKED VNSMQIFTKL SETIVPPINT ATVPDNEDGE AKMNIADTAK QTLISVVDSS SLPEVKEEEQ SPEDALLRGL QRTATDFYAE LQNSTDLGYA NGNLVHGSNQ KESVFMRLNN RIKALEVNMS LSGRYLEELS QRYRKQMEEM QKAFNKTIVK LQNTSRIAEE QDQRQTEAIQ LLQAQLTNMT HLVSNLSATV AELKREVSDR QSYLVISLVL CVVLGLMLCM QRCRNTSQFD GDYISKLPKS NQYPSPKRCF SSYDDMNLKR RTSFPLMRSK SLQLTGKEVD PNDLYIVEPL KFSPEKKKKR CKYKIEKIET IKPAEPLHPI ANGDIKGRKP FTNQRDFSNM GEVYHSSYKG PPSEGSSETS SQSEESYFCG ISACTSLCNG QSQKTKTEKR ALKRRRSKVQ DQGKLIKTLI QTKSGSLPSL HDIIKGNKEI TVGTFGVTAV SGHI // ID H2NKZ2_PONAB Unreviewed; 2610 AA. AC H2NKZ2; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 24. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000006507}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSPPYP00000006507}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000006507, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000006507, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000006507} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000006507}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGA01394911; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394912; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394913; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394914; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394915; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394916; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394917; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394918; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394919; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01394920; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002824691.1; XM_002824645.3. DR ProteinModelPortal; H2NKZ2; -. DR STRING; 9601.ENSPPYP00000006507; -. DR Ensembl; ENSPPYT00000006763; ENSPPYP00000006507; ENSPPYG00000005724. DR GeneID; 100443926; -. DR KEGG; pon:100443926; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H2NKZ2; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000001595; Chromosome 14. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289368 MW; C02FB51A2AABF98B CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID H2P1M8_PONAB Unreviewed; 379 AA. AC H2P1M8; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000012191}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSPPYP00000012191}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000012191, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000012191, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000012191} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000012191}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC188112; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002830245.1; XM_002830199.2. DR STRING; 9601.ENSPPYP00000012191; -. DR Ensembl; ENSPPYT00000012670; ENSPPYP00000012191; ENSPPYG00000010914. DR GeneID; 100459624; -. DR KEGG; pon:100459624; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2P1M8; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001595; Chromosome 20. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}. FT COILED 155 175 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 42970 MW; 1AE7860033ECB0AF CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSPNMNDNFL LPVRINAQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LVEKTGILLL CAFGFWMFSI HLPSKMKVWQ DDSINGPLQS LRLYQEQVRH HSGEIQDLRG SMNLLIAKLQ EMEAMSDEQK MAQKIMKMIH GDYIEKPDFA LKSTGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNVTPGN CWAFEGDRGQ VTIQLAQKVY LSNLTLQHIP KTISLSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPEN VIQMFPLQNQ PARAFSAVKV KISSNWGNPG FTCLYRVRVH GSVAPPREQP HQNPYPERD // ID H2P1S2_PONAB Unreviewed; 438 AA. AC H2P1S2; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000012235}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSPPYP00000012235}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000012235, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000012235, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000012235} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000012235}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGA01363707; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002830299.1; XM_002830253.2. DR STRING; 9601.ENSPPYP00000012235; -. DR Ensembl; ENSPPYT00000012716; ENSPPYP00000012235; ENSPPYG00000010958. DR GeneID; 100454040; -. DR KEGG; pon:100454040; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2P1S2; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001595; Chromosome 20. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 438 AA; 48088 MW; 8799ABA4427FEE1F CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSMSIT SEDSKGLRSA GTGPGEPEGR RARGPSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNWQTA CGAATVRGGA SEPTGSPVVS EEPLDLLPTL DLRQEMPPPR VFKSFLSLLF QGLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLSA FWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL RKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SAPRDFAVFG LQVDDETEVS LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP HFTCLYRVRA HGVRTSEGAE GSATGGPH // ID H2P4E4_PONAB Unreviewed; 721 AA. AC H2P4E4; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000013195}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSPPYP00000013195}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000013195, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000013195, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000013195} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000013195}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGA01111555; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01111556; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01111557; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002831186.1; XM_002831140.3. DR RefSeq; XP_002831188.1; XM_002831142.3. DR STRING; 9601.ENSPPYP00000013195; -. DR Ensembl; ENSPPYT00000013733; ENSPPYP00000013195; ENSPPYG00000011828. DR GeneID; 100453061; -. DR KEGG; pon:100453061; -. DR CTD; 25777; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2P4E4; -. DR KO; K19347; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001595; Chromosome 22. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 217 238 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 277 297 {ECO:0000256|SAM:Coils}. FT COILED 356 376 {ECO:0000256|SAM:Coils}. FT COILED 378 405 {ECO:0000256|SAM:Coils}. FT COILED 408 435 {ECO:0000256|SAM:Coils}. FT COILED 482 502 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 721 AA; 80767 MW; 8B513138E2FF9856 CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVRESYI GTRFPPRSSL EELHGDADWG EDLRVRRRRG TGGSESSRAS GLMGRKATED FLGSSSGYSS EDDYVGYSDA DQQSSGSRLR SAVSRAGSLL WMVATSPGRL FRLLYWWAGT TWYRLTTAAS LLDVFVLTRR FSSLKTFLWF LLPLLLLTCL TYGAWYFYPY GLQTFHPALV SWWAAKDSRR QDEGWEARDS SPHFQAEQRV MSRVHSLERR LEALAAEFSS NWQKEAMRLE RLELQQGAPG QGGGGGLSHE DTLALLEGLV SRREAALKED FRRETAAHIQ EELSALRAEH QQDSEDLFKK IVRASQESEA RIQQLKSEWQ SMTQESFRES SVKELRRLED QLAGLQQELA ALALKQSSVA DEVGLLPQQF QAVRDDVESQ FPAWISQFLA RGGGSRVGLL QREEMQAQLR ELESKILTHV AEMQGKLARE AAASLGLTLQ KEGVIGVTEE QVHHIVKQAL QRYSEDRIGL ADYALESGGA SVISTRCSET YETKTALLSL FGIPLWYHSQ SPRVILQPDV HPGNCWAFQG PQGFAVVRLS ARIRPTAVTL EHVPKALSPN STISSAPKDF AIFGFDEDLQ QEGTLLGKFT YDQDGEPIQT FHFQAPTMAT YQVVELRILT NWGHPEYTCI YRFRVHGEPA H // ID H2PL98_PONAB Unreviewed; 968 AA. AC H2PL98; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000019378}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSPPYP00000019378}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000019378, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000019378, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000019378} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000019378}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGA01299809; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01299810; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01299811; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01299812; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01299813; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01299814; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01299815; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9601.ENSPPYP00000019378; -. DR Ensembl; ENSPPYT00000020141; ENSPPYP00000019378; ENSPPYG00000017288. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2PL98; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001595; Chromosome 7. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0006998; P:nuclear envelope organization; IEA:Ensembl. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 442 463 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 475 493 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 611 645 {ECO:0000256|SAM:Coils}. FT COILED 658 678 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 968 AA; 107567 MW; 366A4F64098EBEA2 CRC64; MRMRIEQACV PVCSSCPAQC LSGRSLREGS GETSLPSRVE QASTAVWFEV VNMDFSRLHM YSPPQCVPEN TGYTYALSSS YSSDALDFET EHKLDPVFDS PRMSRRSLRL ATTACTLGDG EAVGADSGTS SAVSLKNRAA RTAKQHRSAN KSAFSINHVS RQVTSSGVSH GGTVSLQDAV TRRPPVLDES WICEQTTVDH FWGLDDDGDL KGGNKAAIQG NGDVGAAAAT AHNGFSCSNC SMLSERKDVL TAHPVVPGPV LRVYSRDRNQ KCGASFYVNR ILWLARCTAS SFSSFLVQLF QVVLMKLSYE SENYKLKTHE SKDCESESYK SKSHESKAHA SYYGRMNVRE VLREDGHLSV NGEVLCDDCK GKRHLDAHTA APSQSPRPPG RAGTLRHIWA CAGYFLLQIL RRIGAAGRAV SRTVWSALWL AVAAPGKAAS GVFWWLGIGW YQFVTLISWL NVFLLTRCLR NICKFLVLLI PLFLLLAGLS LWGQGDFFSF LPVLNWASMH RTQRVDDPQD VFKPTTSRLN QPLQGDSEAF PWRWMSGVEQ QVASLSGQCH HHGEDLRELT TLLQKLQARV DRMDSGAAGP SASVRDTVGQ PLRETDFMAF HQEHEVRISH LEDILGKLRE KSEAIQKELE QTKQKTISAV GEQLLPTVEH LQLELDQLKS ELSSWRHVKT GCETVDAVRE RVDVQVREMV KLLFSEDEEG GSLEQLLQRF SSQFVSKGDL HTMLRDLELQ ILRNVTHHVS VTKQLPTSEA VVSAVSEAGA SGITEAQARA IVNNALKLYS QDKTGMVDFA LESGGGSILS TRCSETYETK TALMSLFGIP LWYFSQSPRV VIQPDIYPGN CWAFKGSQGY LVVRLSMMIH PAAFTLEHIP KTLSPTGNIS SAPKDFAVYG LENEYQEEGQ LLGQFTYDQD GESLQMFQAL KRPDDTAFQI VELRIFSNWG HPEYTCLYRF RVHGEPVK // ID H2PM58_PONAB Unreviewed; 357 AA. AC H2PM58; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPPYP00000019694}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSPPYP00000019694}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000019694, ECO:0000313|Proteomes:UP000001595}; RN [1] {ECO:0000313|Ensembl:ENSPPYP00000019694, ECO:0000313|Proteomes:UP000001595} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Wilson R.K., Mardis E.; RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."; RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSPPYP00000019694} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPPYP00000019694}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; ABGA01303541; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303542; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303543; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303544; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303545; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303546; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303547; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303548; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; ABGA01303549; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_002818010.1; XM_002817964.1. DR STRING; 9601.ENSPPYP00000019694; -. DR Ensembl; ENSPPYT00000020468; ENSPPYP00000019694; ENSPPYG00000017570. DR GeneID; 100441181; -. DR KEGG; pon:100441181; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2PM58; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001595; Chromosome 7. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001595}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001595}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 48 69 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 357 AA; 40579 MW; 70D35C8F79F20A18 CRC64; MSGKTKARRA AMLSRRCSED ASRSASGNAL LSEDENPDAN GVTRSWKIIL STMLTLTFLL VGLLNYQWLK ETDVPQKSRQ LYAIIAEYGS RLYKYQARLH MPKEQLELLK KESQTLENNF RQILFLIEQI DVLKALLRDM KDGTDNNHNW NTHGDPVEDP DHTEEMSNLV NYVLKKLRED QVQMADYALK SAGASIIEAG TSESYKNNKA KLYWHGIGFL NHEMPPDIIL QPDVYPGKCW AFPGSQGHTL IKLATKIIPT AVTMEHISEK VSPSGNIYSA PKEFSVYGIT KKCEGEEIFL GQFIYNKTGT TVQTFELQHA VSEYLLCVKL NIFSNWGHPK YTCLYRFRVH GTPGKHI // ID H2Q0M0_PANTR Unreviewed; 1406 AA. AC H2Q0M0; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000002812}; GN Name=SUCO {ECO:0000313|Ensembl:ENSPTRP00000002812}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000002812, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000002812, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000002812} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPTRP00000002812}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACZ03000746; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03000747; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03000748; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03000749; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9598.ENSPTRP00000002812; -. DR PaxDb; H2Q0M0; -. DR Ensembl; ENSPTRT00000003061; ENSPTRP00000002812; ENSPTRG00000001689. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H2Q0M0; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000002277; Chromosome 1. DR GO; GO:0016020; C:membrane; IEA:Ensembl. DR GO; GO:0005791; C:rough endoplasmic reticulum; IEA:Ensembl. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IEA:Ensembl. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IEA:Ensembl. DR GO; GO:0046850; P:regulation of bone remodeling; IEA:Ensembl. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}. FT COILED 1088 1108 {ECO:0000256|SAM:Coils}. FT COILED 1138 1158 {ECO:0000256|SAM:Coils}. FT COILED 1344 1364 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1406 AA; 156536 MW; 2C4E0B520F9DC186 CRC64; MRGFLARPFL STNQHLAQWG SPLPQGKGLV QLPSQHTRHS RPFHELCSKE ENSATVPKLI SLVVSGETID FSNNTMDSRR DWEREKRILE GKLQLPKALA RTQRARDEGR RAWTSRWPQQ RRSPESCEAP LSAPLWGPQR GLPGREPLRS RSASAIALRT IGHILALLLR LLHLGLGSGG CREDVPPSGR GKKEEKMKKH RRALALVSCL FLCSLVWLPS WRVCCKESSS ASASSYYSQD DNCALENEDV QFQKKNTESK KLSPPVVETL PTVDLHEESS NAVVDSETVE NISSSSTSEI TPISKLDEIE KSGTIPIAKP SETEQSETDC DVGEALDASA PIEQPSFVSP PDSLVGQHIE NVSSSHGKGK ITKSEFESKV SASEQGGGDP KSALNASDNL KNESSDYTKP GDIDPTSVAS PKDPEDIPTF DEWKKKVMEV EKEKSQSMHA SSNGGSHATK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQLDIANYEL FSSTPKDFLV SISDRYPTNK WIKLGTFHGR DERNVQSFPL DEQMYAKYVK VELLSHFGSE HFCPLSLIRV FGTSMVEEYE EIADSQYHSE RQELFDEDYD YPLDYNTGED KSSKNLLGSA TNAILNMVNI AANILGAKTE DLTEGNKSIS ENATATAAPK MPESTPVSTP VPSPEYVTTE VHTHDMEPST PDTPKESPIV QLVQEEEEEA SPSTVTLLGS GEQEDESSPW FESETQIFCS ELTTICCISS FSEYIYKWCS VRVALYRQRS RTALSKGKDY LVSAQPPLLL PAESVDVSVL QPLSGELENK NIEREAETVV LGDLSSSMHQ DDLVNHTVDA VELEPSHSQT LSQSVLLDIT PEINPLPKIE VSESVEYETG HIPSQVIPQE SSVEIDNETE QKSESFSSIE KPSITYETNK VNELMDNIIK EDVNSMQIFT KLSETIVPPI NTATVPDNED GEAKMNIADT AKQTLISVVD SSSLPEVKEE EQSPEDALLR GLQRTATDFY AELQNSTDLG YANGNLVHGS NQKESVFMRL NNRIKALEVN MSLSGRYLEE LSQRYRKQME DFQKAFNKTI VKLQNTSRIA EEQDQRQTEA IQLLQAQLTN MTQLVSNLSA TVAELKREVS DRQSYLVISL VLCVVLGLML CMQRCRNTSQ FDGDYISKLP KSNQYPSPKR CFSSYDDMNL KRRTSFPLMR SKSLQLTGKE VDPNDLYIVE PLKFSPEKKK KRCKYKIEKI ETIKPEEPLH PIANGDIKGR KPFTNQRDFS NMGEVYHSSY KGPPSEGSSE TSSQSEESYF CGISACTSLC NGQSQKTKTE KRALKRRRSK VQDQGKLIKT LIQTKSGSLP SLHDIIKGNK EITVGTFGVT AVSGHI // ID H2Q0M1_PANTR Unreviewed; 1254 AA. AC H2Q0M1; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000002813}; GN Name=SUCO {ECO:0000313|Ensembl:ENSPTRP00000002813}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000002813, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000002813, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000002813} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPTRP00000002813}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACZ03000746; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03000747; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03000748; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03000749; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_514000.2; XM_514000.4. DR STRING; 9598.ENSPTRP00000002812; -. DR PaxDb; H2Q0M1; -. DR Ensembl; ENSPTRT00000003062; ENSPTRP00000002813; ENSPTRG00000001689. DR GeneID; 457516; -. DR CTD; 51430; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000002277; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1254 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003569688. FT COILED 936 956 {ECO:0000256|SAM:Coils}. FT COILED 986 1006 {ECO:0000256|SAM:Coils}. FT COILED 1192 1212 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1254 AA; 139480 MW; 96FB4DAD97A435ED CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFQKKD EREGPINAES LGKSGSNLPI SPKEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQGGGD PKSALNASDN LKNESSDYTK PGDIDPTSVA SPKDPEDIPT FDEWKKKVME VEKEKSQSMH ASSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPEYVTTEVH THDMEPSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VALYRQRSRT ALSKGKDYLV SAQPPLLLPA ESVDVSVLQP LSGELENKNI EREAETVVLG DLSSSMHQDD LVNHTVDAVE LEPSHSQTLS QSVLLDITPE INPLPKIEVS ESVEYETGHI PSQVIPQESS VEIDNETEQK SESFSSIEKP SITYETNKVN ELMDNIIKED VNSMQIFTKL SETIVPPINT ATVPDNEDGE AKMNIADTAK QTLISVVDSS SLPEVKEEEQ SPEDALLRGL QRTATDFYAE LQNSTDLGYA NGNLVHGSNQ KESVFMRLNN RIKALEVNMS LSGRYLEELS QRYRKQMEDF QKAFNKTIVK LQNTSRIAEE QDQRQTEAIQ LLQAQLTNMT QLVSNLSATV AELKREVSDR QSYLVISLVL CVVLGLMLCM QRCRNTSQFD GDYISKLPKS NQYPSPKRCF SSYDDMNLKR RTSFPLMRSK SLQLTGKEVD PNDLYIVEPL KFSPEKKKKR CKYKIEKIET IKPEEPLHPI ANGDIKGRKP FTNQRDFSNM GEVYHSSYKG PPSEGSSETS SQSEESYFCG ISACTSLCNG QSQKTKTEKR ALKRRRSKVQ DQGKLIKTLI QTKSGSLPSL HDIIKGNKEI TVGTFGVTAV SGHI // ID H2QK96_PANTR Unreviewed; 437 AA. AC H2QK96; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000023066}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSPTRP00000023066}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000023066, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000023066, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000023066} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPTRP00000023066}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACZ03123901; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03123902; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_514609.2; XM_514609.3. DR ProteinModelPortal; H2QK96; -. DR SMR; H2QK96; 241-422. DR STRING; 9598.ENSPTRP00000023066; -. DR PaxDb; H2QK96; -. DR PRIDE; H2QK96; -. DR Ensembl; ENSPTRT00000024987; ENSPTRP00000023066; ENSPTRG00000013445. DR GeneID; 458207; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2QK96; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002277; Chromosome 20. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 164 189 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 202 236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 437 AA; 48165 MW; 5D1E3BCAEEE8B702 CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSMSIT SEDSKGLRSA EPGPGEPEGR RARGPSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNWQTA CGAATVRGGA SEPTGSPVVS EEPLDLLPTL DLRQEMPPPR VFKSFLSLLF QGLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLSA FWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SAPRDFAVFG LQVYDETEVS LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSAQGPH // ID H2QU30_PANTR Unreviewed; 786 AA. AC H2QU30; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 25. DE SubName: Full=Sad1 and UNC84 domain containing 1 {ECO:0000313|EMBL:JAA06521.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000032154}; GN Name=SUN1 {ECO:0000313|EMBL:JAA06521.1, GN ECO:0000313|Ensembl:ENSPTRP00000032154}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000032154, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000032154, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000032154} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [4] {ECO:0000313|EMBL:JAA06521.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Adipose stromal {ECO:0000313|EMBL:JAA06521.1}, RC Skeletal muscle {ECO:0000313|EMBL:JAA43820.1}, and RC Smooth vascular {ECO:0000313|EMBL:JAA16397.1}; RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.; RT "De novo assembly of the reference chimpanzee transcriptome from RT NextGen mRNA sequences."; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC213335; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GABC01004817; JAA06521.1; -; mRNA. DR EMBL; GABF01005748; JAA16397.1; -; mRNA. DR EMBL; GABE01000919; JAA43820.1; -; mRNA. DR RefSeq; XP_009450745.1; XM_009452470.1. DR STRING; 9598.ENSPTRP00000032154; -. DR PaxDb; H2QU30; -. DR Ensembl; ENSPTRT00000034785; ENSPTRP00000032154; ENSPTRG00000018833. DR GeneID; 472263; -. DR CTD; 23353; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002277; Chromosome 7. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 260 283 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 290 309 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 384 404 FT COILED 429 463 FT COILED 476 496 SQ SEQUENCE 786 AA; 87120 MW; D0F820B796764380 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGANSGTSSA VSLKNRAART AKQRRSTNKS AFSINHVSRQ VTSSGVSHGG TVSLQDAVTR RPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGAATAASA HNGFSCSNCS MLSERKDVLT AHPAAPGPVS RVYSRDRNQK CYFLLQILRR IGAAGQAVSR TAWSALWLAV VAPGKAASGV FWWLGIGWYQ FVTLISWLNV FLLTRCLRNI CKFLVLLIPL FLLLAGLSLR GQGNFFSFLP VLNWASMHRT QRVDDPQDVF KPTTSRLKQP LQGDSEALPW HWMSGVEQQV ASLSGQCHHH GENLRELTAL LQKLQARVDQ MDGGAAGPSA SVRDAVGQPP RETDFMAFHQ EHEVRISHLE DILGKLREKS EAIQKELEQT KQKTISAVGE QLLPTVEHLQ LELDQLKSEL SSWRHLKTGC ETVDAVQERV DVQVREMVKL LFSEDQQGGS LEQLLQRFSS QFVSKGDLHT MLRDLQLQIL RNVTHHVSVT KRLPTSEAVV SAVSEAGASG ITEAQARAIV NNALKLYSQD KTGMVDFALE SGGGSILSTR CSETYETKTA LMSLFGIPLW YFSQSPRVVI QPDIYPGNCW AFKGSQGYLV VRLSMMIHPA AFTLEHIPKT LSPTGNISSA PKDFAVYGLE NEYQEEGQLL GQFTYDQDGE SLQMFQALKR PDDTAFQIVE LRIFSNWGHP EYTCLYRFRV HGEPVK // ID H2QUK0_PANTR Unreviewed; 357 AA. AC H2QUK0; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000032763}; GN Name=SUN3 {ECO:0000313|Ensembl:ENSPTRP00000032763}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000032763, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000032763, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000032763} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPTRP00000032763}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC190144; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_003318470.1; XM_003318422.2. DR STRING; 9598.ENSPTRP00000032763; -. DR PaxDb; H2QUK0; -. DR Ensembl; ENSPTRT00000035438; ENSPTRP00000032763; ENSPTRG00000019175. DR GeneID; 463403; -. DR KEGG; ptr:463403; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2QUK0; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002277; Chromosome 7. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}. FT COILED 99 119 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 357 AA; 40503 MW; 9B3ABAB8664200FA CRC64; MSGKTKARRA AMFFRRCSED ASGSASGNAL LSEDENPDAN GVTRSWKIIL STMLTLTFLL VGLLNHQWLK ETDVPQKSRQ LYAIVAEYGS RLYKYQARLR MPKEQLELLK KESQNLENNF RQILFLIKQI DVLKALLRDM KDGMDNNHNW NTHGDPVEDL DHTEEVSNLV NYVLKKLRED QVKMADYALK SAGASIIEAG TSESYKNNKA KLYWHGIGFL NHEMPPDIIL QPDVYPGKCW AFPGSQGHTL IKLATKIIPT AVTMEHISEK VSPSGNISSA PKEFSVYGIT KKCEGEEIFL GQFIYNKTGT TVQTFELQHA VSEYLLCVKL NIFSNWGHPK YTCLYRFRVH GTPGKHI // ID H2R106_PANTR Unreviewed; 379 AA. AC H2R106; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000041560}; GN Name=SUN5 {ECO:0000313|Ensembl:ENSPTRP00000041560}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000041560, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000041560, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000041560} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPTRP00000041560}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACZ03124058; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03124059; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03124060; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_525300.2; XM_525300.4. DR STRING; 9598.ENSPTRP00000041560; -. DR PaxDb; H2R106; -. DR Ensembl; ENSPTRT00000045231; ENSPTRP00000041560; ENSPTRG00000024395. DR GeneID; 469915; -. DR KEGG; ptr:469915; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2R106; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002277; Chromosome 20. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}. FT COILED 155 182 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 379 AA; 42992 MW; 8C7DCA941C9F2A44 CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSPNMNDNIL LPVRNNDQAL GLTQCVLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DDSINGPLQS LRLYQEKVRH HSGEIQDLRG SMNQLIAKLQ EMEAMSDEQK MAQKIMKMIH GDYIEKPDFA LKSIGASIDF EHTSATYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNVTPGN CWAFEGDRGQ VTIQLAQKVY LSNLTLQHIP KTISLSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPEN IIQMFPLQNQ PARAFGAVKV KISSNWGNPG FTCLYRVRVH GSVAPPREQP HQNPYPERD // ID H2R4A1_PANTR Unreviewed; 717 AA. AC H2R4A1; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Sad1 and UNC84 domain containing 2 {ECO:0000313|EMBL:JAA11292.1}; DE SubName: Full=Unc-84 homolog B {ECO:0000313|EMBL:JAA11293.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000046802}; GN Name=SUN2 {ECO:0000313|EMBL:JAA11292.1, GN ECO:0000313|Ensembl:ENSPTRP00000046802}; GN Synonyms=UNC84B {ECO:0000313|EMBL:JAA11293.1}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000046802, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000046802} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15164055; DOI=10.1038/nature02564; RA Watanabe H., Fujiyama A., Hattori M., Taylor T.D., Toyoda A., RA Kuroki Y., Noguchi H., BenKahla A., Lehrach H., Sudbrak R., Kube M., RA Taenzer S., Galgoczy P., Platzer M., Scharfe M., Nordsiek G., RA Bloecker H., Hellmann I., Khaitovich P., Paabo S., Reinhardt R., RA Zheng H.-J., Zhang X.-L., Zhu G.-F., Wang B.-F., Fu G., Ren S.-X., RA Zhao G.-P., Chen Z., Lee Y.-S., Cheong J.-E., Choi S.-H., Wu K.-M., RA Liu T.-T., Hsiao K.-J., Tsai S.-F., Kim C.-G., Oota S., Kitano T., RA Kohara Y., Saitou N., Park H.-S., Wang S.-Y., Yaspo M.-L., Sakaki Y.; RT "DNA sequence and comparative analysis of chimpanzee chromosome 22."; RL Nature 429:382-388(2004). RN [2] {ECO:0000313|Ensembl:ENSPTRP00000046802, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [3] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [4] {ECO:0000313|Ensembl:ENSPTRP00000046802} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [5] {ECO:0000313|EMBL:JAA11292.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Adipose stromal {ECO:0000313|EMBL:JAA11292.1}, RC Skeletal muscle {ECO:0000313|EMBL:JAA44579.1}, RC Skin {ECO:0000313|EMBL:JAA33073.1}, and RC Smooth vascular {ECO:0000313|EMBL:JAA20406.1}; RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.; RT "De novo assembly of the reference chimpanzee transcriptome from RT NextGen mRNA sequences."; RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACZ03126374; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03126375; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; GABC01000046; JAA11292.1; -; mRNA. DR EMBL; GABC01000045; JAA11293.1; -; mRNA. DR EMBL; GABF01001739; JAA20406.1; -; mRNA. DR EMBL; GABF01001738; JAA20407.1; -; mRNA. DR EMBL; GABD01000027; JAA33073.1; -; mRNA. DR EMBL; GABD01000026; JAA33074.1; -; mRNA. DR EMBL; GABE01000160; JAA44579.1; -; mRNA. DR EMBL; GABE01000159; JAA44580.1; -; mRNA. DR EMBL; GABE01000158; JAA44581.1; -; mRNA. DR RefSeq; XP_009436688.1; XM_009438413.1. DR STRING; 9598.ENSPTRP00000046802; -. DR PaxDb; H2R4A1; -. DR Ensembl; ENSPTRT00000044290; ENSPTRP00000046802; ENSPTRG00000023985. DR GeneID; 458837; -. DR KEGG; ptr:458837; -. DR CTD; 25777; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR KO; K19347; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000002277; Chromosome 22. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IEA:Ensembl. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IEA:Ensembl. DR GO; GO:0031022; P:nuclear migration along microfilament; IEA:Ensembl. DR GO; GO:0030335; P:positive regulation of cell migration; IEA:Ensembl. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 213 234 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 273 293 FT COILED 352 372 FT COILED 374 401 FT COILED 404 431 FT COILED 478 498 SQ SEQUENCE 717 AA; 80284 MW; FBE721C68A08F75F CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVRESWF PPRSSLEELH GDANWGEDLR VRRRRGTGGS ESSRASGLVG RKATEDFLGS SSGYSSEDDY VGYSDVDQQS SGSRLRSAVS RAGSLLWMVA TSPGRLFRLL YWWAGTTWYR LTTAASLLDV FVLTRRFSSL KTFLWFLLPL LLLTCLTYGA WYFYPYGLQT FHPALVSWWA AKDSRRPDEG WEARDSSPHF QAEQRVMSRV HSLERRLEAL AAEFSSNWQK EAMRLERLEL RQGAPGQGGG GGLSHEDTLA LLEGLVSRRE AALKEDFRRE TAARIQEELS ALRAEHQQDS EDLFKKIVRA SQESEARIQQ LKSEWQSMTQ ESFQESSVKE LRRLEDQLAG LQQELAALAL KQSSVADEVG LLPQQIQVVR DDVESQFPAW ISQFLARGGG GRVGLLQREE MQAQLRELES KILTHVAEMQ GKSAREAAAS LGLTLQKEGV IGVTEEQVHH IVKQALQRYS EDRIGLADYA LESGGASVIS TRCSETYETK TALLSLFGIP LWYHSQSPRV ILQPDVHPGN CWAFQGPQGF AVVRLSARIR PTAVTLEHVP KALSPNSTIS SAPKDFAIFG FDEDLQQEGT LLGKFTYDQD GEPIQTFHFQ APTMATYQVV ELRILTNWGH PEYTCIYRFR VHGEPAH // ID H2RCS1_PANTR Unreviewed; 2612 AA. AC H2RCS1; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPTRP00000057753}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSPTRP00000057753}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000057753, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000057753, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the RT human genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136134; DOI=10.1038/nature04101; RA Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., RA Rozen S., Wilson R.K., Page D.C.; RT "Conservation of Y-linked genes during human evolution revealed by RT comparative sequencing in chimpanzee."; RL Nature 437:100-103(2005). RN [3] {ECO:0000313|Ensembl:ENSPTRP00000057753} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSPTRP00000057753}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACZ03095664; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095665; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095666; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095667; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095668; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095669; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095670; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ03095671; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9598.ENSPTRP00000057753; -. DR PaxDb; H2RCS1; -. DR Ensembl; ENSPTRT00000066178; ENSPTRP00000057753; ENSPTRG00000006240. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H2RCS1; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000002277; Chromosome 14. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002277}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1247 1267 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2612 AA; 289473 MW; ACEC33BD7389854E CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNVSLS SSKEQKYSFT PSVEINVFKT AFSENEDDES RPAVALIRKL IAVLESIERL PLHLYDTPGS TYNLQILTRR LRFRLERAPG ETALIDRTGR MLKMEPLATV ESLEQYLLKM VAKQWYDFDR SSFVFVRKLR EGQNFIFRHQ HDFDENGIIY WIGTNAKTAY EWVNPAAYGL VVVTSSEGRN LPYGRLEDIL SRDNSALNCH SNDDKNAWFA IDLGLWVIPS AYTLRHARGY GRSALRNWVF QVSKDGQNWT SLYTHVDDCS LNEPGSTATW PLDPPKDEKQ GWRHVRIKQM GKNASGQTHY LSLSGFELYG TVNGVCEDQL GKAAKEAEAN LRRQRRLVRS QVLKYMVPGA RVIRGLDWKW RDQDGSPQGE GTVTGELHNG WIDVTWDAGG SNSYRMGAEG KFDLKLAPGY DPDTVASPKP VSSTVSGTTQ SWSSLVKNNC PDKTSAAAGS SSRKGSSSSV CSVASSSDIS LGSTKTERRS EIVMEHSIVS GADVHEPIVV LSSAENVPQT EVGSSSSAST STLTAETGSE NAERKLGPDS SVRTPGESSA ISMGIVSVSS PDVSSVSELT NKEAASQRPL SSSASNRLSV SSLLAAGAPM SSSASVPNLS SRETSSLESF VRRVANIART NATNNMNLSR SSSDNNTNTL GRNVMSTATS PLMGAQSFPN LTTPGTTSTV TMSTSSVTSS SNVATATTVL SVGQSLSNTL TTSLTSTSSE SDTGQEAEYS LYDFLDSCRA STLLAELDDD EDLPEPDEED DENEDDNQED QEYEEVMILR RPSLQRRAGS RSDVTHHAVT SQLPQVPAGA GSRPIGEQEE EEYETKGGRR RTWDDDYVLK RQFSALVPAF DPRPGRTNVQ QTTDLEIPPP GTPHSELLEE VECTPSPRLA LTLKVTGLGT TREVELPLTN FRSTIFYYVQ KLLQLSCNGN VKSDKLRRIW EPTYTIMYRE MKDSDKEKEN GKMGCWSIEH VEQYLGTDEL PKNDLITYLQ KNADAAFLRH WKLTGTNKSI RKNRNCSQLI AAYKDFCEHG TKSGLNQGAI STLQSSDILN LTKEQPQAKA GNGQNSCGVE DVLQLLRILY IVASDPYSRI SQEDGDEQPQ FTFPPDEFTS KKITTKILQQ IEEPLALASG ALPDWCEQLT SKCPFLIPFE TRQLYFTCTA FGASRAIVWL QNRREATVER TRTTSSVRRD DPGEFRVGRL KHERVKVPRG ESLMEWAENV MQIHADRKSV LEVEFLGEEG TGLGPTLEFY ALVAAEFQRT DLGAWLCDDN FPDDESRHVD LGGGLKPPGY YVQRSCGLFT APFPQDSDEL ERITKLFHFL GIFLAKCIQD NRLVDLPISK PFFKLMCMGD IKSNMSKLIY ESRGDRDLHC TESQSEASTE EGHDSLSVGS FEEDSKSEFI LDPPKPKPPA WFNGILTWED FELVNPHRAR FLKEIKDLAI KRRQILSNKG LSEDEKNTKL QELVLKNPSG SGPPLSIEDL GLNFQFCPSS RIYGFTAVDL KPSGEDEMIT MDNAEEYVDL MFDFCMHTGI QKQMEAFRDG FNKVFPMEKL SSFSHEEVQM ILCGNQSPSW AAEDIINYTE PKLGYTRDSP GFLRFVRVLC GMSSDERKAF LQFTTGCSTL PPGGLANLHP RLTVVRKVDA TDASYPSVNT CVHYLKLPEY SSEEIMRERL LAATMEKGFH LN // ID H2RXI9_TAKRU Unreviewed; 153 AA. AC H2RXI9; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000004859}; GN Name=LOC101072075 {ECO:0000313|Ensembl:ENSTRUP00000004859}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000004859}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000004859} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551351; RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., RA Hosoya S., Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.; RT "Integration of the genetic map and genome assembly of fugu RT facilitates insights into distinct features of genome evolution in RT teleosts and mammals."; RL Genome Biol. Evol. 3:424-442(2011). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000004859} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000004859}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000004859; -. DR Ensembl; ENSTRUT00000004888; ENSTRUP00000004859; ENSTRUG00000002111. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; WHISSAP; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. SQ SEQUENCE 153 AA; 16905 MW; F993E35DA19B8BB9 CRC64; QTYTSPSGCL TLFGIPLWSL FTSPRTAIQG SPILAGTCWC FVGAEGTLAV SLSHPVKITH VTVDHLPSYN SPSGDIKSAP KDFEVHGMKT QAGEGTFLGK FLYDKFGEPT QTFSLPTPTD QSYDIVELRV FSNWGQKEYT CLYRFRVHGQ TDF // ID H2S4M7_TAKRU Unreviewed; 195 AA. AC H2S4M7; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000007349}; GN Name=SUN1 (1 of 2) {ECO:0000313|Ensembl:ENSTRUP00000007349}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000007349}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000007349} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551351; RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., RA Hosoya S., Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.; RT "Integration of the genetic map and genome assembly of fugu RT facilitates insights into distinct features of genome evolution in RT teleosts and mammals."; RL Genome Biol. Evol. 3:424-442(2011). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000007349} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000007349}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000007349; -. DR Ensembl; ENSTRUT00000007394; ENSTRUP00000007349; ENSTRUG00000003138. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; FPLWYFS; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. SQ SEQUENCE 195 AA; 21790 MW; 2BEF32A3AAD42F72 CRC64; LFQDVRVIVD NALRRFSEDR TGMSDFALES GGGSVLVARC SETYRTKVAL LSLFGFPLWY FSQSPRAVIQ PDVNPGNCWA FRGSSGYLVI RLSMPIFPTA VTLEHTPKAL TPSGKMDSAP RDFSVYGLDD ENQERGQLLG AYTYDQDGEA VQTFTVTEVC DRPFQMVEIQ VTSNWGHPEY TCLYRVRVHG TPADT // ID H2SE00_TAKRU Unreviewed; 163 AA. AC H2SE00; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000010631}; GN Name=SUN2 (2 of 2) {ECO:0000313|Ensembl:ENSTRUP00000010631}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000010631}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000010631} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551351; RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., RA Hosoya S., Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.; RT "Integration of the genetic map and genome assembly of fugu RT facilitates insights into distinct features of genome evolution in RT teleosts and mammals."; RL Genome Biol. Evol. 3:424-442(2011). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000010631} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000010631}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000010631; -. DR Ensembl; ENSTRUT00000010689; ENSTRUP00000010631; ENSTRUG00000004471. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2SE00; -. DR OMA; IRITHVT; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. SQ SEQUENCE 163 AA; 18197 MW; B5FAA617FE37E0C5 CRC64; GASVIMSRCS PTYTSTSARL TLFGIPLWRL YRGPRTVIQG VLMLPGVCWP FVGSKGTLGV SLSHPIRITH VTLEHASLSN SPTGEIKSAP KDFEVYGIKS QPEEETFLGS FMYDCRGEQS QTFTLQDPTE KVYDAVELHV LSNWGQEEYT CLYRFRVHGH IAP // ID H2SYM5_TAKRU Unreviewed; 1020 AA. AC H2SYM5; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000017513}; DE Flags: Fragment; GN Name=SUCO (1 of 2) {ECO:0000313|Ensembl:ENSTRUP00000017513}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000017513}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000017513} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000017513}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000017513} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000017513}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000017513; -. DR Ensembl; ENSTRUT00000017587; ENSTRUP00000017513; ENSTRUG00000007127. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR OMA; KIWFIIE; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. FT COILED 701 721 {ECO:0000256|SAM:Coils}. FT COILED 744 771 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSTRUP00000017513}. SQ SEQUENCE 1020 AA; 111775 MW; 9FAE97E1DF578D29 CRC64; DASGDSDPTV SCKDPEDIPT FDEWKRKMME VEKEKTQAVH TANSGASHVG KKVQKNFNNY ASVECGAKIL GSNPEAKSTS AILMENMDMY MLNPCSNKIW FIIELCEPVQ VKQLDIANFE LFSSTPKDFI VSISDRYPTN KWQKLGTFHA RDERTVQSFP LDEHLYAKYV KVELLSHFGS EHFCPLSLIR VFGTSMVEEY EEIADPPERP DDLDDDFDYP PGYTPEVKLS KNLIGSAKDA ILNMVNNIAV NVLGGGAEMQ GNVSSHDVNE TEPSPKEPAA QNPPDDVPTV IPTTIAPSSE TPTTHTSDLE APHVEEEPAL PSEKDEEEPI SSTITLLEKE EESDEGKGKW RYVKHQPGIP NHCSALPPFS SRCCCDASLQ EYLHQRCSAS LSKKRKCQAV QQKQTIPSIE TPAWQRPLFP SGWHEPQQPH SEEHQPRETE QAAEPEPESS ASPSEAPQPP ENTAASHKDS ILELPLLEPS QTSNLPKHSV PDSSSAKPTP GVETPLLSSG EPEKNQDAPA EERHIEPSVS PSGSSHAHPA VPVDESSVGS AEETFKTVVS QPDVNTPDRT DQILSPTASP SYPDLPIFQE ADSVSTEGPG LVPDLGSEPE PSSGHPVITD TKTEEAPDDA SVAASSAAAP PVSPAPPTSP SLSDIYADPP NGTEQNGNQV HGSSQKESVF MRLNNRIKAL EMNMSLSGRY LEQLSQRYRK QMEEMQKAFN KTVIKLQNTS RIAEEQDQRQ TESIQLLQGQ LENMTRLVLN LSDRVSQLQV EVSERQTYLA LSLVLCFCVG LLLCANHCRI TAAPPSTEPE PPVGKSYSYC CPERQFSSCD EPGLKRSASY PLIHSESFQL ATTEGPEMLH TEDTQSLCTA NRKKRRRKIK PTEKVETLRP SFHATPKLCN GGSLCNGVPV TSIPAPLTKR LLPPVFRDSP SEGSSEGSSH SDDPSFCGIA TSCSRICDSV PPPKSRTEKR ASRRRRPKPG SPKKDKGKLL QISTIEDIMK RTREQSSGTF GVNVALSGPV // ID H2SYM6_TAKRU Unreviewed; 297 AA. AC H2SYM6; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000017514}; DE Flags: Fragment; GN Name=SUCO (1 of 2) {ECO:0000313|Ensembl:ENSTRUP00000017514}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000017514}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000017514} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000017514}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000017514} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000017514}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000017513; -. DR Ensembl; ENSTRUT00000017588; ENSTRUP00000017514; ENSTRUG00000007127. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSTRUP00000017514}. SQ SEQUENCE 297 AA; 33319 MW; 25EE57A7B5CA1785 CRC64; DASGDSDPTV SCKDPEDIPT FDEWKRKMME VEKEKTQAVH TANSGASHVG KKVQKNFNNY ASVECGAKIL GSNPEAKSTS AILMENMDMY MLNPCSNKIW FIIELCEPVQ VKQLDIANFE LFSSTPKDFI VSISDRYPTN KWQKLGTFHA RDERTVQSFP LDEHLYAKYV KMFAKYIKVE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADPPERPDDL DDDFDYPPGY TPEVKLSKNL IGSAKDAILN MVNNIAVNVL GGGAEMQENT AASHKDSILE LPLLEPSQTS NLPKHSV // ID H2TG80_TAKRU Unreviewed; 315 AA. AC H2TG80; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000023678}; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSTRUP00000023678}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000023678}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000023678} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21551351; RA Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., RA Hosoya S., Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.; RT "Integration of the genetic map and genome assembly of fugu RT facilitates insights into distinct features of genome evolution in RT teleosts and mammals."; RL Genome Biol. Evol. 3:424-442(2011). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000023678} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000023678}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000023678; -. DR Ensembl; ENSTRUT00000023776; ENSTRUP00000023678; ENSTRUG00000009424. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2TG80; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 82 102 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 315 AA; 35516 MW; 7AC04047B7B811FF CRC64; MGYYDKDGNP TICYRDSVSR VDKRRQNRAI TSDTSDRTDR DSYDIKRFLN YLKESISSES SSSGSCMSSN SKDTVITYKT KWLIFSSLVV LALMLPVISY HVDVNSIERP TSYDLVPTSP VCHKCTNQSF GNVMMRIQKL QTELHYLKEK LNYQLTDANF WTNFALESDA LKIFSLLGIQ LFSKVVPAAV IGGQHPPIPG NCWSFPGSHG NLFIELSHTI TVSNVTLDHV LKSVSPNDTI PSAPRHFTVY GLQSLDDKAV HLGKFMYDLE GNPSQTFAVK VHDSIRSKYI DLQIESNYGH ADYTCLYGFR VHGQI // ID H2TTA3_TAKRU Unreviewed; 2611 AA. AC H2TTA3; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000027911}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000027911, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000027911} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000027911}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000027911} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000027911}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000027911; -. DR Ensembl; ENSTRUT00000028021; ENSTRUP00000027911; ENSTRUG00000011055. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H2TTA3; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 819 839 {ECO:0000256|SAM:Coils}. FT COILED 1247 1267 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2611 AA; 288040 MW; 56A46818824E0E2D CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLSF IRDSGHLVHK DTLHSAMAVV SRLCSKMEPQ DSSLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLCRM AAAGGTVSGP PSSCKPGRVS TGAAPPAPDS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ALPDSMESAL GGDERCVLDT MRLVDLLLVL LFEGRKALPK STAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDV NKEEEEGSEP KGDPEMAPVY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MVHYSSEVLL KEVCDSESGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDV FLDQLARLGV INKVSTLAGP ASDDENEDES KPEKEEEVQE DAREIQQGKP YHWKDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPVTS SQPILSSVAP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVKSMAREL YDDHFKAVES MPRGVVVTLR NISTQLESAW ELHTNRQCIE GENTWRDLMK TALENLIVVL KDENTISPYE MCSSGLVQAL FTVLSNVSVA PPLMVNLQPP LMERINVFKA AFSENEDNES RPAVALIRKL IAVLESIERL PLHLYDTPGS LYNLQILTRR LRFRLERAPG ETALIDRTGR MLKMEPLATV ESLEQYLLKM VAKQWYDFER SSFVFVRKLR EGQSFTFRHQ HDFDENGIIY WVGTNAKTAY EWVNPAAYGL VVVTSSEGRN LPYGRLEDIL SRDSSALNCH TNDDKNAWFA VDLGLWVIPS AYTLRHARGY GRSALRNWVF QVSKDGQNWT TLYTHIDDCS LNEPGSTATW PLDPSKEEKQ GWRHVRIKQM GKNASGQTHY LSLSGLELYG TVTAVCEDQL GKAVKEAEAN LRRQRRLFRS QVMKYIVPGA RVVRGIDWKW RDQDGNPPGE GTVTGEAHNG WIDVTWDAGG SNSYRMGAEG KFDLKLAPGY DPESAATAPS PKPVSSTVSG QQQSWSSLVK NNCLDKGGAT SLGGASSSSR KGSSSSVCSV ASSSDISLSS SMGLMGVGGL RLEKRAEGLL LDQGVGMVTG SSVSTDVQQL EPIVVLSSVV DSGSGSASSS GTLTTDMPAP GDESRNKDST TDPATAISMG LVSVSSPDVS SVSESSGKDA PSQRPLCSAT NARLSVSSLL AAGAPMSSSA SVPNLSSREA SLMESFVRRA PNMSRTNATN NMNLSRSSSD NNTNTLGRNA MTSATSLMGA QSFPNLTTTG TTSTVTMSTS IVTSSNNVAT ATTGLSVGQL LSNTLTTSLT STSSESDTGQ EAEFSLYDFL DSCRANTLLA ELDDEEDLPE PDDDDDENED DNQEDQEYEE VLVIHPLFLL NYTVSSGTGS DVTDGLTFQE EEEYETKGGR RRTWDDDFVL KRQFSALVPA FDPRPGRTNV QQTTDLEIPP PGSPRSEVQE EVECAPSPHL SLTLKVAGLG TTREVELPLS NYKSTIFFYV QRLLQLSCSG TVKTDKLRRI WEPTYTIMYR ELKDSDKEKE SGKMVRAALT GWFTTWVKVG DDFLHHGSFL GHVCNSSLTT LPVPLLQDLC EHSTGISSRS GVLSPSSLLA NQSGEILGIA RELAQAKAGC GQSACGVEDV LQLLRILYII GGDSASNTRT MQEDFEELQF NAAPEEFTSK KITTKILQQI EEPLALASGA LPDWCEQLTA KCPFLIPFET RQLYFTCTAF GASRAIVWLQ NRREASMERS RPSTTVRRDD PGEFRVGRLK HERVKVPRGE AMMEWAESVM QLHADRKSVL EVEFQGEEGT GLGPTLEFYA LIAAEFQRTS LGIWLCDDDF PDDESRQVDL GGGLKPPGFY VQRSCGLFPA PFPQDSEELE RITKLFHFLG IFLAKCIQDN RLVDLPLSQP FFKLLCMGDI KSNWSKLLYQ SCSFTPGQDP ERSHLQPFLL LSESEASTEE SQETYSVGSF DEDSKSEFIM DPPKPKPPAW YHGILTWDDF QLVNPHRASF LKELKELAMK RRQILSSKSL SEDEKNTRLQ DLMLRNPLGS GPPLSIEDLG LNFQFCPSSK VHGFSALDLK PNGDNEMVTM ENAEEYVELM FDLCMHTGIQ KQMEAFREGF NRVFPMEKMS SFSHKEVQMI LCGNQSPSWT ADDIINYTEP KLGYTRDSPG FLRFVRVLCG MSSDERKAFL QFTTGCSTLP PGGLANLHPR LTIVRKVDAT DSSYPSVNTC VHYLKLPEYT SEDIMRERLL AATMEKGFHL N // ID H2UUD7_TAKRU Unreviewed; 1003 AA. AC H2UUD7; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000040562}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000040562, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000040562} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000040562}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000040562} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000040562}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000040562; -. DR Ensembl; ENSTRUT00000040704; ENSTRUP00000040562; ENSTRUG00000015866. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2UUD7; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 416 437 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 444 464 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 606 626 {ECO:0000256|SAM:Coils}. FT COILED 667 694 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1003 AA; 111108 MW; B4ADD320B248A4AA CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQT GAGHYASENV ADFSLGQSTT RRETSLTPSL SFLPALRTLR SKKQQSSSGG LSLSLSQAAT PINTPVSSGI VEESSAATDA ALLTGLEQSR LRQRTVTTTN TFSWRRICSD HRSGVNGDTG TSKSHSSIAN GYICKDCSFH SQKIDSFVAS SPQLGRASSD ILSSSSSSPF TSIYSRDRMA SHSSFCGSIN VKGLMTEDAA HLKLNGSLCK KLSSKPHDLH TPSAMNFVIQ LTINLNVYKL HTLSKLSGGF AYTNSLSAQR RSCTTVWFVC SPSHARAGDD CKGKQHAETH SSLHSQSSRL HHLAGALWSV LAYPGHCVVR SGKVLGCGAV TAFQSLLSLL WMFLTAPVKA VRRLLWFLAT GWYQLVSLMS VFNVFFLTQC LPRLWRLLLL LLPLLLLLAL WFWGPSSAAL LAYLPAINLT EWRPVSPLTL WYNLVPTSVS TPETPIGQTP ATPASQIPHF LPQSVLPPVA LTGADLERLE RVERQLALLW EQLQQRDHKQ DERHGNILGL YNTLKEQLHT QTDRESLGLW VSSLLDQRVG VLHGELEQEQ TRRVQSEEQQ ERLQQGQATR LAEIELLLST LAARTQEVQQ RQKLSEQEKQ VSLSLSLAVK QEDHDALLVE VQRLEAELIK VRQDLQGVVG CRGKCAQLDT LAQTVSAQVR KELQTLFFGS SGTGELPESL LHWLSQRYVS TPDLQASLAS LEMAILRNVS LQLEQNRATT LGAAESQAKT IFHTVSGAVQ HTAAAEGLTE EHVKIMVQNA LRLYSQDRTG LVDYALESGG GSILSTRCSE TYETKTALMS LFGLPLWYFS QSPRVVIQPD VYPGNCWAFK GSQGYLVIRL SLKIVPTSFC LEHIPKTLSP TGNITSAPRD FTVFGLDDEY QEEGKLLGQY TYQDDGEALQ MFPVMEQNDK SFQVIEVRVL SNWGHPDYTC LYRFRVHGDP QLQ // ID H2UUD8_TAKRU Unreviewed; 924 AA. AC H2UUD8; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000040563}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000040563, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000040563} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000040563}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000040563} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000040563}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSTRUT00000040705; ENSTRUP00000040563; ENSTRUG00000015866. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 250 268 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 280 300 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 355 375 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 387 408 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 415 437 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 602 622 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 924 AA; 102539 MW; 6BAB96C1AE840A76 CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQT GAGHYASENV ADFSLGQSTT RRETRTLRSK KQQSSSGGLS LSLSQAATPI NTPVSSGIVE ESSAATDAAL LTGLEQSRLR QRTVTTTNTF SCVDGHSGRR ICSDHRSGVN GDTGTSKSHS SIANGYICKD CSFHSQKIDS FVASSPQLGR ASSDILSSSS SSPFTSIYSR DRSVLRSICN TCVRYSKQSL APFVSLLTVI FSSVVWLGSQ ARASTGKGYY AYYYSFTHNL LLCFFCLWFV CSPSHARAGD DCKGKQHAET HSSLHSQSSR LHHLAGALWS VLAYPGHCVV RSGKVLGCGA VTAFQSLLSL LWMFLTAPVK AVRRLLWFLA TGWYQLVSLM SVFNVFFLTQ CLPRLWRLLL LLLPLLLLLV SPLTLWYNLV PTSVSTPETP IGQTPATPAS QIPSVLPPVA LTGADLERLE RVERQLALLW EQLQQRDHKQ DERHGNILGL YNTLKEQLHT QTDRESLGLW VSSLLDQRVG VLHGELEQEQ SEEQQERLQQ GQATRLAEIE LLLSTLAART QEVQQRQKLS EQEKQAVKQE DHDALLVEVQ RLEAELIKVR QDLQGVVGCR GKCAQVSAQV RKELQTLFFG SSGTGELPES LLHWLSQRYV STPDLQASLA SLEMAILRNV SLQLEQNRAT TLGAAESQAK TIFHTVSGAV QHTAAAEGLT EEHVKIMVQN ALRLYSQDRT GLVDYALESG GGSILSTRCS ETYETKTALM SLFGLPLWYF SQSPRVVIQP DVYPGNCWAF KGSQGYLVIR LSLKIVPTSF CLEHIPKTLS PTGNITSAPR DFTVFGLDDE YQEEGKLLGQ YTYQDDGEAL QMFPVMEQND KSFQVIEVRV LSNWGHPDYT CLYRFRVHGD PQLQ // ID H2UUD9_TAKRU Unreviewed; 861 AA. AC H2UUD9; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000040564}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000040564, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000040564} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000040564}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000040564} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000040564}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSTRUT00000040706; ENSTRUP00000040564; ENSTRUG00000015866. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 292 312 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 324 345 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 352 374 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 539 559 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 861 AA; 95326 MW; 54A302ED707CF730 CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQT GAGHYASENV ADFSLGQSTT RRETRTLRSK KQQSSSGGLS LSLSQAATPI NTPVSSGIVE ESSAATDAAL LTGLEQSRLR QRTVTTTNTF SCVDGHSGRR ICSDHRSGVN GDTGTSKSHS SIANGYICKD CSFHSQKIDS FVASSPQLGR ASSIYSRDRS QSVVWLGSQA RASTGKVCSP SHARAGDDCK GKQHAETHSS LHSQSSRLHH LAGALWSVLA YPGHCVVRSG KVLGCGAVTA FQSLLSLLWM FLTAPVKAVR RLLWFLATGW YQLVSLMSVF NVFFLTQCLP RLWRLLLLLL PLLLLLVSPL TLWYNLVPTS VSTPETPIGQ TPATPASQIP SVLPPVALTG ADLERLERVE RQLALLWEQL QQRDHKQDER HGNILGLYNT LKEQLHTQTD RESLGLWVSS LLDQRVGVLH GELEQEQSEE QQERLQQGQA TRLAEIELLL STLAARTQEV QQRQKLSEQE KQAVKQEDHD ALLVEVQRLE AELIKVRQDL QGVVGCRGKC AQVSAQVRKE LQTLFFGSSG TGELPESLLH WLSQRYVSTP DLQASLASLE MAILRNVSLQ LEQNRATTLG AAESQAKTIF HTVSGAVQHT AAAEGLTEEH VKIMVQNALR LYSQDRTGLV DYALESGGGS ILSTRCSETY ETKTALMSLF GLPLWYFSQS PRVVIQPDVY PGNCWAFKGS QGYLVIRLSL KIVPTSFCLE HIPKTLSPTG NITSAPRDFT VFGLDDEYQE EGKLLGQYTY QDDGEALQMF PVMEQNDKSF QVIEVRVLSN WGHPDYTCLY RFRVHGDPQL Q // ID H2UUE0_TAKRU Unreviewed; 781 AA. AC H2UUE0; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000040565}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000040565, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000040565} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000040565}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000040565} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000040565}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSTRUT00000040707; ENSTRUP00000040565; ENSTRUG00000015866. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 231 254 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 261 281 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 456 476 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 781 AA; 87100 MW; C3A5D7B53956E19B CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQT GAGHYASENV ADFSLGQSTT RRETSSGGLS LSLSQAATPR KTLSFSAVNT PVSSGIVEES SAATDAALLT GLEQSRLRQR TVTTTNTFSC VDGHSGRRIC SDHRSGVNGD TGTSKSHSSI ANGYICKDCS FHSQKIDSFV ASSPQSSSPF TSIYSRDRSQ RKKTVKAVRR LLWFLATGWY QLVSLMSVFN VFFLTQCLPR LWRLLLLLLP LLLLLALWFW GPSSAALLAY LPAINLTEWR PVSPLTLWYN LVPTSVSTPE TPIGQTPATP ASQIPVERQL ALLWEQLQQR DHKQDERHGN ILGLYNTLKE QLHTQTDRES LGLWVSSLLD QRVGVLHGEL EQEQSEEQQE RLQQGQATRL AEIELLLSTL AARTQEVQQR QKLSEQEKQA VKQEDHDALL VEVQRLEAEL IKVRQDLQGV VGCRGKCAQL DTLAQTVRKE LQTLFFGSSG TGELPESLLH WLSQRYVSTP DLQASLASLE MAILRNVSLQ LEQNRATTLG AAESQAKTIF HTVSGAVQHT AAAEGLTEEH VKIMVQNALR LYSQDRTGLV DYALESGGGS ILSTRCSETY ETKTALMSLF GLPLWYFSQS PRVVIQPDVY PGNCWAFKGS QGYLVIRLSL KIVPTSFCLE HIPKTLSPTG NITSAPRDFT VFGLDDEYQE EGKLLGQYTY QDDGEALQMF PVMEQNDKSF QVIEVRVLSN WGHPDYTCLY RFRVHGDPQL Q // ID H2UUE1_TAKRU Unreviewed; 856 AA. AC H2UUE1; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000040566}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000040566, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000040566} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000040566}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000040566} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000040566}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSTRUT00000040708; ENSTRUP00000040566; ENSTRUG00000015866. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 281 301 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 313 334 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 341 361 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 463 483 {ECO:0000256|SAM:Coils}. FT COILED 528 548 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 856 AA; 95038 MW; E83393AF0DDACD93 CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQT GAGHYASENV ADFSLGQSTT RRETRTLRSK KQQSSSGGLS LSLSQAATPR KTLSFSAVNT PVSSGIVEES SAATDAALLT GLEQSRLRQR TVTTTNTFSC VDGHSGRRIC SDHRSGVNGD TGTSKSHSSI ANGYICKDCS FHSQKIDSFV ASSPQSSSPF TSIYSRDRSQ RKKTGDDCKG KQHAETHSSL HSQSSRLHHL AGALWSVLAY PGHCVVRSGK VLGCGAVTAF QSLLSLLWMF LTAPVKAVRR LLWFLATGWY QLVSLMSVFN VFFLTQCLPR LWRLLLLLLP LLLLLALWFW GPSSAALLAY LPAINLTEWR PVSPLTLWYN LVPTSVSTPE TPIGQTPATP ASQIPSVLPP VALTGADLER LERVERQLAL LWEQLQQRDH KQDERHGNIL GLYNTLKEQL HTQTDRSEEQ QERLQQGQAT RLAEIELLLS TLAARTQEVQ QRQKLSEQEK QAVKQEDHDA LLVEVQRLEA ELIKVRQDLQ GVVGCRGKCA QLDTLAQTVS AQVRKELQTL FFGSSGTGEL PESLLHWLSQ RYVSTPDLQA SLASLEMAIL RNVSLQLEQN RATTLGAAES QAKTIFHTVS GAVQHTAAAE GLTEEHVKIM VQNALRLYSQ DRTGLVDYAL ESGGGSILST RCSETYETKT ALMSLFGLPL WYFSQSPRVV IQPDVYPGNC WAFKGSQGYL VIRLSLKIVP TSFCLEHIPK TLSPTGNITS APRDFTVFGL DDEYQEEGKL LGQYTYQDDG EALQMFPVME QNDKSFQVIE VRVLSNWGHP DYTCLYRFRV HGDPQL // ID H2UYJ8_TAKRU Unreviewed; 1030 AA. AC H2UYJ8; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000042027}; DE Flags: Fragment; GN Name=LOC101080077 {ECO:0000313|Ensembl:ENSTRUP00000042027}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000042027, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000042027} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000042027}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000042027} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000042027}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000042028; -. DR Ensembl; ENSTRUT00000042171; ENSTRUP00000042027; ENSTRUG00000016443. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. FT COILED 352 379 {ECO:0000256|SAM:Coils}. FT COILED 705 725 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSTRUP00000042027}. SQ SEQUENCE 1030 AA; 113656 MW; 8AA103F53704A16A CRC64; GDVPTSRETD PSVPSKEDIP TFDEWKKQVM EVEMEKSQSL YTSTTGSPNS AKKVQKNFKN NYASVECGAK ILAANSEAKS TSAILKENMD LYMLNPCSNK IWFVIELCEP IQVKQLDIAN FELFSSTPKD FLVSISDRYP TSKWVKLGTF HARDERIVQS FPLDEQLFAK YIKMFIKYIK VELLSHFGSE HFCPLSLIRV FGTSMVEEYE EIAESQYLSE RMEYLDEDYD YPPGYQLADD NPNGSKNLLG SATNAILNMV NNIAANVLGA TPELEGGAES EDNITAEGAD RGGTEASPDF ALLASAELEH PASQENSSES SGPSSLKDSH DHRQIVTLVE EEEEEEPRQS TVTLMEEEGE EEEEKREEET RDADRKQRDS RIYCPLFSSL SLSCMASLPE LLHRWCSARL AKERLHSLRR RQLSIQTHAH PGPNTPSHTH TLPLIPAPAA TPVKEDVPLT EIAPEPKVPS MPQNDGKTVE VHIEPNFPDT HTPELNILLE PSRTVIPTHG FSDPQSSQAT GSTPPLQAAS ILETQQASTA VPTLSASISL QSSEVASSSD VVLPGFEQPV KPVPKTSRPE PVIPPLDKSA ADSGDSQGLN VQASQPSKKP SDSVAQSGEP QQVEDVADED LLSSSGNSNV QRTATDFYAE LQSSGEPNAG AANGNGILLN GGAVHGSNQK ESVFMRLNNR IKALEMNMSL SSRYLEELSQ RYRKQMEEMQ RAFNKTIIKL QNTSRIAQEQ DQKQTESIQV LQSQLVNITR LMLNLTTTVG QLQREVSDRQ SYLVVSLVLC LFLGLLLFLQ CCCRSSPSTS STNSAPIPRS NHYPSPKRCF SSYDDMNLKR RMTCPIIHSN SLPLCSTEVG PDDLYIVEPL RFSPEKKKKR CKSRSLDKVD FLKEYNSCAT LTNGGPKCNG FHPCLSLEEV SSLSTCPSME SHPEASSCSS TVNSEESHVS RLAPQTPPYT SASLCNGHGL TLGTQQLATM SRQEKRLLKR QKSRQAELPF SAVPSLQQLI KGNKEISVGT IEMTAVTGHF // ID H2UYJ9_TAKRU Unreviewed; 1061 AA. AC H2UYJ9; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000042028}; DE Flags: Fragment; GN Name=LOC101080077 {ECO:0000313|Ensembl:ENSTRUP00000042028}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000042028, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000042028} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000042028}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000042028} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000042028}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 31033.ENSTRUP00000042028; -. DR Ensembl; ENSTRUT00000042172; ENSTRUP00000042028; ENSTRUG00000016443. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H2UYJ9; -. DR OMA; SSPWFES; -. DR TreeFam; TF105817; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. FT COILED 352 379 {ECO:0000256|SAM:Coils}. FT COILED 734 754 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSTRUP00000042028}. SQ SEQUENCE 1061 AA; 116814 MW; A9D4007351268A5A CRC64; PTSRETDPSV PSKEDIPTFD EWKKQVMEVE MEKSQSLYTS TTGSPNSAKK VQKNFKNNYA SVECGAKILA ANSEAKSTSA ILKENMDLYM LNPCSNKIWF VIELCEPIQV KQLDIANFEL FSSTPKDFLV SISDRYPTSK WVKLGTFHAR DERIVQSFPL DEQLFAKYIK VELLSHFGSE HFCPLSLIRV FGTSMVEEYE EIAESQYLSE RMEYLDEDYD YPPGYQLADD NPNGSKNLLG SATNAILNMV NNIAANVLGA TPELEGGAES EGTHLSSRTD NITAEGADRG GTEASPDFAL HRLASAELEH PASQENSSES SGPSSLKDSH DHRQIVTLVE EEEEEEPRQS TVTLMEEEGE EEEEKREEET RDADRKQRDS RIYCPLFSSL SLSCMASLPE LLHRWCSARL AKERLHSLRR RQLSIQTHAH PGPNTPSHTH TLPLIPAPAA TPVKEDVPLT EIAPEPKVPS MPQNDGKTVE VHIEPNFPDT HTPELNILLE PSRTVIPTHG FSDPQSSQMW PTSTEEVKVL VAQATGSTPP LQAASILETQ QASTAVPTLS ASISLQSSEV ASSSDVVLPG FEQPVKPVPK TSRPEPVIPP LGDLPTALPL TDVHVDKSAA DSGDSQGLNV QASQPSKKPS DSVAQSGEPQ QVEDVADEDL LSSSGNSNVQ RTATDFYAEL QSSGEPNAGA ANGNGILLNG GAVHGSNQKE SVFMRLNNRI KALEMNMSLS SRYLEELSQR YRKQMEEMQR AFNKTIIKLQ NTSRIAQEQD QKQTESIQVL QSQLVNITRL MLNLTTTVGQ LQREVSDRQS YLVVSLVLCL FLGLLLFLQC CCRSSPSTSS TNSAPIPRSN HYPSPKRCFS SYDDMNLKRR MTCPIIHSNS LPLCSTEVGP DDLYIVEPLR FSPEKKKKRC KSRSLDKVDF LKEYNSCATL TNGGPKCNGS LPSLPPPVEE VSSLSTCPSM ESHPEASSCS STVNSEESHV SRLAPQTPPY TSASLCNGHG LTLGTQQLAT MSRQEKRLLK RQKSRQAELP FSAVPSLQQL IKGNKEISVG TIEMTAVTGH F // ID H2UYK0_TAKRU Unreviewed; 981 AA. AC H2UYK0; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 21-MAR-2012, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTRUP00000042029}; GN Name=LOC101080077 {ECO:0000313|Ensembl:ENSTRUP00000042029}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000042029, ECO:0000313|Proteomes:UP000005226}; RN [1] {ECO:0000313|Ensembl:ENSTRUP00000042029} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSTRUP00000042029}; RX PubMed=17554307; DOI=10.1038/nature05846; RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., RA Yamada T., Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., RA Shimada A., Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., RA Asakawa S., Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., RA Sugano S., Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., RA Nomoto H., Nogata K., Morishita T., Endo T., Shin-I T., Takeda H., RA Morishita S., Kohara Y.; RT "The medaka draft genome and insights into vertebrate genome RT evolution."; RL Nature 447:714-719(2007). RN [2] {ECO:0000313|Ensembl:ENSTRUP00000042029} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTRUP00000042029}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSTRUT00000042173; ENSTRUP00000042029; ENSTRUG00000016443. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000005226; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005226}; KW Reference proteome {ECO:0000313|Proteomes:UP000005226}. FT COILED 656 676 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 981 AA; 108757 MW; 081FC52D910FF6A5 CRC64; MEVEMEKTLP GQSLYTSTTG SPNSAKKVQK NFKNNYASVE CGAKILAANS EAKSTSAILK ENMDLYMLNP CSNKIWFVIE LCEPIQVKQL DIANFELFSS TPKDFLVSIS DRYPTSKWVK LGTFHARDER IVQSFPLDEQ LFAKYIKMFI KYIKVELLSH FGSEHFCPLS LIRVFGTSMV EEYEEIAESQ YLSERMEYLD EDYDYPPGYQ LADDNPNGSK NLLGSATNAI LNMVNNIAAN VLGATPELEG GAQRGLTEGA QRHLPTLPCK TNQTFTHITI NSILFKENSS ESSGPSSLKD SHDHRQIVTL VEEEEEEEPR QSTVTLMEEE GEEEEEKKQR DSRIYCPLFS SLSLSCMASL PELLHRWCSA RLAKERLHSL RRRQLSIQTH AHPGPNTPSH THTLPLIPAP AATPVKEDVP LTEIAPEPKV PSMPQNDGKT VEVHIEPNFP DTHTPELNIL LEPSRTVIPT HGFSDPQSSQ MWPTSTEESS EVASSSDVVL PGFEQPVKPV PKTSRPEPVI PPLGDLPTAL PLTDVHVDKS AADSGDSQGL NVQASQPSKK PSDSVAQSGE PQQVEDVADE DLLSSSGNSN VQRTATDFYA ELQSSGEPNA GAANGNGILL NGGAVHGSNQ KESVFMRLNN RIKALEMNMS LSSRYLEELS QRYRKQMEEM QRAFNKTIIK LQNTSRIAQE QDQKQTESIQ VLQSQLVNIT RLMLNLTTTV GQLQREVSDR QSYLVVSLVL CLFLGLLLFL QCCCRSSPST SSTNSAPIPR SNHYPSPKRC FSSYDDMNLK RRMTCPIIHS NSLPLCSTEV GPDDLYIVEP LRFSPEKKKK RCKSRSLDKV DFLKEYNSCA TLTNGGPKCN GFHPCLSLEE VSSLSTCPSM ESHPEASSCS STVNSEESHV SRLAPQTPPY TSASLCNGHG LTLGTQQLAT MSRQEKRLLK RQKSRQAELP FSAVPSLQQL IKGNKEISVG TIEMTAVTGH F // ID H2VWU6_CAEJA Unreviewed; 466 AA. AC H2VWU6; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:CJA06766}; GN Name=WBGene00125970 {ECO:0000313|EnsemblMetazoa:CJA06766}; OS Caenorhabditis japonica. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=281687 {ECO:0000313|EnsemblMetazoa:CJA06766}; RN [1] {ECO:0000313|EnsemblMetazoa:CJA06766} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=DF5081 {ECO:0000313|EnsemblMetazoa:CJA06766}; RG Caenorhabditis japonica Sequencing Consortium; RA Wilson R.K.; RL Submitted (AUG-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:CJA06766} RP IDENTIFICATION. RC STRAIN=DF5081 {ECO:0000313|EnsemblMetazoa:CJA06766}; RG EnsemblMetazoa; RL Submitted (NOV-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 281687.CJA06766; -. DR EnsemblMetazoa; CJA06766; CJA06766; CJA06766. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H2VWU6; -. DR Proteomes; UP000005237; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005237}; KW Reference proteome {ECO:0000313|Proteomes:UP000005237}. SQ SEQUENCE 466 AA; 53244 MW; EC0C9D9C8D4A486B CRC64; MALRSVSPTF SNRSSPPITR SVSRNGSRHL QPAPAGFDTS TPLTRRSIQP GMHVETIERV FESADETDVN LNSSQFIYKE HFTVTEMTSM KKEMWYDWLK YRIRMLRRRF IPSMKTFREL LTIVLLVTMT CYYLRDNHKT ETENSNNQFY VDAEQKFHKS ISNLKSDFHN FEKRINLRVE SIENQLEVLK GWNDSVMLEL QNIKFSQTDL AVSLQNLKTD LNIEKQKISE VISVEPIVVT PPATELQMHA FSQSSINRRP LPGVNVANSL IGASIDESCS SRTASAKDGI FYDVMSYVVS FQEGYVLLDR DVLSPGEAWC TYDQRPTLTV KLARYILPTA VSYQHVRWNG IVPNHAPKLY DLVACRSPCC TQWEPLITNC EYKPSLDGQD DQEQFCSVPS TVHSTPINHV QFRFRENHGN MTKTCAYLVR VYGDPVDGPE EEHSSTDNGT GHLETEFLNA SPTQTV // ID H2XAJ8_CAEJA Unreviewed; 191 AA. AC H2XAJ8; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:CJA26236}; GN Name=WBGene00181808 {ECO:0000313|EnsemblMetazoa:CJA26236}; OS Caenorhabditis japonica. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=281687 {ECO:0000313|EnsemblMetazoa:CJA26236}; RN [1] {ECO:0000313|EnsemblMetazoa:CJA26236} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=DF5081 {ECO:0000313|EnsemblMetazoa:CJA26236}; RG Caenorhabditis japonica Sequencing Consortium; RA Wilson R.K.; RL Submitted (AUG-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:CJA26236} RP IDENTIFICATION. RC STRAIN=DF5081 {ECO:0000313|EnsemblMetazoa:CJA26236}; RG EnsemblMetazoa; RL Submitted (NOV-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 281687.CJA26236; -. DR EnsemblMetazoa; CJA26236; CJA26236; CJA26236. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000005237; Unassembled WGS sequence. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005237}; KW Reference proteome {ECO:0000313|Proteomes:UP000005237}. SQ SEQUENCE 191 AA; 21983 MW; E5F1D9F8D386AC5C CRC64; MWAERSRHSH FDQSRRRNLA SIQRFLPKET TTTKPPTSPP PTPNTAKNEK ITKNPEEKPQ KQKSEPILPA GGSTSQREMV LMKLSKRISA VETNLTLSTE YLSELSKQYV TQMSGYQQEL KETRKASKQS AQSLEVAMRA KMSIVKRELR ELRQSVLLLQ KLEHQRNKQA KNEMSRNIFM SSCHYSSNVP P // ID H2XKE2_CIOIN Unreviewed; 2486 AA. AC H2XKE2; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCINP00000030124}; OS Ciona intestinalis (Transparent sea squirt) (Ascidia intestinalis). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=7719 {ECO:0000313|Ensembl:ENSCINP00000030124, ECO:0000313|Proteomes:UP000008144}; RN [1] {ECO:0000313|Ensembl:ENSCINP00000030124} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15114417; DOI=10.1007/s00239-003-2559-6; RA Gissi C., Iannelli F., Pesole G.; RT "Complete mtDNA of Ciona intestinalis reveals extensive gene RT rearrangement and the presence of an atp8 and an extra trnM gene in RT ascidians."; RL J. Mol. Evol. 58:376-389(2004). RN [2] {ECO:0000313|Ensembl:ENSCINP00000030124} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCINP00000030124}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; EAAA01000712; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7719.ENSCINP00000030124; -. DR Ensembl; ENSCINT00000033162; ENSCINP00000030124; ENSCING00000023878. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H2XKE2; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000008144; Unassembled WGS sequence. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 2. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008144}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008144}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT TRANSMEM 2212 2230 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 467 487 {ECO:0000256|SAM:Coils}. FT COILED 493 514 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2486 AA; 278968 MW; 4BFCAA92259F2CF9 CRC64; MADVDPDTLL EWLQMGVGQE RDMQLIALEQ LCMLLLMSDN VDRCFEMCPP RSFLPALCKI FLDETAPDNV VEVAARAMTY YLDVSADCTR RIVAVDGAVK AICNRLSLRL SDDRTNKDLS EQCVKVLEFI CTREPGAVFE AGGLSAVMKF VCNCGSIIHK DTLHSSMFVV SRLCGKVEAT SDSLPECIQS LSSLLHYDDA HVADSALKCF SSLADRFARK GVNPEPLDAF GLTDELIKRL GNCGNRVPGT PFSKTKQTGG TPNATPDSKA NFGITTVVNL LCTLCRGSSE ITHKVLRSDL SQAIEDALKG NERCCLDTMR ICDLLLILLF EGRQAIPKHY LGLFGAPRCL NLSNRRMETM DGERSHRQLI DCIRSKDTEA LIDAVESGVY DVNFMDDVGQ TLLNWAAAFG TQEMVDFLCD RGADVNKGQR SSSLHYAACF GRPAVVKTLL LNSANPTLHD EEGKTALEKA KERNDDGHKE VVKLLEDPGI LQCVKFQEQT KDAEKKSKEE EKQNLIKGDS PVASVFINKL LPLFLDTYQV TVYPSVAQSA LSLLHKTVKY VVEQQLRDAV LSAQNIPEKF ANVVSTALDQ EDNDEGHLAA LEIIQNLMDK CYDLFACSLN HEWIANKIRD MFAPHEDKED DGAVGGQAEG IYLFLYINIP VAQESSNKLE ECMPTSVEDA TSMKPGSLYS WKKTWTFARG KGCLYLWSSA TAIELSHGSN GWFRYILDGK LSTMYSSGSP EGGTDSSESR SEFLEKLQRA FMEASTGEVM LPVLSKPDSV RITAGNWTIV SNKEDELLIT NSDGQQATIL KKEMSGFLFE SNRGTRHAFT PESLLSMDFL NKSVDKKTTQ PSMNKEEELK DKIVVLSRSL YDEYFTSKKQ AFKNVVTDLK KIAQKIEEFS SRDSDINRKC IFYVVTRIVQ QSSLTQLRTL LEEDKGVSAF DLYNSGLIQA MLKALLTNLN ACPIVCFKGT CSGRDDRIEK FKLVFSKATS DAVQVLVRKL ISVLESIERF PVQLYDSPTS FNRGLHLLSR KFSFKMEYQC CADDSNLVDY TGRTLKMETL SSVDDLEQHI LKMASKQWYD HDVSNHAYVM RAREGEVNFK YSSDFDENGI IYWIGTNAKN EYDWTNPASY GLVHVTSSDE GGLPYGKLED ILSRDTTPCN CHTSDNEKAW IALDFGLQII PTKYSLRHSR GYSRSALRNW LFQASNDGKN WTTLVTHTND KSLNQPGSTA SWSIPVDEDE KRGWHQFRIQ QNGKNSSRHM TYLSISGFEI YGAVKGVSHD PPGCAYKKER KGLQVQVTTG ANKQMVPGTR VVRGVDWKWR NQDSRSVGTV SSPIQNETSS ANRSRYNLNH KSFTHTNNEK SQRSVLSMMR AFHSRKDSKR SQKSGKNLPD VGRSDPAVVR SESASSSSSY GDDFLYLDED TEEEKKQERS CAPELDGRTC RLGVYKSSPA PSLVQRRIRK LLNEVGVILK TSRTGVDTNR PSAAFTTTPT GNSHLLPPGE PVSASVSVPN LSSSEAASRM MESFVRSITR APAILNVNDL SNIDEFQYEH NSPSLGYKGR FSVSSPLTSA QSVPNLSAPV TVASNSSSPS IVSSHSVLQA ITQALVTNAN NESEAVSDLF RCLVEDLPTC DPTSSSHSDI FTELDDENDE DENEEDEEFD QLMMQQAEEE VTVAISSYTW DDDHVIKRNL PALIPAFDPR PGRMNLLQTV DLEIPPPGSN ETKDNTKNVL PTNQPKKLQL YLKGINLNKE TIQIPLEKGC TMLNGVQKLM LNHSSSTKSS NLRKLWDTTY TIVYSEVKNS SKFNSCIKVQ QWTPDFLVSH LTSGKLTKHE VLEYLKQNAT KAFISRWNLS KDSTAAQIMQ AFNEFYWDPT NQLKPSNHKT TICAKDVHDV LELLKTLYAM YGDDDIILTE DLICKKLNTK LYQQIEDVVC LSCHALPEWC NFVMKNYSFL FNFDVRNKFF SSTAFGPSRS IVWMQNSSSS QLDRSMAAAS MRRDDPSGFH DLMQLGRLRH ERVKVPREEE TLLDWAINVL DLHAEKKSML EVEFIGEEGT GLGPTLEFYS LVAAELQRKE LGMWLVDDNF IQVDPPEVHG MKRFDYYVQK SGGLFPSPLP QNNIDQVVKL FHFLGILLAK CLQDARLIDL PLSKTFLKLM CSEQQSSAVE QSRFPTFLDI LEFCLNDVST SCSGHFHQIK MNLIKFGYQC ILKFFISNIK NMKIITFVIL LVIFLLIKLF RHTFLSKLLE LCDVRDSILC DLSLTDVEKT SQLNELYLEY NGTKCKVEDL GLTFQFLPSS SVYNYTSYPL TKEGANIDLN LENARQYVNL TLNFYFKVGL EKQMAAFTEG FNRVFPISNL LLFTPDELHL NLCGDQTPQW SRDDVLAYTE PRLGFTKESS LNSRGFLHLV NVICDLTGHE RKSFLQFATG CSSLPPGGLA NLSPRLTIVR KVDSGDGSYP SVNTCVHYLK LPDYSTEAIL KERLLAATRE KGFHLN // ID H2YD24_CIOSA Unreviewed; 294 AA. AC H2YD24; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCSAVP00000003222}; OS Ciona savignyi (Pacific transparent sea squirt). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000003222, ECO:0000313|Proteomes:UP000007875}; RN [1] {ECO:0000313|Ensembl:ENSCSAVP00000003222} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000003222} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCSAVP00000003222}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 51511.ENSCSAVP00000003222; -. DR Ensembl; ENSCSAVT00000003271; ENSCSAVP00000003222; ENSCSAVG00000001915. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H2YD24; -. DR OMA; FPLWYFS; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007875; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007875}; KW Reference proteome {ECO:0000313|Proteomes:UP000007875}. SQ SEQUENCE 294 AA; 32792 MW; B6840C0F1CE3925E CRC64; LQSSPKSSKS SNRRYQHLAA VFVQWLQLRG FVDEQSVVVL QNNIAKNVSN LAVELSVQLE KKIQSYRKEQ ASRSTITLDT IIPKHPHTKT STGGITEALI KAWIGESLEV YSADRIGIAD FALESSGGYI VSTRCSKSFQ RKTALVSIFG IPIYYNINTP RSVIQPNVMP GDCWAFQGSE GYIVIGLSAA VLPDSFTLEH IPQSIALYKN ISSAPKDFSV YGLQSSSDVD GEHLGSYRYN KELSSIQNFK AEPKKSDQIF RFIELRIASN WGNPHFTCVY RFRVHGTKVE DSSD // ID H2Z0Z7_CIOSA Unreviewed; 2505 AA. AC H2Z0Z7; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCSAVP00000011259}; GN Name=Csa.1003 {ECO:0000313|Ensembl:ENSCSAVP00000011259}; OS Ciona savignyi (Pacific transparent sea squirt). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000011259, ECO:0000313|Proteomes:UP000007875}; RN [1] {ECO:0000313|Ensembl:ENSCSAVP00000011259} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000011259} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCSAVP00000011259}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSCSAVT00000011391; ENSCSAVP00000011259; ENSCSAVG00000006589. DR GeneTree; ENSGT00530000063470; -. DR OrthoDB; EOG7Z69BD; -. DR Proteomes; UP000007875; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007875}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007875}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 447 467 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2505 AA; 279068 MW; A9B8B3C2824535F1 CRC64; MADVDPDTLL EWLQTGVGQE RDMQLIALEQ LCMLLLMSDN VDRCFEMCPP RSFLPALCKI FLDETAPDNV VEVAARAMTY YLDVSAECTR RIVAVDGAVK AICNRLSLRL LDDRTNKDLS EQCVKVLEFI CTREPGAVFE AGGLSSVMKF ICNCGSIIHK DTLHSSMFVV SRLCGKMEVA SESLPECIQS LSSLLHYDDA HVADSALRCF SSLADRFTRK GVNPEPLDAY GLTDELIKRL GKYAMQIQFS NFGITTVVNL LCTLCRGSSE ITHKVLGSDL TKAIEDAMKG DERCCLDTMR ICDLLLILLF EGRQAIPKHY VGLFGAPRCL NLTNRRMENL DGDRSHRQLI DCIRSKDTEA LIDAVESGVY DVNFMDDVGQ TLLNWAAAFG TQEMVDFLCD RGADVNKGQR SSSLHYAACF GRPSVVKSLL LHSANTSLHD EEGKTALEKA RERNDEGHKE VVKLLEDPVK KKQVMLQEQS KDADKRLKDE EKSNLVKGDP PAVSMFINKL LPLFLDTYQG AVYPTVAQSS MSLLHKTVKY VSEQQLADVA RTQQNIPEKF ANVVSTAFDQ EDNDEVHLTA LEIVQSLMDK CYDLFASSLN HDWIADKIRD LHAPGEDKKE EGAIAMKPGS LYSWKKTWTF ARGKGCLYLW SSATAIELSH GSNGWFRYIL DGKLSTMYSS GSPEGGTDSS GNHFFLHIDH RCKVGVPYCI PCFLIRLAYF TGVLFTLKHG APESYNYLLP PYLSVVCLPE SRSEFLDKLQ RSFMEASTTE GLASNLPRCS RKPGSLKLAA GNCVTHSKED ELLVTNTDGQ QATILKKDLS GFLFESNRGT RHAFTPESLL SMDFLNRSAD KKPAQPTRNK EEELKDKIRK LSRVLYEDYF TSKHQALKSV VNDLKILAQD IEQCTSVGTE SSEQNNMFET SLTKLRTLLR DDKCVSAFDL YNSGLIQSLL KALVPAVFIY FFLQDQSEKS RRSLLTRVEA FKTTFCESNN PTVRLLVKKL ISVLESIERF PVQLYDSPTS FNRGIHLLSR KFSFKMDYQC CTTDSSLVDY TGRTLKMETL SSVDDLEQYV LKMASKQWYD HEASTHAYVL QAKQGEITFN YSGDFDENGI VYWIGTNAKN ESDWTNPASH GLVHVTSSDE GGLPYGKLAD ILSRDSISCN CHTSDDEKAW LAIDFGLHII PTMYTLRHSR GYSRSALRNW LFQASNDGQT WTTLITHRND KSLNQPGSTA SWPVSPESDE TKGWRHFRIQ QNGKNSSRHM TYLSISGFEI YGKVTGVSDE APGAAYKKER KTLKAQASKQ MVPGARVVRG VDWKWRNQDS RGLGTVNSAI HNGWVDVTWD NGISNLYRMG AEDKFDVKLA PPRDDKLPSD PNSNIFSRRG VLSSLIRSNR LAHASGRHQR YEQYLAARRN SEASSVSRSR YNVSHKSYSQ NSNEVGRNAR NRARSDSDDS DHQKSQRSVL SMMRAFHTRK EGKKSQKGSK NAPDTGKSEG GGVRSESASS SSSYGDEVLY LDEDIEEEKK QERSCAPELD GRTCRLGVYK STPAPSLVQR RIRKLLNEVH GGDSNQPSSS HGIEIGTSSH KNIVRYIVSS VTSSSSESLR NDPLTEIDVL NLLVHGQQPP PEERKIASSD ATQKSASLES VAGEVDKCVV SVYIYMQGQL IHIPKFSIHN KPIPTQVSAS VSVPNLSSSE CASRMMESFV RSITRAPAIL NITYNSDPPL PGPLTTAQSV PNLSAPVTVA SSFSSPSIVG SNSVLQAITQ ALATNGNHDS ETAMLGLPPP SAMSSNKSSN STSWDDDHVI KRNFPALIPA FDPRPGRLNL PQTVDLTIPA PGGNQFFPGT DSKASTKALA PDQPRKVYFY LKGTNMAGEA VQIPLEKGRT MLDCVQQLVL NPCSSTKSSN LRKLWEAQYT IVYSETELDT NKVRLHLITL TSCSVHQLKP PSNHRATICT KEVHDLLELL RCLQQNYCYD AGLTADDLLC KKLSTKLYQQ IEDVVCLACH ALPEWCNGMM KRYPFLFNFD VRNKFFSSTA FGPARSVVWM QNTSSSQFDR SGGSRNLGMS AVASLARRDD PTGLHDLIQL GRLRHERVKV PRDDATLLDW AVNVLDVHAE KKSILEVEFM GEEGTGLGPT LEFYSLVAAE LQRKDLAMWL VDDNFTHKPR DVIESVDEKK SDHYVQRPGG LFPAPLPQDN IDEVVGLFGF LGTLLAKCLQ DSRLIDLPLS TSFIKLICRE SPICPQSQVG ICLTHDLVSI DPARHTFLAK LVALSDERDE IMNNGSLSDA EKTHKVEGLL LDYNGTKCKV EDLGLTLQFL PTSSVYKFTS YPLVEGGERV DLTLQNARQY VDLTINFYFE LGLRKQMAAF RDGFNRVFPI TNLLSFTKDE LHLKLCGDQT PQWTRDDVIA YTEPKLGFTK DSTGFLHFVN VMCDLSGSER KSFLQFATGC SSLPPGGLAN LSPHLTIVKK VDSGDGSYPS VNTCVHYLKL PDYSSEAILK ERLLAATREK GFHLN // ID H2Z0Z8_CIOSA Unreviewed; 2677 AA. AC H2Z0Z8; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCSAVP00000011260}; GN Name=Csa.1003 {ECO:0000313|Ensembl:ENSCSAVP00000011260}; OS Ciona savignyi (Pacific transparent sea squirt). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000011260, ECO:0000313|Proteomes:UP000007875}; RN [1] {ECO:0000313|Ensembl:ENSCSAVP00000011260} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000011260} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 2 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCSAVP00000011260}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 51511.ENSCSAVP00000011260; -. DR Ensembl; ENSCSAVT00000011392; ENSCSAVP00000011260; ENSCSAVG00000006589. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H2Z0Z8; -. DR OMA; NRQCIEG; -. DR TreeFam; TF323674; -. DR Proteomes; UP000007875; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 2. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 1. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007875}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007875}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 500 520 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2677 AA; 298274 MW; 991AFF5BD3C2B74A CRC64; MADVDPDTLL EWLQTGVGQE RDMQLIALEQ LCMLLLMSDN VDRCFEMCPP RSFLPALCKI FLDETAPDNV VEVAARAMTY YLDVSAECTR RIVAVDGAVK AICNRLSLRL LDDRTNKDLS EQCVKVLEFI CTREPGAVFE AGGLSSVMKF ICNCGSIIHK DTLHSSMFVV SRLCGKMEVA SESLPECIQS LSSLLHYDDA HVADSALRCF SSLADRFTRK GVNPEPLDAY GLTDELIKRL GNVFNTIHST LTVLHILHHL QCTQHLITLT PIHFSCGNRV ASVPLNKSLQ SGGTPGATPD SKANFGITTV VNLLCTLCRG SSEITHKVLG SDLTKAIEDA MKGDERCCLD TMRICDLLLI LLFEGRQAIP KHYVGLFGAP RCLNLTNRRM ENLDGDRSHR QLIDCIRSKD TEALIDAVES GVYDVNFMDD VGQTLLNWAA AFGTQEMVDF LCDRGADVNK GQRSSSLHYA ACFGRPSVVK SLLLHSANTS LHDEEGKTAL EKARERNDEG HKEVVKLLED PVMLQEQSKD ADKRLKDEEK SNLVKGDPPA VSMFINKLLP LFLDTYQGAV YPTVAQSSMS LLHKTVKYVS EQQLADVART QQNIPEKFAN VVSTAFDQED NDEVHLTALE IVQSLMDKCY DLFASSLNHD WIADKIRDLH APGEDKKEEG AIGGEIKIKQ EGKTEINNSV MIIFQLTENT FSSSVEDATA MKPGSLYSWK KTWTFARGKG CLYLWSSATA IELSHGSNGW FRYILDGKLS TMYSSGSPEG GTDSSESRSE FLDKLQRSFM EASTTEGLAS NLPRCSRKPG SLKLAAGNCV THSKEDELLV TNTDGQQATI LKKDLSGFLF ESNRGTRHAF TPESLLSMDF LNRSADKKPA QPTRNKEEEL KDKIRKLSRV LYEDYFTSKH QALKSVVNDL KILAQDIEQC TSQNNMFETS LTKLRTLLRD DKCVSAFDLY NSGLIQSLLK ALVFIYFFLQ DQSEKSRRSL LTRVEAFKTT FCESNNPTVR LLVKKLISVL ESIERFPVQL YDSPTSFNRG IHLLSRKFSF KMDYQCCTTD SSLVDYTGRT LKMETLSSVD DLEQYVLKMA SKQWYDHEAS THAYVLQAKQ GEITFNYSGD FDENGIVYWI GTNAKNESDW TNPASHGLVH VTSSDEGGLP YGKLADILSR DSISCNCHTS DDEKAWLAID FGLHIIPTMY TLRHSRGYSR SALRNWLFQA SNDGQTWTTL ITHRNDKSLN QPGSTASWPV SPESDETKGW RHFRIQQNGK NSSRHMTYLS ISGFEIYGKV TGVSDEAPGA AYKKERKTLK AQVTFCSNIH KQMVPGARVV RGVDWKWRNQ DSRGLGTVNS AIHNGWVDVT WDNGISNLYR MGAEDKFDVK LAPPRDDKLP SDPNSNIFSR RGVLSSLIRS NRRNSEASSV SRSRYNVSHK SYSQNSNEVG RNARNRARSD SDDSDHQKSQ RSVLSMMRAF HTRKEGKKSQ KGSKNAPDTG KSEGGGVRSE SASSSSSYGD EVLYLDEDIE EEKKQERSCA PELDGRTCRL GVYKSTPAPS LKFKCKSFPL KYVYSILTIY ATNVFKSNKY IYRRLQPTKL QSRNRNRNLQ SQEHCPMCGE LNDLSAHTVD CLRTNFYIVS SVTSSSSESL RNDPLTEIDV LNLLVHGQQP PPEERKIASS DATQKSASLE SVAGEVDKAV RPAIDHLLGS MLPHPIIAIY CHLESLIHNK PIPTQVSASV SVPNLSSSEC ASRMMESFVR SITRAPAILN VNDLSNIEEG STGMGLGRDS TDAVGKRVNL FGNFLKKTLR PLTTAQSVPN LSAPVTVASS FSSPSIVGSN SVLQAITQAL ATNGNHDSET DLFRFVEELP TCDADSSSPP CDIFTELDDE NEEENEDYEE FDQIMVSLGA MLGLPPPSAM SSNKSSNSTS WDDDHVIKRN FPALIPAFDP RPGRLNLPQT VDLTIPAPGG NQFFPGTDSK ASTKALAPDQ PRKVYFYLKG TNMAGEAVQI PLEKGRTMLD CVQQLVLNPC SSTKSSNLRK LWEAQYTIVY SETELDTNKV RLHLITLTSF TNDQTTQEFH WNPGNQLKPS NHRATICTKE VHDLLELLRC LQQNYCYDAG LTADDLLCKK LSTKLYQQIE DVVCLACHAL PEWCNGMMKR YPFLFNFDVR NKFFSSTAFG PARSVVWMQN TSSSQFDRSA VASLARRDDP TGLHDLIQLG RLRHERVKVP RDDATLLDWA VNVLDVHAEK KSILEVEFMG EEGTGLGPTL EFYSLVAAEL QRKDLAMWLV DDNFTHKPRD RVESVDEKKS DHYVQRPGGL FPAPLPQDNI DEVVGLFGFL GTLLAKCLQD SRLIDLPLST SFIKLICRES PICPQSQVGI CLTQLNEAIT IFISCSIQYS VIKTCNITVT SSDTSQSNLA TGVLGLSDLV SIDPARHTFL AKLVALSDER DEIMNNGSLS DAEKTHKVEG LLLDYNGTKC KVEDLGLTLQ FLPTSSVYKF TSYPLVEGGE RVDLTLQNAR QYVDLTINFY FELGLRKQMA AFRDGFNRVF PITNLLSFTK DELHLKLCGD QTPQWTRDDV IAYTEPKLGF TKDSTGFLHF VNVMCDLSGS ERKSFLQFAT GCSSLPPGGL ANLSPHLTIV KKVDSGDGSY PSVNTCVHYL KLPDYSSEAI LKERLLAATR EKGFHLN // ID H2Z0Z9_CIOSA Unreviewed; 2632 AA. AC H2Z0Z9; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCSAVP00000011261}; GN Name=Csa.1003 {ECO:0000313|Ensembl:ENSCSAVP00000011261}; OS Ciona savignyi (Pacific transparent sea squirt). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000011261, ECO:0000313|Proteomes:UP000007875}; RN [1] {ECO:0000313|Ensembl:ENSCSAVP00000011261} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000011261} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCSAVP00000011261}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSCSAVT00000011393; ENSCSAVP00000011261; ENSCSAVG00000006589. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000007875; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007875}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007875}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 442 462 {ECO:0000256|SAM:Coils}. FT COILED 477 497 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2632 AA; 293562 MW; 7B4BBA597A8A1D49 CRC64; MADVDPDTLL EWLQTGVGQE RDMQLIALEQ LCMLLLMSDN VDRCFEMCPP RSFLPALCKI FLDETAPDNV VEVAARAMTY YLDVSAECTR RIVAVDGAVK AICNRLSLRL LDDRTNKDLS EQCVKVLEFI CTREPGAVFE AGGLSSVMKF ICNCGSIIHK DTLHSSMFVV SRLCGKMEVA SESLPECIQS LSSLLHYDDA HVADSALRCF SSLADRFTRK GVNPEPLDAY GLTDELIKRL GNLQSGGTPG ATPDSKANFG ITTVVNLLCT LCRGSSEITH KVLGSDLTKA IEDAMKGDER CCLDTMRICD LLLILLFEGR QAIPKHYVGL RMENLDGDRS HRQLIDCIRS KDTEALIDAV ESGVYDVNFM DDVGQTLLNW AAAFGTQEMV DFLCDRGADV NKGQRSSSLH YAACFGRPSV VKSLLLHSAN TSLHDEEGKT ALEKARERND EGHKEVVKLL EDPETWIASE EAVKKKQVML QEQSKDADKR LKDEEKNQTF IISLRRLLPL FLDTYQGAVY PTVAQSSMSL LHKTVKYVSE QQLADVARTQ QNIPEKFANV VSTAFDQEDN DEVHLTALEI VQSLMDKCYD LFASSLNHDW IADKIRDLHA PGEDKKEEGA IGGEIKIKQE GIQLWCQECN NNCLALKICT PGVKHNDMQL KSNTQPYKHA YFVTTLHPGK KQQFAPNSNI NTTDLIHLLS CGSSRAGAKL DSLAVSSSVE DATAMKPGSL YSWKKTWTFA RGKGCLYLWS SATAIELSHG SNGWFRYILD GKLSTMYSSG SPEGGTDSSE SRSEFLDKLQ RSFMEASTTE GLASNLPRCS RKPGSLKLAA GNCVTHSATR RTNCWSLTQM DSRWATILKK DLSGFLFESN RGTRHAFTPE SLLSMDFLNR SADKKPAQPT RNKEEELKDK IRKLSRVLYE DYFTSKHQAL KSVVNDLKIL AQDIEQCTSV GTESSEQNNM FETSLTKLRT LLRDDKCVSA FDLYNSGLIQ SLLKALDQSE KSRRSLLTRV EAFKTTFCES NNPTVRLLVK KLISVLESIE RFPVQLYDSP TSFNRGIHLL SRKFSFKMDY QCCTTDSSLV DYTGRTLKME TLSSVDDLEQ YVLKMASKQW YDHEASTHAY VLQAKQGEIT FNYSGDFDEN GIVYWIGTNA KNESDWTNPA SHGLVHVTSS DEGGLPYGKL ADILSRDSIS CNCHTSDDEK AWLAIDFGLH IIPTMYTLRH SRGYSRSALR NWLFQASNDG QTWTTLITHR NDKSLNQPGS TASWPVSPES DETKGWRHFR IQQNGKNSSR HMTYLSISGF EIYGKVTGVS DEAPGAAYKK ERKTLKAQVT FCSNIQFKLV FFMSKVSTLL LYKEFNCDSI KVHVIITELL IRILIRINGC HDKQMVPGAR VVRGVDWKWR NQDSRGLGTV NSAIHNGWVD VTWDNGISNL YRMGAEDKFD VKLAPPRDDK LPSDPNSNIF SRRGVLSSLI RSNSVKTCKG TLPVMVNFSE IYISYTFSLR NSEASSVSRS RYNVSHKSYS QNSNEVGRNA RNRARSDSDD SDHQKSQRSV LSMMRAFHTR KEGKKSQKGS KNAPDTGKSE GGGVRSESAS SSSSYGDEVL YLDEDIEEEK KQERSCAPEL DGRTCRLGVY KSTPAPSLVQ RRIRKLLNEK FKCKSFPLKY VYSILTIYAT NVFKSNKYIY RRLQPTKLQS RNRNRNLQSQ EHCPMCGELN DLSAHTVDYI VSSVTSSSSE SLRNDPLTEI DVLNLLVHGQ QPPPEERKIA SSDATQKSAS LESVAGEVDK CVVSVYIYMQ GQLIHIPKFS IHNKPIPTQV SASVSVPNLS SSECASRMME SFVRSITRAP AILNVNDLSN IEEGSTGMGL GRDSTDAITY NSDPPLPGPL TTAQSVPNLS APVTVASSFS SPSIVGSNSV LQAITQALAT NGNHDSETGF SNKSSNSTSW DDDHVIKRNF PALIPAFDPR PGRLNLPQTV DLTIPAPGTD SKASTKALAP DQPRKVYFYL KGTNMAGEAV QIPLEKGRTM LDCVQQLVLN PCSSTKSSNL RKLWEAQYTI VYSETELDTN KVRLHLITLT SSTICTKEVH DLLELLRCLQ QNYCYDAGLT ADDLLCKKLS TKLYQQIEDV VCLACHALPE WCNGMMKRYP FLFNFDVRNK FFSSTAFGPA RSVVWMQNTS SSQFDRSGGS RNLGMSAVAS LARRDDPTGL HDLIQLGRLR HERVKVPRDD ATLLDWAVNV LDVHAEKKSI LEVEFMGEEG TGLGPTLEFY SLVAAELQRK DLAMWLVDDN FTHKPRDVSF ESVDEKKSDH YVQRPGGLFP APLPQDNIDE VVGLFGFLGT LLAKCLQDSR LIDLPLSTSF IKLICRESPI CPQSQVGICL THDLVSIDPA RHTFLAKLVA LSDERDEIMN NGSLSDAEKT HKVEGLLLDY NGTKCKVEDL GLTLQFLPTS SVYKFTSYPL VEGGERVDLT LQNARQYVDL TINFYFELGL RKQMAAFRDG FNRVFPITNL LSFTKDELHL KLCGDQTPQW TRDDVIAYTE PKLGFTKDST GFLHFVNVMC DLSGSERKSF LQFATGCSSL PPGGLANLSP HLTIVKKVDS GDGSYPSVNT CVHYLKLPDY SSEAILKERL LAATREKGFH LN // ID H2Z100_CIOSA Unreviewed; 2489 AA. AC H2Z100; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSCSAVP00000011262}; GN Name=Csa.1003 {ECO:0000313|Ensembl:ENSCSAVP00000011262}; OS Ciona savignyi (Pacific transparent sea squirt). OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Enterogona; OC Phlebobranchia; Cionidae; Ciona. OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000011262, ECO:0000313|Proteomes:UP000007875}; RN [1] {ECO:0000313|Ensembl:ENSCSAVP00000011262} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000011262} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSCSAVP00000011262}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR Ensembl; ENSCSAVT00000011394; ENSCSAVP00000011262; ENSCSAVG00000006589. DR GeneTree; ENSGT00530000063470; -. DR Proteomes; UP000007875; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 2. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007875}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007875}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 467 487 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2489 AA; 277159 MW; 7B52A0FE8B4A5677 CRC64; MADVDPDTLL EWLQTGVGQE RDMQLIALEQ LCMLLLMSDN VDRCFEMCPP RSFLPALCKI FLDETAPDNV VEVAARAMTY YLDVSAECTR RIVAVDGAVK AICNRLSLRL LDDRTNKDLS EQCVKVLEFI CTREPGAVFE AGGLSSVMKF ICNCGSIIHK DTLHSSMFVV SRLCGKMEVA SESLPECIQS LSSLLHYDDA HVADSALRCF SSLADRFTRK GVNPEPLDAY GLTDELIKRL GNCGNRVASV PLNKSLQSGG TPGATPDSKA NFGITTVVNL LCTLCRGSSE ITHKVLGSDL TKAIEDAMKG DERCCLDTMR ICDLLLILLF EGRQAIPKHY VGLFGAPRCL NLTNRRMENL DGDRSHRQLI DCIRSKDTEA LIDAVESGVY DVNFMDDVGQ TLLNWAAAFG TQEMVDFLCD RGADVNKGQR SSSLHYAACF GRPSVVKSLL LHSANTSLHD EEGKTALEKA RERNDEGHKE VVKLLEDPAV KKKQVMLQEQ SKDADKRLKD EEKSNLVKGD PPAVSMFINK LLPLFLDTYQ GAVYPTVAQS SMSLLHKTVK YVSEQQLADV ARTQQNIPEK FANVVSTAFD QEDNDEVHLT ALEIVQSLMD KCYDLFASSL NHDWIADKIR DLHAPGEDKK EEGAIGGEIK IKQEVSSSVE DATAMKPGSL YSWKKTWTFA RGKGCLYLWS SATAIELSHG SNGWFRYILD GKLSTMYSSG SPEGGTDSSE SRSEFLDKLQ RSFMEASTTE GLASNLPRCS RKPGSLKLAA GNHSLCNKED ELLVTNTDGQ QATILKKDLS GFLFESNRGT RHAFTPESLL SMDFLNRSAD KKPAQPTRNK EEELKDKIRK LSRVLYEDYF TSKHQALKSV VNDLKILAQD IEQCTSVGTE QNNMFETSLT KLRTLLRDDK CVSAFDLYNS GLIQSLLKAL VFIYFFLQDQ SEKSRRSLLT RVEAFKTTFC ESNNPTVRLL VKKLISVLES IERFPVQLYD SPTSFNRGIH LLSRKFSFKM DYQCCTTDSS LVDYTGRTLK METLSSVDDL EQYVLKMASK QWYDHEASTH AYVLQAKQGE ITFNYSGDFD ENGIVYWIGT NAKNESDWTN PASHGLVHVT SSDEGGLPYG KLADILSRDS ISCNCHTSDD EKAWLAIDFG LHIIPTMYTL RHSRGYSRSA LRNWLFQASN DGQTWTTLIT HRNDKSLNQP GSTASWPVSP ESDETKGWRH FRIQQNGKNS SRHMTYLSIS GFEIYGKVTG VSDEAPGAAY KKERKTLKAQ VTFCSNIHKQ MVPGARVVRG VDWKWRNQDS RGLGTVNSAI HNGWVDVTWD NGISNLYRMG AEDKFDVKLA PPRDDKLPSD PNSNIFSRRG VLSSLIRSNS YSQNSNEVGR NARNRARSDS DDSDHQKSQR SVLSMMRAFH TRKEGKKSQK GSKNAPDTGK SEGGGVRSES ASSSSSYGDE VLYLDEDIEE EKKQERSCAP ELDGRTCRLG VYKSTPAPSL KTSLRPGSET SHRPPTGFNA AAPHNSHLLS PGEPVSASVS VPNLSSSECA SRMMESFVRS ITRAPAILNV NDLSNIEEGS TGMGLGRDST DAVGPLTTAQ SVPNLSAPVT VASSFSSPSI VGSNSVLQAI TQALATNGNH DSETDLFRFV EELPTCDADS SSPPCDIFTE LDDENEEENE DYEEFDQIMV SLGAMLGLPP PSAMSSNKSS NSTSWDDDHV IKRNFPALIP AFDPRPGRLN LPQTVDLTIP APGTDSKAST KALAPDQPRK VYFYLKGTNM AGEAVQIPLE KGRTMLDCVQ QLVLNPCSST KSSNLRKLWE AQYTIVYSET ELDTNKGELW TPEYVANHLG SEKLPTAELV QYFKENANES FCSHWVESKS SDKDHLISAF KEFHWNPGNQ LKPSNHRATI CTKEVHDLLE LLRCLQQNYC YDAGLTADDL LCKKLSTKLY QQIEDVVCLA CHALPEWCNG MMKRYPFLFN FDVRNKFFSS TAFGPARSVV WMQNTSSSQF DRSAVASLAR RDDPTGLHDL IQLGRLRHER VKVPRDDATL LDWAVNVLDV HAEKKSILEV EFMGEEGTGL GPTLEFYSLV AAELQRKDLA MWLVDDNFTH KPRDVIESVD EKKSDHYVQR PGGLFPAPLP QDNIDEVVGL FGFLGTLLAK CLQDSRLIDL PLSTSFIKLI CRESPICPQS QVGICLTQLN EAITIFISCS IQYSVIKTCN ITMHLKYFFK TTKPTTLTFL NVYISHRRHT FLAKLVALSD ERDEIMNNGS LSDAEKTHKV EGLLLDYNGT KCKVEDLGLT LQFLPTSSVY KFTSYPLVEG GERVDLTLQN ARQYVDLTIN FYFELGLRKQ MAAFRDGFNR VFPITNLLSF TKDELHLKLC GDQTPQWTRD DVIAYTEPKL GFTKDSTGFL HFVNVMCDLS GSERKSFLQF ATGCSSLPPG GLANLSPHLT IVKKVDSGDG SYPSVNTCVH YLKLPDYSSE AILKERLLAA TREKGFHLN // ID H3A677_LATCH Unreviewed; 714 AA. AC H3A677; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000005148}; GN Name=SUN2 {ECO:0000313|Ensembl:ENSLACP00000005148}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000005148, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000005148} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000005148} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000005148}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01217950; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217951; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217952; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217953; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217954; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217955; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217956; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01217958; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7897.ENSLACP00000005148; -. DR Ensembl; ENSLACT00000005194; ENSLACP00000005148; ENSLACG00000004576. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H3A677; -. DR OMA; ATERNEW; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 170 188 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 222 252 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 287 307 {ECO:0000256|SAM:Coils}. FT COILED 372 392 {ECO:0000256|SAM:Coils}. FT COILED 402 429 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 714 AA; 80571 MW; B39797A08F289D8D CRC64; MSRRSRRLQP YYSADDDGTS TSSMGSTSGN VISYKETPVK IFKRKSSSKR SSPVRRTPPP PRAKGSISNE SFSSLFSNVS DSSTLPLQSD QRTGGDYYWV RQSAGKQRNG SQPNRSNGFL PSEATSPSYD SYCASSGYSS AEEEYAGSDC TGYDTSPVGQ MKGFLAQIGY CLQMLFLNPV QAFLLFYWWL GTSWYRLTAT VAAHNVFILG RSVSRTALLV CFFFFLVLLL LLLLSLLLLV FFFFFTGIWL WYPSALPWTQ KHLSEPQVQP NKQHSQHHTE RSILDKVAAL EQSVRQLSLE LKQQKEEGCS RIEKAEGGAR LSREDAISAV EEFLNKRHQA MKEELMRDGE TYTQQKLSNY HQKHHKEFAD ALATLMQKSE DLQTKMSQLS SKAKSQFSAE EREKLLLTVV NLEERLDSMK AEMNGVQAKQ HELMLQLDSF PNSIQSIRDD VDSRMSAAIR QLLQDAMGLA TSFVLQEDLQ RILQELEGSL RREVSTQGSS FRADVVGETL EAAGITEVSM EVVSRIVDRA LRLYSEDRIG KVDYALESAE GASVISTRCS ETFETKTALL SLFGIPLWYH SQSPRVVLQP DVYPGNCWAF RGSQGFLVIH LASRIRPTAF TLEHIPKSVS PGGTITSAPR DFAVFGLEDE SEEHGISLGQ FMYDQDLQPI QSLFLFQGEH FRSYQVVELR IFSNWGHPEY TCIYRFRVHG EPAK // ID H3AHB7_LATCH Unreviewed; 2580 AA. AC H3AHB7; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000009038}; GN Name=HECTD1 {ECO:0000313|Ensembl:ENSLACP00000009038}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000009038, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000009038} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000009038} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000009038}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01171133; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171134; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171135; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171136; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171137; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171138; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171139; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171140; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171141; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01171142; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7897.ENSLACP00000009038; -. DR Ensembl; ENSLACT00000009107; ENSLACP00000009038; ENSLACG00000007980. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H3AHB7; -. DR OMA; NRQCIEG; -. DR TreeFam; TF323674; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1209 1236 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2580 AA; 286149 MW; 068E12F1B2BB4E38 CRC64; MSDNVDRCFE TCPPRTFLPA LCKIFLDESA PDNVLEVTAR AITYYLDVSA ECTRRIVGVD GAIKALCNRL VVVELNNRTS RDLAEQCVKV LELICTRESG AVFEAGGLNC VLTFIRDSGH LVHKDTLHSA MAVVSRLCGK MEPQDSSLET CVESLSSLLK HEDHQVSDGA LRCFASLADR FTRRGVDPAP LAKHGLTEEL LSRMAAAGGT VTGPSLACKS GRSTTGAPST TPDSKLSNQV STIVSLLSTL CRGSPVVTHD LLRSELPDSI ESALQGDERC VLDTMRLVDL LLVLLFEGRK ALPKSSAGST GRIPGLRRLD SSGERSHRQL IDCIRSKDTD ALIDAIDTGA FEVNFMDDVG QTLLNWASAF GTQEMVEFLC ERGADVNRGQ RSSSLHYAAC FGRPQVAKTL LRHGANPDLR DEDGKTPLDK ARERGHSEVV AILQSPGDWM CPVNKGDDKK KKDANKEEEE TNEPKGDPEM APIYLKRLLP VFAQTFQQTM LPSIRKASLA LIRKMIHYCS EALLKEVCDS DAGHNLPTVL VEITATVLDQ EDDDDGHLLA LQIIRDLVDK GGDLFLDQLA RLGVINKVST LAGPSSDDEN EEESKPEKED EPQEDAKELQ MGKPYHWRDW SIIRGRDCLY IWSDAAALEL SNGSNGWFRF ILDGKLATMY SSGSPEGGSD SSESRSEFLE KLQKARSQVK PTTTSQPILS TTGPTKLTVG NWSLTCLKEG EIAIHNSDGQ QATILREDLP GFVFESNRGT KHSFTAETSL GSEFVTGWTG KRGRKLKSKL EKTKQKVRTM ARDLYDDHFK AVESMPRGVV VTLRNIATQL ESSWELHTNR QYIEGENTWR DLMKTALENL IVLLKDENTI SPYEMCSSGL VQALLTVLNN TVDLDVKQDC SQLVERINVF KTAFSENEDD DSCPAVALIR KLIAVLESIE RLPLHLYDTP GSTYNLQILT RRLRFRLERA PGETALIDRT GRMLKMEPLA TVESLEQYLL KMVAKQWYDF DRSSFVFVRK LREGQIFTFR HQHDFDENGI VYWIGTNAKT AYEWVNPAAY GLAVVTSSEG RNLPYGRLED ILSRDSSALN CHTNDDKNAW FAIDLGLWVI PSAYTLRHAR GYGRSALRNW VFQVSKDGQN WTTMYTHVDD CSLNEPGSTA TWPLEPSKDE KQGWRHIRIK QMGKNASGQT HYLSLSGFEL YGTVIGVCED QLGKAAKEAE ANLRRQRRLV RSQAQKYMVQ GARVIRGIDW KWRDQDGNPP GEGSVTGELH NGWIDVTWDA GGSNSYRMGA EGKFDLKLAP GYDPDSAPSP KPVSSTVSGT TQGWSGSVKN NCPDKTSVAG AGSSSRKGSS SSVCSVASSS DISLSSTKIE RRSESVVEQN TTTSTENHEP IVVLSTAETV PQAEVGSASS ASTSTLTAET GSESVDRKLG PDSSIRTAGE SGAISMGIVS VSSPDVSSVS ELSNKETASQ RPLSSSASNR LSVSSLLAAG APMSSSASVP NLSSRETSSL ESFVRRVANI ARTNATNNMN LSRSSSDNNT NTLGRNVMST AASPLMGAQS FPNLTTTGTT STVTMSTSSV TSSNNVVTAT TSLSVGQSLS NTLTTSLTST SSESDTGQEA EYSLYDFLDS CRASTLLAEL DDEEDLPEPD EEDDENEDDN QEDQEYEEVM LLLLPSLKIS LDFKSYLKHF YYLQILPTIV QVLYVAEPKV LPEQEEEEYE IKGGRRRTWD DDFVLKRQFS ALVPAFDPRP GRTNVQQTTD LEIPPPGTPC SELLEEVECA PSPHLALILK VAGLGATREV ELPLANYRST IFYYVQKLLQ LSCNGNVKSD KLRRIWEPTY TIMYREMKDS DKEKESGKTG FWSVEHVEQY LGTDELPKND LITYLQKNAD SAFLRHWKLT GTNKSIRKNR NCSQLIAAYK DFCEHGSKSS GLSHGSHSTL HSCDILIAAR EQPQAKAGSG QNACGVEDVL QLLRILYIIA SDPYSTRTAQ EEGEEQLQFN VSAEEFTSKK ITTKILQQIE EPLALASAAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATMERTR TTTTVRRDDP GEFRVGRLKH ERVKVPRGET LMEWAENVMQ VHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRKEL GIWLCDDDFP DDESRQVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYDS RGDRDFHYTE SQSEASTEEG HDSLSVGSLD EDSKSEFILD PPKPKPPAWF HGILTWEDFE LVNLHRARFL KEIKELAVKR RQILSNKALS EDEKNTKLQD LMLKNPSGSG PPLSIEDLGL NFQFCPSSKV HGFAAVDLKP NGEDEVVTID NAEEYVELMF DFCMHTGIQK QMGAFRDGFN RVFPMEKLSS FSHEEVQMIL CGNQSPSWTA EDITNYTEPK LGYTRDSPGF MRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TIVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID H3B3L7_LATCH Unreviewed; 1247 AA. AC H3B3L7; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000016488}; GN Name=SUCO {ECO:0000313|Ensembl:ENSLACP00000016488}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000016488, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000016488} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000016488} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000016488}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01093907; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093908; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093909; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093910; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093911; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093912; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093913; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093914; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093915; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093916; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_005998588.1; XM_005998526.1. DR STRING; 7897.ENSLACP00000016488; -. DR Ensembl; ENSLACT00000016602; ENSLACP00000016488; ENSLACG00000014529. DR GeneID; 102365149; -. DR KEGG; lcm:102365149; -. DR CTD; 51430; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H3B3L7; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 1247 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003580886. FT TRANSMEM 1006 1024 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 930 950 {ECO:0000256|SAM:Coils}. FT COILED 980 1000 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1247 AA; 138760 MW; 1EB1F20FC92D82D1 CRC64; MMKERLSCLV VYLLISVLYW CPDRYVCCKE DSPSTQFEQE DTIFENQDEQ TNKKGEGDKS VQAEPLNDVE LKTTNHLGED NLLDCTKVDD QNVKVEEPSA SGSETESLVN VDESTNLAAE IENISRSATS EISSVSQSSA IENSSAGFPV VSSSEAEQLE PDCDTGGVQE ADYLSEPPSL VNPSKSLSGD HMESVLSSHD KESTVSNFSH KLPATQKTPD KQKANTSQST KGEADQTKPI DSKTTTSPKD PGDIPTFDEW KKQVMEVEKE KSQSMHPSSN GGSSSAKKVQ KNRNNYASVE CGAKILASNP EAKSTSAILI ENMDLYMLNP CSAKIWFVIE LCEPIQVKQL DIANYELFSS TPKDFLVSIS DRYPTSKWVK LGTFHAREER TVQSFPLDEQ MYAKYVKMFI KYIKVELISH FGSEHFCPLS LIRVFGTSMV EEYEEIADSQ YNIERQEPYD EDYDYLIDYN AREDKSSKNL LGSATNAILN MVNIAANMLG AKTDFEETSE AKVNESVPSE NVTSTTQMSE QTPLPTPVLE AADVTLSTTT EIGDTDMKPS DLETQTESPI VQLVQEYEDE TSLSTITLLE HEEEEEEKLA WYDMETQIYC SKLDISCVSS FSEYIHKWCL VIVTHHRLRS STTTYKPIRN YSTQTTLPPI TSTVQMDLTN DSIEIAVPDK LESTTAQSEL LTQSVTTPVR DSLFNRSTEL ELEPSQFSAV PATNTSDSLP DPKPTSSSLV SETTSKPVEQ ISTHSSGPEK KTEIQMESVK KTVDVHSPSS VTDPVSETKL DSTRELMETQ LVIEATETTS LGDQTLTEVE NETTGSKETV LEPHKPVSKP SELDPMESPQ VPEGKEEEQA AEELLLTVPS SGGLQRTVTD IYAELQNSVE LGNINGNQVH GSNQKESVFM RLNNRIKALE MNMSLSSRYL EELSQRYRKQ MEEMQKAFNK TIIKLQNTSR IAEEQDQRQT EAIQLLQAQL ANVTLLASNL SATVAELKRE VSDRQSYLVI ALVLCVVLGL MVCVQRCRTT PRSHKDYQPL PKSNHYPSPK RCFSSYDDMN LKRRTSCPLI RSQSCHTTST EVGPDDLYIV EPLKFSPEKK KKRCRIKTEK IETIKPTASF LPVVNGGIKI NNPLTNHNDF SGMGEVYSSS YKGPPSEGSS EASSQSEESY FCGISACTSI CNGQTQKTKS EKRALKRRRS KPQDRGRLIQ DLIQTKSGSM PSLHEIMKGN KEITVGSFGV TAVSGHV // ID H3B3L8_LATCH Unreviewed; 1105 AA. AC H3B3L8; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000016489}; GN Name=SUCO {ECO:0000313|Ensembl:ENSLACP00000016489}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000016489, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000016489} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000016489} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000016489}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01093907; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093908; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093909; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093910; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093911; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093912; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093913; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093914; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093915; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01093916; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7897.ENSLACP00000016488; -. DR Ensembl; ENSLACT00000016603; ENSLACP00000016489; ENSLACG00000014529. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 864 882 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 788 808 {ECO:0000256|SAM:Coils}. FT COILED 838 858 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1105 AA; 123005 MW; FE4BD81B2955B5C8 CRC64; SAGFPVVSSS EAEQLEPDCD TGGVQEADYL SEPPSLVNPS KSLSGDHMES VLSSHDKEST VSNFSHKLPA TQKTPDKQKA NTSQSTKGEK ADQTKPIDSK TTTSPKDPGD IPTFDEWKKQ VMEVEKEKSQ SMHPSSNGGS SSAKKVQKNR NNYASVECGA KILASNPEAK STSAILIENM DLYMLNPCSA KIWFVIELCE PIQVKQLDIA NYELFSSTPK DFLVSISDSR YPTSKWVKLG TFHAREERTV QSFPLDEQMY AKYVKMFIKY IKVELISHFG SEHFCPLSLI RVFGTSMVEE YEEIADSQYN IERQEPYDED YDYLIDYNAR EDKSSKNLLG SATNAILNMV NIAANMLGAK TDFEETSEAK VNESVPSENV TSTTQMSEQT PLPTPVLEAA DVTLSTTTEI GDTDMKPSDL ETQTESPIVQ LVQEYEDETS LSTITLLEHE EEEEEKLAWY DMETQIYCSK LDISCVSSFS EYIHKWCLVI VTHHRLRSST TTYKPIRNYS TQTTLPPITS TVQMDLTNDS IEIAVPDKLE STTAQSELLT QSVTTPVRDS LFNRSTELEL EPSQFSAVPA TNTSDSLPDP KPTSSSLVSE TTSKPVEQIS THSSGPEKKT EIQMESVKKT VDVHSPSSVT DPVSETKLDS TRELMETQLV IEATETTSLG DQTLTEVENE TTGSKETVLE PHKPVSKPSE LDPMESPQVP EGKEEEQAAE ELLLTVPSSG GLQRTVTDIY AELQNSVELG NINGNQVHGS NQKESVFMRL NNRIKALEMN MSLSSRYLEE LSQRYRKQME EMQKAFNKTI IKLQNTSRIA EEQDQRQTEA IQLLQAQLAN VTLLASNLSA TVAELKREVS DRQSYLVIAL VLCVVLGLMV CVQRCRTTPR SHKDYQPLPK SNHYPSPKRC FSSYDDMNLK RRTSCPLIRS QSCHTTSTEV GPDDLYIVEP LKFSPEKKKK RCRIKTEKIE TIKPTASFLP VVNGGIKINN PLTNHNDFSG MGEVYSSSYK GPPSEGSSEA SSQSEESYFC GISACTSICN GQTQKTKSEK RALKRRRSKP QDRGRLIQDL IQTKSGSMPS LHEIMKGNKE ITVGSFGVTA VSGHV // ID H3B4W3_LATCH Unreviewed; 923 AA. AC H3B4W3; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000016934}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSLACP00000016934}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000016934, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000016934} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000016934} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000016934}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01079163; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01079164; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01079165; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_005997024.1; XM_005996962.1. DR STRING; 7897.ENSLACP00000016934; -. DR Ensembl; ENSLACT00000017053; ENSLACP00000016934; ENSLACG00000014915. DR GeneID; 102354065; -. DR KEGG; lcm:102354065; -. DR CTD; 23353; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H3B4W3; -. DR KO; K19347; -. DR OMA; MKLNYES; -. DR TreeFam; TF323915; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 331 355 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 386 404 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 411 428 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 483 503 {ECO:0000256|SAM:Coils}. FT COILED 563 590 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 923 AA; 103924 MW; FA9330E4E5E7056D CRC64; MDFSRLHTYA PPQYVPENTG YTYALSSSYS SDALDFEVKH QIDPVYDSPR MSRRGLRLVT TGYYTTEDAF NDSITSSSSH TGNISSYKEK SSKSVRQHRT ASNQSAVTHL TSRKATSSSS FLSQSSFHSH ASGMSMVSTV LDESAIQERT EVGHIWGLDD DGDDGLQKGG GDATVIKANG GMLLTESQST LNGYKCNDCS MLSERKDVLT AQSTPYMASS VVYSRDRQKH KSRGVHVYLN RILHLPKYAA TSLASLFVQL FQTVLWKSDF ESKAHSCYCG SMNVKQFIDG DGHLTLNGES LCDDCKGMKH LENYTAVHAQ STKSRRVART FWHIISYAGY FLLQAVQSVG SAGWFVTRKV LAVLWLAIVS PGKAASGAFW WLGTGWYQLI TLMSLFNVFL LTRCLPKVCK LFLFLIPLLL LLVGLWYLNP STLLSLLPVF NRTEVQKVPP LDEPIASFGA QQIYSGSGPP PESSTEFFDF SRMTELEKQM ALLSDRCQQS GQQYDAQYSR IMLLLEKLQQ QVAQTDDQER MSVLISTLVN QHLKEVKLGG MDLTQQNDFM AWHQDHESRI RELEELLRKL SVRSEEVSMD LKMAKASTTS ENDEQNRHLL AEVNRLDLEF NRIKSELLAV QSLKTTCEKI DTIHETVDAQ VKESVKMFVF GDKQYDVPES ILEWLSTQCV SKNDFQSVLQ DLEMRILKNI TLYRAEFKQM PTAEVVTGAI TNVGITGITE EQARVIVKNA LNLYSQDKTG MVDFAMESGG GSILSTRCSE TYETKTALMS LFGIPLWYFS QSPRVVIQPD IHPGNCWAFK GSQGYLVVRL SLLIYPTAFT LEHIPKALSP TGNISSAPKD FTVYGLEDEY QEEGKLLGQY TYNEEGESLQ TFYVLEETDK AYQIVELRIL SNWGHPEYTC VYRFRVHGNP LKK // ID H3B4W4_LATCH Unreviewed; 546 AA. AC H3B4W4; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000016935}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSLACP00000016935}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000016935, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000016935} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000016935} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000016935}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01079163; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01079164; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01079165; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7897.ENSLACP00000016934; -. DR Ensembl; ENSLACT00000017054; ENSLACP00000016935; ENSLACG00000014915. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 28 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 40 62 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 115 135 {ECO:0000256|SAM:Coils}. FT COILED 194 221 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 546 AA; 62036 MW; B81E2736E4C1423F CRC64; GKAASGAFWW LGTGWYQLIT LMSLFNVFLL TRCLPKVCKL FLFLIPLLLL LVVVGLMFYG VVGLHNALLP VFNRTEVQKV PPLDEPIASF GAQQIYSGSG PPPESSTEFF DFSRMTELEK QMALLSDRCQ QSGQQYDAQY SRIMLLLEKL QQQVAQTDDQ ERMSVLISTL VNQHLKEVKL GGMDLTQNDF MAWHQDHESR IRELEELLRK LSVRSEEVSM DLKMAKASTT SENDEQNRHL LAEVNRLDLE FNRIKSELLA VQSLKTTCEK VDAQVKESVK MFVFGDKQYD VPESILEWLS TQCVSKNDFQ SVLQDLEMRI LKNITLYRAE FKQMPTAEVV TGAITNVGIT GITEEQARVI VKNALNLYSQ DKTGMVDFAM ESGGGSILST RCSETYETKT ALMSLFGIPL WYFSQSPRVV IQPDIHPGNC WAFKGSQGYL VVRLSLLIYP TAFTLEHIPK ALSPTGNISS APKDFTVYGL EDEYQEEGKL LGQYTYNEEG ESLQTFYVLE ETDKAYQIVE LRILSNWGHP EYTCVYRFRV HGNPLK // ID H3B4W5_LATCH Unreviewed; 545 AA. AC H3B4W5; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSLACP00000016936}; GN Name=SUN1 {ECO:0000313|Ensembl:ENSLACP00000016936}; OS Latimeria chalumnae (West Indian ocean coelacanth). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Coelacanthiformes; Coelacanthidae; Latimeria. OX NCBI_TaxID=7897 {ECO:0000313|Ensembl:ENSLACP00000016936, ECO:0000313|Proteomes:UP000008672}; RN [1] {ECO:0000313|Ensembl:ENSLACP00000016936} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=9215903; RA Zardoya R., Meyer A.; RT "The complete DNA sequence of the mitochondrial genome of a 'living RT fossil,' the coelacanth (Latimeria chalumnae)."; RL Genetics 146:995-1010(1997). RN [2] {ECO:0000313|Ensembl:ENSLACP00000016936} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSLACP00000016936}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AFYH01079163; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01079164; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AFYH01079165; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7897.ENSLACP00000016934; -. DR Ensembl; ENSLACT00000017055; ENSLACP00000016936; ENSLACG00000014915. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000008672; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008672}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008672}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 6 27 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 39 59 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 114 134 {ECO:0000256|SAM:Coils}. FT COILED 193 220 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 545 AA; 62136 MW; C89DC8BB0F395328 CRC64; KAASGAFWWL GTGWYQLITL MSLFNVFLLT RCLPKVCKLF LFLIPLLLLL GKNVSLWYLN PSTLLSLLPV FNRTEVQKVP PLDEPIASFG AQQIYSGSGP PPESSTEFFD FSRMTELEKQ MALLSDRCQQ SGQQYDAQYS RIMLLLEKLQ QQVAQTDDQE RMSVLISTLV NQHLKEWGLI LDVASQNDFM AWHQDHESRI RELEELLRKL SVRSEEVSMD LKMAKASTTS ENDEQNRHLL AEVNRLDLEF NRIKSELLAV QSLKTTCEKV DAQVKESVKM FVFGDKQYDV PESILEWLST QCVSKNDFQS VLQDLEMRIL KNITLYRAEF KQMPTAEVVT GAITNVGITG ITEEQARVIV KNALNLYSQD KTGMVDFAME SGGGSILSTR CSETYETKTA LMSLFGIPLW YFSQSPRVVI QPDIHPGNCW AFKGSQGYLV VRLSLLIYPT AFTLEHIPKA LSPTGNISSA PKDFTVYGLE DEYQEEGKLL GQYTYNEEGE SLQTFYVLEE TDKAYQIVEL RILSNWGHPE YTCVYRFRVH GNPLK // ID H3BVY6_TETNG Unreviewed; 1042 AA. AC H3BVY6; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000000148}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000000148, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000000148} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000000148} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000000148}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01013708; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000008952; -. DR Ensembl; ENSTNIT00000001535; ENSTNIP00000000148; ENSTNIG00000006204. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. FT COILED 344 364 {ECO:0000256|SAM:Coils}. FT COILED 721 741 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1042 AA; 114446 MW; 49CDB4F7569D614A CRC64; PELALSLSPN GCLYNPSNYY SQAGDVTDPS VPSKEDIPTF DEWKKQVMEV EMEKSQSLYT STTGSPHSAK KVQKSFKNNY ASVECGAKIL AANSEAKSTS AILKENMDLY MLNPCSNKIW FVIELCEPIQ VKQLDIANFE LFSSTPKDFL VSISDRYPTN KWVKLGTFHA RDERIVQSFP LDEQLFAKYL KIELLSHFGS EHFCPLSLIR VFGTSMVEEY EEIADSQYLS ERMEYLDEDY DYPPGYQLAE DNPNGSKNLL GSATNAILNM VNNIAANVLG ATPELEGGTE SEGTAVSCFP RSCFSSEQAS PVDSSHTIGN TTTGGDKKES TEAFPDSALR QSTVTLMEEE GEEEEDRRQE ETRDADRNQS DSHIYCPLFS SLSLSCMASL PELLHRWCSA RLAKERLRSL RRRQLGIQTH THPAPNTPSP IHTPLLIPVP APTPVTEELS QTETVLKLEV PLMPQNDVKM AEVHIAQPNT PDTHTPELNV LLEPSRTVIP THGFSDTQSF SVGLTSTNEV KVLPPVKEVA QATVSTPPLQ VASIPETQPA VVASPTLTVS ISSQSLGSDV ASSSDAAPPV SEQPVKPPLK ASRPEPVVAP LGELPTVLPV ADIHTDRPAA DPSKEQLDPV MQGGDPQRVD DVTDEDLLSS GGNGNVQRTA TDFYAELQNG GESNAGAANG NGMLLNGGAV HGSSQKESVF MRLNNRIKAL EMNMSLSSRY LEELSQRYRK QMEEMQRAFN KTIIKLQNTS RIAQEQDQKQ TDSIQVLQSQ LVNITKLMLN LTTTVGQLQR EVSDRQSYLV VSLVLCLFLG LLLFLQCCCR SSPSTSSDTA PIPRSNHYPS PKRCFSSYDD MNLKRRMTCP IIHSNSLPLC CSEVGPDDLY IVEPLKFSPE KKKKRKSKSL DKVDLLKEYY PPAPLINGAP KCNGFHPCLS LQPLLEEVSS PSKESPSEPS SSPVNSEESH TSGLALQTAA YMSASQCNGH GLTLSMQQLA TMSRQEKRSL KRRKSRPAEM PFSAVPSLQQ LIKGNKEISV GTIGVTAVTG HF // ID H3C0L4_TETNG Unreviewed; 837 AA. AC H3C0L4; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000001780}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000001780, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000001780} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000001780} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000001780}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01015037; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSTNIT00000003817; ENSTNIP00000001780; ENSTNIG00000017972. DR GeneTree; ENSGT00390000011587; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 290 313 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 320 338 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 479 506 {ECO:0000256|SAM:Coils}. FT COILED 516 536 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 837 AA; 92795 MW; 367D742DBDB0AB3C CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQA CALRTLRNKK QQSGSGGLSL SLSQAATPRK TLSFSAVNTP VNSGIFQESS TATDAALFTG LDESHLRQRT VTTTNTFTCV DGQAGRRICS DHSSGVNGDT SASKAHASLT NGYICKDCSF PSQKIDNILS SSSSSSPFTS VYSRDRSRRS KTGDDCKGKQ HAETHSSLLS QSSRLHRMAG TLWGHCLLWP GHCVLRSGKV LGCGAVRALR SLLSLLWMFL TAPVKAGRGL LWFLATGWYQ LVSLMTVLNV FFLTQCLPRL WRLLLLLLPF LLLLAVNLTE WRPISPLTLW YNLVPASASV SAPETPIRQT PATPASETPS MLPPVALSGA DLERLAHIER QLALLGAQLK QTDHKQDERH GNILELYNSL KDQLHTRTDR ESLGGSFNIH CISPPVCAAQ SEEQQESQQR GQATRLAEIE VLLNTLAAKT QEVQQKQKQF EQEKRESTRA VKQEDHAALL LDVQRLEAEL GKIRQDLQAV VGCRGKCEQL DTLKDTVSAQ VRKELQTLFF GSGGTGELPE SLLHWLSQRY VSSPDLQALL ASLEMSILRN VSLQLEHSRV STLGEAESQV SGAVQHTAAT EGLPEEQVKI IVQNALRLYS QDRTGLVDYA LESGGGSILS TRCSETYETK TALMSLFGLP LWYFSQSPRV VIQPDVYPGN CWAFKGSQGY LVIRLSLKIV PTSFCLEHIP RTLSPTGNIT SAPRDFTVFG LDDEYQEEGK LLGQYTYQED GDALQMFPVQ EQNDKSFQII EMRVLSNWGH QEYTCLYRFR VHGNPQL // ID H3C7S9_TETNG Unreviewed; 183 AA. AC H3C7S9; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000004301}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000004301, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000004301} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000004301} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000004301}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01006467; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000004301; -. DR Ensembl; ENSTNIT00000004439; ENSTNIP00000004301; ENSTNIG00000001916. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H3C7S9; -. DR OMA; NIMMHIE; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. SQ SEQUENCE 183 AA; 20660 MW; 4F933858F14C0FD1 CRC64; NIMMHIEKLR TELNDVKRKL NHQLPDPNFW TNFALESHGA KVYKKLGIQI FSKVGPASVI QGQHPPIPGN CWSFPGSHGN LFIELSHMVT VSHVTLDHVS SSVVPADTIS SAPRQFSVYG RQRLDDRAVH LGKFTYDLEG NPTQTFAVKV YDTIAFKYID LQIDSNYGHA DYTCFYGFRV HGL // ID H3CA85_TETNG Unreviewed; 2585 AA. AC H3CA85; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000005157}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000005157, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000005157} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000005157} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000005157}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01007089; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000005157; -. DR Ensembl; ENSTNIT00000005303; ENSTNIP00000005157; ENSTNIG00000002605. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR InParanoid; H3CA85; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 2. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 820 840 {ECO:0000256|SAM:Coils}. FT COILED 1248 1268 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2585 AA; 285137 MW; 9F3BB1636D2CEE36 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLSF IRDSGHLVHK DTLHSAMAVV SRLCSKMEPQ DPSLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLCRM AAAGGAASGP PSSCKPGRAS TGAGPPAPDS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ALPDSMESAL GGDERCVLDT MRLVDLLLVL LFEGRKALPK STAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDV NKEEEEGSEP KGDPEMAPVY LKRLLPVFAQ TFQQTMLPSI RRKASLALIR KMVHYSSEVL LREVCDSESG HNLPTVLVEI TATVLDQEDD DDGHLLALQI IRDLVDKGGD VFLDQLARLG VINKVSTLAG PASDDENEDE SKPEKEEEVQ EDAREIQQGK PYHWKDWSII RGRDCLYIWS DAAALELSNG SNGWFRFILD GKLATMYSSG SPEGGSDSSE SRSEFLEKLQ RARSQVKPVT SSQPILSTVA PTKLTVGNWS LTCLKEGEIA IHNSDGQQAT ILKEDLPGFV FESNRGTKHS FTAETSLGSE FVTGWTGKRG RKLKSKLEKT KQKVKSMARE LYDDHFKAVE SMPRGVVVTL RNISTQLESA WELHTNRQQC VEGENTWRDL MKTALENLIV VLKDENTISP YEMCSSGLVQ ALFTVLNNVS VPRLLLHPQG PLMERINVFK AAFSENEDNE SRPAVALIRK LIAVLESIER LPLHLYDTPG SLYNLQILTR RLRFRLERAP GETALIDRTG RMLKMEPLAT VESLEQYLLK MVAKQWYDFE RSSFVFVRKL REGQSFTFRH QHDFDENGII YWVGTNAKTA YEWVNPAAYG LVVVTSSEGR NLPYGRLEDI LSRDSSALNC HTNDDKNAWF AVDLGLWVLP SAYTLRHARG YGRSALRNWV FQVSKDGQNW TTLYTHVDDC SLNEPGSTAT WPLDPSKEEK QGWRHIRIKQ MGKNASGQTH YLSLSGLELY GTVTAVCEDQ LGKAVKEAEA NLRRQRRLFR SQVMKYIVPG ARVVRGIDWK WRDQDGNPPG EGTVTGEAHN GWIDVTWDAG GSNSYRMGAE GKFDLKLAPG YDPESAATAP SPKPVSSTVS GPSSMQQQQS WSSLVKNNCL DTPLGGASSS SRKGSSSSVC SVASSSDISL SSSMGLMGVG GLRLEKRAEG LLLDQGVGVG VGTGGGVGSD VQQLEPIVVL SSVVDSGSGS ASSSGTLPTD AAAPGDESRS KDSGTDPATA ISMGLVSVSS PDVSSVSESS GKDAPSQRPL CSATNARLSV SSLLAGAPMS SSASVPNLSS REASLMESFV RRAPNMSRTN ATNNMNLSRS SSDNNTNTLG RNALTTATSL MGAQSFPNLT TTGTTSTVTM STSIVTSSNN VATATTGLSV GQLLSNTLTT SLTSTSSESD TGQEAEFSLY DFLDSCRANT LAELDDEEDL PEPDDDDDEN EDDNQEDQEY EEVLVIQPSL AFWSGTGSDV TVQEEEEYET KGGRRRTWDD DFVLKRQFSA LVPAFDPRPG RTNVQQTTDL EIPPPGRSPR SEVQEEVECA PSPHLSLTLK VAGLGTSREV ELPLSNYKST IFFYVQRLLQ LSCSGAVKTD KLRRIWEPTY TIMYRELKDA DKEKESAKTV RRVWALSHPL TCCVLQDVCE HGTGFSARSG VLSPGSLLAS QSGEILGVAR EMAQAKAGCS QNACGVEDVL QLLRILYIIG GDSASNTRTM QEDFEELQFN ASPEEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTAK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATMERSR PSTTVRRDDP GEFRVGRLKH ERVKVPRGEA MMEWAESVMQ LHADRKSVLE VEFQGEEGTG LGPTLEFYAL VAAEFQRTSL GIWLCDDDFP DDESRQVDLG GGLKPPGFYV QRSCGLFPAA FPQDSEELER IAKLFHFLGI FLAKCIQDNR LVDLPLSQPF FKLLCMGDIK STWSRQLYQS CSFPPGQEPE RLHLQPFLLL SESEASTEES QETYSVGSFD EDSKSEFIMD PPKPKPPAWY HGILTWDDFQ LVNPHSRASF LKELKELAMK RRQILSSKSL SEDEKNTRLQ DLMLRNPLGS GPPLSIEDLG YPPLLNFQFC PSSKVHGFSA LDLKPNGDNE MVTMENAEEY VELMFDLCMH TGIQKQMEAF REGFNRVFQM EKMSSFSHKE VQMILCGNQS PSWTADDIIN YTEPKLGYTR DSPGFLRFVR VLCGMSSDER KAFLQFTTGC STLPPGGLAN LHPRLTIVRK VDATDSSYPS VNTCVHYLKL PEYTSEDIMR ERLLAATMEK GFHLN // ID H3CD45_TETNG Unreviewed; 189 AA. AC H3CD45; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000006168}; GN Name=SUN1 (1 of 2) {ECO:0000313|Ensembl:ENSTNIP00000006168}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000006168, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000006168} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000006168} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000006168}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01007940; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000006168; -. DR Ensembl; ENSTNIT00000006316; ENSTNIP00000006168; ENSTNIG00000003578. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; FPLWYFS; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. SQ SEQUENCE 189 AA; 21172 MW; CBF9F8FDBA984534 CRC64; IVENALRRFS EDRTGMPDFA LESGGGSILS TRCSETYRTK VALLSLFGFP LWYFSQSPRA VIQPDVHPGN CWAFRGSSGF LVIRLSMPIF PTAITLEHTP KALSPSGKMH SAPRDFSVYG LDDENQERGH LLGVYTYDQD GDAVQTFTVS EVYERPFQLV EVQVTSNWGQ PDYTCLYRIR VHGTPADTL // ID H3CIA7_TETNG Unreviewed; 185 AA. AC H3CIA7; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000007985}; GN Name=SUN2 (2 of 4) {ECO:0000313|Ensembl:ENSTNIP00000007985}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000007985, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000007985} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000007985} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000007985}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01011339; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000007985; -. DR Ensembl; ENSTNIT00000008146; ENSTNIP00000007985; ENSTNIG00000005304. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR OMA; IRITHVT; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. SQ SEQUENCE 185 AA; 20777 MW; 047F2B6D2503E1E4 CRC64; MTRLNRWLLE CIKVCVRVCE GASVITSRCS QTYTSASPRL TVFGIPLFTL SRGPRTVIQG SLKHPGECWS FVGSKGTLAV SLSHPIRITH VTMEHAQRSH SPTGEIKSAP RDFEVYGIRT QPEKETFLGN FTYDQFGEPS QTFALKDPGE EAYQAVELHV LTNWGQQEYT CLYRFRVHGH MAPAS // ID H3CK83_TETNG Unreviewed; 178 AA. AC H3CK83; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000008662}; GN Name=SUN2 (3 of 4) {ECO:0000313|Ensembl:ENSTNIP00000008662}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000008662, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000008662} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000008662} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000008662}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01006467; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000008662; -. DR Ensembl; ENSTNIT00000008831; ENSTNIP00000008662; ENSTNIG00000005929. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H3CK83; -. DR OMA; HKANITH; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 178 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003582178. SQ SEQUENCE 178 AA; 19776 MW; E06C121229B481C6 CRC64; HSTAWISALM LSSLLFFAGA KVYKKQSSNT YEKIEGFKIF GIQIFSKVGP ASVIQGQQPP IPGNCWSFPG SHGNLFIELS HMVTVSHVTL DHVPSSVVPA DTISSAPRQF SVYGRQRLDD RAVHLGKFTY DLEGNPSQTF AVKVYDTITL KYIDLQIESN YGHADYTCLY GFRVHGKI // ID H3CL23_TETNG Unreviewed; 1049 AA. AC H3CL23; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 19. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000008952}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000008952, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000008952} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000008952} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000008952}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01013708; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000008952; -. DR Ensembl; ENSTNIT00000009123; ENSTNIP00000008952; ENSTNIG00000006204. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H3CL23; -. DR OMA; NGSPHPV; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. FT COILED 351 371 {ECO:0000256|SAM:Coils}. FT COILED 728 748 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1049 AA; 115370 MW; FCD50A4A9E89417C CRC64; PELALSLSPN GCLYNPSNYY SQAGDVTDPS VPSKEDIPTF DEWKKQVMEV EMEKSQSLYT STTGSPHSAK KVQKSFKNNY ASVECGAKIL AANSEAKSTS AILKENMDLY MLNPCSNKIW FVIELCEPIQ VKQLDIANFE LFSSTPKDFL VSISDRYPTN KWVKLGTFHA RDERIVQSFP LDEQLFAKYL KMFIKYIKIE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYLSERM EYLDEDYDYP PGYQLAEDNP NGSKNLLGSA TNAILNMVNN IAANVLGATP ELEGGTESEG TAVSCFPRSC FSSEQASPVD SSHTIGNTTT GGDKKESTEA FPDSALRQST VTLMEEEGEE EEDRRQEETR DADRNQSDSH IYCPLFSSLS LSCMASLPEL LHRWCSARLA KERLRSLRRR QLGIQTHTHP APNTPSPIHT PLLIPVPAPT PVTEELSQTE TVLKLEVPLM PQNDVKMAEV HIAQPNTPDT HTPELNVLLE PSRTVIPTHG FSDTQSFSVG LTSTNEVKVL PPVKEVAQAT VSTPPLQVAS IPETQPAVVA SPTLTVSISS QSLGSDVASS SDAAPPVSEQ PVKPPLKASR PEPVVAPLGE LPTVLPVADI HTDRPAADPS KEQLDPVMQG GDPQRVDDVT DEDLLSSGGN GNVQRTATDF YAELQNGGES NAGAANGNGM LLNGGAVHGS SQKESVFMRL NNRIKALEMN MSLSSRYLEE LSQRYRKQME EMQRAFNKTI IKLQNTSRIA QEQDQKQTDS IQVLQSQLVN ITKLMLNLTT TVGQLQREVS DRQSYLVVSL VLCLFLGLLL FLQCCCRSSP STSSDTAPIP RSNHYPSPKR CFSSYDDMNL KRRMTCPIIH SNSLPLCCSE VGPDDLYIVE PLKFSPEKKK KRKSKSLDKV DLLKEYYPPA PLINGAPKCN GFHPCLSLQP LLEEVSSPSK ESPSEPSSSP VNSEESHTSG LALQTAAYMS ASQCNGHGLT LSMQQLATMS RQEKRSLKRR KSRPAEMPFS AVPSLQQLIK GNKEISVGTI GVTAVTGHF // ID H3D6N6_TETNG Unreviewed; 141 AA. AC H3D6N6; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000016176}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000016176, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000016176} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000016176} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000016176}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01014751; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000016176; -. DR Ensembl; ENSTNIT00000016387; ENSTNIP00000016176; ENSTNIG00000013193. DR eggNOG; ENOG410IX85; Eukaryota. DR eggNOG; ENOG4111V3C; LUCA. DR GeneTree; ENSGT00390000017748; -. DR InParanoid; H3D6N6; -. DR OMA; NEETCWN; -. DR OrthoDB; EOG7NKKPC; -. DR TreeFam; TF300180; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. SQ SEQUENCE 141 AA; 16102 MW; C8B1531E4AFD52BA CRC64; MASSLICNDV KSWVSSVLNR DVKQYGKKYL FDCNEETCWN SDQGERQWVV VEFPQSVRVS EVKMQFQGGF SARTCRLQGC LKEGDLDTIG QFYPEDNNCL QSFPIQEAPV TDKVKIMFEN STDFFGRIII YSLDVLGEKA S // ID H3DKT7_TETNG Unreviewed; 927 AA. AC H3DKT7; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000021135}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000021135, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000021135} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000021135} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000021135}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01015037; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000021135; -. DR Ensembl; ENSTNIT00000021368; ENSTNIP00000021135; ENSTNIG00000017972. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H3DKT7; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 375 398 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 405 426 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 571 605 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 927 AA; 102826 MW; 7493870812AB0944 CRC64; MDFSQLHTYT PPQCAPENTG YTYSLSSSYS TAALEFEQEH QIAAVYESPR MSRRSLRLQG HYSVDYSHSQ STTRTLRNKK QQSGSGGLSL SLSQAATPRK TLSFSAVNTP VNSGIFQESS TATDAALFTG LDESHLRQRT VTTTNTFTCV DGQAGVNGDT SASKAHASLT NGYICKDCSF PSQKIDTFIT QSSSSSSQLA QSSSDILSSS SSSSPFTSVY SRDRSRRSKT GVLASFTNSL RQAMSSSLSQ LCFVTIVLTH SSFCGSMNVK GLVTEDAAHL KLNGSLYCSS SHRLTRSGDD CKGKQHAETH SSLLSQSSRL HRMAGTLWGH CLLWPGHCVL RSGKVLGCGA VRALRSLLSL LWMFLTAPVK AGRGLLWFLA TGWYQLVSLM TVLNVFFLTQ CLPRLWRLLL LLLPFLLLLG EALWWWGPST AALLAYLPAV NLTEWRPISP LTLWQTPATP ASETPSMLPP VALSGADLER LAHIERQLAL LGAQLKQTDH KQDERHGNIL ELYNSLKDQL HTRTDRESLG VWVSSLLDQR VGVLQGELEQ EHAQRLQESQ QRGQATRLAE IEVLLNTLAA KTQEVQQKQK QFEQEKRRAV KQEDHAALLL DVQRLEAELG KIRASVSNWT HLKTRQAQVS AQVRKELQTL FFGSGGTGEL PESLLHWLSQ RYVSSPDLQA LLASLEMSIL RNVSLQLEHS RVSTLGEAES QAKAIFHTVS GAVQHTAATE GLPEEQVKII VQNALRLYSQ DRTGLVDYAL ESGGGSILST RCSETYETKT ALMSLFGLPL WYFSQSPRVV IQPDVYPGNC WAFKGSQGYL VIRLSLKIVP TSFCLEHIPR TLSPTGNITS APRDFTVFGL DDEYQEEGKL LGQYTYQEDG DALQMFPVQE QNDKSFQIIE MRVLSNWGHQ EYTCLYRFRV HGNPQLQ // ID H3DNC0_TETNG Unreviewed; 349 AA. AC H3DNC0; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 23. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000022019}; GN Name=SUN2 (4 of 4) {ECO:0000313|Ensembl:ENSTNIP00000022019}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000022019, ECO:0000313|Proteomes:UP000007303}; RN [1] {ECO:0000313|Ensembl:ENSTNIP00000022019} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|Ensembl:ENSTNIP00000022019} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSTNIP00000022019}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01015100; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 99883.ENSTNIP00000022019; -. DR Ensembl; ENSTNIT00000022255; ENSTNIP00000022019; ENSTNIG00000018834. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H3DNC0; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 89 108 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 112 132 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 349 AA; 39247 MW; CFF455E173AE4313 CRC64; MVRRSSRLQA GKYYAVSNGQ NSTPVASISY YETPVRSPRR ARVRSPRRKS PSPSPAQGRP PTRVQHNNQP FLKDSLLSPP PPRRHQKHLS AFLFVLILFF FLFFYPRLVH RIRSLEAQNL KLTKEWQLLQ QRPDGSSVSP ELQQHVDGLF RKLAAELDVL ANRGGSSEDQ RPVADRMADF ALESQGASVI SSRCSQTYTC PSPSLTLFGI PLWSSYRSPR TAIQGSPITA GTCWSFAGAE GTLAVSLSHP VKITHVTVDH LSRYNSPTGD IKSAPKDLEV YGMKTRAGEG TFLGRFRYDK LGESTQTFSL PKPTEEVYEM VELRVLSNWG QKEYTCLYRF RVHGQTDVS // ID H3DXL9_PRIPA Unreviewed; 246 AA. AC H3DXL9; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:PPA02164}; OS Pristionchus pacificus (Parasitic nematode). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. OX NCBI_TaxID=54126 {ECO:0000313|EnsemblMetazoa:PPA02164}; RN [1] {ECO:0000313|EnsemblMetazoa:PPA02164} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA02164}; RA Wilson R.K.; RT "Draft sequence assembly of the Pristionchus pacificus genome."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:PPA02164} RP IDENTIFICATION. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA02164}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 54126.PPA02164; -. DR EnsemblMetazoa; PPA02164; PPA02164; PPA02164. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; H3DXL9; -. DR Proteomes; UP000005239; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005239}; KW Reference proteome {ECO:0000313|Proteomes:UP000005239}. SQ SEQUENCE 246 AA; 27085 MW; 09A53C92F7C576E7 CRC64; MKLIKEASEK GEPLTFKYIS DFDENGIIYW LGTNAKSESE WTNPASVGVV VATSSDAPRQ PFGRPEDILS RDPNALNCHT GDDKNGFFSI DMGAVIKVSN YTLRHSRGYS RSALRNWLFQ GSNDNKTWDV LSYHKNDTAL TEPGSTATFP IDAGKGAYRY FRISQNGENS SGSTYYLSLS GFEMYGTVLE AVEKEIRCDN EKGEKKSRLP ISSTPLSSPF PPLGSLGHPL HASAPSKLPH PGQLRL // ID H3EHJ8_PRIPA Unreviewed; 1059 AA. AC H3EHJ8; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:PPA09213}; GN Name=WBGene00098767 {ECO:0000313|EnsemblMetazoa:PPA09213}; OS Pristionchus pacificus (Parasitic nematode). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. OX NCBI_TaxID=54126 {ECO:0000313|EnsemblMetazoa:PPA09213}; RN [1] {ECO:0000313|EnsemblMetazoa:PPA09213} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA09213}; RA Wilson R.K.; RT "Draft sequence assembly of the Pristionchus pacificus genome."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:PPA09213} RP IDENTIFICATION. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA09213}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 54126.PPA09213; -. DR EnsemblMetazoa; PPA09213; PPA09213; PPA09213. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H3EHJ8; -. DR Proteomes; UP000005239; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005239}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005239}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 114 131 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 140 156 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 596 615 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 622 641 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 653 673 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 731 753 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 827 847 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1059 AA; 116996 MW; 79DF162B130056E4 CRC64; MGWEESGRRS PSLELIDRPP SLYRPPIRPY RTGYTYAYSK SYRNSTDYDD YEVPNMAGTS FYHDSSMRED DGVEGYSFMS TIYEEAVTIY YKTITAVYCS LATTYAGGRS IVEWIYYILY SIAYGVWYAG SHGGRWIGDA IYRVLYSIAY GLWLVLDSTG RFSYNGSSLL LQTLFDIVMF VPRKVNGFIN PPPINRRVTY SEDDNEVRYF THNERIVVSN GFSSSSSSSS HNQKSVFGGL WNGVMKYLKR GRRSHMEPVY ELRSRTIERE VGDDTDTDDE LGIDEHVSVV RHQTNQSVPT RRARTVKAMD EQDVIFDESS ENILTNLAYL PVDALVALYS LLSYTISSVG SGVSNVGYYS FHGTKSVFES VFDGFITILY YTMYAPIATV GSFVSNVASG NISSAPANPA NNGVANAART TTRKSRSHAV HHTTALGSAT TDEDEMMAAP VYSTHERHEE EDILADLRDT PVVRRSTRRL STSSNTSDRS ASGSNAASTS TARTTRSARA VKVPLLHSSS QSSSSSLFAP VGGVVWAVKD SITTILAKII EIIHFTFLLL IQCFKAIGQL IVGGASTIIS SIGALFGAIA SGTTTGSAGI LSMIGAVFNN IFNVFRSVFT ESASVVSSTG SAIGAGVGSI WGSRPSGSTL WNLFLWLVLL LPLIFFCLWL LALPPFKKEH DEVVAEYVKH YSSIMEDYYS YGQHHTKSFI GVTVDTIGAG ARSLWTIFAS LLQWILAAVF GLWESLLMFL AGLRIDRLFA FSPTPAIVPP IIVAPSSECP PVPAPVYIPG PPAPPVYIPA PPPTIDQEAL IAAIVAKVTA QMEQRMSDSL NGKIHIMEES VRRAEEELRA KITVTHEPFD YSNLDALIAA AIRKYDSDKT GLVDYALESS GGQIISTRCS ETYALSTRVE KIFDIPLYYS NYGPRVVIQR NSQALVPGEC WAFKGGIGYL TIKLAVPIKV TSVSYEHIPP SISRNGENLS APKTFTIFTY EKDEYDFASR FELGKFTYDA HGDPLQFFPA HPPYPVQIIE FQVDSNYGEQ YTCLYRFRVH GDNLAVVRK // ID H3F5P2_PRIPA Unreviewed; 788 AA. AC H3F5P2; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:PPA17472}; GN Name=WBGene00107026 {ECO:0000313|EnsemblMetazoa:PPA17472}; OS Pristionchus pacificus (Parasitic nematode). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. OX NCBI_TaxID=54126 {ECO:0000313|EnsemblMetazoa:PPA17472}; RN [1] {ECO:0000313|EnsemblMetazoa:PPA17472} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA17472}; RA Wilson R.K.; RT "Draft sequence assembly of the Pristionchus pacificus genome."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:PPA17472} RP IDENTIFICATION. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA17472}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 54126.PPA17472; -. DR EnsemblMetazoa; PPA17472; PPA17472; PPA17472. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; H3F5P2; -. DR Proteomes; UP000005239; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005239}; KW Reference proteome {ECO:0000313|Proteomes:UP000005239}. SQ SEQUENCE 788 AA; 83921 MW; 83D32C64D02EA4A6 CRC64; MRNSNEAFDR RLTVLHLQPR SIEIANFELF SSGPAAVRFS AAERFPTQQW SVLGEWSLAD TRTVQTLPVA ADALTYAKFI KLELLAHHGA EHFCTLSTVR VLGVSMVDEY EAEAAVAARI AAPHAAVPPL TATEAVVTPP RVQHTQPKEQ QQQQQQVKKE EPPQAAVPTE STPITSTEVP PIAVVEPPPV APRPPASDAS PPASGSGLVQ DVMSGTLLKK IIEVVGGGKK KTADAPAAPP RLSAYDACDR TPTDGWPRAT CARRAFFCPP GTTPAPAAAR PSHAEETAAK RAAARRQFIA ATQREEAARA AAAAPPPQLP HTQIDAPPQR KATTTPVVQP QAAAPTPPVA EAAPKEAEPP VAAAKPQQEQ QPPPAAAAPP AAQQQQAAPP VGIFEGLPAG TNSHKLETIF IKLTKRVSAL ELNMSLSSEY LSELSKQYIG ECGVLFNLRK EVDALSAWLA TLRAQAGSVA LTRRISGAYA APQDGHHETS PAEERPSSAE VGHHRRHQTV DIGGEDYASY EEAYESTCPY SEGGGRRQPQ QGRPYGPQPR PPREPNASDA DDGEEEEGGH ADADDAFDLR HFRHHSDGIW TTEQVLYAVL GAQALTVALV LLMQACYARA FGRGRQPADP PAPAAVPAPD TAELERLIAA ALERRAQREA APPRVPVAAA AAASADASPR SSASSTASSS GAAPQPLQQL SGGKKKRRQR RSTAEQQQQQ QQNHHHCRQC GEGEEGPEFG LGLGDGLVLA VALPNWKRGG IRGPAPFRRF RPLIAPGCSR FLHCQTRV // ID H3FTA6_PRIPA Unreviewed; 426 AA. AC H3FTA6; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 14-OCT-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:PPA25528}; GN Name=WBGene00115082 {ECO:0000313|EnsemblMetazoa:PPA25528}; OS Pristionchus pacificus (Parasitic nematode). OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Diplogasterida; OC Neodiplogasteridae; Pristionchus. OX NCBI_TaxID=54126 {ECO:0000313|EnsemblMetazoa:PPA25528}; RN [1] {ECO:0000313|EnsemblMetazoa:PPA25528} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA25528}; RA Wilson R.K.; RT "Draft sequence assembly of the Pristionchus pacificus genome."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:PPA25528} RP IDENTIFICATION. RC STRAIN=PS312 {ECO:0000313|EnsemblMetazoa:PPA25528}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EnsemblMetazoa; PPA25528; PPA25528; PPA25528. DR InParanoid; H3FTA6; -. DR Proteomes; UP000005239; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005239}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005239}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 45 65 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 222 249 {ECO:0000256|SAM:Coils}. FT COILED 361 381 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 426 AA; 48311 MW; 9E40FB52AC469E6F CRC64; MILRDILVKI PFGKFLSRHA FEPGQVRKTT IEYTIENQGP LHSRYYPFCI AFYILIAALG SAYFFGATPT VVTENVLDPV VISVGDAVNG MKEKMGDAYM GVRDSMGDVY SGVKERIKDV VEEYKKIDEG GRNVTIEEKR AEEKERNRKI DEMITKWSDQ MNAKFKKLEA KIDSDTNSIN KRLDSFETAL HHLKMDLEEK MNSMVDKNES PSSQSLTSFV SIEEFRVNLE NTKKEMDELR RMVPREDTEE YLNLASYTSG ASIIDSATSS SLYSSVLNMF SPDRTPIFVL TERHLFPGDC MPLPSTGGEV GINLSTLGHI SHIEYYHLYW SEAAGIPQSA PKRIQIMGCA DDFATVNCET IAECEYNVEN TANRREEAKR RRIFGVPINC PVKLVHKEEM KSAVAKSIRI KILSNHGAEH TYASYW // ID H3GZ52_PHYRM Unreviewed; 653 AA. AC H3GZ52; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblProtists:Phyra83035}; OS Phytophthora ramorum (Sudden oak death agent). OC Eukaryota; Stramenopiles; Oomycetes; Peronosporales; Phytophthora. OX NCBI_TaxID=164328 {ECO:0000313|EnsemblProtists:Phyra83035}; RN [1] {ECO:0000313|EnsemblProtists:Phyra83035} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Pr102 {ECO:0000313|EnsemblProtists:Phyra83035}; RX PubMed=16946064; DOI=10.1126/science.1128796; RA Tyler B.M., Tripathy S., Zhang X., Dehal P., Jiang R.H.Y., Aerts A., RA Arredondo F.D., Baxter L., Bensasson D., Beynon J.L., Chapman J., RA Damasceno C.M.B., Dorrance A.E., Dou D., Dickerman A.W., Dubchak I.L., RA Garbelotto M., Gijzen M., Gordon S.G., Govers F., Grunwald N.J., RA Huang W., Ivors K.L., Jones R.W., Kamoun S., Krampis K., Lamour K.H., RA Lee M.-K., McDonald W.H., Medina M., Meijer H.J.G., Nordberg E.K., RA Maclean D.J., Ospina-Giraldo M.D., Morris P.F., Phuntumart V., RA Putnam N.H., Rash S., Rose J.K.C., Sakihama Y., Salamov A.A., RA Savidor A., Scheuring C.F., Smith B.M., Sobral B.W.S., Terry A., RA Torto-Alalibo T.A., Win J., Xu Z., Zhang H., Grigoriev I.V., RA Rokhsar D.S., Boore J.L.; RT "Phytophthora genome sequences uncover evolutionary origins and RT mechanisms of pathogenesis."; RL Science 313:1261-1266(2006). RN [2] {ECO:0000313|EnsemblProtists:Phyra83035} RP IDENTIFICATION. RC STRAIN=Pr102 {ECO:0000313|EnsemblProtists:Phyra83035}; RG EnsemblProtists; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS566078; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 164328.Phyra83035; -. DR EnsemblProtists; Phyra83035; Phyra83035; Phyra83035. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; H3GZ52; -. DR Proteomes; UP000005238; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005238}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005238}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 653 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003587088. FT TRANSMEM 534 556 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 462 489 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 653 AA; 71548 MW; 761E6BE340CEE6F0 CRC64; MRPKLLLLVL FVALCSVRGI VTPDVPSDST PSEDADAPVG DSVASSDAEP PPDEPGADSS GSPQRITEPD EPTADAQLAD EDVDPLMDVP SGLFEVVDAD SVDTRKRQNY ASLDAGATIL DAAPETKSPT NLLVPDKDRY MLTPCSNPRK WVVISLSEDV HADAIAVANY EKFSSPVKDF IVLGSVNYPT DTWLVLGNFT ATHTNGEQIF QLDAQQHVRY LKFRFLSHYG SEYYCTLSQL RVFGRTFTQV ISQLEKSIDA EVEALDAQAA LPVPQPSTVS ELSVPRIPDP TELMSQCLME KNNSVVAVFY DKQQRLEHYQ SHGMCCLVDY SPEQIEAEVA ASSNAKEQVA ATSTDSSDPD VSDGAGQTSG SVPSTASSSN TNSSSVPAAS GSNSNATAPS SAAAPASLLP TAHVTAASST QGLGRLESIF VRITKKIQAL EVNQSVMGRQ IEEFHTNQWA AIKMLQSNQE SLNEQLREIR TMIVDLKDHV AKELSANEQT LLSYGRLLDD VRRDNIALWN EMLIVREVIT TMKAGILCAI VLSGFIILFY LLRLLFRCVS KCKERADLRE WFWRMENHES TAEDQDTNSP VGSMAAGALR VNRKAQFGSS WDDSAIERKT LVSDMVGDGP QQFRRHRPKR FSQPTTALKR PRK // ID H6BRJ4_EXODN Unreviewed; 640 AA. AC H6BRJ4; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EHY53999.1}; GN ORFNames=HMPREF1120_02176 {ECO:0000313|EMBL:EHY53999.1}; OS Exophiala dermatitidis (strain ATCC 34100 / CBS 525.76 / NIH/UT8656) OS (Black yeast) (Wangiella dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae; OC Exophiala. OX NCBI_TaxID=858893 {ECO:0000313|EMBL:EHY53999.1, ECO:0000313|Proteomes:UP000007304}; RN [1] {ECO:0000313|Proteomes:UP000007304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 34100 / CBS 525.76 / NIH/UT8656 RC {ECO:0000313|Proteomes:UP000007304}; RX PubMed=24496724; DOI=10.1534/g3.113.009241; RA Chen Z., Martinez D.A., Gujja S., Sykes S.M., Zeng Q., Szaniszlo P.J., RA Wang Z., Cuomo C.A.; RT "Comparative genomic and transcriptomic analysis of Wangiella RT dermatitidis, a major cause of phaeohyphomycosis and a model black RT yeast human pathogen."; RL G3 (Bethesda) 0:0-0(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH226131; EHY53999.1; -; Genomic_DNA. DR RefSeq; XP_009154460.1; XM_009156212.1. DR EnsemblFungi; EHY53999; EHY53999; HMPREF1120_02176. DR GeneID; 20306815; -. DR InParanoid; H6BRJ4; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000007304; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007304}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007304}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 282 304 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 387 407 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 640 AA; 70047 MW; B4D01588EE10C692 CRC64; MPPRRSTARL SATPIRASSP TKRSTRGSSV LASEDAIPRR VTRGGSQQPS MAAEGAVNNP RLPEVQIQQS YAYGSSKSAV LPAQLVARNK MNLREMAETI DAGVEQAQQH LQHHIEETHA QLQNETDPRA ERARRRASRE PASREGSVTT DDVEKNKSQR VAAWASSLES SQLDEIPEED SSSGRPPSTP DNATHKDTDP SSFPSGIFDH SYNYERGLRR PNVTVRRKTG GDSTLQQAWK TAKTIADQSR QASARALGAT AQWTSRLFRA SGRAISDLPN SVFVQVMVSL LFGLFVATAA SFLFCHIYTS YICDAHSSSP IGVTLQRYCG GCVRASSSPL NFTLGANGGD LSKLSAALSG IQSQIQAIEG RLSEKLDSQY TIVDTDIKEL RRQHSELSSH IAGLQRVRGG GSVSSSGDVA SPVIAKVNYF APNNGANVDP HNTSPTRERR QALVSRVLSR MVGMTLYETK PAITALQPWQ DVGDYWCSSA SPANNDDQQD SMRLGVRVAE MIFPTEVVVE NYPNAGSLFP GSTPKRIQVW ADFQHLDSRE WESLNIRQMQ ADGPLSLGPT YALIGEVEYD ASVEAPHVQA FPLAVNQHDI HLYAAQSFVV RVVKNYGAEY TCLYRIRMHG VPALQYHDGR // ID H6C013_EXODN Unreviewed; 940 AA. AC H6C013; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 14-OCT-2015, entry version 13. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EHY57162.1}; GN ORFNames=HMPREF1120_05210 {ECO:0000313|EMBL:EHY57162.1}; OS Exophiala dermatitidis (strain ATCC 34100 / CBS 525.76 / NIH/UT8656) OS (Black yeast) (Wangiella dermatitidis). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae; OC Exophiala. OX NCBI_TaxID=858893 {ECO:0000313|EMBL:EHY57162.1, ECO:0000313|Proteomes:UP000007304}; RN [1] {ECO:0000313|Proteomes:UP000007304} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 34100 / CBS 525.76 / NIH/UT8656 RC {ECO:0000313|Proteomes:UP000007304}; RX PubMed=24496724; DOI=10.1534/g3.113.009241; RA Chen Z., Martinez D.A., Gujja S., Sykes S.M., Zeng Q., Szaniszlo P.J., RA Wang Z., Cuomo C.A.; RT "Comparative genomic and transcriptomic analysis of Wangiella RT dermatitidis, a major cause of phaeohyphomycosis and a model black RT yeast human pathogen."; RL G3 (Bethesda) 0:0-0(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH226133; EHY57162.1; -; Genomic_DNA. DR RefSeq; XP_009157623.1; XM_009159375.1. DR EnsemblFungi; EHY57162; EHY57162; HMPREF1120_05210. DR GeneID; 20309849; -. DR InParanoid; H6C013; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007304; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007304}; KW Reference proteome {ECO:0000313|Proteomes:UP000007304}. FT COILED 406 426 {ECO:0000256|SAM:Coils}. FT COILED 702 722 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 940 AA; 102551 MW; ED824ADD06A63038 CRC64; MAVGRPAPGI WIAAQAFFIA SILVVFTTAT SSTIATATCP FRTVNYITHS LPQQCLPSSR KANATSSVTE ATSYIDPESA SIPHASDRHA NTTEVVLTTS TTSLSLEFTS TSSEDATITG EPAQSVSSPP LPESTMSSAG ASVGSTDDES PLEDGKFLSF EDWKRENLKK AGQSEHVGKG HGLDRDTRKR PNIVQESLDA LGDDAEIDID FSGFVSDSLE SVAPGQSQSN HKLDETPSDV KPGSAPRAGV RKKDAGTTCK ERFNYASFDC AASVLKTNPE AKSPSAVLGK NKDSYMLNEC SAQNKFLILE LCDDIAIDTI VLANFEFFSS TFRTFRVSVS DKYPVKIDKW KTLGTFEARN SRDVQAFLVE NPVIWARYLR IEFLTHYGNE YYCPLSLVRV HGTTMLEDYK HDLESLQMEE DDVESQDSGE SLAMDELIPE AVAEPLLKVT SEASLSNPPV VETSDLGPSD SFPIKPETPS VIHIPSTTSM MMPMPTSSAN FSVPFDEPSM NEEAYGVCKS VDNPTTIEPE ELQPAATDQT TNVVVVTSTS MPTTTETAGN SQHTTSTAPA SNSTTTFADT QISSSIPDVA NNSSISKDPT VTAVNNSMKP ASSTTQATAA APTIQESFFK SVQKRLQMLE ANSSLSLQYI EEQSRALRDA FQKVDQRQMA KTTSFLEYLN TTVLNELRDF RQQYDQLWQS TVIELELQRE RYQQENLAIN ARLGILADEV IFQKRMSILQ MILILICLGL VIFSRGSLNS YLELPLVQSV LARSPSSKWL NLHSLETPSH SPGPSRSHST RAERVRHGIL KGHRRSTSED SVTDTLSPSD LYSPPTPVSF DSPSEEEEGL DDGNRLGDPE FDPSLIERPG TSPPVLPGTE TPPMSQSNGK IGHDMVDSAL LSSSPSQIAT TTPRVVVNDA TPPTKRLTWQ LPETWKDHGD // ID H7BZA7_HUMAN Unreviewed; 281 AA. AC H7BZA7; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=SUN domain-containing protein 3 {ECO:0000313|Ensembl:ENSP00000388627}; DE Flags: Fragment; GN Name=SUN3 {ECO:0000313|Ensembl:ENSP00000388627}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000388627, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000388627, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [2] {ECO:0000313|Ensembl:ENSP00000388627} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000388627}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC069279; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; H7BZA7; -. DR STRING; 9606.ENSP00000297325; -. DR PaxDb; H7BZA7; -. DR Ensembl; ENST00000453071; ENSP00000388627; ENSG00000164744. DR HGNC; HGNC:22429; SUN3. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR NextBio; 35528444; -. DR Proteomes; UP000005640; Chromosome 7. DR ExpressionAtlas; H7BZA7; baseline and differential. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|PeptideAtlas:H7BZA7}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT COILED 19 39 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000388627}. SQ SEQUENCE 281 AA; 32092 MW; F134B9497D851E3B CRC64; XYAIIAEYGS RLYKYQARLR MPKEQLELLK KESQNLENNF RQILFLIEQI DVLKALLRDM KDGMDNNHNW NTHGDPVEDP DHTEVLDEEV SNLVNYVLKK LREDQVEMAD YALKSAGASI IEAGTSESYK NNKAKLYWHG IGFLNHEMPP DIILQPDVYP GKCWAFPGSQ GHTLIKLATK IIPTAVTMEH ISEKVSPSGN ISSAPKEFSV YGITKKCEGE EIFLGQFIYN KTGTTVQTFE LQHAVSEYLL CVKLNIFSNW GHPKYTCLYR FRVHGTPGKH I // ID H7C2N0_HUMAN Unreviewed; 170 AA. AC H7C2N0; DT 18-APR-2012, integrated into UniProtKB/TrEMBL. DT 18-APR-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=SUN domain-containing protein 3 {ECO:0000313|Ensembl:ENSP00000406887}; DE Flags: Fragment; GN Name=SUN3 {ECO:0000313|Ensembl:ENSP00000406887}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000406887, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000406887, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [2] {ECO:0000313|Ensembl:ENSP00000406887} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000406887}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC069279; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; H7C2N0; -. DR STRING; 9606.ENSP00000297325; -. DR PaxDb; H7C2N0; -. DR Ensembl; ENST00000412371; ENSP00000406887; ENSG00000164744. DR HGNC; HGNC:22429; SUN3. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR NextBio; 35529561; -. DR Proteomes; UP000005640; Chromosome 7. DR Bgee; H7C2N0; -. DR ExpressionAtlas; H7C2N0; baseline and differential. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Proteomics identification {ECO:0000213|PeptideAtlas:H7C2N0}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000406887}. SQ SEQUENCE 170 AA; 18911 MW; 3FB0E8049F8AB13B CRC64; EDQVEMADYA LKSAGASIIE AGTSESYKNN KAKLYWHGIG FLNHEMPPDI ILQPDVYPGK CWAFPGSQGH TLIKLATKII PTAVTMEHIS EKVSPSGNIS SAPKEFSVYG ITKKCEGEEI FLGQFIYNKT GTTVQTFELQ GPWHTRQAHL EELVQKAMPH VQNIQECLFS // ID H8WWU5_CANO9 Unreviewed; 580 AA. AC H8WWU5; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 14-OCT-2015, entry version 15. DE SubName: Full=Slp1 protein {ECO:0000313|EMBL:CCG21085.1}; GN ORFNames=CORT_0A07000 {ECO:0000313|EMBL:CCG21085.1}; OS Candida orthopsilosis (strain 90-125) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Candida. OX NCBI_TaxID=1136231 {ECO:0000313|EMBL:CCG21085.1, ECO:0000313|Proteomes:UP000005018}; RN [1] {ECO:0000313|EMBL:CCG21085.1, ECO:0000313|Proteomes:UP000005018} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=90-125 {ECO:0000313|Proteomes:UP000005018}; RX PubMed=22563396; DOI=10.1371/journal.pone.0035750; RA Riccombeni A., Vidanes G., Proux-Wera E., Wolfe K.H., Butler G.; RT "Sequence and analysis of the genome of the pathogenic yeast Candida RT orthopsilosis."; RL PLoS ONE 7:E35750-E35750(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; HE681719; CCG21085.1; -; Genomic_DNA. DR RefSeq; XP_003866524.1; XM_003866476.1. DR EnsemblFungi; CCG21085; CCG21085; CORT_0A07000. DR GeneID; 14537792; -. DR KEGG; cot:CORT_0A07000; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000005018; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005018}; KW Membrane {ECO:0000256|SAM:Phobius}; Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 580 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003616601. FT TRANSMEM 494 511 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 580 AA; 66368 MW; 40084ED97AAD9510 CRC64; MLSTWIIIAL TVYLEKCVGQ SSKDGNSESV TNTSSQEYSP INYTLPVFLS TEIIPSFANV YGERQELHLQ SPVLQSQNRN ESHPNSVIDD CHFMSFEEWK KQKIETNISI SNTSRNITEV SKPTTNVTTT NSTAVSVVEI TEQEGTTYKN KFNFASADCA ATIVKTNSQA KGASAILKEN KDSYLLNECS VQNKFIIIEL CQDILVSQVV LGNYEFFSSM YKDIRVSVSD RFPTQNWREL GQFTAKNIRD VQRFDIANPL IWARYLKLEI LSHYGNEFYC PISVVRVHGK TMIDEFKEDE EISSQQLQSP EPTTISELDA NDSELESLIN DTFNECSVVL PHLLLNEFLK DYNTTHSNHC LPSERNNNSA LITSTTATIA TTQESIYKNI IKRLTLLESN ATLSLLYIEE QSKLLSTAFS NLEKRQTANF NNLLRSVNST LLHQLTVFKE SYHDMYSQYS ELFHLQDHKY KHFIAESNQR MKNISSDLTF QKRLSFFNSI IIICLLVYVI LTREINVEVQ NQAVKRHNKS EEALVPDLHR QNKASVFNRS RKDRPSISDP ILAPTKPIHN DHHPKHRKST // ID H8Z9I7_NEMS1 Unreviewed; 289 AA. AC H8Z9I7; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 14-OCT-2015, entry version 10. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EHY66618.1}; GN ORFNames=NERG_00258 {ECO:0000313|EMBL:EHY66618.1}; OS Nematocida sp. 1 (strain ERTm2 / ATCC PRA-371) (Nematode killer OS fungus). OC Eukaryota; Fungi; Microsporidia; Nematocida. OX NCBI_TaxID=944018 {ECO:0000313|EMBL:EHY66618.1, ECO:0000313|Proteomes:UP000005622}; RN [1] {ECO:0000313|Proteomes:UP000005622} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ERTm2 {ECO:0000313|Proteomes:UP000005622}; RX PubMed=22813931; DOI=10.1101/gr.142802.112; RA Cuomo C.A., Desjardins C.A., Bakowski M.A., Goldberg J., Ma A.T., RA Becnel J.J., Didier E.S., Fan L., Heiman D.I., Levin J.Z., Young S., RA Zeng Q., Troemel E.R.; RT "Microsporidian genome analysis reveals evolutionary strategies for RT obligate intracellular growth."; RL Genome Res. 0:0-0(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JH604633; EHY66618.1; -; Genomic_DNA. DR EnsemblFungi; EHY66618; EHY66618; NERG_00258. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000005622; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005622}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005622}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 86 105 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 105 132 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 289 AA; 33552 MW; A4B00C5583FD64F7 CRC64; MENRRPLRIR KKEEIFRIKE IQTAEKDDTK GSLPHVSDEE PLTDSEEHNS LNISTPEEEA KEEEIKKEEV ERKLSFWSLY MQSSSFSVCV LVIGAVFGFL LHSYYRNLDN CINGLKDKIE SQKERLLELE SIFHSKNREI DVADYLEGAR ILYNITTDPY VEKKWFKSNV TGLSAEVAID RVCDKHHCYS FNGSEGKLGI AFKNEKIIRK IGIMHPLYND RTSAVKSFTV DCIMNDKHIN IGEFEYEIPG DSFQQFSITP TKCTGMIFKI KSNHGKKQYT CIYKIYAFE // ID H9FS03_MACMU Unreviewed; 785 AA. AC H9FS03; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=SUN domain-containing protein 1 isoform a {ECO:0000313|EMBL:AFE77412.1}; GN Name=SUN1 {ECO:0000313|EMBL:AFE77412.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE77412.1}; RN [1] {ECO:0000313|EMBL:AFE77412.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Caudate {ECO:0000313|EMBL:AFE77412.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JU333657; AFE77412.1; -; mRNA. DR STRING; 9544.ENSMMUP00000022451; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 259 282 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 289 308 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 383 403 {ECO:0000256|SAM:Coils}. FT COILED 428 462 {ECO:0000256|SAM:Coils}. FT COILED 475 495 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 785 AA; 87283 MW; 9360624CF6C60323 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADGGASSA VSLKNRAART AKQRRSTNKS AFSINHVSRQ VTSFGVSHSG TDSLQDAVTR QPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DLGAAAATAH NGFSCSNCSM LSERKDMLTA HPAAPGPVSR VYSRDRNQKR YFLLQTLRRI GAAGRAVSRM AWSALWLAVV APGKAASGVF WWLGIGWYQF VTLISWLNVF LLTRCLRNIC KLLVLLVPLL LLLAGLSLRG QGDFFSFLPV LNWASTHRTQ RVDDPQDVFK PATSRLNQPL QGDNEAFPWH WMSGMEQQVT SLSGQCHHHG ENLRELTTLL QKLQARVDQM DNGAAGPSTS VRDAVGQPLK ETDFMAFHQE HEVRISHLED ILGKLREKSE AIQKELEQTK QKTVSAVGEQ LLPTVEHLQL ELDQLKSELS SWRHMKTGCE TVDALQERVD VQVRETVKLL FSEDQQGGSL EQLLQRFSSQ CVSRGDLHTM LRDLELQILR NVTHHISVTK RLPASEVVVS AVSEAGASGI TEAQARAIVN NALKLYSQDK TGMVDFALES GGGSILSTRC SETYETKTAL MSLFGIPLWY FSQSPRVVIQ PDIYPGNCWA FKGSQGYLVV RLSMMIHPAA FTLEHIPKTL SPTGNISSAP KDFAVYGLEN EYQEEGQLLG QFTYDQDGES LQMFQALKTP DDRVFQIVEL RIFSNWGHPE YTCLYRFRVH GEPVK // ID H9FSI8_MACMU Unreviewed; 720 AA. AC H9FSI8; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=SUN domain-containing protein 2 isoform b {ECO:0000313|EMBL:AFE77597.1}; GN Name=SUN2 {ECO:0000313|EMBL:AFE77597.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE77597.1}; RN [1] {ECO:0000313|EMBL:AFE77597.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Caudate {ECO:0000313|EMBL:AFE77597.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JU333841; AFE77596.1; -; mRNA. DR EMBL; JU333842; AFE77597.1; -; mRNA. DR STRING; 9544.ENSMMUP00000009544; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 217 238 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 276 296 FT COILED 355 375 FT COILED 377 404 FT COILED 407 434 FT COILED 481 501 SQ SEQUENCE 720 AA; 80541 MW; 426A1FE3344FBD93 CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVRESYI GTAFPPRSAL EELHGDADWG EDLRVRRRRG TGGSESSRAS GLVGRKAAED FLGSSSGYSS EDDYMGYSDA DQQSSGSRLW NAVSRAGSLL WMVATSPGRL FRLLYWWAGT TWYRLTTAAS LLDVFVLTRR FSSLKTFLWF LLPLLLLTCL TYGAWYFYPY GLQTFHPALV SWWAAKDSRR QDEGWESRDS SHFQAEQRVM SRVHSLERRL EALAAEFSSN WQKEAMRLER LELRQGAPGQ GGGGGLSHED TLALLEGLVS RHEAALKEDF RREAAARIQE ELSALRAEHQ QDSEDLFKKI VRASQESEAR IQQLKSEWQS MTQESFRESS VKELRRLEDQ LAGLQQELAA LALKQSLVAD EVGLLPQQIQ AVRDDVESQF PAWISQFLAR GGGGRVGLLQ REEMQAQLRE LESKILTHVA EMQGKSAREA AASLGMTLQK EGVIGVTEEQ VHRIVKQALQ RYSEDRIGLA DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRAILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGKFTY DQDGEPIQTF HFQAPSMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID H9FSK0_MACMU Unreviewed; 2610 AA. AC H9FSK0; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:AFE77609.1}; GN Name=HECTD1 {ECO:0000313|EMBL:AFE77609.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE77609.1}; RN [1] {ECO:0000313|EMBL:AFE77609.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Caudate {ECO:0000313|EMBL:AFE77609.1}, and RC Thymus {ECO:0000313|EMBL:AFH29037.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JU333854; AFE77609.1; -; mRNA. DR EMBL; JU472233; AFH29037.1; -; mRNA. DR RefSeq; NP_001248188.1; NM_001261259.2. DR UniGene; Mmu.13270; -. DR ProteinModelPortal; H9FSK0; -. DR STRING; 9544.ENSMMUP00000004174; -. DR GeneID; 717177; -. DR KEGG; mcc:717177; -. DR CTD; 25831; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR KO; K12231; -. DR ExpressionAtlas; H9FSK0; baseline. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:Ensembl. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:AFE77609.1}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2610 AA; 289368 MW; C02FB51A2AABF98B CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQPQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID H9FSK1_MACMU Unreviewed; 2608 AA. AC H9FSK1; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 22. DE SubName: Full=E3 ubiquitin-protein ligase HECTD1 {ECO:0000313|EMBL:AFE77610.1}; GN Name=HECTD1 {ECO:0000313|EMBL:AFE77610.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE77610.1}; RN [1] {ECO:0000313|EMBL:AFE77610.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Caudate {ECO:0000313|EMBL:AFE77610.1}, and RC Thymus {ECO:0000313|EMBL:AFH31857.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JU333855; AFE77610.1; -; mRNA. DR EMBL; JU475053; AFH31857.1; -; mRNA. DR UniGene; Mmu.13270; -. DR STRING; 9544.ENSMMUP00000004174; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:AFE77610.1}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2608 AA; 289138 MW; EE1435C1DD3B5571 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEILRRPSL QRRAGSRSDV THHAVTSQLP QVPAGAGSRP IGEQEEEEYE TKGGRRRTWD DDYVLKRQFS ALVPAFDPRP GRTNVQQTTD LEIPPPGTPH SELLEEVECT PSPRLALTLK VTGLGTTREV ELPLTNFRST IFYYVQKLLQ LSCNGNVKSD KLRRIWEPTY TIMYREMKDS DKEKENGKMG CWSIEHVEQY LGTDELPKND LITYLQKNAD AAFLRHWKLT GTNKSIRKNR NCSQLIAAYK DFCEHGTKSG LNQGAISTLQ SSDILNLTKE QPQAKAGNGQ NSCGVEDVLQ LLRILYIVAS DPYSRISQED GDEQPQFTFP PDEFTSKKIT TKILQQIEEP LALASGALPD WCEQLTSKCP FLIPFETRQL YFTCTAFGAS RAIVWLQNRR EATVERTRTT SSVRRDDPGE FRVGRLKHER VKVPRGESLM EWAENVMQIH ADRKSVLEVE FLGEEGTGLG PTLEFYALVA AEFQRTDLGA WLCDDNFPDD ESRHVDLGGG LKPPGYYVQR SCGLFTAPFP QDSDELERIT KLFHFLGIFL AKCIQDNRLV DLPISKPFFK LMCMGDIKSN MSKLIYESRG DRDLHCTESQ SEASTEEGHD SLSVGSFEED SKSEFILDPP KPKPPAWFNG ILTWEDFELV NPHRARFLKE IKDLAIKRRQ ILSNKGLSED EKNTKLQELV LKNPSGSGPP LSIEDLGLNF QFCPSSRIYG FTAVDLKPSG EDEMITMDNA EEYVDLMFDF CMHTGIQKQM EAFRDGFNKV FPMEKLSSFS HEEVQMILCG NQSPSWAAED IINYTEPKLG YTRDSPGFLR FVRVLCGMSS DERKAFLQFT TGCSTLPPGG LANLHPRLTV VRKVDATDAS YPSVNTCVHY LKLPEYSSEE IMRERLLAAT MEKGFHLN // ID H9FVI4_MACMU Unreviewed; 1253 AA. AC H9FVI4; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Protein osteopotentia homolog isoform 1 {ECO:0000313|EMBL:AFE78643.1}; GN Name=C1orf9 {ECO:0000313|EMBL:AFE78643.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE78643.1}; RN [1] {ECO:0000313|EMBL:AFE78643.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Caudate {ECO:0000313|EMBL:AFE78643.1}, and RC Thymus {ECO:0000313|EMBL:AFH32703.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JU334889; AFE78642.1; -; mRNA. DR EMBL; JU334890; AFE78643.1; -; mRNA. DR EMBL; JU334891; AFE78644.1; -; mRNA. DR EMBL; JU475899; AFH32703.1; -; mRNA. DR STRING; 9544.ENSMMUP00000018073; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1253 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003619496. FT COILED 935 955 FT COILED 985 1005 FT COILED 1191 1211 SQ SEQUENCE 1253 AA; 139219 MW; ADAC3A151E995CEC CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFLKKD EREGPINAES LGKSGSNLPV SPEEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQDGGD QKSALNASDN VKNESSDYTK PGDIDPTSVT SPKDPEDIPT FDEWKKKVME VEKEKSQSMH PSSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPAYVTTEVD TNDMELSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VALYWQRSRT ALSKGKDYLV SAQPPLLPAE SVDISVLQPL SGELENKNIE REAETVVLGD LSSSMHQDDL VNHTVDAVEL EPSHSQTLSQ SLLLDITPEI NPLPKIEVSE SVEYEAGHIT SQVIPQESSV EIDNEAEQKS ESFSSIEKPS VTYETNKVNE VVDNIIKEDV NSMQIFTKLS ETIVPPINTA TVPDNEDGEA KMNVADTAKQ TLISVVDSSS LPEVKEEEQS PEDALLRGLQ RTATDFYAEL QNSTDLGYAN GNLVHGSNQK ESVFMRLNNR IKALEVNMSL SGRYLEELSQ RYRKQMEEMQ KAFNKTIVKL QNTSRIAEEQ DQRQTEAIQL LQAQLTNMTQ LVSNLSATVA ELKREVSDRQ SYLVISLVLC VVLGLMLCMQ RCRNTSQFDG DYISKLPKSN QYPSPKRCFS SYDDMNLKRR TSFPLMRSKS LQLTGKEVDP NDLYIVEPLK FSPEKKKKRC KYKIEKIETI KPAEPLHPIA NGDIKGRKPF TNQRDFSNIG EVYHSSYKGP PSEGSSETSS QSEESYFCGI SACTSLCNGQ SQKTKTEKRA LKRRRSKVQD QGKLIKTLIQ TKSGSLPSLH DIIKGNKEIT VGTFGVTAVS GHI // ID H9G4U1_ANOCA Unreviewed; 227 AA. AC H9G4U1; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000001142}; GN Name=LOC100566393 {ECO:0000313|Ensembl:ENSACAP00000001142}; OS Anolis carolinensis (Green anole) (American chameleon). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; OC Toxicofera; Iguania; Iguanidae; Polychrotinae; Anolis. OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000001142, ECO:0000313|Proteomes:UP000001646}; RN [1] {ECO:0000313|Ensembl:ENSACAP00000001142} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000001142}; RG The Genome Sequencing Platform; RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J., RA Lander E.S., Lindblad-Toh K.; RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard)."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001646} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21881562; DOI=10.1038/nature10390; RA Alfoldi J., Di Palma F., Grabherr M., Williams C., Kong L., RA Mauceli E., Russell P., Lowe C.B., Glor R.E., Jaffe J.D., Ray D.A., RA Boissinot S., Shedlock A.M., Botka C., Castoe T.A., Colbourne J.K., RA Fujita M.K., Moreno R.G., Ten Hallers B.F., Haussler D., Heger A., RA Heiman D., Janes D.E., Johnson J., de Jong P.J., Koriabine M.Y., RA Lara M., Novick P.A., Organ C.L., Peach S.E., Poe S., Pollock D.D., RA de Queiroz K., Sanger T., Searle S., Smith J.D., Smith Z., RA Swofford R., Turner-Maier J., Wade J., Young S., Zadissa A., RA Edwards S.V., Glenn T.C., Schneider C.J., Losos J.B., Lander E.S., RA Breen M., Ponting C.P., Lindblad-Toh K.; RT "The genome of the green anole lizard and a comparative analysis with RT birds and mammals."; RL Nature 477:587-591(2011). RN [3] {ECO:0000313|Ensembl:ENSACAP00000001142} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAR-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSACAP00000001142}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 28377.ENSACAP00000001142; -. DR Ensembl; ENSACAT00000001174; ENSACAP00000001142; ENSACAG00000001215. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H9G4U1; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001646; Unassembled WGS sequence. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:InterPro. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001646}; KW Reference proteome {ECO:0000313|Proteomes:UP000001646}. SQ SEQUENCE 227 AA; 25845 MW; 9C935F6DCD3AFAEF CRC64; MKSIQKGIPN SIKQILKEND IPGENKDQVL EMINQAFRKT YEDHVQMPDW AQKTIGATID HSRTSKSYEP ENAKSCWYKY FFISTAKPPE TILQPDVYPG NCWAFHGSEG QVVIKLPERI FPTAVTVQHI PRAVSPVKGV TSALKDFSVY GIDDEINEET LLGTFMYDIE KETIQTFQLQ KEAAKQFLCM KFKVQSNWGN AEFTCIYRVR VHGNMSGNSV PSEKGQK // ID H9GAB9_ANOCA Unreviewed; 338 AA. AC H9GAB9; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000005431}; DE Flags: Fragment; GN Name=LOC100552827 {ECO:0000313|Ensembl:ENSACAP00000005431}; OS Anolis carolinensis (Green anole) (American chameleon). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; OC Toxicofera; Iguania; Iguanidae; Polychrotinae; Anolis. OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000005431, ECO:0000313|Proteomes:UP000001646}; RN [1] {ECO:0000313|Ensembl:ENSACAP00000005431} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RG The Genome Sequencing Platform; RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J., RA Lander E.S., Lindblad-Toh K.; RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard)."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001646} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21881562; DOI=10.1038/nature10390; RA Alfoldi J., Di Palma F., Grabherr M., Williams C., Kong L., RA Mauceli E., Russell P., Lowe C.B., Glor R.E., Jaffe J.D., Ray D.A., RA Boissinot S., Shedlock A.M., Botka C., Castoe T.A., Colbourne J.K., RA Fujita M.K., Moreno R.G., Ten Hallers B.F., Haussler D., Heger A., RA Heiman D., Janes D.E., Johnson J., de Jong P.J., Koriabine M.Y., RA Lara M., Novick P.A., Organ C.L., Peach S.E., Poe S., Pollock D.D., RA de Queiroz K., Sanger T., Searle S., Smith J.D., Smith Z., RA Swofford R., Turner-Maier J., Wade J., Young S., Zadissa A., RA Edwards S.V., Glenn T.C., Schneider C.J., Losos J.B., Lander E.S., RA Breen M., Ponting C.P., Lindblad-Toh K.; RT "The genome of the green anole lizard and a comparative analysis with RT birds and mammals."; RL Nature 477:587-591(2011). RN [3] {ECO:0000313|Ensembl:ENSACAP00000005431} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAR-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSACAP00000005431}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 28377.ENSACAP00000005431; -. DR Ensembl; ENSACAT00000005549; ENSACAP00000005431; ENSACAG00000005488. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H9GAB9; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000001646; Unassembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001646}; KW Reference proteome {ECO:0000313|Proteomes:UP000001646}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSACAP00000005431}. SQ SEQUENCE 338 AA; 37831 MW; FC49E33E2D187B3B CRC64; QGIQKELELT KTKTASSDMD EQNQHFSSKI KHLELELSHV KSELLNSQGL KTSCASVDML QEKVDAQVME SVKFILFGHQ KGDLPESLLQ WLTSKFVSKS DLRVLLQDLE SRILRNITLH MSVTNTKSAS EVVTSVVNEA GIAGITEAQA RLIVNNALKL FSQDKTGMVD FALESGGGSV LSTRCSETYE TKTALISLFG IPLWYFSQSP RVVIQPDMYP GNCWAFKGSQ GYLVVRLSMV IHPTAFTLEH IPKTLSPTGN ITSAPKDFSV YGLEDEYQEG VLLGQYTYDQ DGEPLQMFQV TEATEKAFQI VELRIFSNWG HSEYTCLYRF RVHGRPAE // ID H9GJ94_ANOCA Unreviewed; 975 AA. AC H9GJ94; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000012557}; GN Name=suco {ECO:0000313|Ensembl:ENSACAP00000012557}; OS Anolis carolinensis (Green anole) (American chameleon). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; OC Toxicofera; Iguania; Iguanidae; Polychrotinae; Anolis. OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000012557, ECO:0000313|Proteomes:UP000001646}; RN [1] {ECO:0000313|Ensembl:ENSACAP00000012557} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000012557}; RG The Genome Sequencing Platform; RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J., RA Lander E.S., Lindblad-Toh K.; RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard)."; RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Proteomes:UP000001646} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=21881562; DOI=10.1038/nature10390; RA Alfoldi J., Di Palma F., Grabherr M., Williams C., Kong L., RA Mauceli E., Russell P., Lowe C.B., Glor R.E., Jaffe J.D., Ray D.A., RA Boissinot S., Shedlock A.M., Botka C., Castoe T.A., Colbourne J.K., RA Fujita M.K., Moreno R.G., Ten Hallers B.F., Haussler D., Heger A., RA Heiman D., Janes D.E., Johnson J., de Jong P.J., Koriabine M.Y., RA Lara M., Novick P.A., Organ C.L., Peach S.E., Poe S., Pollock D.D., RA de Queiroz K., Sanger T., Searle S., Smith J.D., Smith Z., RA Swofford R., Turner-Maier J., Wade J., Young S., Zadissa A., RA Edwards S.V., Glenn T.C., Schneider C.J., Losos J.B., Lander E.S., RA Breen M., Ponting C.P., Lindblad-Toh K.; RT "The genome of the green anole lizard and a comparative analysis with RT birds and mammals."; RL Nature 477:587-591(2011). RN [3] {ECO:0000313|Ensembl:ENSACAP00000012557} RP IDENTIFICATION. RG Ensembl; RL Submitted (MAR-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSACAP00000012557}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 28377.ENSACAP00000012557; -. DR Ensembl; ENSACAT00000012813; ENSACAP00000012557; ENSACAG00000012783. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; H9GJ94; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000001646; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001646}; KW Reference proteome {ECO:0000313|Proteomes:UP000001646}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 975 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003620043. SQ SEQUENCE 975 AA; 108903 MW; 42B452EBCBBF3F2B CRC64; KNKNISLSLL LCAFSCSPVF LNFPIWHVTC KEGPSSTILV SQNENSPLAN MDENIQEKEE TDRSVGTLSL EPINTIINNY PEEYPDDYTK ETECIYLAKI WRDGSRSIPK VMDMGKLETQ EEWSSVDLSE TSFNIAGISE STFSLPTSES SSVSQSSVIE NSSADIPVIT SETEQSELDC NLGGILETNS QSEASPLVTS PDSLVGQHIE NISSHRKGKT TKSEFDPQVA APEQKTDPKS ALNTSGNLRG EVKTEQRKMG EIDPTSVIAP KDPGDIPTFD EWKKQVMEVE KEKSQSMHPS SNGGQHPTKK VQKNRNNYAS VECGAKILAA NPEAKSTSAI LMENMDLYML NPCSTKIWFV VELCEPIQVR QLDIANHELF SSTPKDFLVS ISDRYPTNKW IKLGTFHARD ERNVQSFPLD EQMYAKYVKM FIKYLKVELV SHFGSEHFCP LSLIRVFGTS MVEEYEEIAD SQYQSERQEL FDEDYDYPLD YPSVGEEKSS KNLLGSATNA ILNMVNIAAN ILGAKTGEDS VEQGNKSVPE NTTATSMMTP ELPQPTVVPS LEPDTSEIPQ TENELLVLDR TRESPIVQLV HEDEEETSQS TVTLLPSDEQ EEEIPWFESE TQMYCYDLVT VCCISSFSEY VYKWCSAVAM FHRRHSKIDS TWGKYDYAAT WQDQLVPTKS LDVLIHEYTP EKLDTLNAEP SEIVTDVSSN LLDKGIINQT EGTFELEPSH PQTVSQSILL DVATGVKSVS TTEVSSEPGK HETASESSEI PFPEEIPAEE NGVVAPVTEK PSATTTVTEF QEMSTEEKTS AEIISKLTET VPRPECTVAT ESYNVETKDS SSEIEKQEVP LVESSSLELR EDEQTVEETF LSIPVSGLPR TATDFYAELQ NSTDLAYGNG NLIHGSNQKE SVFMRLNNRI KALEVNMSLS SRYLEELSQR YASVIFRFTL KVDEKIVCVR LNVCGMYSWC SKFAG // ID H9IUH4_BOMMO Unreviewed; 235 AA. AC H9IUH4; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA}; GN Name=LOC101738307 {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA}; OS Bombyx mori (Silk moth). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. OX NCBI_TaxID=7091 {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA, ECO:0000313|Proteomes:UP000005204}; RN [1] {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=p50T {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA}; RX PubMed=19121390; DOI=10.1016/j.ibmb.2008.11.004; RG International Silkworm Genome Consortium; RT "The genome of a lepidopteran model insect, the silkworm Bombyx RT mori."; RL Insect Biochem. Mol. Biol. 38:1036-1045(2008). RN [2] {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA} RP IDENTIFICATION. RC STRAIN=p50T (Dazao) {ECO:0000313|EnsemblMetazoa:BGIBMGA000904-TA}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABH01001039; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_012547158.1; XM_012691704.1. DR STRING; 7091.BGIBMGA000904-TA; -. DR EnsemblMetazoa; BGIBMGA000904-RA; BGIBMGA000904-TA; BGIBMGA000904. DR GeneID; 101738307; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H9IUH4; -. DR OMA; FPLWYFS; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000005204; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005204}; KW Reference proteome {ECO:0000313|Proteomes:UP000005204}. SQ SEQUENCE 235 AA; 26312 MW; 325141F0649D61F9 CRC64; MERMSAVIPA VAAAAGRAKD ALEPSLRKNS RQALDNYDYD RQVADYALES AGGRILDTGD TIEHLVYESP ISWGLHLITS WMCRECQGAS AMIRPGTLPG ECWAFKGSKG QAMIRLLGTV KVMGVSVEHI PAHISPTREI SSAPRLFQVE GLEYRSDPYP HDFGTVEYDK EGKPIQYFEV LYPSTKGYSL IRIRVLTNWG HPVYTCVYRV RVHGELSGRN QNFGADDTEM RIENE // ID H9J7Y3_BOMMO Unreviewed; 395 AA. AC H9J7Y3; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA}; GN Name=LOC101741471 {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA}; OS Bombyx mori (Silk moth). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. OX NCBI_TaxID=7091 {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA, ECO:0000313|Proteomes:UP000005204}; RN [1] {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=p50T {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA}; RX PubMed=19121390; DOI=10.1016/j.ibmb.2008.11.004; RG International Silkworm Genome Consortium; RT "The genome of a lepidopteran model insect, the silkworm Bombyx RT mori."; RL Insect Biochem. Mol. Biol. 38:1036-1045(2008). RN [2] {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA} RP IDENTIFICATION. RC STRAIN=p50T (Dazao) {ECO:0000313|EnsemblMetazoa:BGIBMGA005625-TA}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABH01020158; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7091.BGIBMGA005625-TA; -. DR EnsemblMetazoa; BGIBMGA005625-RA; BGIBMGA005625-TA; BGIBMGA005625. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; H9J7Y3; -. DR OMA; WVHTSPR; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000005204; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005204}; KW Reference proteome {ECO:0000313|Proteomes:UP000005204}. FT COILED 151 171 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 395 AA; 44889 MW; 3911FFB8A249F9BF CRC64; MSLFGIKSYN YSDFVIQEKE ILDDAYSSPR ISQDSNLADR MRALEAWALS VDSRLKLFDQ KLSKLDNIES QIEQYSLMHL QQNLMQILTR DNTDALALKL KAYFDQNYVT PDQMREASRV LNERLANIGQ AELDEDRIKE MVQEYLAMFE RRQLEVIVQK VEEHVKDVEV QRSGSGVDME AVRTLVAGML EVYDADKTGL VDYALESAGG QVLSTRCTEL YQIKSKQYWV LGVPVLWVHT SPRNALTAGA APADCWAFQG FPGYLVIKTY AIIEVTGFSL EHMSKLLAID GKIESAPKNF SVYGLHGELD PEPHLFGDYM YDADGKSIQY FPVKHPKTTN IDGIEYPVAY DIVELRIESN HGNPTYTCVY RFRVHGNPLA DVRRAAEDSM HDSQL // ID H9J944_BOMMO Unreviewed; 820 AA. AC H9J944; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 16. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA}; GN Name=LOC101735970 {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA}; OS Bombyx mori (Silk moth). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. OX NCBI_TaxID=7091 {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA, ECO:0000313|Proteomes:UP000005204}; RN [1] {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=p50T {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA}; RX PubMed=19121390; DOI=10.1016/j.ibmb.2008.11.004; RG International Silkworm Genome Consortium; RT "The genome of a lepidopteran model insect, the silkworm Bombyx RT mori."; RL Insect Biochem. Mol. Biol. 38:1036-1045(2008). RN [2] {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA} RP IDENTIFICATION. RC STRAIN=p50T (Dazao) {ECO:0000313|EnsemblMetazoa:BGIBMGA006036-TA}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABH01004457; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7091.BGIBMGA006036-TA; -. DR EnsemblMetazoa; BGIBMGA006036-RA; BGIBMGA006036-TA; BGIBMGA006036. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; H9J944; -. DR OMA; DAVMSIM; -. DR OrthoDB; EOG7MPRDC; -. DR Proteomes; UP000005204; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000005204}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000005204}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 571 589 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 498 561 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 820 AA; 93119 MW; 068EA26BDBA8FDC8 CRC64; MLNTCNSRIW FVVELCEAVP AQKIEIANFE LFSSTPKDIA VYFSDRFPTR DWASVGQFTA QDMRDVQSFD LYPHLFGKFI KVEMLSHHGS EHYCPISLFK VYGTSEFEVL EKENSQHPAH IDEDEDDEII DVPDVPAAET EPSKNLFGSA RDAVMSIMKK AAQALVKTEV PKNVSSEHND TLADDAYRRC CSPSHIIVCD NCSESLYNDV YELISCNSDK LVSLMRQGFL RDTLKCTGIC QQFGLDFKST KTIEFSDERV AYMNALFPQK YLAALCNILA IKEKKVVLNT SFENEHNVTS DNSTEESPQV VNSGTNGEQE IKLAPDNSNG SDDKINETTQ SDEIPIDTLN DSSTLPVELL PEEVDNMEDK KEDVIVAEET SGDEPQEFIA PNIDKSIEIA TEQENKDGLL KTKIENGKEI TEKRDGNGEE SNDQLMMEND NFISDIDQIA ADPAPPGAPN TQNQNQQQTT LQKESVFLRL SNRVKTLERN MSLSGQYLEE LSRRYKRQVE EMQKTFEKTV QQMTEEKKKT NEREQKYLEQ MSNLQEQLAQ MTSAMHVLME ERDSWFGNIN FFRFIIFQAI IVALVIYYVS KRRRIEPILV PVPRKTRKKQ DKLRRKSVEG VSGHATPSTK KRRPSEEALQ IARQAIEDTK GSNEGEWQVA RKNRRRKTCI VLNAETAAKS WTRQDSIGKL QENTITLDDD EYVAPVSEPK QFNADVEPPK PDYTKTNGFF NNLKTKTMKT RRLSSPAFLR TFSRQSTRST PSPVVRSEEP IFNGNAKKKA ASESPTGSLW SESTELSPQD SESSGSKKKK SLKNILKKVF // ID H9JM75_BOMMO Unreviewed; 2360 AA. AC H9JM75; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 16-MAY-2012, sequence version 1. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:BGIBMGA010628-TA}; OS Bombyx mori (Silk moth). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; OC Bombycoidea; Bombycidae; Bombycinae; Bombyx. OX NCBI_TaxID=7091 {ECO:0000313|EnsemblMetazoa:BGIBMGA010628-TA, ECO:0000313|Proteomes:UP000005204}; RN [1] {ECO:0000313|EnsemblMetazoa:BGIBMGA010628-TA} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=p50T {ECO:0000313|EnsemblMetazoa:BGIBMGA010628-TA}; RX PubMed=19121390; DOI=10.1016/j.ibmb.2008.11.004; RG International Silkworm Genome Consortium; RT "The genome of a lepidopteran model insect, the silkworm Bombyx RT mori."; RL Insect Biochem. Mol. Biol. 38:1036-1045(2008). RN [2] {ECO:0000313|EnsemblMetazoa:BGIBMGA010628-TA} RP IDENTIFICATION. RC STRAIN=p50T (Dazao) {ECO:0000313|EnsemblMetazoa:BGIBMGA010628-TA}; RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BABH01033403; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BABH01033404; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BABH01033405; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BABH01033406; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BABH01033407; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7091.BGIBMGA010628-TA; -. DR EnsemblMetazoa; BGIBMGA010628-RA; BGIBMGA010628-TA; BGIBMGA010628. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; H9JM75; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR Proteomes; UP000005204; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 1. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000005204}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000005204}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2360 AA; 258926 MW; 3ED60F87BCB12C50 CRC64; MAEVDPETLL EWLLTGQGDE RDMQLIALEQ LCMLLLMSDN VDSCPPRTFL PALCKIFLDE CAPDNVLEVT ARAITYYLDV SAECTRRIVA IEGAVKAICN RLLTVDPNNR TSKDLAEQCI KVLELVCTRE AGAVWEGGGL PAVLHFITNH GTSVHKDTLH SAMAVVSRVC GKMEPGDARV GDAVSSLSML LRHSDARVAD AALRCFASLA DRFARAHADP APLAEHGLIE ELVRRLGSTE SGDDKCLTSA VSTTVSLLST LCRGSAQITH DLVRLELCSA IEKAVQADER WCLECMRLVD LLLVLLCEGR HAIATGANRN VGSSSGARST EASRGGERSH RQLIDCIRGK DTDALLSAVS SGAVDVNFTD DVGQTLLNWA AAFGTKEMVE FLCEKGADVN RGQRSSSLHY AACFGRPAIA KVLLRYGANA DLRDEDGKTP LDKARERHDQ GHREVAAILQ CPGEWLVVGN SESPSPSTDD DFPETGDKEM AEFYLERLVP LFCARYVGAG GGGVRRACLS LVRKMVHYAP PRLLRDAHPA ALLTNLVAAV LDNQDEYPVR RPHPRYLRSR YIRPPADDDD GHLTVLGISE ELMAKAADVY LEQFARLGVF SKVEALAITP IQYDSDGSST PELSGEDASC LSSGVAYSWG EWSLCRGRDA LYAWSDAAAL ELSTGSNGWF RFLLDGKLAT MYSSGSPEHQ TDNTGESTLL LFSSKLDPFA LLVAVVPFKF VLKLKHLVCN IESLGNEKSV TKKGQTNRIC LVRELENRGE FIEKLQKAKA SVRNFIPQPI LSKPGPNKLV LGNWSLTCEN GWADRRSLLT NNSTPTKPRS RISAKTEALK AQVCERARAL YSRHLASAVT RQPRPPVARL RALLARMQRI ATQPSKDWQR ELNESLDQLT ELLCGDEHLS AYELQSSGLA PALLQVLSPQ VNDGPGHLSE RSRVVSAWMC RASGAAGAAL AERLVAVLES VERLPVLAPH DAPPAPASAL HHLTKRIRLR VERESDEISE DSNANNNSGR NLKVEALTTV RQLERFLAKS VARQWYDMER STFLFVQKIK TEAPLSFTYD HDFDENGVLY FIGTNAGTCE WVNPGAHGLV YVWSSDGKQL PYGRPEDVLS RSPEPLNVHT NDDRRAFIAL DLGVHIVPTA YTLRHARGYG QSALRNWLFQ MSVDGLTWCT LVAHTDEQAL QEPGSTATWR LRTDSSYRYL RIQQNGKNAS GQKYYLSLSG LEIYGKVTGV VETAPRQNGA PHQTTGTTAS SACNNATAAN NTNNAGMAGG GGGARARRWS RGRGVCAGAR VSRGPDWKWR DQDGPHPALG TLTSELHNGW VDVRWDHGVR NSYRMGAEGK FDLKVVSGGT VIESSKASRK SHSTPSLPDA TSVDQVSVAS TEQASSADNI SSEEMVSNMS RPRTHATDLS AINNSTHHIN SDLATIVESL TLGAESNNCI TELGNTSFTN MEMGPTSITD ITKPYPAKEP LPESSAHECD ENEAGESQYG DNKKESQSGS GTMSASEPDL TQQGTGRLLE SLGVGRGNSA GRGSNVPRSN RNNHSSGLLP SLVRLALSSN FPGGLLSAAQ SYPSLSSNAQ NALTLSLTST SSESEQVSLE DFLESCRAPA LLTELEDDED GDEALDSDKE NEPTYQEVSR NLLSLMEEEA LEALRGSSGS GQNNRSRRPW DDDFVLKRQF SALIPAFDPR PGRTNLNQTV DLEIPLNEES EEEESWEEVP ENVGGGATSE TGRAPALRLV LSAGGASLPL ARGSWTLYRA VLLLHARLPH ADLHRDTTYT LTYKEVEGSE GAFASSDTED DEPNDPEGGI VGAEGSEGGM ATSCVRVLRR LRAAAPELPA EPFLSTKLTN KLHLQLQEPL ALAAAATPHW CQQLIDWCPF LFPLETRQMF FACTAFGTSR TIVWLQAQRD RALDRQRATN TVSPRRAELE ATEFRMGRLR HERVRIPRDP DMLRSAIQVM RVHASRKSVL EVEFAGEEGT GLGPTLEFYA LVAAELQRAD LALWLHDAPL HVDDDAPLHL MQPTEKPPGY YVSRPGGLFP APLPQESPIC DKVCKYFWFL GVFLAKVLQD GRLVDLPLSE PFLRIMCGEE LTKENLQEID PIRHRFLEKM LEAAEGYDKI MRDESLDENE KQKRVSELNV DGAAFEELSL TMTHIAPNVD PAVAVQPLCD GGEHIEVGAH NARLYAEWSA RWMVSAGVRR QVAWFKRGFA RVFPPRRLRA FSPSELRLLL CGERGPVWTR EHLLQYTEPK LGYTRDSPGF LRLVDVLVEM SICERKAFLQ FATGCSSLPP GGLANLHPRL TVVRKVDAGD GSYPSVNTCV HYLKLPEYSC KEVLRERLLA ATNERGFHLN // ID H9KYU0_CHICK Unreviewed; 243 AA. AC H9KYU0; DT 16-MAY-2012, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGALP00000000599}; DE Flags: Fragment; OS Gallus gallus (Chicken). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda; OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; OC Phasianidae; Phasianinae; Gallus. OX NCBI_TaxID=9031 {ECO:0000313|Ensembl:ENSGALP00000000599, ECO:0000313|Proteomes:UP000000539}; RN [1] {ECO:0000313|Proteomes:UP000000539} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Red jungle fowl {ECO:0000313|Proteomes:UP000000539}; RX PubMed=15592404; DOI=10.1038/nature03154; RG International Chicken Genome Sequencing Consortium; RA Hillier L.W., Miller W., Birney E., Warren W., Hardison R.C., RA Ponting C.P., Bork P., Burt D.W., Groenen M.A.M., Delany M.E., RA Dodgson J.B., Chinwalla A.T., Cliften P.F., Clifton S.W., RA Delehaunty K.D., Fronick C., Fulton R.S., Graves T.A., Kremitzki C., RA Layman D., Magrini V., McPherson J.D., Miner T.L., Minx P., Nash W.E., RA Nhan M.N., Nelson J.O., Oddy L.G., Pohl C.S., Randall-Maher J., RA Smith S.M., Wallis J.W., Yang S.-P., Romanov M.N., Rondelli C.M., RA Paton B., Smith J., Morrice D., Daniels L., Tempest H.G., RA Robertson L., Masabanda J.S., Griffin D.K., Vignal A., Fillon V., RA Jacobbson L., Kerje S., Andersson L., Crooijmans R.P., Aerts J., RA van der Poel J.J., Ellegren H., Caldwell R.B., Hubbard S.J., RA Grafham D.V., Kierzek A.M., McLaren S.R., Overton I.M., Arakawa H., RA Beattie K.J., Bezzubov Y., Boardman P.E., Bonfield J.K., RA Croning M.D.R., Davies R.M., Francis M.D., Humphray S.J., Scott C.E., RA Taylor R.G., Tickle C., Brown W.R.A., Rogers J., Buerstedde J.-M., RA Wilson S.A., Stubbs L., Ovcharenko I., Gordon L., Lucas S., RA Miller M.M., Inoko H., Shiina T., Kaufman J., Salomonsen J., RA Skjoedt K., Wong G.K.-S., Wang J., Liu B., Wang J., Yu J., Yang H., RA Nefedov M., Koriabine M., Dejong P.J., Goodstadt L., Webber C., RA Dickens N.J., Letunic I., Suyama M., Torrents D., von Mering C., RA Zdobnov E.M., Makova K., Nekrutenko A., Elnitski L., Eswara P., RA King D.C., Yang S.-P., Tyekucheva S., Radakrishnan A., Harris R.S., RA Chiaromonte F., Taylor J., He J., Rijnkels M., Griffiths-Jones S., RA Ureta-Vidal A., Hoffman M.M., Severin J., Searle S.M.J., Law A.S., RA Speed D., Waddington D., Cheng Z., Tuzun E., Eichler E., Bao Z., RA Flicek P., Shteynberg D.D., Brent M.R., Bye J.M., Huckle E.J., RA Chatterji S., Dewey C., Pachter L., Kouranov A., Mourelatos Z., RA Hatzigeorgiou A.G., Paterson A.H., Ivarie R., Brandstrom M., RA Axelsson E., Backstrom N., Berlin S., Webster M.T., Pourquie O., RA Reymond A., Ucla C., Antonarakis S.E., Long M., Emerson J.J., RA Betran E., Dupanloup I., Kaessmann H., Hinrichs A.S., Bejerano G., RA Furey T.S., Harte R.A., Raney B., Siepel A., Kent W.J., Haussler D., RA Eyras E., Castelo R., Abril J.F., Castellano S., Camara F., Parra G., RA Guigo R., Bourque G., Tesler G., Pevzner P.A., Smit A., Fulton L.A., RA Mardis E.R., Wilson R.K.; RT "Sequence and comparative analysis of the chicken genome provide RT unique perspectives on vertebrate evolution."; RL Nature 432:695-716(2004). RN [2] {ECO:0000313|Ensembl:ENSGALP00000000599} RP IDENTIFICATION. RC STRAIN=Red jungle fowl {ECO:0000313|Ensembl:ENSGALP00000000599}; RG Ensembl; RL Submitted (APR-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSGALP00000000599}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AADN03008418; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008453; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008470; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008473; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008474; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008475; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008478; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AADN03008490; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 9031.ENSGALP00000000599; -. DR PaxDb; H9KYU0; -. DR Ensembl; ENSGALT00000000600; ENSGALP00000000599; ENSGALG00000000443. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR InParanoid; H9KYU0; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR Proteomes; UP000000539; Chromosome 25. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000539}; KW Reference proteome {ECO:0000313|Proteomes:UP000000539}. FT COILED 13 33 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSGALP00000000599}. SQ SEQUENCE 243 AA; 27718 MW; 5A48EB72518AB73F CRC64; THLLPHHYRC HHRHALSQEI ERLQQAASEL RAQRDCYQTP EIERLQQAAS ELRAQLDCYQ TPDWALQSFG ATIDTRRTSP IYELRSWFSR HCFWCSVNPP DTILQPGVSL GECWPMEGQQ GQVVIRLRAK IRPSCVTLEH ITPEMTPSGT ASSAPRDVAV FGLDADSEEE VPLVSFTFDV GEGPTQTFLL KNNHSRAFRY IKVLVKSNWG HPRYTCLYRV QVHGKVTPDW ALQSLVRTQA GKP // ID H9Z7D3_MACMU Unreviewed; 720 AA. AC H9Z7D3; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=SUN domain-containing protein 2 isoform b {ECO:0000313|EMBL:AFH31849.1}; GN Name=SUN2 {ECO:0000313|EMBL:AFH31849.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFH31849.1}; RN [1] {ECO:0000313|EMBL:AFH31849.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Thymus {ECO:0000313|EMBL:AFH31849.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JU475045; AFH31849.1; -; mRNA. DR EMBL; JU475046; AFH31850.1; -; mRNA. DR STRING; 9544.ENSMMUP00000009544; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 217 238 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 276 296 {ECO:0000256|SAM:Coils}. FT COILED 355 375 {ECO:0000256|SAM:Coils}. FT COILED 377 404 FT COILED 407 434 {ECO:0000256|SAM:Coils}. FT COILED 481 501 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 720 AA; 80597 MW; 3649F6B4E9087FC6 CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVRESYI GTAFPPRSAL EELHGDADWG EDLRVRRRRG TGGSESSRAS GLVGRKAAED FLGSSSGYSS EDDYMGYSDA DQQSSGSRLW NAVSRAGSLL WMVATSPGRL FRLLYWWAGT TWYRLTTAAS LLDVFVLTRR FSSLKTFLWF LLPLLLLTCL TYGAWYFYPY GLQTFHPALV SWWAAKDSRR QDEGWESRDS SHFQAEQRVM SRVHSLERRL EALAAEFSSN WQKEAMRLER LELRQGAPGQ GGGGGLSHED TLALLEGLVS RHEAALKEDF RREAAARIQE ELSALRAEHQ QDSEDLFKKI VRASQESEAR IQQLKSEWQS MTQESFRESS VKELRRLEDQ LAGLQQELVA LALKQSLVAD EVGLLPQQIQ AVRDDVESQF PAWISQFLAR GGGGRVGLLQ REEMQAQLRE LESKILTHVA EMQGKSAREA AASLGMTLQK EGVIGVTEEQ VHRIVKQALQ RYSEDRIGLA DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRVILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGKFTY DQDGEPIQTF HFQAPSMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID HECD1_HUMAN Reviewed; 2610 AA. AC Q9ULT8; D3DS86; Q6P445; Q86VJ1; Q96F34; Q9UFZ7; DT 12-FEB-2003, integrated into UniProtKB/Swiss-Prot. DT 30-NOV-2010, sequence version 3. DT 11-NOV-2015, entry version 143. DE RecName: Full=E3 ubiquitin-protein ligase HECTD1; DE EC=6.3.2.-; DE AltName: Full=E3 ligase for inhibin receptor; DE AltName: Full=EULIR; DE AltName: Full=HECT domain-containing protein 1; GN Name=HECTD1; Synonyms=KIAA1131; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RA Zhang H.; RT "EULIR is an E3 ubiquitin ligase for inhibin receptor."; RL Submitted (MAR-2003) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12508121; DOI=10.1038/nature01348; RA Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., RA Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., RA Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., RA Sun H., Du H., Pepin K., Artiguenave F., Robert C., Cruaud C., RA Bruels T., Jaillon O., Friedlander L., Samson G., Brottier P., RA Cure S., Segurens B., Aniere F., Samain S., Crespeau H., Abbasi N., RA Aiach N., Boscus D., Dickhoff R., Dors M., Dubois I., Friedman C., RA Gouyvenoux M., James R., Madan A., Mairey-Estrada B., Mangenot S., RA Martins N., Menard M., Oztas S., Ratcliffe A., Shaffer T., Trask B., RA Vacherie B., Bellemere C., Belser C., Besnard-Gonnet M., RA Bartol-Mavel D., Boutard M., Briez-Silla S., Combette S., RA Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., Muselet D., RA Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., Trybou A., RA Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., RA Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., RA Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., RA Verdier J., Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., RA Matsuda F., Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., RA Quetier F., Waterston R., Hood L., Weissenbach J.; RT "The DNA sequence and analysis of human chromosome 14."; RL Nature 421:601-607(2003). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT PRO-2027. RA Mural R.J., Istrail S., Sutton G., Florea L., Halpern A.L., RA Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., RA Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., RA Hannenhalli S., Turner R., Yooseph S., Lu F., Nusskern D.R., RA Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., RA Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., RA Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., RA Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 443-2610, AND VARIANT RP PRO-2027. RC TISSUE=Brain; RX PubMed=10574461; DOI=10.1093/dnares/6.5.329; RA Hirosawa M., Nagase T., Ishikawa K., Kikuno R., Nomura N., Ohara O.; RT "Characterization of cDNA clones selected by the GeneMark analysis RT from size-fractionated cDNA libraries from human brain."; RL DNA Res. 6:329-336(1999). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1652-2610, AND VARIANT RP PRO-2027. RC TISSUE=Testis; RX PubMed=17974005; DOI=10.1186/1471-2164-8-399; RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., RA Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., RA Ottenwaelder B., Poustka A., Wiemann S., Schupp I.; RT "The full-ORF clone resource of the German cDNA consortium."; RL BMC Genomics 8:399-399(2007). RN [6] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 2091-2610. RC TISSUE=Muscle, and Urinary bladder; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [7] RP INTERACTION WITH IGSF1. RX PubMed=12421765; DOI=10.1101/gr.406902; RA Nakayama M., Kikuno R., Ohara O.; RT "Protein-protein interactions between large proteins: two-hybrid RT screening using a functionally classified library composed of long RT cDNAs."; RL Genome Res. 12:1773-1784(2002). RN [8] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1488, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=18669648; DOI=10.1073/pnas.0805139105; RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., RA Elledge S.J., Gygi S.P.; RT "A quantitative atlas of mitotic phosphorylation."; RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008). RN [9] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=19413330; DOI=10.1021/ac9004309; RA Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., RA Mohammed S.; RT "Lys-N and trypsin cover complementary parts of the phosphoproteome in RT a refined SCX-based approach."; RL Anal. Chem. 81:4493-4501(2009). RN [10] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=20068231; DOI=10.1126/scisignal.2000475; RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., RA Mann M.; RT "Quantitative phosphoproteomics reveals widespread full RT phosphorylation site occupancy during mitosis."; RL Sci. Signal. 3:RA3-RA3(2010). RN [11] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Buerckstuemmer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [12] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-631, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21406692; DOI=10.1126/scisignal.2001570; RA Rigbolt K.T., Prokhorova T.A., Akimov V., Henningsen J., RA Johansen P.T., Kratchmarova I., Kassem M., Mann M., Olsen J.V., RA Blagoev B.; RT "System-wide temporal characterization of the proteome and RT phosphoproteome of human embryonic stem cell differentiation."; RL Sci. Signal. 4:RS3-RS3(2011). RN [13] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-2318, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Liver; RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). RN [14] RP STRUCTURE BY NMR OF 1266-1338. RG RIKEN structural genomics initiative (RSGI); RT "Solution structure of MIB-HERC2 domain in HECT domain containing RT protein 1."; RL Submitted (OCT-2006) to the PDB data bank. CC -!- FUNCTION: Probable E3 ubiquitin-protein ligase which accepts CC ubiquitin from an E2 ubiquitin-conjugating enzyme in the form of a CC thioester and then directly transfers the ubiquitin to targeted CC substrates. May be required for development of the head mesenchyme CC and neural tube closure (By similarity). {ECO:0000250}. CC -!- PATHWAY: Protein modification; protein ubiquitination. CC -!- SUBUNIT: Interacts with IGSF1. {ECO:0000269|PubMed:12421765}. CC -!- SIMILARITY: Belongs to the UPL family. K-HECT subfamily. CC {ECO:0000305}. CC -!- SIMILARITY: Contains 4 ANK repeats. {ECO:0000255|PROSITE- CC ProRule:PRU00023}. CC -!- SIMILARITY: Contains 1 HECT (E6AP-type E3 ubiquitin-protein CC ligase) domain. {ECO:0000255|PROSITE-ProRule:PRU00104}. CC -!- SIMILARITY: Contains 1 MIB/HERC2 domain. {ECO:0000255|PROSITE- CC ProRule:PRU00749}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AY254380; AAP13073.1; -; mRNA. DR EMBL; AL121808; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL136418; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CH471078; EAW65950.1; -; Genomic_DNA. DR EMBL; CH471078; EAW65952.1; -; Genomic_DNA. DR EMBL; AB032957; BAA86445.2; -; mRNA. DR EMBL; AL110222; CAB53681.1; -; mRNA. DR EMBL; BC011658; AAH11658.2; -; mRNA. DR EMBL; BC063686; AAH63686.1; -; mRNA. DR CCDS; CCDS41939.1; -. DR PIR; T14761; T14761. DR RefSeq; NP_056197.3; NM_015382.3. DR RefSeq; XP_005267559.2; XM_005267502.2. DR UniGene; Hs.708017; -. DR PDB; 2DK3; NMR; -; A=1266-1338. DR PDB; 2LC3; NMR; -; A=1879-1966. DR PDB; 3DKM; X-ray; 1.60 A; A=1271-1341. DR PDBsum; 2DK3; -. DR PDBsum; 2LC3; -. DR PDBsum; 3DKM; -. DR ProteinModelPortal; Q9ULT8; -. DR SMR; Q9ULT8; 1266-1338, 1879-1966. DR BioGrid; 117359; 38. DR DIP; DIP-31669N; -. DR IntAct; Q9ULT8; 12. DR MINT; MINT-6783566; -. DR STRING; 9606.ENSP00000382269; -. DR PhosphoSite; Q9ULT8; -. DR BioMuta; HECTD1; -. DR DMDM; 313104227; -. DR MaxQB; Q9ULT8; -. DR PaxDb; Q9ULT8; -. DR PeptideAtlas; Q9ULT8; -. DR PRIDE; Q9ULT8; -. DR Ensembl; ENST00000399332; ENSP00000382269; ENSG00000092148. DR Ensembl; ENST00000553700; ENSP00000450697; ENSG00000092148. DR GeneID; 25831; -. DR KEGG; hsa:25831; -. DR UCSC; uc001wrc.1; human. DR CTD; 25831; -. DR GeneCards; HECTD1; -. DR HGNC; HGNC:20157; HECTD1. DR HPA; HPA002929; -. DR neXtProt; NX_Q9ULT8; -. DR PharmGKB; PA134989284; -. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR HOGENOM; HOG000018061; -. DR HOVERGEN; HBG067533; -. DR InParanoid; Q9ULT8; -. DR KO; K12231; -. DR PhylomeDB; Q9ULT8; -. DR TreeFam; TF323674; -. DR Reactome; R-HSA-983168; Antigen processing: Ubiquitination & Proteasome degradation. DR UniPathway; UPA00143; -. DR ChiTaRS; HECTD1; human. DR EvolutionaryTrace; Q9ULT8; -. DR GenomeRNAi; 25831; -. DR NextBio; 47125; -. DR PRO; PR:Q9ULT8; -. DR Proteomes; UP000005640; Chromosome 14. DR Bgee; Q9ULT8; -. DR CleanEx; HS_HECTD1; -. DR ExpressionAtlas; Q9ULT8; baseline and differential. DR Genevisible; Q9ULT8; HS. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0001779; P:natural killer cell differentiation; IEA:Ensembl. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IEA:Ensembl. DR GO; GO:0001843; P:neural tube closure; IEA:Ensembl. DR GO; GO:0051865; P:protein autoubiquitination; IEA:Ensembl. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IEA:Ensembl. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IEA:Ensembl. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IEA:Ensembl. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW 3D-structure; ANK repeat; Complete proteome; Ligase; Phosphoprotein; KW Polymorphism; Reference proteome; Repeat; Ubl conjugation pathway. FT CHAIN 1 2610 E3 ubiquitin-protein ligase HECTD1. FT /FTId=PRO_0000083945. FT REPEAT 395 424 ANK 1. FT REPEAT 426 455 ANK 2. FT REPEAT 459 491 ANK 3. FT REPEAT 579 612 ANK 4. FT DOMAIN 1266 1338 MIB/HERC2. {ECO:0000255|PROSITE- FT ProRule:PRU00749}. FT DOMAIN 2151 2610 HECT. {ECO:0000255|PROSITE- FT ProRule:PRU00104}. FT REGION 2029 2103 K-box. FT COMPBIAS 1350 1649 Ser-rich. FT ACT_SITE 2579 2579 Glycyl thioester intermediate. FT {ECO:0000255|PROSITE-ProRule:PRU00104}. FT MOD_RES 631 631 Phosphoserine. FT {ECO:0000244|PubMed:21406692}. FT MOD_RES 640 640 Phosphoserine. FT {ECO:0000250|UniProtKB:Q69ZR2}. FT MOD_RES 1384 1384 Phosphoserine. FT {ECO:0000250|UniProtKB:Q69ZR2}. FT MOD_RES 1488 1488 Phosphoserine. FT {ECO:0000244|PubMed:18669648}. FT MOD_RES 1567 1567 Phosphoserine. FT {ECO:0000250|UniProtKB:Q69ZR2}. FT MOD_RES 2318 2318 Phosphoserine. FT {ECO:0000244|PubMed:24275569}. FT VARIANT 656 656 Q -> H (in dbSNP:rs11620816). FT /FTId=VAR_059666. FT VARIANT 2027 2027 L -> P (in dbSNP:rs1315794). FT {ECO:0000269|PubMed:10574461, FT ECO:0000269|PubMed:17974005, FT ECO:0000269|Ref.3}. FT /FTId=VAR_067707. FT CONFLICT 561 561 K -> Q (in Ref. 1; AAP13073). FT {ECO:0000305}. FT CONFLICT 603 603 L -> I (in Ref. 1; AAP13073). FT {ECO:0000305}. FT CONFLICT 611 613 FLD -> YKH (in Ref. 1; AAP13073). FT {ECO:0000305}. FT CONFLICT 653 653 K -> Q (in Ref. 1; AAP13073). FT {ECO:0000305}. FT CONFLICT 894 894 E -> K (in Ref. 1; AAP13073). FT {ECO:0000305}. FT CONFLICT 927 944 SMDLDMKQDCSQLVERIN -> VSIFRATKQKQNEVPKVIL FT S (in Ref. 1; AAP13073). {ECO:0000305}. FT CONFLICT 951 951 S -> T (in Ref. 1; AAP13073). FT {ECO:0000305}. FT CONFLICT 1281 1281 I -> T (in Ref. 2; BAA86445). FT {ECO:0000305}. FT HELIX 1270 1272 {ECO:0000244|PDB:2DK3}. FT STRAND 1279 1282 {ECO:0000244|PDB:3DKM}. FT TURN 1289 1292 {ECO:0000244|PDB:3DKM}. FT STRAND 1299 1301 {ECO:0000244|PDB:3DKM}. FT STRAND 1309 1314 {ECO:0000244|PDB:3DKM}. FT TURN 1315 1317 {ECO:0000244|PDB:2DK3}. FT STRAND 1319 1325 {ECO:0000244|PDB:3DKM}. FT HELIX 1326 1328 {ECO:0000244|PDB:3DKM}. FT STRAND 1332 1334 {ECO:0000244|PDB:3DKM}. FT HELIX 1883 1886 {ECO:0000244|PDB:2LC3}. FT HELIX 1896 1902 {ECO:0000244|PDB:2LC3}. FT STRAND 1905 1908 {ECO:0000244|PDB:2LC3}. FT HELIX 1910 1920 {ECO:0000244|PDB:2LC3}. FT HELIX 1923 1928 {ECO:0000244|PDB:2LC3}. FT HELIX 1935 1941 {ECO:0000244|PDB:2LC3}. FT HELIX 1944 1957 {ECO:0000244|PDB:2LC3}. FT TURN 1960 1962 {ECO:0000244|PDB:2LC3}. SQ SEQUENCE 2610 AA; 289384 MW; 27E56401E07E158C CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSTTADS KLSNQVSTIV SLLSTLCRGS PVVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDT NKDEEECNEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MIHFCSEALL KEVCDSDVGH NLPTILVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDI FLDQLARLGV ISKVSTLAGP SSDDENEEES KPEKEDEPQE DAKELQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARGQVKPSTS SQPILSAPGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVRTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESSW ELHTNRQCIE SENTWRDLMK TALENLIVLL KDENTISPYE MCSSGLVQAL LTVLNNSMDL DMKQDCSQLV ERINVFKTAF SENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FVFVRKLREG QNFIFRHQHD FDENGIIYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DNSALNCHSN DDKNAWFAID LGLWVIPSAY TLRHARGYGR SALRNWVFQV SKDGQNWTSL YTHVDDCSLN EPGSTATWPL DPPKDEKQGW RHVRIKQMGK NASGQTHYLS LSGFELYGTV NGVCEDQLGK AAKEAEANLR RQRRLVRSQV LKYMVPGARV IRGLDWKWRD QDGSPQGEGT VTGELHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP DTVASPKPVS STVSGTTQSW SSLVKNNCPD KTSAAAGSSS RKGSSSSVCS VASSSDISLG STKTERRSEI VMEHSIVSGA DVHEPIVVLS SAENVPQTEV GSSSSASTST LTAETGSENA ERKLGPDSSV RTPGESSAIS MGIVSVSSPD VSSVSELTNK EAASQRPLSS SASNRLSVSS LLAAGAPMSS SASVPNLSSR ETSSLESFVR RVANIARTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TPGTTSTVTM STSSVTSSSN VATATTVLSV GQSLSNTLTT SLTSTSSESD TGQEAEYSLY DFLDSCRAST LLAELDDDED LPEPDEEDDE NEDDNQEDQE YEEVMILRRP SLQRRAGSRS DVTHHAVTSQ LPQVPAGAGS RPIGEQEEEE YETKGGRRRT WDDDYVLKRQ FSALVPAFDP RPGRTNVQQT TDLEIPPPGT PHSELLEEVE CTPSPRLALT LKVTGLGTTR EVELPLTNFR STIFYYVQKL LQLSCNGNVK SDKLRRIWEP TYTIMYREMK DSDKEKENGK MGCWSIEHVE QYLGTDELPK NDLITYLQKN ADAAFLRHWK LTGTNKSIRK NRNCSQLIAA YKDFCEHGTK SGLNQGAIST LQSSDILNLT KEQPQAKAGN GQNSCGVEDV LQLLRILYIV ASDPYSRISQ EDGDEQLQFT FPPDEFTSKK ITTKILQQIE EPLALASGAL PDWCEQLTSK CPFLIPFETR QLYFTCTAFG ASRAIVWLQN RREATVERTR TTSSVRRDDP GEFRVGRLKH ERVKVPRGES LMEWAENVMQ IHADRKSVLE VEFLGEEGTG LGPTLEFYAL VAAEFQRTDL GAWLCDDNFP DDESRHVDLG GGLKPPGYYV QRSCGLFTAP FPQDSDELER ITKLFHFLGI FLAKCIQDNR LVDLPISKPF FKLMCMGDIK SNMSKLIYES RGDRDLHCTE SQSEASTEEG HDSLSVGSFE EDSKSEFILD PPKPKPPAWF NGILTWEDFE LVNPHRARFL KEIKDLAIKR RQILSNKGLS EDEKNTKLQE LVLKNPSGSG PPLSIEDLGL NFQFCPSSRI YGFTAVDLKP SGEDEMITMD NAEEYVDLMF DFCMHTGIQK QMEAFRDGFN KVFPMEKLSS FSHEEVQMIL CGNQSPSWAA EDIINYTEPK LGYTRDSPGF LRFVRVLCGM SSDERKAFLQ FTTGCSTLPP GGLANLHPRL TVVRKVDATD ASYPSVNTCV HYLKLPEYSS EEIMRERLLA ATMEKGFHLN // ID HECD1_MOUSE Reviewed; 2618 AA. AC Q69ZR2; DT 10-AUG-2010, integrated into UniProtKB/Swiss-Prot. DT 10-AUG-2010, sequence version 2. DT 11-NOV-2015, entry version 88. DE RecName: Full=E3 ubiquitin-protein ligase HECTD1; DE EC=6.3.2.-; DE AltName: Full=HECT domain-containing protein 1; DE AltName: Full=Protein open mind; GN Name=Hectd1; Synonyms=Kiaa1131, Opm; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1045-2618. RC TISSUE=Thymus; RX PubMed=15368895; DOI=10.1093/dnares/11.3.205; RA Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S., RA Saga Y., Seino S., Nishimura M., Kaisho T., Hoshino K., Kitamura H., RA Nagase T., Ohara O., Koga H.; RT "Prediction of the coding sequences of mouse homologues of KIAA gene: RT IV. The complete nucleotide sequences of 500 mouse KIAA-homologous RT cDNAs identified by screening of terminal sequences of cDNA clones RT randomly sampled from size-fractionated libraries."; RL DNA Res. 11:205-218(2004). RN [3] RP FUNCTION, DEVELOPMENTAL STAGE, AND DISRUPTION PHENOTYPE. RX PubMed=17442300; DOI=10.1016/j.ydbio.2007.03.018; RA Zohn I.E., Anderson K.V., Niswander L.; RT "The Hectd1 ubiquitin ligase is required for development of the head RT mesenchyme and neural tube closure."; RL Dev. Biol. 306:208-221(2007). RN [4] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Liver; RX PubMed=17242355; DOI=10.1073/pnas.0609836104; RA Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.; RT "Large-scale phosphorylation analysis of mouse liver."; RL Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007). RN [5] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-641; SER-1389 AND RP SER-1572, AND IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE RP ANALYSIS]. RC TISSUE=Brain, Brown adipose tissue, Heart, Kidney, Liver, Lung, RC Pancreas, Spleen, and Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). CC -!- FUNCTION: Probable E3 ubiquitin-protein ligase which accepts CC ubiquitin from an E2 ubiquitin-conjugating enzyme in the form of a CC thioester and then directly transfers the ubiquitin to targeted CC substrates. Involved in development of the head mesenchyme and CC neural tube closure. {ECO:0000269|PubMed:17442300}. CC -!- PATHWAY: Protein modification; protein ubiquitination. CC -!- SUBUNIT: Interacts with IGSF1. {ECO:0000250}. CC -!- DEVELOPMENTAL STAGE: Ubiquitously expressed throughout early CC development of the embryo. {ECO:0000269|PubMed:17442300}. CC -!- DISRUPTION PHENOTYPE: Perinatal lethality, exencephaly, impaired CC neural fold elevation, abnormal head mesenchyme morhology and CC defects in eye and cranial vault morphology. CC {ECO:0000269|PubMed:17442300}. CC -!- SIMILARITY: Contains 4 ANK repeats. {ECO:0000255|PROSITE- CC ProRule:PRU00023}. CC -!- SIMILARITY: Contains 1 HECT (E6AP-type E3 ubiquitin-protein CC ligase) domain. {ECO:0000255|PROSITE-ProRule:PRU00104}. CC -!- SIMILARITY: Contains 1 MIB/HERC2 domain. {ECO:0000255|PROSITE- CC ProRule:PRU00749}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC159644; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC157213; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AK173106; BAD32384.1; -; mRNA. DR UniGene; Mm.249391; -. DR ProteinModelPortal; Q69ZR2; -. DR SMR; Q69ZR2; 1271-1343, 1884-1971. DR STRING; 10090.ENSMUSP00000046766; -. DR MaxQB; Q69ZR2; -. DR PaxDb; Q69ZR2; -. DR PRIDE; Q69ZR2; -. DR Ensembl; ENSMUST00000179265; ENSMUSP00000136449; ENSMUSG00000035247. DR MGI; MGI:2384768; Hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR HOGENOM; HOG000018061; -. DR HOVERGEN; HBG067533; -. DR InParanoid; Q69ZR2; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; Q69ZR2; -. DR TreeFam; TF323674; -. DR Reactome; R-MMU-983168; Antigen processing: Ubiquitination & Proteasome degradation. DR UniPathway; UPA00143; -. DR ChiTaRS; Hectd1; mouse. DR PRO; PR:Q69ZR2; -. DR Proteomes; UP000000589; Chromosome 12. DR Bgee; Q69ZR2; -. DR ExpressionAtlas; Q69ZR2; baseline and differential. DR Genevisible; Q69ZR2; MM. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IDA:MGI. DR GO; GO:0001892; P:embryonic placenta development; IMP:MGI. DR GO; GO:0001779; P:natural killer cell differentiation; IMP:MGI. DR GO; GO:1903077; P:negative regulation of protein localization to plasma membrane; IMP:MGI. DR GO; GO:0001843; P:neural tube closure; IMP:UniProtKB. DR GO; GO:0051865; P:protein autoubiquitination; IMP:MGI. DR GO; GO:0070534; P:protein K63-linked ubiquitination; IDA:MGI. DR GO; GO:0060708; P:spongiotrophoblast differentiation; IMP:MGI. DR GO; GO:0060707; P:trophoblast giant cell differentiation; IMP:MGI. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW ANK repeat; Complete proteome; Ligase; Phosphoprotein; KW Reference proteome; Repeat; Ubl conjugation pathway. FT CHAIN 1 2618 E3 ubiquitin-protein ligase HECTD1. FT /FTId=PRO_0000396127. FT REPEAT 396 425 ANK 1. FT REPEAT 427 456 ANK 2. FT REPEAT 460 492 ANK 3. FT REPEAT 580 613 ANK 4. FT DOMAIN 1271 1343 MIB/HERC2. {ECO:0000255|PROSITE- FT ProRule:PRU00749}. FT DOMAIN 2156 2618 HECT. {ECO:0000255|PROSITE- FT ProRule:PRU00104}. FT REGION 2034 2108 K-box. {ECO:0000250}. FT COMPBIAS 496 499 Poly-Lys. FT COMPBIAS 1355 1654 Ser-rich. FT COMPBIAS 1750 1757 Poly-Glu. FT ACT_SITE 2587 2587 Glycyl thioester intermediate. FT {ECO:0000255|PROSITE-ProRule:PRU00104}. FT MOD_RES 632 632 Phosphoserine. FT {ECO:0000250|UniProtKB:Q9ULT8}. FT MOD_RES 641 641 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 1389 1389 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 1493 1493 Phosphoserine. FT {ECO:0000250|UniProtKB:Q9ULT8}. FT MOD_RES 1572 1572 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 2323 2323 Phosphoserine. FT {ECO:0000250|UniProtKB:Q9ULT8}. FT CONFLICT 1070 1070 F -> S (in Ref. 2; BAD32384). FT {ECO:0000305}. FT CONFLICT 2472 2474 Missing (in Ref. 2; BAD32384). FT {ECO:0000305}. SQ SEQUENCE 2618 AA; 290086 MW; 8A06C9F973B11AFA CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLTF IRDSGHLVHK DTLHSAMAVV SRLCGKMEPQ DSSLEICVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTVSGP SSACKPGRST TGAPSAAADS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ELPDSIESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK SSAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAAFEV NFMDDVGQTL LNWASAFGTQ EMVEFLCERG ADVNRGQRSS SLHYAACFGR PQVAKTLLRH GANPDLRDED GKTPLDKARE RGHSEVVAIL QSPGDWMCPV NKGDDKKKKD TNKDEEECNE PRGDPEMAPL YLKRLLPVFA QTFQHTMLPS IRKASLALIR KMIHFCSEAL LKEVCDSDVG HNLPTTLVEI TATVLDQEDD DDGHLLALQI IRDLVDKGGD IFLDQLARLG VISKVSALAG PSSDDENEEE SKPEKEDEPQ EDAKELQQGK PYHWRDWSII RGRDCLYIWS DAAALELSNG SNGWFRFILD GKLATMYSSG SPEGGSDSSE SRSEFLEKLQ RARGQVKPST SSQPILSAPG PTKLTVGNWS LTCLKEGEIA IHNSDGQQAT ILKEDLPGFV FESNRGTKHS FTAETSLGSE FVTGWTGKRG RKLKSKLEKT KQKVRTMARD LYDDHFKAVE SMPRGVVVTL RNIATQLESS WELHTNRQCI EGENTWRDLM KTALENLIVL LKDENTISPY EMCSSGLVQA LLTVLNNVSI FRATKQKQNE VLVERINVFK TAFSESEDDE SYSRPAVALI RKLIAVLESI ERLPLHLYDT PGSTYNLQIL TRRLRFRLER APGETSLIDR TGRMLKMEPL ATVESLEQYL LKMVAKQWYD FDRSSFVFVR KLREGQNFIF RHQHDFDENG IIYWIGTNAK TAYEWVNPAA YGLVVVTSSE GRNLPYGRLE DILSRDNSAL NCHSNDDKNA WFAIDLGVWV IPSAYTLRHA RGYGRSALRN WVFQVSKDGQ NWTSLYTHVD DCSLNEPGST ATWPLDPAKD EKQGWRHVRL KQMGKNASGQ THYLSLSGFE LYGTVNGVCE DQLGKAAKEA EANLRRQRRL VRSQVLKYMV PGARVIRGLD WKWRDQDGSP QGEGTVTGEL HNGWIDVTWD AGGSNSYRMG AEGKFDLKLA PGYDPDTVAS PKPVSSTVSG TTQSWSSLVK NNCPDKTSAA AGSSSRKGSS SSVCSVASSS DISLASTKTE RRSEIVMEHS IVSGADVHEP IVVLSSAENV PQTEVGSSSS ASTSTLTAET GSENAERKLG PDSSVRAPGE SSAISMGIVS VSSPDVSSVS ELTNKEAASQ RPLSSSASNR LSVSSLLAAG APMSSSASVP NLSSRETSSL ESFVRRVANI ARTNATNNMN LSRSSSDNNT NTLGRNVMST ATSPLMGAQS FPNLTTPGTT STVTMSTSSV TSSSNVATAT TVLSVGQSLS NTLTTSLTST SSESDTGQEA EYSLYDFLDS CRASTLLAEL DDDEDLPEPD EEDDENEDDN QEDQEYEEVM ILRRPSLQRR AGSRSDVTHH VVTSQLPQVP SGAGSRPVGE QEEEEYETKG GRRRAWDDDY VLKRQFSALV PAFDPRPGRT NVQQTTDLEI PPPGTPHSEL LEEVECTPSP RLALTLKVTG LGTTREVELP LTNFRSTIFY YVQKLLQLSC NGNVKSDKLR RIWEPTYTIM YREMKDSDKE KENGKMGCWS IEHVEQYLGT DELPKNDLIT YLQKNADAAF LRHWKLTGTN KSIRKNRNCS QLIAAYKDFC EHGTKSGLNQ GAISSLQSSD ILNLTKEQPQ AKAGNGQSPC GVEDVLQLLR ILYIVASDPY SRISQEDGDE QPQFTFPPDE FTSKKITTKI LQQIEEPLAL ASGALPDWCE QLTSKCPFLI PFETRQLYFT CTAFGASRAI VWLQNRREAT VERTRTTSSV RRDDPGEFRV GRLKHERVKV PRGESLMEWA ENVMQIHADR KSVLEVEFLG EEGTGLGPTL EFYALVAAEF QRTDLGTWLC DDNFPDDESR HVDLGGGLKP PGYYVQRSCG LFTAPFPQDS DELERITKLF HFLGIFLAKC IQDNRLVDLP ISKPFFKLMC MGDIKSNMSK LIYESRGDRD LHCTESQSEA STEEGHDSLS VGSFEEDSKS EFILDPPKPK PPAWFNGILT WEDFELVNPH RARFLKEIKD LAIKRRQILG NKSLSEDEKN TKLQELVLRN PSGSGPPLSI EDLGLNFQFC PSSRIYGFTA VDLKPSGEDE MITMDNAEEY VDLMFDFCMH TGIQKQMEAF RGNVDGFNKV FPMEKLSSFS HEEVQMILCG NQSPSWAAED IINYTEPKLG YTRDSPGFLR FVRVLCGMSS DERKAFLQFT TGCSTLPPGG LANLHPRLTV VRKVDATDAS YPSVNTCVHY LKLPEYSSEE IMRERLLAAT MEKGFHLN // ID I0FQL1_MACMU Unreviewed; 720 AA. AC I0FQL1; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=SUN domain-containing protein 2 isoform b {ECO:0000313|EMBL:AFI36737.1}; GN Name=SUN2 {ECO:0000313|EMBL:AFI36737.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFI36737.1}; RN [1] {ECO:0000313|EMBL:AFI36737.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Testis {ECO:0000313|EMBL:AFI36737.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JV046666; AFI36737.1; -; mRNA. DR EMBL; JV046667; AFI36738.1; -; mRNA. DR STRING; 9544.ENSMMUP00000009544; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 217 238 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 276 296 FT COILED 355 375 FT COILED 377 404 FT COILED 407 434 FT COILED 481 501 SQ SEQUENCE 720 AA; 80569 MW; 2D56E6B4E9176FCE CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVRESYI GTAFPPRSAL EELHGDADWG EDLRVRRRRG TGGSESSRAS GLVGRKAAED FLGSSSGYSS EDDYMGYSDA DQQSSGSRLW NAVSRAGSLL WMVATSPGRL FRLLYWWAGT TWYRLTTAAS LLDVFVLTRR FSSLKTFLWF LLPLLLLTCL TYGAWYFYPY GLQTFHPALV SWWAAKDSRR QDEGWESRDS SHFQAEQRVM SRVHSLERRL EALAAEFSSN WQKEAMRLER LELRQGAPGQ GGGGGLSHED TLALLEGLVS RHEAALKEDF RREAAARIQE ELSALRAEHQ QDSEDLFKKI VRASQESEAR IQQLKSEWQS MTQESFRESS VKELRRLEDQ LAGLQQELVA LALKQSLVAD EVGLLPQQIQ AVRDDVESQF PAWISQFLAR GGGGRVGLLQ REEMQAQLRE LESKILTHVA EMQGKSAREA AASLGMTLQK EGVIGVTEEQ VHRIVKQALQ RYSEDRIGLA DYALESGGAS VISTRCSETY ETKTALLSLF GIPLWYHSQS PRAILQPDVH PGNCWAFQGP QGFAVVRLSA RIRPTAVTLE HVPKALSPNS TISSAPKDFA IFGFDEDLQQ EGTLLGKFTY DQDGEPIQTF HFQAPSMATY QVVELRILTN WGHPEYTCIY RFRVHGEPAH // ID I0FSE5_MACMU Unreviewed; 1253 AA. AC I0FSE5; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Protein osteopotentia homolog isoform 1 {ECO:0000313|EMBL:AFI37371.1}; GN Name=C1orf9 {ECO:0000313|EMBL:AFI37371.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFI37371.1}; RN [1] {ECO:0000313|EMBL:AFI37371.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Orbital {ECO:0000313|EMBL:AFJ71548.1}, and RC Testis {ECO:0000313|EMBL:AFI37371.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JV047300; AFI37371.1; -; mRNA. DR EMBL; JV636208; AFJ71548.1; -; mRNA. DR STRING; 9544.ENSMMUP00000018073; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 FT CHAIN 30 1253 FT /FTId=PRO_5003626523. FT COILED 935 955 {ECO:0000256|SAM:Coils}. FT COILED 985 1005 {ECO:0000256|SAM:Coils}. FT COILED 1191 1211 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1253 AA; 139203 MW; 938BBFAC52078ECD CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFPKKD EREGPINAES LGKSGSNLPV SPEEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQDGGD QKSALNASDN VKNESSDYTK PGDIDPTSVT SPKDPEDIPT FDEWKKKVME VEKEKSQSMH PSSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LVSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPAYVTTEVD TNDMELSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VALYWQRSRT ALSKGKDYLV SAQPPLLPAE SVDISVLQPL SGELENKNIE REAETVVLGD LSSSMHQDDL VNHTVDAVEL EPSHSQTLSQ SLLLDITPEI NPLPKIEVSE SVEYEAGHIT SQVIPQESSV EIDNEAEQKS ESFSSIEKPS VTYETNKVNE VVDNIIKEDV NSMQIFTKLS ETIVPPINTA TVPDNEDGEA KMNVADTAKQ TLISVVDSSS LPEVKEEEQS PEDALLRGLQ RTATDFYAEL QNSTDLGYAN GNLVHGSNQK ESVFMRLNNR IKALEVNMSL SGRYLEELSQ RYRKQMEEMQ KAFNKTIVKL QNTSRIAEEQ DQRQTEAIQL LQAQLTNMTQ LVSNLSATVA ELKREVSDRQ SYLVISLVLC VVLGLMLCMQ RCRNTSQFDG DYISKLPKSN QYPSPKRCFS SYDDMNLKRR TSFPLMRSKS LQLTGKEVDP NDLYIVEPLK FSPEKKKKRC KYKIEKIETI KPAEPLHPIA NGDIKGRKPF TNQRDFSNIG EVYHSSYKGP PSEGSSETSS QSEESYFCGI SACTSLCNGQ SQKTKTEKRA LKRRRSKVQD QGKLIKTLIQ TKSGSLPSLH DIIKGNKEIT VGTFGVTAVS GHI // ID I0FSE6_MACMU Unreviewed; 1246 AA. AC I0FSE6; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Protein osteopotentia homolog isoform 1 {ECO:0000313|EMBL:AFI37372.1}; GN Name=C1orf9 {ECO:0000313|EMBL:AFI37372.1}; OS Macaca mulatta (Rhesus macaque). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFI37372.1}; RN [1] {ECO:0000313|EMBL:AFI37372.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Testis {ECO:0000313|EMBL:AFI37372.1}; RX PubMed=25319552; DOI=10.1186/1745-6150-9-20; RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., RA Pandey S., Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., RA Tharp G.K., Marcais G., Roberts M., Ferguson B., Fox H.S., RA Treangen T., Salzberg S.L., Yorke J.A., Norgren R.B.Jr.; RT "A new rhesus macaque assembly and annotation for next-generation RT sequencing analyses."; RL Biol. Direct 9:20-20(2014). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; JV047301; AFI37372.1; -; mRNA. DR STRING; 9544.ENSMMUP00000018073; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1246 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003627365. FT COILED 928 948 {ECO:0000256|SAM:Coils}. FT COILED 978 998 {ECO:0000256|SAM:Coils}. FT COILED 1184 1204 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1246 AA; 138279 MW; 993CC22B275858E9 CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFPKKD EREGPINAES LGKSGSNLPV SPEEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQDGGD QKSALNASDN VKNESSDYTK PGDIDPTSVT SPKDPEDIPT FDEWKKKVME VEKEKSQSMH PSSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KVELVSHFGS EHFCPLSLIR VFGTSMVEEY EEIADSQYHS ERQELFDEDY DYPLDYNTGE DKSSKNLLGS ATNAILNMVN IAANILGAKT EDLTEGNKSI SENATATAAP KMPESTPVST PVPSPAYVTT EVDTNDMELS TPDTPKESPI VQLVQEEEEE ASPSTVTLLG SGEQEDESSP WFESETQIFC SELTTICCIS SFSEYIYKWC SVRVALYWQR SRTALSKGKD YLVSAQPPLL PAESVDISVL QPLSGELENK NIEREAETVV LGDLSSSMHQ DDLVNHTVDA VELEPSHSQT LSQSLLLDIT PEINPLPKIE VSESVEYEAG HITSQVIPQE SSVEIDNEAE QKSESFSSIE KPSVTYETNK VNEVVDNIIK EDVNSMQIFT KLSETIVPPI NTATVPDNED GEAKMNVADT AKQTLISVVD SSSLPEVKEE EQSPEDALLR GLQRTATDFY AELQNSTDLG YANGNLVHGS NQKESVFMRL NNRIKALEVN MSLSGRYLEE LSQRYRKQME EMQKAFNKTI VKLQNTSRIA EEQDQRQTEA IQLLQAQLTN MTQLVSNLSA TVAELKREVS DRQSYLVISL VLCVVLGLML CMQRCRNTSQ FDGDYISKLP KSNQYPSPKR CFSSYDDMNL KRRTSFPLMR SKSLQLTGKE VDPNDLYIVE PLKFSPEKKK KRCKYKIEKI ETIKPAEPLH PIANGDIKGR KPFTNQRDFS NIGEVYHSSY KGPPSEGSSE TSSQSEESYF CGISACTSLC NGQSQKTKTE KRALKRRRSK VQDQGKLIKT LIQTKSGSLP SLHDIIKGNK EITVGTFGVT AVSGHI // ID I0YVU7_9CHLO Unreviewed; 279 AA. AC I0YVU7; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EIE22516.1}; GN ORFNames=COCSUDRAFT_83473 {ECO:0000313|EMBL:EIE22516.1}; OS Coccomyxa subellipsoidea C-169. OC Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; OC Coccomyxaceae; Coccomyxa. OX NCBI_TaxID=574566 {ECO:0000313|EMBL:EIE22516.1, ECO:0000313|Proteomes:UP000007264}; RN [1] {ECO:0000313|EMBL:EIE22516.1, ECO:0000313|Proteomes:UP000007264} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C-169 {ECO:0000313|EMBL:EIE22516.1, RC ECO:0000313|Proteomes:UP000007264}; RX PubMed=22630137; DOI=10.1186/gb-2012-13-5-r39; RA Blanc G., Agarkova I., Grimwood J., Kuo A., Brueggeman A., Dunigan D., RA Gurnon J., Ladunga I., Lindquist E., Lucas S., Pangilinan J., RA Proschold T., Salamov A., Schmutz J., Weeks D., Yamada T., RA Claverie J.M., Grigoriev I., Van Etten J., Lomsadze A., Borodovsky M.; RT "The genome of the polar eukaryotic microalga coccomyxa subellipsoidea RT reveals traits of cold adaptation."; RL Genome Biol. 13:R39-R39(2012). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EIE22516.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AGSI01000010; EIE22516.1; -; Genomic_DNA. DR RefSeq; XP_005647060.1; XM_005647003.1. DR GeneID; 17040386; -. DR KEGG; csl:COCSUDRAFT_83473; -. DR KO; K19347; -. DR Proteomes; UP000007264; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007264}; KW Reference proteome {ECO:0000313|Proteomes:UP000007264}. FT COILED 51 92 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 279 AA; 30818 MW; F7D07D47D9C1897C CRC64; MAWFDFDVVR PKVPHQEEGL LLLAVLTSWL TSRFSQPDLT KRQLSTILTS VEQADRRYNA LVKALDSAQG QVHTTEDRLE STQKKLDALE QSFQGGSDKD TKLSGKVEPS LIVNMAEQRM EKLLARFAAD KTYLLTPGGP IPGECLALNG SKGYVDIKLR ETIVPTALTY EHVPTSIAYD IRSAPQDMSA TGSLSEAFRF SPKGKGPVLS LLERRRGVGP TDLGEFVYDP SQGALNTVAL DGTIPADQIR LKVESNYGHP DYTCLYRVRI HGRVPKEGE // ID I1C5P8_RHIO9 Unreviewed; 480 AA. AC I1C5P8; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 16-SEP-2015, entry version 10. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EIE83778.1}; GN ORFNames=RO3G_08483 {ECO:0000313|EMBL:EIE83778.1}; OS Rhizopus delemar (strain RA 99-880 / ATCC MYA-4621 / FGSC 9543 / NRRL OS 43880) (Mucormycosis agent) (Rhizopus arrhizus var. delemar). OC Eukaryota; Fungi; Fungi incertae sedis; Mucoromycotina; Mucorales; OC Mucorineae; Rhizopodaceae; Rhizopus. OX NCBI_TaxID=246409 {ECO:0000313|EMBL:EIE83778.1, ECO:0000313|Proteomes:UP000009138}; RN [1] {ECO:0000313|EMBL:EIE83778.1, ECO:0000313|Proteomes:UP000009138} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RA 99-880 / ATCC MYA-4621 / FGSC 9543 / NRRL 43880 RC {ECO:0000313|Proteomes:UP000009138}; RX PubMed=19578406; DOI=10.1371/journal.pgen.1000549; RA Ma L.-J., Ibrahim A.S., Skory C., Grabherr M.G., Burger G., Butler M., RA Elias M., Idnurm A., Lang B.F., Sone T., Abe A., Calvo S.E., RA Corrochano L.M., Engels R., Fu J., Hansberg W., Kim J.-M., RA Kodira C.D., Koehrsen M.J., Liu B., Miranda-Saavedra D., O'Leary S., RA Ortiz-Castellanos L., Poulter R., Rodriguez-Romero J., RA Ruiz-Herrera J., Shen Y.-Q., Zeng Q., Galagan J., Birren B.W., RA Cuomo C.A., Wickes B.L.; RT "Genomic analysis of the basal lineage fungus Rhizopus oryzae reveals RT a whole-genome duplication."; RL PLoS Genet. 5:E1000549-E1000549(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476737; EIE83778.1; -; Genomic_DNA. DR InParanoid; I1C5P8; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000009138; Unassembled WGS sequence. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000009138}; KW Reference proteome {ECO:0000313|Proteomes:UP000009138}. SQ SEQUENCE 480 AA; 55461 MW; F2FC566649776F32 CRC64; MMTSSTPIKP FSSFEEWQQR IDTRQRPRSP KVVGQQVIDS IDGGLSDDLG AMFENIMSHQ KPNNVYEEQQ YISPIGNKPK KQEDNLKYLK ERFNYASVDC AATVRKANKE AKGAQSILFE SKDQYLLNRC SANKFVIINL CEQIRVDTIV MANFEFFSST FKDFKVYGSA KYPSDDWWLL GQWQARNTRD LQVFRVPESP WTNYIKIEFL THYGHEYYCP LSLVRVHGMS MMELYTNIES NDDEIPTPEH LWPAEIREQI IQPQYDIVNT SESFPIKVEE EDVPIVIPPV VNDTEEMPTE EQMEMPEENI EKIEKTVTTS AVTTISGEST CVQRNTIMPL PINKSSQELI EQPLMIDSND DHHEVNMTSM GSPVTSTVSH TSTQETSTSN LTEDNRPMIT QHKIHHSKET TQESIYKTIM KRLNVLEHNM TLSQRFLDEQ NKVLNDVFLE MERKHQEQLI VLIEHLNGTA SQKIETMASE // ID I1CPD1_RHIO9 Unreviewed; 583 AA. AC I1CPD1; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 13. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EIE90311.1}; GN ORFNames=RO3G_15022 {ECO:0000313|EMBL:EIE90311.1}; OS Rhizopus delemar (strain RA 99-880 / ATCC MYA-4621 / FGSC 9543 / NRRL OS 43880) (Mucormycosis agent) (Rhizopus arrhizus var. delemar). OC Eukaryota; Fungi; Fungi incertae sedis; Mucoromycotina; Mucorales; OC Mucorineae; Rhizopodaceae; Rhizopus. OX NCBI_TaxID=246409 {ECO:0000313|EMBL:EIE90311.1, ECO:0000313|Proteomes:UP000009138}; RN [1] {ECO:0000313|EMBL:EIE90311.1, ECO:0000313|Proteomes:UP000009138} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=RA 99-880 / ATCC MYA-4621 / FGSC 9543 / NRRL 43880 RC {ECO:0000313|Proteomes:UP000009138}; RX PubMed=19578406; DOI=10.1371/journal.pgen.1000549; RA Ma L.-J., Ibrahim A.S., Skory C., Grabherr M.G., Burger G., Butler M., RA Elias M., Idnurm A., Lang B.F., Sone T., Abe A., Calvo S.E., RA Corrochano L.M., Engels R., Fu J., Hansberg W., Kim J.-M., RA Kodira C.D., Koehrsen M.J., Liu B., Miranda-Saavedra D., O'Leary S., RA Ortiz-Castellanos L., Poulter R., Rodriguez-Romero J., RA Ruiz-Herrera J., Shen Y.-Q., Zeng Q., Galagan J., Birren B.W., RA Cuomo C.A., Wickes B.L.; RT "Genomic analysis of the basal lineage fungus Rhizopus oryzae reveals RT a whole-genome duplication."; RL PLoS Genet. 5:E1000549-E1000549(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476746; EIE90311.1; -; Genomic_DNA. DR InParanoid; I1CPD1; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000009138; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009138}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009138}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 102 118 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 124 148 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 160 178 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 42 87 {ECO:0000256|SAM:Coils}. FT COILED 202 222 {ECO:0000256|SAM:Coils}. FT COILED 259 279 {ECO:0000256|SAM:Coils}. FT COILED 316 350 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 583 AA; 67564 MW; C91C588C59291285 CRC64; MELKSISKKP YAFNSHPEDD TSIDEGYEDF DAYDELELMV NHPELEEAMR RMIQQREQLQ QMEEQLLAQH KQQRKNQSNN ETQQEAQEQP EGSLIRYYAS KWLEFIIFIF FMIYWIIKEP IIRTVTFFTM IVSSLLVTPA IFVWSKLFDQ WTPNNELRKR ITSLSTGLLF AYLAYLAYPR VQPYIPTYHW THATVNPPTA DLTNIIRHIS RWEERVNQLS DKQVRHEALY NDLSSKVNSE LQSIRNQIKQ STSNLIQRIS SQQNNIEGIN SQLEQSTSNL LQRMSNQQGD IEAIANQLEQ STTNFLQKFT HQQGDIENLS NQHSNIANQY NDVANQYDSL TNQHSNLANE HSSIINQQEA LVQKLNEIEK MLSSVEWGHS QGSWPDLTQQ EIQKYITDKV NQFIPHETAD FALESRGARV IHTMTSKTFQ PMKPWLQHIR RITGVSSRLR TIPEMALQPQ TYPGECWSME GTSGSLAILL SQPVHLESIT IEYPTPEIMS FNMSTAPKNI QILGIKDFKH HPESTVSLGL VQYDIYKNQA IQNFLLDSNN DVFEAVIIKI LSNWGNLLHT DLYRVRLHGA PPV // ID I1F6N9_AMPQE Unreviewed; 1936 AA. AC I1F6N9; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:PAC:15711734}; OS Amphimedon queenslandica (Sponge). OC Eukaryota; Metazoa; Porifera; Demospongiae; Haplosclerida; Niphatidae; OC Amphimedon. OX NCBI_TaxID=400682 {ECO:0000313|EnsemblMetazoa:PAC:15711734, ECO:0000313|Proteomes:UP000007879}; RN [1] {ECO:0000313|EnsemblMetazoa:PAC:15711734} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Lucas S., Shapiro H., Lindquist E., Tice H., Dalin E., RA Glavina del Rio T., Bruce D., Barry K., Pitluck S., Srivastava M., RA Simakov O., Chapman J., Mitros T., Hellsten U., Putnam N.H., Fahey B., RA Gauthier M., Larroux C., Richards G.S., Stanke M., Adamska M., RA Darling A., Dacre M., Degnan S.M., Zhai Y., Adamski M., Calcino A., RA Cummins S.F., Goodstein D.M., Harris C., Shu S., Woodcroft B., RA Leys S.P., Manning G., Degnan B.M., Rokhsar D.S.; RT "The genome of the haplosclerid demosponge Amphimedon queenslandica RT and the evolution of animal complexity."; RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:PAC:15711734} RP IDENTIFICATION. RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 400682.PAC_15711734; -. DR EnsemblMetazoa; PAC:15711734; PAC:15711734; Aqu1.213206. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; I1F6N9; -. DR OMA; NRQCIEG; -. DR Proteomes; UP000007879; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 2. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007879}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007879}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 1936 AA; 217233 MW; 65707D8C0F04CB7D CRC64; MDLEPETLLE WLSIGDGADR DIQLVALEQL CMLLLMADNV DKCFESCPPR TFIPALCKIF MDPNAPDNVV EIYEPVIKCF MTLIDRFIRR GHDVNPLIDK GLIHELTKRL AAVSDSSSSS PQSVSIIVNL LSTLIRGSSS TANAVLRSNL PDSIKQAVKG DERCVLDVLR LIELLLILLF EGRKALPKNL QPLASRQIES FPGDASNRHV IEAIRAGNNE EFFEAMESGI DINHMDDVGQ TLLNWASAFG TLEMRLLQFG ADTELRDEDG HTPLDKAQER GDEWYQQCLE IFDNSDNVLR DDDDDDDSDS FDSDSDAESK IKDMISSTAA TGTGPASSGG NKVISGNIIE PSGQTTAAAN NEENKNNGKE DASSDKVDPE LIPVYMKLLL PLLVEVFHSS LSQTLRKECL RLVCKMFPFV SKEVLQDICS DSESQRSFPG QITEVLTSTL EIEDDLDGQL AALTILDELL GKVPDTFLEQ FLRLGLPVHV SNIAVPPKEA EEPSTEETDS GPLSLKESEV LQDATELTPL VPYQWKDWVF IRSSECLFLW NEHIVLELSR VSNGWFRYLM DKKVATMYSS GSVEGSSSSL GGKSDFIVKM QLCCSQVPSD AVLYPILSTP SPVKLKIGHW SLSCEKEGEI VIHNAESSQI SSLAWKIQQK YFTSNSEGTR KVAFELQLIL KNIEDTCLAH ESDVSLDSGM GWKEELQFGL QALSDLLKEE STLSAFEVHS SSLVQVLLHC ISGQFSEEPV PAQKINERFN LFKEVFSDAD KNACLSEESN ISPPSVSLVK KLIAVLEAVE RLPLLCHDAP GTPLNLQIIQ RKLHFKLERA PDEGDLRDFS GKTFKTEPLV TIKGLEKFLS GRAAKQWYDF ERTSYHFVSV LQKMKSPLVF QHVSDFDENG IFYWIGSNAR TCDWVNPAAH HIVVVSSSDG RILPYGNLDD ILCRDDTPAN CHTKDDRNSW FAVDTGVWFF PTHYSLRHSR GYGNRSALRS WDFQVSKDGV TWTTVYSHVN DNSLNEPGST ASWSLSPPTD PEGWRHLRLI LTGPNASGHT HYLSLSGLEV YGEVRGLADN ELGKAAKEQE RQLYQKRRFV KEHIMKKLHI GARVVRGVDW KWRDQDGIPP VPGTVIGELR NGWVEVQWDH GSANSYRMGA EGKYDLELTG EEPAIPPPPE SEETTSNTPP APDNVSDEDD DDPVHQKTWD DDCILKQSFS ALVSAFDPRP GQTNVPQIQD FVIPLPGSVN QTVDDSIDAK FKRVKLALFV KYQGLELIME DSSKPICHYV QRLLQHLPKG ERKRRVWDET FTLVYRDSSH VISSGQWLGP VSYVASSLLS GVISKEDVIL FIRKNGKDPL SLASDSLPDW CYKLTGHYSF LFPFETREMF FLSTAFGTSR TVIWLQKCCD EVLERIRGGA LKKEEQYEYH IGRLRQDRVH IPRKEDDILL WASSVLHAHA ERKSVLEVEF LNEEGTGLGP SLEFYALVSA EFQKSSLGMW LTNDRSSLQH NDMSRQVDIG LGVKPPGYYV QRSCGLFPAP VPSSSPEFDR ICKHFELLGL FLAKCLQDGR RVDIPLSESF LKLMCFKQVI TEDPGTLPSI TRPSEEERRD DDVTLNSSDN AINNNPTSFS NDTNTITGQE ATGKEKIMMD EERRKESSAA KDASKDDAAL MNDKVHSSKT SGNTVPWFTG ILTMGDLVTI DPYRGKFLVQ LQDLVQRKIE LSEMEMTQSV DGLLLDDGSQ LSDLMLNFTY APSSSSYGYE WYDLTDNGSA TELDNSNAEE YLALTKDFVL ESGIRKQLES FKTGFDRVFS MNKLQIFSPY ELRLLLCGEQ SPSWTREDIL KYTVPKYGYN SDSHGYQRFV NVLVELDSDE RKAFVQFITG CSSLPPGGLA NLHPRLTVVR KDSKDDNMFP SVNTCVHYLK LPEYSSERIL KTKLMEATFE KGFHLN // ID I1FZU9_AMPQE Unreviewed; 435 AA. AC I1FZU9; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:PAC:15721663}; OS Amphimedon queenslandica (Sponge). OC Eukaryota; Metazoa; Porifera; Demospongiae; Haplosclerida; Niphatidae; OC Amphimedon. OX NCBI_TaxID=400682 {ECO:0000313|EnsemblMetazoa:PAC:15721663, ECO:0000313|Proteomes:UP000007879}; RN [1] {ECO:0000313|EnsemblMetazoa:PAC:15721663} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Lucas S., Shapiro H., Lindquist E., Tice H., Dalin E., RA Glavina del Rio T., Bruce D., Barry K., Pitluck S., Srivastava M., RA Simakov O., Chapman J., Mitros T., Hellsten U., Putnam N.H., Fahey B., RA Gauthier M., Larroux C., Richards G.S., Stanke M., Adamska M., RA Darling A., Dacre M., Degnan S.M., Zhai Y., Adamski M., Calcino A., RA Cummins S.F., Goodstein D.M., Harris C., Shu S., Woodcroft B., RA Leys S.P., Manning G., Degnan B.M., Rokhsar D.S.; RT "The genome of the haplosclerid demosponge Amphimedon queenslandica RT and the evolution of animal complexity."; RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:PAC:15721663} RP IDENTIFICATION. RG EnsemblMetazoa; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 400682.PAC_15721663; -. DR EnsemblMetazoa; PAC:15721663; PAC:15721663; Aqu1.223135. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; I1FZU9; -. DR Proteomes; UP000007879; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007879}; KW Reference proteome {ECO:0000313|Proteomes:UP000007879}. FT COILED 59 79 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 435 AA; 48598 MW; FCBCA8C2FC54012A CRC64; MGKENKEEKE EVPSTVTEET MVDNSTGAKE VELVTTETEE TTPLDNSLEL EKKEKGKYNE STEERKEEEE GDLERREVNI ETIELTKEAE GDKQGENDST TLSDQDKDTS SAQDEATNSS SEDHTSSIND TESSEPDDIV TFEEFKNRAS QELNQPIKQP QGIIHKGNSN NFASYECGAK VVATNPEAKN SHAILTGNKD EYMLNPCSAE IWFVIELCEL VSVNKFEIGS FELFSSIPES FAVFTSESYP TSNWESISSF QMRNERGVQE FPLADTVYAK YIKIVMLSHF GSEHYCPISM VSVYGTTMME EYELSETQRT QGGKRDSVED EGRITITDTD NTDSTVKPNG PEANNPLLAA TGAIISVIGK AVNGLMGKEK KERMNHNIDS EDERRQLAKS TRPDDLHQVS FDEARSCVTV EEGRKDTGQQ LSNEK // ID I1HEA8_BRADI Unreviewed; 440 AA. AC I1HEA8; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:BRADI2G09990.1}; GN Name=BRADI2G09990 {ECO:0000313|EnsemblPlants:BRADI2G09990.1}; OS Brachypodium distachyon (Purple false brome) (Trachynia distachya). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Brachypodieae; Brachypodium. OX NCBI_TaxID=15368 {ECO:0000313|EnsemblPlants:BRADI2G09990.1, ECO:0000313|Proteomes:UP000008810}; RN [1] {ECO:0000313|EnsemblPlants:BRADI2G09990.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G09990.1}; RX PubMed=20148030; DOI=10.1038/nature08747; RG International Brachypodium Initiative; RT "Genome sequencing and analysis of the model grass Brachypodium RT distachyon."; RL Nature 463:763-768(2010). RN [2] {ECO:0000313|EnsemblPlants:BRADI2G09990.1} RP IDENTIFICATION. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G09990.1}; RG EnsemblPlants; RL Submitted (NOV-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003566816.1; XM_003566768.2. DR STRING; 15368.BRADI2G09990.1; -. DR EnsemblPlants; BRADI2G09990.1; BRADI2G09990.1; BRADI2G09990. DR GeneID; 100843043; -. DR KEGG; bdi:100843043; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; I1HEA8; -. DR KO; K19347; -. DR OMA; RVSGWYQ; -. DR Proteomes; UP000008810; Chromosome 2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008810}; KW Reference proteome {ECO:0000313|Proteomes:UP000008810}. FT COILED 168 195 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 440 AA; 46932 MW; 22F5D09D027A53DF CRC64; MPSPSLSAAA VANPVSPLAL DPSPAASRPA AAATAALRKR SVLLLDQRPH PSTPTSRAAT AAAPPLSQAR RKRGLSSSGR PRWQTALSVA AKNAALLAVL LYLGDQAWRW AHPAPPALLD DVALAAYNAR VDDVEASLVR ALRALQVQVE AVNRKIDGEV GAARGHLAAL LEEKRLALEA QLSRLESRTD ELNDSLGGLK QMEFLRKDEF ETFLNEIKES LGPDSGSEVD LDQLRLVAKE IAMREIEKHA ADGIGRVDYA VGSAGGRVMR HSEAYDAGKR GGLLSALPFG GGDKGDPSQK ILQPSFGEPG QCLPLKGSSG FVEIQLRKGI IPDAITLEHV SKDVAYDMST APKDCRVSGW YQGPPTETPP SHAAKMSTLT EFTYDLAKNN VQTFDITVAD VSVVNMVRLD FASNHGSSAL TCIYRIRVHG HEPVTPGISS // ID I1HLY3_BRADI Unreviewed; 450 AA. AC I1HLY3; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:BRADI2G36230.1}; GN Name=BRADI2G36230 {ECO:0000313|EnsemblPlants:BRADI2G36230.1}; OS Brachypodium distachyon (Purple false brome) (Trachynia distachya). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Brachypodieae; Brachypodium. OX NCBI_TaxID=15368 {ECO:0000313|EnsemblPlants:BRADI2G36230.1, ECO:0000313|Proteomes:UP000008810}; RN [1] {ECO:0000313|EnsemblPlants:BRADI2G36230.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G36230.1}; RX PubMed=20148030; DOI=10.1038/nature08747; RG International Brachypodium Initiative; RT "Genome sequencing and analysis of the model grass Brachypodium RT distachyon."; RL Nature 463:763-768(2010). RN [2] {ECO:0000313|EnsemblPlants:BRADI2G36230.1} RP IDENTIFICATION. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G36230.1}; RG EnsemblPlants; RL Submitted (NOV-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003568952.1; XM_003568904.2. DR STRING; 15368.BRADI2G36230.1; -. DR EnsemblPlants; BRADI2G36230.1; BRADI2G36230.1; BRADI2G36230. DR GeneID; 100831344; -. DR KEGG; bdi:100831344; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; I1HLY3; -. DR KO; K19347; -. DR OMA; VKHSEPF; -. DR Proteomes; UP000008810; Chromosome 2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008810}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008810}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 112 135 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 450 AA; 49599 MW; 7EE1365185F95113 CRC64; MSASTAAIPT TNTNGNHALS VDSHSSQDVR RRTAVVAKKI ATAELLTERG VNGVSDDKIT GKKDIGHTIR GESVIEKPKY SSEGRKDAFA SATTAEHRKK SATKQEKAKW EIALSVLMKL CLLISAIAWM GQVFWRWQNG DLSFTALDLE SRLSKVEGFK KTTKMLQVQL DILDKKLGNE IGKAKTDAGK QFEDKGNKLE AKMKTLEGKT DILDKSLAEL RDMGFVSRKE FNEIVSQVKK KKGANSDISL DDVRIIAKEI VEMEITRHAA DGLGMVDYAL GSGGGKVVKH SEPFKKAKSI LPRRSEAHKM LEPSFGQPGE CFALEGSSGF VEIKLRTGII PEAVTLEHVD QSVAYDRSSA PKDFQVSGWY QGPEDDSDKQ PRTTVNLGEF SYDLQKSNAQ TFQLDRTTAD ARVINTVRLD FSSNHGNSEL TCIYRFRVHG NEPGSLGTWA // ID I1HP44_BRADI Unreviewed; 475 AA. AC I1HP44; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 12. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:BRADI2G42670.1}; GN Name=BRADI2G42670 {ECO:0000313|EnsemblPlants:BRADI2G42670.1}; OS Brachypodium distachyon (Purple false brome) (Trachynia distachya). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Brachypodieae; Brachypodium. OX NCBI_TaxID=15368 {ECO:0000313|EnsemblPlants:BRADI2G42670.1, ECO:0000313|Proteomes:UP000008810}; RN [1] {ECO:0000313|EnsemblPlants:BRADI2G42670.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G42670.1}; RX PubMed=20148030; DOI=10.1038/nature08747; RG International Brachypodium Initiative; RT "Genome sequencing and analysis of the model grass Brachypodium RT distachyon."; RL Nature 463:763-768(2010). RN [2] {ECO:0000313|EnsemblPlants:BRADI2G42670.1} RP IDENTIFICATION. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G42670.1}; RG EnsemblPlants; RL Submitted (NOV-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 15368.BRADI2G42670.1; -. DR EnsemblPlants; BRADI2G42670.1; BRADI2G42670.1; BRADI2G42670. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; I1HP44; -. DR OMA; AVLKIMM; -. DR Proteomes; UP000008810; Chromosome 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008810}; KW Reference proteome {ECO:0000313|Proteomes:UP000008810}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 30 {ECO:0000256|SAM:SignalP}. FT CHAIN 31 475 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5003643008. SQ SEQUENCE 475 AA; 52197 MW; 4A408A10BC91E854 CRC64; MDGGLREVSV SVVFSVWCIL FLLRSQFLHS QTDPSDFDGE HGKRDNHCKV MPLEAYIFPA DNVSSPTCQS SSSPHHHQEV PPSNATGSNS SSEAAFVELD EFRILEGKAD NDTARHHQRV AVSGGASVTH RLEPSGAEYN YAAASKGAKV LAHNKEAKGA ANILVGDKDR YLRNPCSANN KFVVVELSEE TLVHTIALAN LEHYSSNFKD LELYGSLSYP AESWELLGRF AAENAKHAQR FVLPEPRWTR YLRLRLVSHY GSGFYCILSY FQVYGVDAVE QMLQDFIANH SSEGVDAPNA DARKDNNGRN DTAVGTPVDA KVDSGTRRND STSTDVVKNN ASKGGGAVDT KPPPQGKEQG KQASSSTGRI HSDAVIKILM QKMRSLEQGL LTLEDYTKVI SHRYGAKLPD LHNGLSQTTK ALDKMKADVK DLVEWKNNVA RDLGELKDWK SSVTGKLDDL IRENSAMRQV LALAL // ID I1HTX2_BRADI Unreviewed; 613 AA. AC I1HTX2; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 14. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:BRADI2G56660.1}; GN Name=BRADI2G56660 {ECO:0000313|EnsemblPlants:BRADI2G56660.1}; OS Brachypodium distachyon (Purple false brome) (Trachynia distachya). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Pooideae; Brachypodieae; Brachypodium. OX NCBI_TaxID=15368 {ECO:0000313|EnsemblPlants:BRADI2G56660.1, ECO:0000313|Proteomes:UP000008810}; RN [1] {ECO:0000313|EnsemblPlants:BRADI2G56660.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G56660.1}; RX PubMed=20148030; DOI=10.1038/nature08747; RG International Brachypodium Initiative; RT "Genome sequencing and analysis of the model grass Brachypodium RT distachyon."; RL Nature 463:763-768(2010). RN [2] {ECO:0000313|EnsemblPlants:BRADI2G56660.1} RP IDENTIFICATION. RC STRAIN=cv. Bd21 {ECO:0000313|EnsemblPlants:BRADI2G56660.1}; RG EnsemblPlants; RL Submitted (NOV-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003564754.1; XM_003564706.2. DR STRING; 15368.BRADI2G56660.1; -. DR EnsemblPlants; BRADI2G56660.1; BRADI2G56660.1; BRADI2G56660. DR GeneID; 100840902; -. DR KEGG; bdi:100840902; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; I1HTX2; -. DR OMA; YGSASYC; -. DR Proteomes; UP000008810; Chromosome 2. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008810}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008810}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 32 50 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 554 574 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 595 612 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 484 511 {ECO:0000256|SAM:Coils}. FT COILED 526 553 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 613 AA; 68117 MW; B0998690F247D32F CRC64; MQKSRRDLMK RRAAAAAQEQ SVGAAAGRKR RLYGFSASLV VASWVALLLL NTLIGHGDGQ HDGGGPAVAL PVAGSTVNAS SVSPDVVHRG DEDNLAVSDD TCVKIDENVT ISAETRLQED EQCSTDDVPS EDMEALSKDD QIELSEDQGD SPFLTNVDSG APPAEKGNGE DVPKSARLSR VVPPGLDEFK TRAIAERGKD DSSQTGHVIH RREPSGKLYN YASAAKGAKV LDFNKEAKGA ANILDKDKDK YLRNPCSAEG KFVIIELSEE TLVDTIAIAN FEHYSSNLKE FEMLSSLVYP TENWETLGRF TVANAKHAQN FTFPEPKWAR YLKFNLLNHY GSASYCTLSM FEVYGMDAVE KMLENLIPVE NKNVESDDKL KEPIDQTPWK EPNGGKESSE EPLDEDEFEL EDDKTNGDSP RNGANDQIVE TRTLQAGRIP GDTVLKVLMQ KVQSLDVSFS VLERYLEELN SRYGQIFKDF DSEIDSKDAL LEKIKLELKQ LQISKDDFAK EIEGIISWKL VASSQLNQLL LDNAILRSEF ERFREKQVDL ENRSFAVIFL SFVFGCLAIG KLSIGMIFNI GRLYDLEKFD RVKSGWLVLL FSSCIIASIL VIQ // ID I1KFX8_SOYBN Unreviewed; 543 AA. AC I1KFX8; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GLYMA06G48190.3}; OS Glycine max (Soybean) (Glycine hispida). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Phaseoleae; Glycine; Soja. OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:GLYMA06G48190.3, ECO:0000313|Proteomes:UP000008827}; RN [1] {ECO:0000313|EnsemblPlants:GLYMA06G48190.3} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:GLYMA06G48190.3}; RX PubMed=20075913; DOI=10.1038/nature08670; RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W., RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., RA May G.D., Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., RA Sandhu D., Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., RA Goodstein D., Barry K., Futrell-Griggs M., Abernathy B., Du J., RA Tian Z., Zhu L., Gill N., Joshi T., Libault M., Sethuraman A., RA Zhang X.-C., Shinozaki K., Nguyen H.T., Wing R.A., Cregan P., RA Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.C., RA Jackson S.A.; RT "Genome sequence of the palaeopolyploid soybean."; RL Nature 463:178-183(2010). RN [2] {ECO:0000313|EnsemblPlants:GLYMA06G48190.3} RP IDENTIFICATION. RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:GLYMA06G48190.3}; RG EnsemblPlants; RL Submitted (MAY-2013) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 3847.GLYMA06G48190.4; -. DR EnsemblPlants; GLYMA06G48190.3; GLYMA06G48190.3; GLYMA06G48190. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000008827; Chromosome 6. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008827}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008827}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 25 47 Helical. FT COILED 513 543 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 543 AA; 61761 MW; 57C736E97476E9A8 CRC64; MQRSRKALLE RRAIQKATSG RNYVYLYKVS LSLVFVLWGL VFLFSLWTSH GHGYGDHESR EVPVGVSNWN EDEHRQCKKS NSADEYLTKE TDDVYIPSET FCSDGAKTDG LIGESLSSGE SINRVETGYK ENYISPDTEE HEVERSKSAA KHQNDVQKYN HLSQAMPLGL DEFKSRAIGS KIKSGTNPSG SVIHRLEPGG AEYNYASASK GAKVLASNKE ARGASDILSR NKDKYLRNPC SSEEKFVVIE LSEETLVKTI EIANFEHHSS NFKEFELYGS LVYPTDAWIF LGNFTASNVK QAQRFVLEEQ KWMRYIKLNL QSHYGSEFYC TLSIVEVYGV DAIERMLEDL IYAQDKPFAS GEGNGEKRVA SPLSNAAKAD NVRPNTITGI NSDPASEISS ENQEAIIVKR NVPDPVEEIR QQVGRMPGDT VLKILMQKVR YLDLNLSVLE QYMEDLNSRY INIFKEYSKD MGEKDLLLEK IKEEISRFLE RQDVMMKEFS DLDSWRSHFS VQLDHVLRDN AVLRSEVEKV RENQVSLENK VVS // ID I1KZR2_SOYBN Unreviewed; 541 AA. AC I1KZR2; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 20. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GLYMA09G00440.1}; OS Glycine max (Soybean) (Glycine hispida). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Phaseoleae; Glycine; Soja. OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:GLYMA09G00440.1, ECO:0000313|Proteomes:UP000008827}; RN [1] {ECO:0000313|EnsemblPlants:GLYMA09G00440.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:GLYMA09G00440.1}; RX PubMed=20075913; DOI=10.1038/nature08670; RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W., RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., RA May G.D., Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., RA Sandhu D., Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., RA Goodstein D., Barry K., Futrell-Griggs M., Abernathy B., Du J., RA Tian Z., Zhu L., Gill N., Joshi T., Libault M., Sethuraman A., RA Zhang X.-C., Shinozaki K., Nguyen H.T., Wing R.A., Cregan P., RA Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.C., RA Jackson S.A.; RT "Genome sequence of the palaeopolyploid soybean."; RL Nature 463:178-183(2010). RN [2] {ECO:0000313|EnsemblPlants:GLYMA09G00440.1} RP IDENTIFICATION. RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:GLYMA09G00440.1}; RG EnsemblPlants; RL Submitted (MAY-2013) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003534427.1; XM_003534379.2. DR STRING; 3847.GLYMA09G00440.1; -. DR EnsemblPlants; GLYMA09G00440.1; GLYMA09G00440.1; GLYMA09G00440. DR GeneID; 100783254; -. DR KEGG; gmx:100783254; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; I1KZR2; -. DR OMA; PESEDAH; -. DR Proteomes; UP000008827; Chromosome 9. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008827}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008827}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 476 496 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 517 539 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 413 433 {ECO:0000256|SAM:Coils}. FT COILED 455 475 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 541 AA; 61186 MW; 5715FE521B055C35 CRC64; MQRSREALLQ RRRAKTQLSL SLLFVLWPLI FLFSRAHGYT PTPSVRLSNW KEDKHRQCKT SNSANKCLLK ETHDYILPNY KEDCDTPTVV DAQMSDHLPW AVPLGLDEFK SRAISSKIKS GTSGSSGSVM HRVEPGGAEY NYASASMGAK LLGSNKEAKG ASNILSRDKD KYLRNPCSAE DKFVIIELSE ETLVDTIEIA NFEHHSSNLK AFELLGSLSF PTDVWVFLGN FTASNVRHAQ RFVLQQPKWV RYLKLNLQSH YGSEFYCTLS VVEVYGVDAV ERMLEDLIHT QDNLLAPGDG NADKMTVSPH PNPPESEDAH QNTFGGINSY PASDISSANH EKLNSNVPDP VEEIRQQVGR MPGDTVLKIL MQKVRTLDLN LFVLERYMED LNTRYVNIFK EYSKDIGGKD ILIQNIKEDI RNLVDQQDAI TKDGSDLKSW KSHISMQFGH LLRDNAVLRS EVNEVRRKQA SLENKGVLVF LVCCIFSMLV ILRLSLDMAT SVYRVLQSVN RTDCSRKFCA VSSSWFLLLL NCIIIIFILT L // ID I1KZR3_SOYBN Unreviewed; 448 AA. AC I1KZR3; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 15. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GLYMA09G00440.2}; OS Glycine max (Soybean) (Glycine hispida). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Phaseoleae; Glycine; Soja. OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:GLYMA09G00440.2, ECO:0000313|Proteomes:UP000008827}; RN [1] {ECO:0000313|EnsemblPlants:GLYMA09G00440.2} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:GLYMA09G00440.2}; RX PubMed=20075913; DOI=10.1038/nature08670; RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W., RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., RA May G.D., Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., RA Sandhu D., Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., RA Goodstein D., Barry K., Futrell-Griggs M., Abernathy B., Du J., RA Tian Z., Zhu L., Gill N., Joshi T., Libault M., Sethuraman A., RA Zhang X.-C., Shinozaki K., Nguyen H.T., Wing R.A., Cregan P., RA Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.C., RA Jackson S.A.; RT "Genome sequence of the palaeopolyploid soybean."; RL Nature 463:178-183(2010). RN [2] {ECO:0000313|EnsemblPlants:GLYMA09G00440.2} RP IDENTIFICATION. RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:GLYMA09G00440.2}; RG EnsemblPlants; RL Submitted (MAY-2013) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 3847.GLYMA09G00440.1; -. DR EnsemblPlants; GLYMA09G00440.2; GLYMA09G00440.2; GLYMA09G00440. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000008827; Chromosome 9. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008827}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008827}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 383 403 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 424 446 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 320 340 {ECO:0000256|SAM:Coils}. FT COILED 362 382 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 448 AA; 50298 MW; 094D58664F232DB8 CRC64; MSDHLPWAVP LGLDEFKSRA ISSKIKSGTS GSSGSVMHRV EPGGAEYNYA SASMGAKLLG SNKEAKGASN ILSRDKDKYL RNPCSAEDKF VIIELSEETL VDTIEIANFE HHSSNLKAFE LLGSLSFPTD VWVFLGNFTA SNVRHAQRFV LQQPKWVRYL KLNLQSHYGS EFYCTLSVVE VYGVDAVERM LEDLIHTQDN LLAPGDGNAD KMTVSPHPNP PESEDAHQNT FGGINSYPAS DISSANHEKL NSNVPDPVEE IRQQVGRMPG DTVLKILMQK VRTLDLNLFV LERYMEDLNT RYVNIFKEYS KDIGGKDILI QNIKEDIRNL VDQQDAITKD GSDLKSWKSH ISMQFGHLLR DNAVLRSEVN EVRRKQASLE NKGVLVFLVC CIFSMLVILR LSLDMATSVY RVLQSVNRTD CSRKFCAVSS SWFLLLLNCI IIIFILTL // ID I1M0G8_SOYBN Unreviewed; 464 AA. AC I1M0G8; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 18. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GLYMA13G25480.1}; OS Glycine max (Soybean) (Glycine hispida). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Phaseoleae; Glycine; Soja. OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:GLYMA13G25480.1, ECO:0000313|Proteomes:UP000008827}; RN [1] {ECO:0000313|EnsemblPlants:GLYMA13G25480.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:GLYMA13G25480.1}; RX PubMed=20075913; DOI=10.1038/nature08670; RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W., RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., RA May G.D., Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., RA Sandhu D., Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., RA Goodstein D., Barry K., Futrell-Griggs M., Abernathy B., Du J., RA Tian Z., Zhu L., Gill N., Joshi T., Libault M., Sethuraman A., RA Zhang X.-C., Shinozaki K., Nguyen H.T., Wing R.A., Cregan P., RA Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.C., RA Jackson S.A.; RT "Genome sequence of the palaeopolyploid soybean."; RL Nature 463:178-183(2010). RN [2] {ECO:0000313|EnsemblPlants:GLYMA13G25480.1} RP IDENTIFICATION. RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:GLYMA13G25480.1}; RG EnsemblPlants; RL Submitted (MAY-2013) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003542777.1; XM_003542729.2. DR STRING; 3847.GLYMA13G25480.1; -. DR PRIDE; I1M0G8; -. DR EnsemblPlants; GLYMA13G25480.1; GLYMA13G25480.1; GLYMA13G25480. DR GeneID; 100796587; -. DR KEGG; gmx:100796587; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; I1M0G8; -. DR KO; K19347; -. DR OMA; YETEMAF; -. DR Proteomes; UP000008827; Chromosome 13. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008827}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008827}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 107 128 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 178 227 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 464 AA; 50489 MW; 074C144A821B7C3C CRC64; MSASTVSITA ANPGTRRRPV IATEKKTATS LELLANDVAV SPAVATSGDG ATGRDLSHHS IRGEALLDRA PRDLTPAKKV AGAGPNSASG PPRRTRKPAA KSEKPRWLTL VSIFGKNLVL LVVLAGLVQL IRRMSLKSGD AAAGGFAGFS EFEGRISDVE GLLKKTAKMI QVQVDVVDKK IEDEVRGLRK ELNEKIEEKG VILESGLKKL EAKNEELEKY LSELKGENWL SKEEFEKFVE EVRSVKGSGY EGGGLDEIRE FARGVIEKEI EKHAADGLGR VDYALASGGG TVVKHSEVFD LGRGNWFLKS ARNGVNPNAE KMLKPSFGEP GQCFPLKDTR GFVQIRLRTA IIPEAVTLEH VAKSVAYDRS SAPKDCRVSG WLQEHNADSA IDTEKMHLLS EFTYDLEKSN AQTFNVLNSA ASGVINMVRL DFTSNHGSPS HTCIYRFRVH GHEPDSVSMM ALES // ID I1MII0_SOYBN Unreviewed; 462 AA. AC I1MII0; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 09-JAN-2013, sequence version 2. DT 11-NOV-2015, entry version 21. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GLYMA15G35240.1}; OS Glycine max (Soybean) (Glycine hispida). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; OC Phaseoleae; Glycine; Soja. OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:GLYMA15G35240.1, ECO:0000313|Proteomes:UP000008827}; RN [1] {ECO:0000313|EnsemblPlants:GLYMA15G35240.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:GLYMA15G35240.1}; RX PubMed=20075913; DOI=10.1038/nature08670; RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W., RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., RA May G.D., Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., RA Sandhu D., Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., RA Goodstein D., Barry K., Futrell-Griggs M., Abernathy B., Du J., RA Tian Z., Zhu L., Gill N., Joshi T., Libault M., Sethuraman A., RA Zhang X.-C., Shinozaki K., Nguyen H.T., Wing R.A., Cregan P., RA Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.C., RA Jackson S.A.; RT "Genome sequence of the palaeopolyploid soybean."; RL Nature 463:178-183(2010). RN [2] {ECO:0000313|EnsemblPlants:GLYMA15G35240.1} RP IDENTIFICATION. RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:GLYMA15G35240.1}; RG EnsemblPlants; RL Submitted (MAY-2013) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR RefSeq; XP_003546680.1; XM_003546632.2. DR RefSeq; XP_006598048.1; XM_006597985.1. DR RefSeq; XP_006598049.1; XM_006597986.1. DR STRING; 3847.GLYMA15G35240.1; -. DR EnsemblPlants; GLYMA15G35240.1; GLYMA15G35240.1; GLYMA15G35240. DR GeneID; 100813866; -. DR KEGG; gmx:100813866; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; I1MII0; -. DR KO; K19347; -. DR OMA; MEIARHS; -. DR Proteomes; UP000008827; Chromosome 15. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008827}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008827}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 105 130 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 176 225 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 462 AA; 50602 MW; 373A7854957DA3C6 CRC64; MSASTVSITA ANPGARRRPV IATEKKTATN LELLANDVAV SPAVATSGDG ATGRDLSHHS VRGEALLDRT PRDLAPAKKV AGGNSSSVPP RRARKLSAKA EKPRWLTLVS IFGKNMVLLV VLAGLVQLIW RMSLKSGDGM AGGYVGFSEF EGRISDVEGL LKKTAKMIQV QVDVVDKKIE DEVRGLRREL NEKIEEKGEI LENGLKKMEA KNEELERYLS ELKGEDWLSK EEFEKFVDEV RSVKGSGYEG GGLDEIREFA RGVIVKEIEK HAADGLGRVD YALASSGGAV VKHSEVFDLV RGNWFLKSAR NGVHPNAEKM LKPSFGEPGQ CFPLKDSRGF VQIRLRTAII PEAVTLEHVA KSVAYDRSSA PKDCRVSGWL QEHNADSAIN TEKMHLLAEF TYDLEKSNAQ TFNVLNSAAS GVINTVRLDF TSNHGSPSHT CIYRFRVHGH EPDSVSMLAQ EL // ID I1NTW1_ORYGL Unreviewed; 625 AA. AC I1NTW1; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:ORGLA01G0330100.1}; OS Oryza glaberrima (African rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=4538 {ECO:0000313|EnsemblPlants:ORGLA01G0330100.1, ECO:0000313|Proteomes:UP000007306}; RN [1] {ECO:0000313|EnsemblPlants:ORGLA01G0330100.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=IRGC 96717 {ECO:0000313|EnsemblPlants:ORGLA01G0330100.1}; RA Wing R.A., Yu Y., Rounsley S., Reddy-Marri P., Goicoechea J.L., RA Sisneros N., Lee S., Song X., Angelova A., Kudrna D.P., de Baynast K., RA Zuccolo A.; RT "The complete genome of Oryza glaberrima."; RL Submitted (JUN-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblPlants:ORGLA01G0330100.1} RP IDENTIFICATION. RG EnsemblPlants; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 4538.ORGLA01G0330100.1; -. DR EnsemblPlants; ORGLA01G0330100.1; ORGLA01G0330100.1; ORGLA01G0330100. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OMA; YGSASYC; -. DR Proteomes; UP000007306; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007306}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007306}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 42 60 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 566 586 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 607 624 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 503 523 {ECO:0000256|SAM:Coils}. FT COILED 538 565 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 625 AA; 69846 MW; 0A0AE4D336993A4B CRC64; MQRSRRALLK RKAAAAAKEE EEEAGVGVAT AAAAGRRRRR RLYGFSVSLV VACWVVLLLL NPLVGHGNGQ RDEGIFADEG SSDPSFDSVE PTLSEGSVDS VVQQENGENH ALPGDSCAKP DENHVLSEET LLEKDQLCSN DEAQGDGMDA LPKDNVDQGE NLPRTDDDSV VHPEGEVESE GVPRPARLSR VVPPGLDEFK TRAIAERGKG VPSGQPGNVI HRREPSGKLY NYASAAKGAK VLEFNKEAKG ASNILDKDKD KYLRNPCSAE GKFVIIELSE ETLVDTIAIA NFEHYSSNLK EFEMLSSLNY PTDSWETLGR FTVANAKIAQ NFTFPEPKWA RYLKLNLLSH YGSEFYCTLS MLEVYGMDAV EKMLENLIPV ENKRLEPDDK MKEPVDQQTQ LKEPTEGKES SHEPLDEDEF ELEDDKLNGD SSKNGAHDQV TETRPIQAGR IPGDTVLKVL MQKVQSLDVS FTVLERYLEE LNSRYGQIFK DFDADIDTKD ALLEKIKLEL KHLESSKDDF AKEIEGILSW KLVASSQLNQ LLLDNVRIRS ELERFREKQA DLENRSFAVI FLSFVFGCLA IAKLSIGMIF NTCRLYNFEK FDRVKSGWLV LLFSSCIIAS ILIIQ // ID I1PTX1_ORYGL Unreviewed; 453 AA. AC I1PTX1; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:ORGLA05G0081900.1}; OS Oryza glaberrima (African rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=4538 {ECO:0000313|EnsemblPlants:ORGLA05G0081900.1, ECO:0000313|Proteomes:UP000007306}; RN [1] {ECO:0000313|EnsemblPlants:ORGLA05G0081900.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=IRGC 96717 {ECO:0000313|EnsemblPlants:ORGLA05G0081900.1}; RA Wing R.A., Yu Y., Rounsley S., Reddy-Marri P., Goicoechea J.L., RA Sisneros N., Lee S., Song X., Angelova A., Kudrna D.P., de Baynast K., RA Zuccolo A.; RT "The complete genome of Oryza glaberrima."; RL Submitted (JUN-2010) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblPlants:ORGLA05G0081900.1} RP IDENTIFICATION. RG EnsemblPlants; RL Submitted (JUN-2015) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR STRING; 4538.ORGLA05G0081900.1; -. DR EnsemblPlants; ORGLA05G0081900.1; ORGLA05G0081900.1; ORGLA05G0081900. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OMA; VKHSEPF; -. DR Proteomes; UP000007306; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007306}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007306}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 113 136 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 190 224 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 453 AA; 50383 MW; B2D50ADBA337CB52 CRC64; MSVSTAAVPT ANTNGNHALS MDSHSSQDVR RRTVVVARKK ASPELLADGG FNGTSSVDKI TDKKDLSHTI RGESVLGKSK YPLEARKDAI ASAAAADRRK KSGAKQEKAK WEIALSVLMK LCLLISAVAW MGQLFWRWQN GDLSFTTLDM ESRLSKVEGF KKTTKMLQVQ LDILDKKLGN EIDKTRRDIT KQFEDKGNKL EIKMKALEGK TDKLDKSLAE LRDMGFVSKK EFDEIVEQLK KKKGLDGTVG DISLDDIRLF AKEIVEMEIE RHAADGLGMV DYALASGGGK VVKHSEAFRK AKSFMPSRNS LLEPAKKMLE PSFGQPGECF ALQGSSGYVE IKLRTGIIPE AVSLEHVDKS VAYDRSSAPK DFQVSGWYEG PEDDSDKESR VVTNLGEFSY DLEKNNVQTF QLERTADSRV INMVRLDFSS NHGNSELTCI YRFRMHGREP GSP // ID I1S7G2_GIBZE Unreviewed; 656 AA. AC I1S7G2; DT 13-JUN-2012, integrated into UniProtKB/TrEMBL. DT 13-JUN-2012, sequence version 1. DT 11-NOV-2015, entry version 17. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ESU11685.1}; GN ORFNames=FGSG_12785 {ECO:0000313|EMBL:ESU11685.1}; OS Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC 9075 / NRRL 31084) OS (Wheat head blight fungus) (Fusarium graminearum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Fusarium. OX NCBI_TaxID=229533 {ECO:0000313|EMBL:ESU11685.1}; RN [1] {ECO:0000313|EMBL:ESU11685.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PH-1 {ECO:0000313|EMBL:ESU11685.1}; RG The Broad Institute Genome Sequencing Platform; RA Birren B., Lander E., Galagan J., Nusbaum C., Devon K., Ma L.-J., RA Jaffe D., Butler J., Alvarez P., Gnerre S., Grabherr M., Kleber M., RA Mauceli E., Brockman W., MacCallum I.A., Young S., LaButti K., RA DeCaprio D., Crawford M., Koehrsen M., Engels R., Montgomery P., RA Pearson M., Howarth C., Larson L., White J., O'Leary S., Kodira C., RA Zeng Q., Yandava C., Alvarado L., Kistler C., Xu J.-R., Trail F.; RT "Genome Sequence of Fusarium graminearum (Gibberella zeae)."; RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DS231665; ESU11685.1; -; Genomic_DNA. DR RefSeq; XP_011324261.1; XM_011325959.1. DR EnsemblFungi; ESU11685; ESU11685; FGSG_12785. DR GeneID; 23559597; -. DR KEGG; fgr:FGSG_12785; -. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR InParanoid; I1S7G2; -. DR OrthoDB; EOG7P8PJ5; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 100 120 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 192 212 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 656 AA; 75260 MW; 5C0A87F2EEE3B00D CRC64; MAADPWYFEW MRFRGWLRFI FWPPYWFGRG GGDPFDFTFN DDIDESEESP TEWGRLFNPM TYLLVLKRWF DGIMDRVFRF IDRLSGIQIG QVQRSFAERI AWAVALIGFA FFLLMGSGAL HHIPEIPDID YLKPSVSWPS TDGFSFGNII PSVPTVSWPS WPSWSRDSDD LPFYDPFAMD DVVIPDDHKR ALDALKNQAE IHKKALKRLE TILPRIVHMD LVKGRPSIKP EFWHALQDHL RESGSFLNLD NKRGNYEISS EQQWKAIVAR LGKDPTFKGK LDGIVGNAIQ DRLPNFWDTW FRNNNAVLEP LVEKAMAKKQ TAGSGAAFDQ KLSKIVSDQL RKQNQTAVSR DDFLAHLRDD LTKHESQVQA EFSRLKSDMD NHIKESIRTA KMMAPQTMSN TEMKQLIRKI VHQTLTDVSL TAVAKSKIHA HWHSDLKYQV NFFGIGAGAT METHYTAPDW NPYTARTTEK DALALGLTGI HPRPRIEVLL PWQEEGDRWC GSHAVDSDGR PHGVGVSIHL GHLVIPENIA VEHIHPNATL DPDARPRHIE VFAKFEFKEE QELVRDYSSN KFPENINGWN FNPSPLPDSF VKITQFEYQG DELNEGVHVH HINDEFANLG IPTDHVIIRA MSNYGAPDHT CFYRVRLFGR PVDELS // ID O22992_ARATH Unreviewed; 466 AA. AC O22992; DT 01-JAN-1998, integrated into UniProtKB/TrEMBL. DT 01-JAN-1998, sequence version 1. DT 14-OCT-2015, entry version 55. DE SubName: Full=T19F6.21 protein {ECO:0000313|EMBL:AAB63627.1}; GN Name=T19F6.21 {ECO:0000313|EMBL:AAB63627.1}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702; RN [1] {ECO:0000313|EMBL:AAB63627.1} RP NUCLEOTIDE SEQUENCE. RA Rounsley S.D., Lin X., Ketchum K.A., Crosby M.L., Brandon R.C., RA Spriggs T.A., Mason T.M., Kerlavage A.R., Adams M.D., Somerville C.R., RA Venter J.C.; RT "Arabidopsis thaliana chromosome IV BAC T19F6 genomic sequence."; RL Submitted (JUL-1997) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:AAB63627.1} RP NUCLEOTIDE SEQUENCE. RA Lin X.; RL Submitted (APR-1999) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC002343; AAB63627.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 406 425 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 446 465 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 296 316 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 466 AA; 52802 MW; C1362320A9B88251 CRC64; MLIKETNLGS FNLGGIFQRL LYCVPLKKSI YFVDRIGNYT DGSVSKTLNS TSSVFPQATE KENNFCLLRK GQLQDVYEHV LVNNALLICK VVLPERRISK KTLEARDPRY VNLEDKSLKV NGSSQLVNNG TRYRLEPDGN GYNYASAMKG AKVVDHNKEA KGASNVLGKD HDKYLRNPCS VSDKYVVIEL AEETLVDTVR IANFEHYSSN PKEFSLSGSL SFPSDMWTPA GSFAAANVKQ IQSFRLPEPK XTDQIGKETE AQKKKDDVVK TINIIGDKKY EVKEKHNVLK VMMQKVKLIE MNLSLLEDSV KKMNDKQPEV SLEMKKTLVL VEKSKADIRE ITEWKGKMKL PMNLIFFEQE KELRDLELWK TLVASRVESL ARGNSALRLD VEKIVKEQAN LESKELGVLL ISLFFVVLAT IRLVSTRLWA FLGMSITDKA RSLWPDSGWV MILLSSSIMI FIHLLS // ID O23133_ARATH Unreviewed; 639 AA. AC O23133; DT 01-JAN-1998, integrated into UniProtKB/TrEMBL. DT 01-JAN-1998, sequence version 1. DT 11-NOV-2015, entry version 64. DE SubName: Full=Putative uncharacterized protein F19G10.15 {ECO:0000313|EMBL:AAB72170.1}; GN Name=F19G10.15 {ECO:0000313|EMBL:AAB72170.1}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702; RN [1] {ECO:0000313|EMBL:AAB72170.1} RP NUCLEOTIDE SEQUENCE. RA Federspiel N.A., Palm C.J., Conway A.B., Kurtz D.B., Conway A.R., RA Au M., Araujo R., Buehler E., Dewar K., Feng J., Kim C., Li Y., RA Oji O., Osborne B.I., Shinn P., Sun H., Toriumi M., Vyotskaia V., RA Yu G., Ecker J., Theologis A., Davis R.W.; RL Submitted (APR-1997) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF000657; AAB72170.1; -; Genomic_DNA. DR PIR; H86362; H86362. DR ProteinModelPortal; O23133; -. DR STRING; 3702.AT1G22882.1; -. DR PaxDb; O23133; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 466 486 {ECO:0000256|SAM:Coils}. FT COILED 522 560 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 639 AA; 71773 MW; 2D25ACCC1DC4BBD6 CRC64; MLRTNSCVLV DEPLNDSMGM ADPDDGQSDE KVVPFDGPLS LASASVDVTS DLSRNDDVNL SEESEDKEQE AEISSTVSGN DIESKDTYLL KQSEINKKDT GIDAGSKYDD FPKKSEINNT GTWNDTEGKD DNNFLKQSQL NKTGTGNDTE SSDNEFLEQN QMNKTVLGNG TEINVSKVDQ PSRAVPLGLD EFKSRASNSR NKSLSDQVSG VIHRMEPGGK EYNYASASKG AKVLSSNKEA KGAASILSRD NDKYLRNPCS TEGKFVVVEL SEETLVNTIK IANFEHYSSN LKEFELQGTL VYPTDTWVHM GNFTASNVKH EQNFTLLEPK WVRYLKLNFI SHYGSEFYCT LSLIEVYGVD AVERMLEDLI SVQDNKNAYK PREGDSEHKE KPMQQIESLE GDDGADKSTH REKEKEAPPE NMLAKTEASM AKSSNKLSEP VEEMRHHQPG SRMPGDTVLK ILMQKLRSLD LNLSILERYL EELNLRYGNI FKEMDREAGV REKAIVALRL DLEGMKERQE GMVSEAEEMK EWRKRVEAEM EKAEKEKENI RQSLEQIVFD TMKSSYIVYR KIEKNGHNSF ADVSKSENFF CLEKLIIQCG AGNNLRLDCV SHPSPPPPHR SMAPPIFVPP STSHKGQGP // ID O61794_CAEEL Unreviewed; 802 AA. AC O61794; DT 01-AUG-1998, integrated into UniProtKB/TrEMBL. DT 01-OCT-2001, sequence version 2. DT 11-NOV-2015, entry version 86. DE SubName: Full=SUn (SUN) domain Containing Ossification factor homolog {ECO:0000313|EMBL:CCD65168.1}; GN Name=suco-1 {ECO:0000313|EMBL:CCD65168.1, GN ECO:0000313|WormBase:R12E2.2}; GN ORFNames=CELE_R12E2.2 {ECO:0000313|EMBL:CCD65168.1}, GN R12E2.2 {ECO:0000313|WormBase:R12E2.2}; OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239 {ECO:0000313|EMBL:CCD65168.1, ECO:0000313|Proteomes:UP000001940}; RN [1] {ECO:0000313|Proteomes:UP000001940} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Bristol N2 {ECO:0000313|Proteomes:UP000001940}; RX PubMed=9851916; DOI=10.1126/science.282.5396.2012; RG The C. elegans sequencing consortium; RT "Genome sequence of the nematode C. elegans: a platform for RT investigating biology."; RL Science 282:2012-2018(1998). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BX284601; CCD65168.1; -; Genomic_DNA. DR PIR; T33098; T33098. DR RefSeq; NP_491321.1; NM_058920.4. DR UniGene; Cel.17379; -. DR ProteinModelPortal; O61794; -. DR STRING; 6239.R12E2.2.1; -. DR PaxDb; O61794; -. DR EnsemblMetazoa; R12E2.2; R12E2.2; WBGene00020031. DR GeneID; 172011; -. DR KEGG; cel:CELE_R12E2.2; -. DR UCSC; R12E2.2; c. elegans. DR CTD; 172011; -. DR WormBase; R12E2.2; CE28764; WBGene00020031; suco-1. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR InParanoid; O61794; -. DR OMA; ERCEETQ; -. DR NextBio; 873653; -. DR Proteomes; UP000001940; Chromosome I. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001940}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001940}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 {ECO:0000256|SAM:SignalP}. FT CHAIN 21 802 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004159182. FT TRANSMEM 646 668 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 802 AA; 90644 MW; 439762F6DE687CCC CRC64; MKLKRLLIFA VLLVLPNINA NQEVSFVKHW KDILLTDGEH SPICALSIDG CARSAPYNVS KKIMKASVNA SEKDSIPEKS IDNFDEWTKK RRDAVANQNG QNQKTIEPTQ VARHDEVSIP LPPITRPPRN FASRECGAKI IAANPEAENA KAVVNEKDVD DYMRNPCQSA KEKFIVIELC EAIQIKKIAI GNFELFASRP KTIQVFISER YPPLANWISV GPFHLQDHHK NLQTFDVPNT NVYAKYVRIN LEDHYGKEHY CIVSVVNVMG STLADEYDKE EAAAHLLNVI EEVKEEPVTT PPPSEQKMQT QLPVPPKSPN QTVSARMKSV DFRQLKSVCS QCSVGKVSNL ICHILPIPTR VKKNDLVKPP NNLKTSTIKP AVTDKRDLKT EIGLWAERSR HSNFEQSRRR NLATIQRLHP KDVQKISTAP DVPTPKAENI EKPTEKPSEE VKPPAREQPQ VSLPPKPKSE PILPAGGSTN QRELVLMKLS KRIAAVELNL TLSSEYLSEL SKQYVSQMSG YQQELKETRK ASKKTAQTVE AMMRSKMNGV KRELRDLRQS VYLLQKLENS RYNNVQSEMS RQVLMSSCHI SSNVPPSPTI ARLPLIIPAL NRKLENFTNF EERMKKIYET AKSVMFGSLT WNTDHLIVAL ISFNIMALSF LFAGVFYIHR RNKERCEETQ IIVKNELRAR IAKVGIENRK LISKGMRRAE LAVTAAVSSA LKIEKTSSNR SAMTELETAL ANLFEAQQTR IEEQFEQNQK MLRDALSERN GSRFDDTLSV EDSESSSETE HSKEDTPTLN AD // ID SUCO_HUMAN Reviewed; 1254 AA. AC Q9UBS9; B2RNU4; Q9BQB9; Q9BXQ2; Q9UL04; DT 11-SEP-2007, integrated into UniProtKB/Swiss-Prot. DT 01-MAY-2000, sequence version 1. DT 11-NOV-2015, entry version 103. DE RecName: Full=SUN domain-containing ossification factor; DE AltName: Full=Membrane protein CH1; DE AltName: Full=Protein osteopotentia homolog; DE AltName: Full=SUN-like protein 1; DE Flags: Precursor; GN Name=SUCO; Synonyms=C1orf9, CH1, OPT, SLP1; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND TISSUE SPECIFICITY. RC TISSUE=Testis; RX PubMed=10673381; DOI=10.1006/bbrc.1999.2016; RA Roesok O., Pedeutour F., Odeberg J., Lundeberg J., Aasheim H.-C.; RT "The C1orf9 gene encodes a putative transmembrane member of a novel RT protein family."; RL Biochem. Biophys. Res. Commun. 267:855-862(2000). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2). RC TISSUE=Mammary gland; RA Chen L.-C., Cheung J., Moore D., Ljung B.-M., Kuo W.-L., Collins C., RA Gray J.W., Smith H.S.; RT "Cloning of an overexpressed gene on chromosome 1 in breast cancer RT defines a new gene family."; RL Submitted (OCT-1998) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RA Rhodes S.; RL Submitted (JAN-1999) to the EMBL/GenBank/DDBJ databases. RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16710414; DOI=10.1038/nature04727; RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., RA Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., RA Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., RA McDonald L., Evans R., Phillips K., Atkinson A., Cooper R., Jones C., RA Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., RA Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I.S., Aubin K., RA Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., RA Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., RA Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., RA Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., RA Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., RA Hammond S., Harrison E.S.I., Hart E., Haugen E., Heath P.D., RA Holmes S., Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., RA Isherwood J., James R., Johnson C., Johnson D., Joy A., Kay M., RA Kershaw J.K., Kibukawa M., Kimberley A.M., King A., Knights A.J., RA Lad H., Laird G., Lawlor S., Leongamornlert D.A., Lloyd D.M., RA Loveland J., Lovell J., Lush M.J., Lyne R., Martin S., RA Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., McLaren S., RA Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., RA Subramanian S., Sycamore N., Tracey A., Tromans A., Van Helmond Z., RA Wall M., Wallis J.M., White S., Whitehead S.L., Wilkinson J.E., RA Willey D.L., Williams H., Wilming L., Wray P.W., Wu Z., Coulson A., RA Vaudin M., Sulston J.E., Durbin R.M., Hubbard T., Wooster R., RA Dunham I., Carter N.P., McVean G., Ross M.T., Harrow J., Olson M.V., RA Beck S., Rogers J., Bentley D.R.; RT "The DNA sequence and biological annotation of human chromosome 1."; RL Nature 441:315-321(2006). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., RA Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., RA Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., RA Hannenhalli S., Turner R., Yooseph S., Lu F., Nusskern D.R., RA Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., RA Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., RA Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., RA Venter J.C.; RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases. RN [6] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RC TISSUE=Brain; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [7] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 22-244. RX PubMed=11158380; DOI=10.1093/oxfordjournals.molbev.a003795; RA Yu N., Zhao Z., Fu Y.-X., Sambuughin N., Ramsay M., Jenkins T., RA Leskinen E., Patthy L., Jorde L.B., Kuromori T., Li W.-H.; RT "Global patterns of human DNA sequence variation in a 10-kb region on RT chromosome 1."; RL Mol. Biol. Evol. 18:214-222(2001). RN [8] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=18669648; DOI=10.1073/pnas.0805139105; RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., RA Elledge S.J., Gygi S.P.; RT "A quantitative atlas of mitotic phosphorylation."; RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008). CC -!- FUNCTION: Required for bone modeling during late embryogenesis. CC Regulates type I collagen synthesis in osteoblasts during their CC postnatal maturation (By similarity). {ECO:0000250}. CC -!- SUBCELLULAR LOCATION: Rough endoplasmic reticulum membrane CC {ECO:0000250}; Single-pass type I membrane protein {ECO:0000250}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q9UBS9-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9UBS9-2; Sequence=VSP_027921, VSP_027922, VSP_027923; CC Note=No experimental confirmation available.; CC -!- TISSUE SPECIFICITY: Highly expressed in pancreas and testis and to CC a lower extent in prostate, ovary, heart, thymus, small intestine CC and spleen. {ECO:0000269|PubMed:10673381}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AJ250075; CAB57360.1; -; mRNA. DR EMBL; AF097535; AAF04619.1; -; mRNA. DR EMBL; AL035291; CAA22894.1; -; mRNA. DR EMBL; Z96050; CAI19034.1; -; Genomic_DNA. DR EMBL; Z94054; CAI19034.1; JOINED; Genomic_DNA. DR EMBL; Z94054; CAI19460.1; -; Genomic_DNA. DR EMBL; Z96050; CAI19460.1; JOINED; Genomic_DNA. DR EMBL; BC137125; AAI37126.1; -; mRNA. DR EMBL; CH471067; EAW90931.1; -; Genomic_DNA. DR EMBL; AF310265; AAK28742.1; -; Genomic_DNA. DR EMBL; AF310266; AAK28743.1; -; Genomic_DNA. DR EMBL; AF310267; AAK28744.1; -; Genomic_DNA. DR EMBL; AF310268; AAK28745.1; -; Genomic_DNA. DR EMBL; AF310269; AAK28746.1; -; Genomic_DNA. DR EMBL; AF310270; AAK28747.1; -; Genomic_DNA. DR EMBL; AF310271; AAK28748.1; -; Genomic_DNA. DR EMBL; AF310272; AAK28749.1; -; Genomic_DNA. DR EMBL; AF310273; AAK28750.1; -; Genomic_DNA. DR EMBL; AF310274; AAK28751.1; -; Genomic_DNA. DR EMBL; AF310275; AAK28752.1; -; Genomic_DNA. DR EMBL; AF310276; AAK28753.1; -; Genomic_DNA. DR EMBL; AF310277; AAK28754.1; -; Genomic_DNA. DR EMBL; AF310278; AAK28755.1; -; Genomic_DNA. DR EMBL; AF310279; AAK28756.1; -; Genomic_DNA. DR EMBL; AF310280; AAK28757.1; -; Genomic_DNA. DR EMBL; AF310281; AAK28758.1; -; Genomic_DNA. DR EMBL; AF310282; AAK28759.1; -; Genomic_DNA. DR EMBL; AF310283; AAK28760.1; -; Genomic_DNA. DR EMBL; AF310284; AAK28761.1; -; Genomic_DNA. DR EMBL; AF310285; AAK28762.1; -; Genomic_DNA. DR EMBL; AF310286; AAK28763.1; -; Genomic_DNA. DR EMBL; AF310287; AAK28764.1; -; Genomic_DNA. DR EMBL; AF310288; AAK28765.1; -; Genomic_DNA. DR EMBL; AF310289; AAK28766.1; -; Genomic_DNA. DR EMBL; AF310290; AAK28767.1; -; Genomic_DNA. DR EMBL; AF310291; AAK28768.1; -; Genomic_DNA. DR EMBL; AF310292; AAK28769.1; -; Genomic_DNA. DR EMBL; AF310293; AAK28770.1; -; Genomic_DNA. DR EMBL; AF310294; AAK28771.1; -; Genomic_DNA. DR EMBL; AF310295; AAK28772.1; -; Genomic_DNA. DR EMBL; AF310296; AAK28773.1; -; Genomic_DNA. DR EMBL; AF310297; AAK28774.1; -; Genomic_DNA. DR EMBL; AF310298; AAK28775.1; -; Genomic_DNA. DR EMBL; AF310299; AAK28776.1; -; Genomic_DNA. DR EMBL; AF310300; AAK28777.1; -; Genomic_DNA. DR EMBL; AF310301; AAK28778.1; -; Genomic_DNA. DR EMBL; AF310302; AAK28779.1; -; Genomic_DNA. DR EMBL; AF310303; AAK28780.1; -; Genomic_DNA. DR EMBL; AF310304; AAK28781.1; -; Genomic_DNA. DR EMBL; AF310305; AAK28782.1; -; Genomic_DNA. DR EMBL; AF310306; AAK28783.1; -; Genomic_DNA. DR EMBL; AF310307; AAK28784.1; -; Genomic_DNA. DR EMBL; AF310308; AAK28785.1; -; Genomic_DNA. DR EMBL; AF310309; AAK28786.1; -; Genomic_DNA. DR EMBL; AF310310; AAK28787.1; -; Genomic_DNA. DR EMBL; AF310311; AAK28788.1; -; Genomic_DNA. DR EMBL; AF310312; AAK28789.1; -; Genomic_DNA. DR EMBL; AF310313; AAK28790.1; -; Genomic_DNA. DR EMBL; AF310314; AAK28791.1; -; Genomic_DNA. DR EMBL; AF310315; AAK28792.1; -; Genomic_DNA. DR EMBL; AF310316; AAK28793.1; -; Genomic_DNA. DR EMBL; AF310317; AAK28794.1; -; Genomic_DNA. DR EMBL; AF310318; AAK28795.1; -; Genomic_DNA. DR EMBL; AF310319; AAK28796.1; -; Genomic_DNA. DR EMBL; AF310320; AAK28797.1; -; Genomic_DNA. DR EMBL; AF310321; AAK28798.1; -; Genomic_DNA. DR EMBL; AF310322; AAK28799.1; -; Genomic_DNA. DR EMBL; AF310323; AAK28800.1; -; Genomic_DNA. DR EMBL; AF310324; AAK28801.1; -; Genomic_DNA. DR EMBL; AF310325; AAK28802.1; -; Genomic_DNA. DR CCDS; CCDS1303.1; -. [Q9UBS9-1] DR CCDS; CCDS65726.1; -. [Q9UBS9-2] DR PIR; JC7185; JC7185. DR RefSeq; NP_001269679.1; NM_001282750.1. DR RefSeq; NP_001269680.1; NM_001282751.1. DR RefSeq; NP_055098.1; NM_014283.4. [Q9UBS9-1] DR RefSeq; NP_057311.3; NM_016227.3. [Q9UBS9-2] DR UniGene; Hs.204559; -. DR ProteinModelPortal; Q9UBS9; -. DR BioGrid; 119536; 4. DR STRING; 9606.ENSP00000263688; -. DR DMDM; 74761893; -. DR MaxQB; Q9UBS9; -. DR PaxDb; Q9UBS9; -. DR PRIDE; Q9UBS9; -. DR Ensembl; ENST00000263688; ENSP00000263688; ENSG00000094975. [Q9UBS9-1] DR Ensembl; ENST00000367723; ENSP00000356696; ENSG00000094975. [Q9UBS9-2] DR Ensembl; ENST00000608151; ENSP00000477484; ENSG00000094975. [Q9UBS9-2] DR GeneID; 51430; -. DR KEGG; hsa:51430; -. DR UCSC; uc001giq.4; human. [Q9UBS9-1] DR UCSC; uc009wwd.3; human. [Q9UBS9-2] DR CTD; 51430; -. DR GeneCards; SUCO; -. DR HGNC; HGNC:1240; SUCO. DR HPA; HPA047251; -. DR neXtProt; NX_Q9UBS9; -. DR PharmGKB; PA25621; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR HOGENOM; HOG000070169; -. DR HOVERGEN; HBG107549; -. DR InParanoid; Q9UBS9; -. DR OMA; SSPWFES; -. DR PhylomeDB; Q9UBS9; -. DR TreeFam; TF105817; -. DR GenomeRNAi; 51430; -. DR NextBio; 54995; -. DR PRO; PR:Q9UBS9; -. DR Proteomes; UP000005640; Chromosome 1. DR Bgee; Q9UBS9; -. DR CleanEx; HS_C1orf9; -. DR ExpressionAtlas; Q9UBS9; baseline and differential. DR Genevisible; Q9UBS9; HS. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0016020; C:membrane; ISS:UniProtKB. DR GO; GO:0005791; C:rough endoplasmic reticulum; ISS:UniProtKB. DR GO; GO:0030867; C:rough endoplasmic reticulum membrane; IEA:UniProtKB-SubCell. DR GO; GO:0007275; P:multicellular organismal development; IEA:UniProtKB-KW. DR GO; GO:0001503; P:ossification; IEA:UniProtKB-KW. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; ISS:UniProtKB. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; ISS:UniProtKB. DR GO; GO:0046850; P:regulation of bone remodeling; ISS:UniProtKB. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; KW Developmental protein; Endoplasmic reticulum; Glycoprotein; Membrane; KW Osteogenesis; Reference proteome; Signal; Transmembrane; KW Transmembrane helix. FT SIGNAL 1 29 {ECO:0000255}. FT CHAIN 30 1254 SUN domain-containing ossification FT factor. FT /FTId=PRO_5000065707. FT TRANSMEM 1011 1031 Helical. {ECO:0000255}. FT DOMAIN 284 453 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 909 1009 {ECO:0000255}. FT CARBOHYD 202 202 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 236 236 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 524 524 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 928 928 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 955 955 N-linked (GlcNAc...). {ECO:0000255}. FT VAR_SEQ 1 1 M -> MRGFLARPFLSTNQHLAQWGSPLPQGKGLVQLPSQH FT TRHSRPFHELCSKEENSATVPKLISLVVSSETIDFSNKTMD FT SRRDWEREKRILEGKLQLPKALARTQRARDEGRAWTSRWLQ FT RRRSPESCEAPLSAPLWGPQRGLPGREPLRSRSASAIALRT FT IGHILALLLRLLHLGLGSGGCREDVPPSGRGKKEEKM (in FT isoform 2). {ECO:0000303|Ref.2}. FT /FTId=VSP_027921. FT VAR_SEQ 60 96 Missing (in isoform 2). FT {ECO:0000303|Ref.2}. FT /FTId=VSP_027922. FT VAR_SEQ 421 427 Missing (in isoform 2). FT {ECO:0000303|Ref.2}. FT /FTId=VSP_027923. FT CONFLICT 452 452 S -> N (in Ref. 2; AAF04619). FT {ECO:0000305}. FT CONFLICT 811 811 V -> M (in Ref. 2; AAF04619). FT {ECO:0000305}. SQ SEQUENCE 1254 AA; 139430 MW; 4EBA1ABCC27DAAB1 CRC64; MKKHRRALAL VSCLFLCSLV WLPSWRVCCK ESSSASASSY YSQDDNCALE NEDVQFQKKD EREGPINAES LGKSGSNLPI SPKEHKLKDD SIVDVQNTES KKLSPPVVET LPTVDLHEES SNAVVDSETV ENISSSSTSE ITPISKLDEI EKSGTIPIAK PSETEQSETD CDVGEALDAS APIEQPSFVS PPDSLVGQHI ENVSSSHGKG KITKSEFESK VSASEQGGGD PKSALNASDN LKNESSDYTK PGDIDPTSVA SPKDPEDIPT FDEWKKKVME VEKEKSQSMH ASSNGGSHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQLDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERNVQSFP LDEQMYAKYV KMFIKYIKVE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYHSERQ ELFDEDYDYP LDYNTGEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGNKSISEN ATATAAPKMP ESTPVSTPVP SPEYVTTEVH THDMEPSTPD TPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSPWFE SETQIFCSEL TTICCISSFS EYIYKWCSVR VALYRQRSRT ALSKGKDYLV LAQPPLLLPA ESVDVSVLQP LSGELENTNI EREAETVVLG DLSSSMHQDD LVNHTVDAVE LEPSHSQTLS QSLLLDITPE INPLPKIEVS ESVEYEAGHI PSPVIPQESS VEIDNETEQK SESFSSIEKP SITYETNKVN ELMDNIIKED VNSMQIFTKL SETIVPPINT ATVPDNEDGE AKMNIADTAK QTLISVVDSS SLPEVKEEEQ SPEDALLRGL QRTATDFYAE LQNSTDLGYA NGNLVHGSNQ KESVFMRLNN RIKALEVNMS LSGRYLEELS QRYRKQMEEM QKAFNKTIVK LQNTSRIAEE QDQRQTEAIQ LLQAQLTNMT QLVSNLSATV AELKREVSDR QSYLVISLVL CVVLGLMLCM QRCRNTSQFD GDYISKLPKS NQYPSPKRCF SSYDDMNLKR RTSFPLMRSK SLQLTGKEVD PNDLYIVEPL KFSPEKKKKR CKYKIEKIET IKPEEPLHPI ANGDIKGRKP FTNQRDFSNM GEVYHSSYKG PPSEGSSETS SQSEESYFCG ISACTSLCNG QSQKTKTEKR ALKRRRSKVQ DQGKLIKTLI QTKSGSLPSL HDIIKGNKEI TVGTFGVTAV SGHI // ID SUCO_MOUSE Reviewed; 1250 AA. AC Q8C341; Q3TAG8; Q3V3T1; Q8CE34; DT 11-SEP-2007, integrated into UniProtKB/Swiss-Prot. DT 11-SEP-2007, sequence version 3. DT 11-NOV-2015, entry version 74. DE RecName: Full=SUN domain-containing ossification factor; DE AltName: Full=Membrane protein CH1; DE AltName: Full=Protein osteopotentia; DE AltName: Full=SUN-like protein 1; DE Flags: Precursor; GN Name=Suco; Synonyms=Opt; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). RC STRAIN=C57BL/6J, and NOD; TISSUE=Cecum, Lung, Skin, and Spleen; RX PubMed=16141072; DOI=10.1126/science.1112014; RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., RA Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., RA Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., RA Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., RA Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., RA Hayashizaki Y.; RT "The transcriptional landscape of the mammalian genome."; RL Science 309:1559-1563(2005). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [3] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Pancreas, and Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). RN [4] RP DISRUPTION PHENOTYPE, TISSUE SPECIFICITY, DEVELOPMENTAL STAGE, RP FUNCTION, GLYCOSYLATION, AND SUBCELLULAR LOCATION. RX PubMed=20440000; DOI=10.1083/jcb.201003006; RA Sohaskey M.L., Jiang Y., Zhao J.J., Mohr A., Roemer F., Harland R.M.; RT "Osteopotentia regulates osteoblast maturation, bone formation, and RT skeletal integrity in mice."; RL J. Cell Biol. 189:511-525(2010). CC -!- FUNCTION: Required for bone modeling during late embryogenesis. CC Regulates type I collagen synthesis in osteoblasts during their CC postnatal maturation. {ECO:0000269|PubMed:20440000}. CC -!- SUBCELLULAR LOCATION: Rough endoplasmic reticulum membrane CC {ECO:0000269|PubMed:20440000}; Single-pass type I membrane protein CC {ECO:0000269|PubMed:20440000}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q8C341-1; Sequence=Displayed; CC Name=2; CC IsoId=Q8C341-2; Sequence=VSP_027924; CC -!- TISSUE SPECIFICITY: Present in chondrocytes, osteoblasts, CC osteoclasts and osteocytes (at protein level). CC {ECO:0000269|PubMed:20440000}. CC -!- DEVELOPMENTAL STAGE: Expressed at E9.5 and E13.5. CC {ECO:0000269|PubMed:20440000}. CC -!- PTM: N-glycosylated. {ECO:0000269|PubMed:20440000}. CC -!- DISRUPTION PHENOTYPE: Most mice die neonatally from respiratory CC distress (50% on a mixed C57BL6/CD1 background and 100% on an CC inbred C57BL6/129Ola background). Surviving mice fail to thrive CC and show significantly reduced body weight, skeletal deformities CC and spontaneous fractures. More than 80% die by postnatal day 10, CC and none survives to weaning. {ECO:0000269|PubMed:20440000}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK029097; BAC26295.1; -; mRNA. DR EMBL; AK033720; BAE43286.1; -; mRNA. DR EMBL; AK087029; BAC39786.2; -; mRNA. DR EMBL; AK171856; BAE42700.1; -; mRNA. DR EMBL; AC164414; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR CCDS; CCDS48415.1; -. [Q8C341-2] DR RefSeq; XP_006496823.1; XM_006496760.1. [Q8C341-1] DR UniGene; Mm.170002; -. DR ProteinModelPortal; Q8C341; -. DR STRING; 10090.ENSMUSP00000044815; -. DR PhosphoSite; Q8C341; -. DR MaxQB; Q8C341; -. DR PaxDb; Q8C341; -. DR PRIDE; Q8C341; -. DR Ensembl; ENSMUST00000048377; ENSMUSP00000044815; ENSMUSG00000040297. [Q8C341-2] DR GeneID; 226551; -. DR CTD; 51430; -. DR MGI; MGI:2138346; Suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR HOVERGEN; HBG107549; -. DR InParanoid; Q8C341; -. DR OMA; SSPWFES; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR PRO; PR:Q8C341; -. DR Proteomes; UP000000589; Chromosome 1. DR Bgee; Q8C341; -. DR CleanEx; MM_AI848100; -. DR ExpressionAtlas; Q8C341; baseline and differential. DR Genevisible; Q8C341; MM. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0016020; C:membrane; IDA:UniProtKB. DR GO; GO:0005791; C:rough endoplasmic reticulum; IDA:UniProtKB. DR GO; GO:0030867; C:rough endoplasmic reticulum membrane; IEA:UniProtKB-SubCell. DR GO; GO:0007275; P:multicellular organismal development; IEA:UniProtKB-KW. DR GO; GO:0001503; P:ossification; IEA:UniProtKB-KW. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; IMP:UniProtKB. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; IMP:UniProtKB. DR GO; GO:0046850; P:regulation of bone remodeling; IMP:UniProtKB. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; KW Developmental protein; Endoplasmic reticulum; Glycoprotein; Membrane; KW Osteogenesis; Reference proteome; Signal; Transmembrane; KW Transmembrane helix. FT SIGNAL 1 19 {ECO:0000255}. FT CHAIN 20 1250 SUN domain-containing ossification FT factor. FT /FTId=PRO_0000302718. FT TRANSMEM 1007 1027 Helical. {ECO:0000255}. FT DOMAIN 283 452 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 905 1005 {ECO:0000255}. FT CARBOHYD 201 201 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 235 235 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 523 523 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 924 924 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 951 951 N-linked (GlcNAc...). {ECO:0000255}. FT VAR_SEQ 285 285 S -> SLSTG (in isoform 2). FT {ECO:0000303|PubMed:16141072}. FT /FTId=VSP_027924. FT CONFLICT 4 4 Y -> N (in Ref. 1; BAE42700). FT {ECO:0000305}. FT CONFLICT 299 299 T -> K (in Ref. 1; BAC26295). FT {ECO:0000305}. SQ SEQUENCE 1250 AA; 139169 MW; 8713115267891629 CRC64; MKKYRRALAL VSCLSLCSLV WLPSWHVCCK ESSSASTSYY SQDDNCAIGS EDTQFQKKNE REEPSNAELS GKSNSYLTIS PEGNKIKDDY TVDVQDLETT KLSLPVVEAL PTVDLHEESS SVVVGSETIE NSSSSSTSER TPVSELDEVE KSGTLSIAKP GEVEQPEADC DAGEAPDADA PVEQPAFVSP PESLVGQHIE NVSSSHGKEK VTKSEFESKV SVSEQDGGDP KSALNTSDTL KNESSDYTKP GETDPTSVTS PKDPEDIPTF DEWKKKVMEV EKEKSQSLHP SSNGGPHATK KVQKNRNNYA SVECGAKILA ANPEAKSTSA ILIENMDLYM LNPCSTKIWF VIELCEPIQV KQFDIANYEL FSSTPKDFLV SISDRYPTNK WIKLGTFHGR DERNVQSFPL DEQMYAKYVK MFIKYIKVEL LSHFGSEHFC PLSLIRVFGT SMVEEYEEIA DSQYQSERQE LFDEDYDYPL DYNTVEDKSS KNLLGSATNA ILNMVNIAAN ILGAKTEDLT EGNKSISENA TATTEPKMTE STRVSTPVPS PEYVIKEVHT HDREPSTSDP PKESPIVQLV QEEEEEASPS TVTLLGSGEQ EDESSSWFES ETHILCSELT SICCISSFSE YIYKWCSVRI ALYRQRSRTV SKGKDFVPPQ PSLLLPVESV EVSVPQPPSG DVDSENMERE AETVDLDDLS SVHQGHLINH TVDTIELEPS YPQTLSQSLL LDVTPEMNSL SKVEGSESVK SEGGYIPSQL MTQESSVEFD DKTEKKTESF SSAEKLSVIY ETSKVNEVMD NTVKEDILST EVVTKFPETV VPPPMNTATV PEGESVETKP SIADTLKHTV TPVMDPSLPE VKEDEQSPED ALLRGLQRTA TDFYAELQNS TDLGYGNGNL VHGSNQKESV FMRLNNRIKA LEVNMSLSGR YLEELSQRYR KQMEEMQKAF NKTIVKLQNT SRIAEEQDQR QTEAIHLLQA QLTNMTQLVS NLSATVAELK REVSDRQSYL VMSLVLCVVL GLMLCMQRCR TTSQFDGDYI SKLPKSNQYP SPKRCFSSYD DMNLKRRTSF PLIRSKSLQF TGKEVDPNDL YIVEPLKFSP EKKKKRCKYK TEKIETIKPA DPLHPIANGD IKGRKPFTNQ RDFSNMGEVY HSSYKGPPSE GSSETSSQSE ESYFCGISAC TSLCNGQTQK TKTEKRALKR RRSKVQDQGK LIKALIQTKS GSLPSLHDII KGNKEITVGA FGVTAVSGHI // ID SUCO_RAT Reviewed; 1253 AA. AC Q710E6; DT 11-SEP-2007, integrated into UniProtKB/Swiss-Prot. DT 05-JUL-2004, sequence version 1. DT 11-NOV-2015, entry version 64. DE RecName: Full=SUN domain-containing ossification factor; DE AltName: Full=Membrane protein CH1; DE AltName: Full=Protein osteopotentia homolog; DE AltName: Full=SUN-like protein 1; DE Flags: Precursor; GN Name=Suco; Synonyms=Dd25, Opt; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RC STRAIN=Wistar; TISSUE=Brain; RA Verlaet M., Lakaye B., Grisar T.; RT "Expression of mRNA encoding C1orf9 in brain structures of Rat."; RL Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases. CC -!- FUNCTION: Required for bone modeling during late embryogenesis. CC Regulates type I collagen synthesis in osteoblasts during their CC postnatal maturation (By similarity). {ECO:0000250}. CC -!- SUBCELLULAR LOCATION: Rough endoplasmic reticulum membrane CC {ECO:0000250}; Single-pass type I membrane protein {ECO:0000250}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AJ421447; CAD13342.1; -; mRNA. DR RefSeq; NP_955435.1; NM_199403.1. DR UniGene; Rn.228937; -. DR ProteinModelPortal; Q710E6; -. DR STRING; 10116.ENSRNOP00000033216; -. DR PhosphoSite; Q710E6; -. DR PaxDb; Q710E6; -. DR PRIDE; Q710E6; -. DR GeneID; 360863; -. DR KEGG; rno:360863; -. DR UCSC; RGD:735185; rat. DR CTD; 51430; -. DR RGD; 735185; Suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOVERGEN; HBG107549; -. DR InParanoid; Q710E6; -. DR PhylomeDB; Q710E6; -. DR NextBio; 674408; -. DR PRO; PR:Q710E6; -. DR Proteomes; UP000002494; Unplaced. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0016020; C:membrane; ISS:UniProtKB. DR GO; GO:0005791; C:rough endoplasmic reticulum; ISS:UniProtKB. DR GO; GO:0030867; C:rough endoplasmic reticulum membrane; IEA:UniProtKB-SubCell. DR GO; GO:0007275; P:multicellular organismal development; IEA:UniProtKB-KW. DR GO; GO:0001503; P:ossification; IEA:UniProtKB-KW. DR GO; GO:0032967; P:positive regulation of collagen biosynthetic process; ISS:UniProtKB. DR GO; GO:0045669; P:positive regulation of osteoblast differentiation; ISS:UniProtKB. DR GO; GO:0046850; P:regulation of bone remodeling; ISS:UniProtKB. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil; Complete proteome; Developmental protein; KW Endoplasmic reticulum; Glycoprotein; Membrane; Osteogenesis; KW Reference proteome; Signal; Transmembrane; Transmembrane helix. FT SIGNAL 1 19 {ECO:0000255}. FT CHAIN 20 1253 SUN domain-containing ossification FT factor. FT /FTId=PRO_0000302719. FT TRANSMEM 1012 1032 Helical. {ECO:0000255}. FT DOMAIN 284 453 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 910 1010 {ECO:0000255}. FT CARBOHYD 202 202 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 236 236 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 929 929 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 956 956 N-linked (GlcNAc...). {ECO:0000255}. SQ SEQUENCE 1253 AA; 139260 MW; 4E21F2371612F8E1 CRC64; MKKYRRALAL VSCLSLCSLV WLPSWHVCCK ESSSASTSYY SQDDNCAVGS EDIQFQKKNE REEPSNAKVS EKSNSYLTIS PEENKLKDDY TVDECKIWKQ SKLSLPVVEA LPTVDSHEES SSVVVGSENI ENSSSSSTSE TSPISKLDEI ENSGTLSVAK PGDTEQPEAD CDAGEAADAD ASVEQPAFVS APESLVGQHI ENVSSSHGKE KVTKSEFESK VSVSEQDGGD PKSALNASDT LKNESSDYTK PRETDPTSVT SPKDPEDIPT FDEWKKKVME VEKEKSQSLH PSSNGGPHAT KKVQKNRNNY ASVECGAKIL AANPEAKSTS AILIENMDLY MLNPCSTKIW FVIELCEPIQ VKQFDIANYE LFSSTPKDFL VSISDRYPTN KWIKLGTFHG RDERTVQSFP LDEQMYAKYV KMFIKYIKVE LLSHFGSEHF CPLSLIRVFG TSMVEEYEEI ADSQYQSERQ ELFDEDYDYP LDYNTVEDKS SKNLLGSATN AILNMVNIAA NILGAKTEDL TEGDKSISEN ATATTEPKMP ESTGVSTPVP SPEYIIKEVH THDTEPPTSD PPKESPIVQL VQEEEEEASP STVTLLGSGE QEDESSSWFE SETQILCSEL TSICCISSFS EYLYKWCSVR IALYRQHSRT VSKGKDVSPQ PSLLPPVDSV EVSVLQPPSG NVDKEDMERE LETVALDDLS SVHQAHVRNH TVDTVELEPS YPQTLSQSLP LDVTPEMDSL STVEGSESVK SEGGHKPSQV MPQESSVEFD DETEKKPESF SSVAKLSVIY ETSKVNEVMD GPVKEDIVST HVVTKFPETK FPETVAPPPI NTAAVPESEG METKPSLADT LKHVVTPVTD PSLPEVKEDE QSPDDALLRG LQRTATDFYA ELQNSTDLGY GNGNLVHGSN QKESVFMRLN NRIKALEVNM SLSGRYLEEL SQRYRKQMEE MQKAFNKTIV KLQNTSRIAE EQDQRQTEAI HLLQAQLTNM TQIVSNLSAT VAELKREVSD RQSYLVMSLV LCVVLGLMLC MQRCRNTSQF DGDYTSKLPK SNQYPSPKRC FSSYDDMNLK RRTSFPLIRS KSLQFTGKED PNDLYIVEPL KFSPEKKKKR CKYKTEKIET IKPADPLHPI ANGDIKGRKP FTNQRDFSSM GEVYHSSYKG PPSEGSSETS SQSEESYFCG ISACTSLCNG QTQKTKLRRG LKRRRSKVQD QGKLIKALIQ TKSGSLPSLH DIIKGNKEIT VGAFGVTAVS GHI // ID Q0CAA5_ASPTN Unreviewed; 816 AA. AC Q0CAA5; DT 17-OCT-2006, integrated into UniProtKB/TrEMBL. DT 17-OCT-2006, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAU30516.1}; GN ORFNames=ATEG_09379 {ECO:0000313|EMBL:EAU30516.1}; OS Aspergillus terreus (strain NIH 2624 / FGSC A1156). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=341663 {ECO:0000313|EMBL:EAU30516.1, ECO:0000313|Proteomes:UP000007963}; RN [1] {ECO:0000313|Proteomes:UP000007963} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NIH 2624 / FGSC A1156 {ECO:0000313|Proteomes:UP000007963}; RA Birren B.W., Lander E.S., Galagan J.E., Nusbaum C., Devon K., Henn M., RA Ma L.-J., Jaffe D.B., Butler J., Alvarez P., Gnerre S., Grabherr M., RA Kleber M., Mauceli E.W., Brockman W., Rounsley S., Young S.K., RA LaButti K., Pushparaj V., DeCaprio D., Crawford M., Koehrsen M., RA Engels R., Montgomery P., Pearson M., Howarth C., Larson L., Luoma S., RA White J., Alvarado L., Kodira C.D., Zeng Q., Oleary S., Yandava C., RA Denning D.W., Nierman W.C., Milne T., Madden K.; RT "Annotation of the Aspergillus terreus NIH2624 genome."; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476607; EAU30516.1; -; Genomic_DNA. DR RefSeq; XP_001218001.1; XM_001218000.1. DR EnsemblFungi; CADATEAT00007791; CADATEAP00007791; CADATEAG00007791. DR GeneID; 4353813; -. DR EuPathDB; FungiDB:ATEG_09379; -. DR HOGENOM; HOG000172520; -. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000007963; Unassembled WGS sequence. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007963}; KW Reference proteome {ECO:0000313|Proteomes:UP000007963}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 27 {ECO:0000256|SAM:SignalP}. FT CHAIN 28 816 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004170107. SQ SEQUENCE 816 AA; 87702 MW; 14CA04CD693976F7 CRC64; MRALAWATAT CLPLAALLFF GIDSTTAESL PPICLTRNWR DVEARFIQWP TCVETRWDSN TPTPAVDPLS VTVSEPSTPP STTTSTAPTP SDDRPDADAD TDSLLDNANF LSFEDWKKQN LAKVGQSAEN VGGSRRAGAA GTEDRRRPTG INNALDSLGE DVEIELDFGG FGADTADTAK PTAWGAQVPV ETKGDGGDAR AKGGSGGPAG DESPAQGVAR RGKDAGTTCK ERFNYASFDC AATVLKTNPE CKGSSSVLIE NKDSYMLNEC RANNKFLILE LSIFRTFRVS VSDRYPAKPD QWRELGVYEA RNTREVQAFA VGNPLIWARY LRIEFLTHYG NEFYCPVSLI RVHGTTMLEE YKHDGEAGRG DDEVSEPESV EVNPSAGQLD EKEPQPPATA ESVVSESTDG STAGSRTGNI CPNPAVEVEA RLWGAALVPP GAPAAQADAD VAESTSHTKE PPAVDVPPKA PADEQKVTVP AGGNADSSSA ASTTTVAESA THNTTVEADS KPSASKEEQA APPEATRATG TQPPSPNPTT QESFFKSVNK RLQMLESNST LSLLYIEEQS RILRDAFNKV EKRQLAKTST FLEGLNVTVL NELKQFREQY DQVWKSVALE FEHQRIQYHQ EIYSISAQLG VLADELVFQK RVSVIQSIMV LFCFALVLFS RGTVSSYIDF PSVQSMVARS YSLRSSSPPF GSPSVSPSST RPASSYRGGT TAGAGHRRNI SEDSQDAAPL SPTIAYSPPT PTSDDASSPV EAEKDESALS MPEVTPSHLR SRSSPPVMDG GREADEGCSE ESESEEEEGD AGVELR // ID Q0CJD9_ASPTN Unreviewed; 691 AA. AC Q0CJD9; DT 17-OCT-2006, integrated into UniProtKB/TrEMBL. DT 17-OCT-2006, sequence version 1. DT 11-NOV-2015, entry version 26. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EAU33956.1}; GN ORFNames=ATEG_06195 {ECO:0000313|EMBL:EAU33956.1}; OS Aspergillus terreus (strain NIH 2624 / FGSC A1156). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=341663 {ECO:0000313|EMBL:EAU33956.1, ECO:0000313|Proteomes:UP000007963}; RN [1] {ECO:0000313|Proteomes:UP000007963} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=NIH 2624 / FGSC A1156 {ECO:0000313|Proteomes:UP000007963}; RA Birren B.W., Lander E.S., Galagan J.E., Nusbaum C., Devon K., Henn M., RA Ma L.-J., Jaffe D.B., Butler J., Alvarez P., Gnerre S., Grabherr M., RA Kleber M., Mauceli E.W., Brockman W., Rounsley S., Young S.K., RA LaButti K., Pushparaj V., DeCaprio D., Crawford M., Koehrsen M., RA Engels R., Montgomery P., Pearson M., Howarth C., Larson L., Luoma S., RA White J., Alvarado L., Kodira C.D., Zeng Q., Oleary S., Yandava C., RA Denning D.W., Nierman W.C., Milne T., Madden K.; RT "Annotation of the Aspergillus terreus NIH2624 genome."; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH476601; EAU33956.1; -; Genomic_DNA. DR RefSeq; XP_001215373.1; XM_001215373.1. DR EnsemblFungi; CADATEAT00002527; CADATEAP00002527; CADATEAG00002527. DR GeneID; 4321621; -. DR EuPathDB; FungiDB:ATEG_06195; -. DR HOGENOM; HOG000176993; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000007963; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000007963}; KW Reference proteome {ECO:0000313|Proteomes:UP000007963}. SQ SEQUENCE 691 AA; 75343 MW; 831EA90F37F6FAF5 CRC64; MPPKRASTRR AGAIARSSQV HNSPSYLPSV TSPETNNPAL PDIPVKHSFA YGSSSTPLLP RKLTAKPNLG LVEVADAIDK DIKTAQDREQ DQPQTKTRSR KASASASRSP VRRSRREPTP DQIQLLDSLR EVTVSPHPEA GHSTPTPTPP IPHSLSTMSS PATRNLTNPQ HQDLQAEQLY PSPLLRFGSP TRDVSLSSPQ FGSSMDNESL ISWSVERDIH GDDLQRVHPT TARTDPKMKN ITAPPRRFSG LAFANDTIEE EEPESEHPAS KSPSPKRATQ AQPTTDFESS EPRPLSRSRS RSDSRQQEPM TSSAPVRTII PDRIVGEASY SEPIAPRQEH TMRDTPATTS FAFSPPKGAV MRLVASAIVA ILSIVAVYSF GDNLAELSSN IGSPLSWGRG LSHIDLNSTG LEAVNSLSTQ VLRLGAQVSS LSRDVRNMRA EVDNVAAAPT TIVQRLPAVP EVARVNFLSI GMGPIIDPYT TSPTAGRTPT LLEKAASLFF RTPRRGPMRP IMALVNWEEV GDCWCSAPRS GVSQLSVLLG RHIVPEEVVV EHIPKGATIR PEVAPQDMEL WAQFSVVDTS AASSGQPLRP LPASQMPEGF SLHETIMGAL RVAYKGEPES AYSDDKLLGA SFYRIGKWRY DIDAPNNIQA FALDAIIDVP AIRVGRVVFR VKSNWGANET CIYRLKLYGH M // ID Q0JH95_ORYSJ Unreviewed; 607 AA. AC Q0JH95; DT 03-OCT-2006, integrated into UniProtKB/TrEMBL. DT 03-OCT-2006, sequence version 1. DT 11-NOV-2015, entry version 52. DE SubName: Full=Os01g0876400 protein {ECO:0000313|EMBL:BAF06883.1}; DE Flags: Fragment; GN OrderedLocusNames=Os01g0876400 {ECO:0000313|EMBL:BAF06883.1}; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947 {ECO:0000313|EMBL:BAF06883.1, ECO:0000313|Proteomes:UP000000763}; RN [1] {ECO:0000313|Proteomes:UP000000763} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [2] {ECO:0000313|Proteomes:UP000000763} RP GENOME REANNOTATION. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=18089549; DOI=10.1093/nar/gkm978; RG The rice annotation project (RAP); RT "The rice annotation project database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP008207; BAF06883.1; -; Genomic_DNA. DR RefSeq; NP_001044969.1; NM_001051504.1. DR UniGene; Os.26261; -. DR ProteinModelPortal; Q0JH95; -. DR STRING; 39947.LOC_Os01g65520.1; -. DR PaxDb; Q0JH95; -. DR PRIDE; Q0JH95; -. DR EnsemblPlants; OS01T0876400-01; OS01T0876400-01; OS01G0876400. DR GeneID; 4325081; -. DR KEGG; osa:4325081; -. DR Gramene; Q0JH95; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q0JH95; -. DR Proteomes; UP000000763; Chromosome 1. DR ExpressionAtlas; Q0JH95; baseline and differential. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000763}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000763}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 548 568 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 589 606 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 478 505 {ECO:0000256|SAM:Coils}. FT COILED 527 547 {ECO:0000256|SAM:Coils}. FT NON_TER 607 607 {ECO:0000313|EMBL:BAF06883.1}. SQ SEQUENCE 607 AA; 68638 MW; CDEA17BD48195848 CRC64; RGRRRRRRRR RRRRQGWAWL PRRLYGFSVS LVVACWVVLL LLNPLVGHGN GQRDEGIFAD EGSSDPSFDS VEPTLSEGSV DSVVQQENGE NHALPGDSCA KPDENHVLSE ETLLEKDQLC SNDEAQGDSM DALPKDNVDQ GENLPRTDDD SVVHPEGEVE SEGVPRPARL SRVVPPGLDE FKTRAIAERG KGVPSGQPGN VIHRREPSGK LYNYASAAKG AKVLEFNKEA KGASNILDKD KDKYLRNPCS AEGKFVIIEL SEETLVDTIA IANFEHYSSN LKEFEMLSSL NYPTDSWETL GRFTVANAKI AQNFTFPEPK WARYLKLNLL SHYGSEFYCT LSMLEVYGMD AVEKMLENLI PVENKRLEPD DKMKEPVDQQ TQLKEPTEGK ESSHEPLDED EFELEDDKLN GDSSKNGAHD QVTETRPIQA GRIPGDTVLK VLMQKVQSLD VSFSVLERYL EELNSRYGQI FKDFDADIDT KDALLEKIKL ELKHLERSKD DFAKEIEGIL SWKLVASSQL NQLLLDNVII RSELERFREK QADLENRSFA VIFLSFVFGC LAIAKLSIGM IFNTCRLYNF EKFDRVKSGW LVLLFSSCII ASILIIQ // ID Q0JLH7_ORYSJ Unreviewed; 537 AA. AC Q0JLH7; DT 03-OCT-2006, integrated into UniProtKB/TrEMBL. DT 03-OCT-2006, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=Os01g0599900 protein {ECO:0000313|EMBL:BAF05401.1}; GN OrderedLocusNames=Os01g0599900 {ECO:0000313|EMBL:BAF05401.1}; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947 {ECO:0000313|EMBL:BAF05401.1, ECO:0000313|Proteomes:UP000000763}; RN [1] {ECO:0000313|Proteomes:UP000000763} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [2] {ECO:0000313|Proteomes:UP000000763} RP GENOME REANNOTATION. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=18089549; DOI=10.1093/nar/gkm978; RG The rice annotation project (RAP); RT "The rice annotation project database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP008207; BAF05401.1; -; Genomic_DNA. DR RefSeq; NP_001043487.1; NM_001050022.1. DR UniGene; Os.91180; -. DR ProteinModelPortal; Q0JLH7; -. DR STRING; 39947.LOC_Os01g41600.1; -. DR PaxDb; Q0JLH7; -. DR GeneID; 4324592; -. DR KEGG; osa:4324592; -. DR Gramene; Q0JLH7; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q0JLH7; -. DR Proteomes; UP000000763; Chromosome 1. DR ExpressionAtlas; Q0JLH7; baseline. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000763}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000763}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 30 {ECO:0000256|SAM:SignalP}. FT CHAIN 31 537 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004174547. FT TRANSMEM 475 494 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 518 536 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 453 473 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 537 AA; 58551 MW; 6BAFA4BBA55CE887 CRC64; MDGGLREVSL SVVFSVWCLL FLLRSQFLHS QTDPSDFYDD VEDGMRENYC KVMPLEAYIF PTEYNASAAA PTCQPSLHPP DQPQQETDHR SLEPFNNTTG GKSSAEAAAL DELDEFRSRI LQGKAENGRV PDGATPAAHR LEPSGAEYNY AAASKGAKVL AHNREAKGAA NILGGDKDRY LRNPCSADDK FVDVELSEET LVRTIGLANL EHYSSNFRDF ELYGSPSYPA PAEEWELLGR FTADNAKHAQ RFVLPDPRWT RYLRLRLATH YGSGFYCILS YLEVYGIDAV EQMLQEIISG SGADTDASAA AKAEEGGDGG TLRNDTAQVN ARLDGVGGGG GSAAGRNDSA GDGAGAKNNG SRMTVAGDGK PAAAGRFHGD AVLKIMMQKM RSLELGLSTL EDYTKALNHR YGAKLPDLHT GLSQTTMALD RMKADVRDLV EWKGNVAKDL GELKEWRSNV EEMRSIQETM QNKELAVLSI SLFFACLALF KLACDRVLFL FTRKGAAAAE RMCGASKGWI LVLASSSFTT FLVLLYN // ID Q0U2J1_PHANO Unreviewed; 757 AA. AC Q0U2J1; DT 05-SEP-2006, integrated into UniProtKB/TrEMBL. DT 05-SEP-2006, sequence version 1. DT 11-NOV-2015, entry version 39. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAT78667.1}; GN ORFNames=SNOG_14042 {ECO:0000313|EMBL:EAT78667.1}; OS Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 / FGSC 10173) OS (Glume blotch fungus) (Septoria nodorum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Phaeosphaeriaceae; Parastagonospora. OX NCBI_TaxID=321614 {ECO:0000313|Proteomes:UP000001055}; RN [1] {ECO:0000313|Proteomes:UP000001055} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SN15 / ATCC MYA-4574 / FGSC 10173 RC {ECO:0000313|Proteomes:UP000001055}; RX PubMed=18024570; DOI=10.1105/tpc.107.052829; RA Hane J.K., Lowe R.G.T., Solomon P.S., Tan K.-C., Schoch C.L., RA Spatafora J.W., Crous P.W., Kodira C.D., Birren B.W., Galagan J.E., RA Torriani S.F.F., McDonald B.A., Oliver R.P.; RT "Dothideomycete-plant interactions illuminated by genome sequencing RT and EST analysis of the wheat pathogen Stagonospora nodorum."; RL Plant Cell 19:3347-3368(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH445353; EAT78667.1; -; Genomic_DNA. DR RefSeq; XP_001804241.1; XM_001804189.1. DR EnsemblFungi; SNOT_14042; SNOT_14042; SNOG_14042. DR GeneID; 5981164; -. DR KEGG; pno:SNOG_14042; -. DR InParanoid; Q0U2J1; -. DR KO; K19347; -. DR OMA; VANYELH; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001055; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001055}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001055}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 301 322 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 757 AA; 84170 MW; 3AB7D76E9047B91C CRC64; MSQLGGMGAT PARRSARLSQ AGSVTGQSVV TTMTNGGTRQ RKKGPLTKVK PRKSNAYGAS GRVGAAEELS ISATGFAQAF QNQRGEAAAR DEEDEEEDDV DELGDGTPRM SGALNGRTPQ RSSSPELEAP TPMPPGLSFM DSEDLAPSEN DLSASVGNTS KSFGPVHEAG MLFRPLRQVA ARSPSAEVFA KPLWQTNRTR HRQTQANVQE EEDVEVAVEI EPVRQQPPKQ VTPAPRKVID QSASQPLAEE REGPTSLRSK PRKQPSQQRA ADPHLDEWLG NVEPGLKDDD KWFWERYFNP LFWTLVCAAG LLAICFGLVQ LMTSKHEPNS ITRPGMVTAF GNRISNAYYD VADWVMPAER PEKKKQNMDE FKRGDGTIDD NYLWSRMKKI YNEFDSRFEN MDDTIAKLNQ HLPEYMVVRT LPDGRREVTD EFWNALISKA ESSGGDAEWT EFLKRNEDKL RDIFGGTSTG TPSDMRPEAV SREEFMNLVQ RHYDTMAAQV DEKVYQAIQG QASQIKAIAQ AEAKKAMIDS IRLHTLAQSN LLTNYELNLQ KANHFSPGLG AVVIPTLTSA TFLDSPSPWG SLGRLLFSRY RNPPKAALDR WEEPGDCWCA APNPMMSGQA QLTVALARPV YPQQVTIEHL PMSMMPSKKI TNAPRTIELW VETDQSVESQ GSHREGSCQE GPAGWSCLGS FRYNIHASNH QQTFDLDVPS PVPVSKAMLR VTSNWGADHT CLYRVRLHGK DAAEDHQYEV RLNDPVQ // ID Q0UGC2_PHANO Unreviewed; 988 AA. AC Q0UGC2; DT 05-SEP-2006, integrated into UniProtKB/TrEMBL. DT 05-SEP-2006, sequence version 1. DT 11-NOV-2015, entry version 44. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAT83384.1}; GN ORFNames=SNOG_09192 {ECO:0000313|EMBL:EAT83384.1}; OS Phaeosphaeria nodorum (strain SN15 / ATCC MYA-4574 / FGSC 10173) OS (Glume blotch fungus) (Septoria nodorum). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Phaeosphaeriaceae; Parastagonospora. OX NCBI_TaxID=321614 {ECO:0000313|Proteomes:UP000001055}; RN [1] {ECO:0000313|Proteomes:UP000001055} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SN15 / ATCC MYA-4574 / FGSC 10173 RC {ECO:0000313|Proteomes:UP000001055}; RX PubMed=18024570; DOI=10.1105/tpc.107.052829; RA Hane J.K., Lowe R.G.T., Solomon P.S., Tan K.-C., Schoch C.L., RA Spatafora J.W., Crous P.W., Kodira C.D., Birren B.W., Galagan J.E., RA Torriani S.F.F., McDonald B.A., Oliver R.P.; RT "Dothideomycete-plant interactions illuminated by genome sequencing RT and EST analysis of the wheat pathogen Stagonospora nodorum."; RL Plant Cell 19:3347-3368(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH445338; EAT83384.1; -; Genomic_DNA. DR RefSeq; XP_001799493.1; XM_001799441.1. DR EnsemblFungi; SNOT_09192; SNOT_09192; SNOG_09192. DR GeneID; 5976394; -. DR KEGG; pno:SNOG_09192; -. DR InParanoid; Q0UGC2; -. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001055; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001055}; KW Reference proteome {ECO:0000313|Proteomes:UP000001055}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 26 {ECO:0000256|SAM:SignalP}. FT CHAIN 27 988 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004177756. SQ SEQUENCE 988 AA; 107037 MW; DA037355F726A2E0 CRC64; MPAIGASLRK WTLCLLLCSV PTPAWTQSTN DTAVEHAGAT ASLSTTTPRP SSLASTSAST VRSFIHSETT CPFRTINYIT HTLPQQCAKT AWSAPGEAAS TDVPAIEAGA TPSQQIAIPR AEEAVKSDIQ DERHSTPTAV PQAQETGAGA FTDAPAGGTA SEESESELET ESPLDNANFL SFEEWKKRNL EQVGQSPENV QGRAAPSNQA TRRRPVNVNA LDSLGDEAEI EIDFSGFGPP LDGGDIANSN QAGLRGADNT KAVEDEKKAT PSSWTLSKDA GKTCKERFNY ASFDCAATVL KTNKQAKSAT SVLVENKDSY MLNECSANNK FLIVELCDDI LVDTVVLANY EFFSSMFRHF RVSVSDRYPV KMERWKTLAT FEARNSRDLQ PFLITEPQIW ARYLRIEFLT QYGNEFYCPL SLLRVHGTTM MEQFRREEEE ARGVDDDVDL EAEQEVVKPA VDSGPIPADQ VPILPLKDDD AKPSSTVDVH AQTTETSNEA ADSRSTDLSH ETVQSSTASA VGTPSNVSTV RDTNVDTAGS SSVSTPVDNS PSSSSMVESG ASTGESNSSD LASSNGRDTS NTQASSSATD SPASASIATE AKSISKASQT TNDAAPSPSS NTASKAASNN TVGNPHSQSQ PRSSSTQPNP ATPSTQESFF KSIHKRLLYL EANSTLSLQY IEEQSRILRD AFKKVEKRQL VKTEKFLDHL NSTVMQELKS FRTMYDQLWQ STVIETESMK ERHKSEMSEI GTRLTLMADE LVWQKRMAVV QSTLLLLCLG LVLFVRSGTL GSAADVPIVQ QLGSKYNSFF ESSPPHSPPE SGLARRRRTF KNMWRSDTSA GLSDHQSDGV NALSDAETDG ARSPIDVEYS PPTPTTPAAL AKFDVRNGIP EETPSPDDQA KRLEVLETQS GPATPNGTRD SRPSWEEVDR AVDQLKAEKN GQLSPHGERK GKERKGKKRS PLRRAQSSYD GLADGDGTVD TGDEGLFS // ID Q0WQI7_ARATH Unreviewed; 443 AA. AC Q0WQI7; DT 05-SEP-2006, integrated into UniProtKB/TrEMBL. DT 05-SEP-2006, sequence version 1. DT 11-NOV-2015, entry version 34. DE SubName: Full=Putative uncharacterized protein At1g71360 {ECO:0000313|EMBL:BAF00612.1}; GN OrderedLocusNames=At1g71360 {ECO:0000313|EMBL:BAF00612.1}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000313|EMBL:BAF00612.1}; RN [1] {ECO:0000313|EMBL:BAF00612.1} RP NUCLEOTIDE SEQUENCE. RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Morosawa T., RA Kamiya A., Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., RA Kohara Y., Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., RA Akiyama K., Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., RA Hayashizaki Y., Shinozaki K.; RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs."; RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK228709; BAF00612.1; -; mRNA. DR STRING; 3702.AT1G71360.1; -. DR PaxDb; Q0WQI7; -. DR PRIDE; Q0WQI7; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 25 47 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 393 417 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 423 441 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 364 391 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 443 AA; 49208 MW; 902FCF24236C4B8D CRC64; MQRSRRALLV RRRVSETTSN GRNRFYKVSL SLVFLIWGLV FLSTLWISHV DGDKGRSLVD SVEKGEPDDE RADETAESVD ATSLESTSVH SNPGLSSDVD IAAAGESKGS ETILKQLEVD NTIVIVGNVT ESKDNVPMKQ SEINNNTVPG NDTETTGSKL DQLSRAVPLG LDEFKSRASN SRDKSLSGQV TGVIHRMEPG GKEYNYAAAS KGAKVLSSNK EAKGASSIIC RDKDKYLRNP CSTEGKFVVI ELSEETLVNT IKIANFEHYS SNLKDFEILG TLVYPTDTWV HLGNFTALNM KHEQNFTFAD PKWVRYLKLN LLSHYGSEFY CTLSLLEVYG VDAVERMLED LISIQDKNIL KLQEGDTENE KEKVKERLEQ VLERLEWMEK KGVVVFTICV GFGTIAVVAV VFGMGIVRAE KQGGLAWLLL LISSTFVMFI LSL // ID Q11P94_CYTH3 Unreviewed; 505 AA. AC Q11P94; DT 22-AUG-2006, integrated into UniProtKB/TrEMBL. DT 22-AUG-2006, sequence version 1. DT 11-NOV-2015, entry version 67. DE SubName: Full=Possible outer membrane protein {ECO:0000313|EMBL:ABG60769.1}; GN Name=yiaD {ECO:0000313|EMBL:ABG60769.1}; GN OrderedLocusNames=CHU_3536 {ECO:0000313|EMBL:ABG60769.1}; OS Cytophaga hutchinsonii (strain ATCC 33406 / NCIMB 9469). OC Bacteria; Bacteroidetes; Cytophagia; Cytophagales; Cytophagaceae; OC Cytophaga. OX NCBI_TaxID=269798 {ECO:0000313|EMBL:ABG60769.1, ECO:0000313|Proteomes:UP000001822}; RN [1] {ECO:0000313|EMBL:ABG60769.1, ECO:0000313|Proteomes:UP000001822} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 33406 / NCIMB 9469 {ECO:0000313|Proteomes:UP000001822}; RX PubMed=17400776; DOI=10.1128/AEM.00225-07; RA Xie G., Bruce D.C., Challacombe J.F., Chertkov O., Detter J.C., RA Gilna P., Han C.S., Lucas S., Misra M., Myers G.L., Richardson P., RA Tapia R., Thayer N., Thompson L.S., Brettin T.S., Henrissat B., RA Wilson D.B., McBride M.J.; RT "Genome sequence of the cellulolytic gliding bacterium Cytophaga RT hutchinsonii."; RL Appl. Environ. Microbiol. 73:3536-3546(2007). CC -!- SIMILARITY: Belongs to the ompA family. CC {ECO:0000256|RuleBase:RU003859}. CC -!- SIMILARITY: Contains OmpA-like domain. CC {ECO:0000256|SAAS:SAAS00077540}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CP000383; ABG60769.1; -; Genomic_DNA. DR RefSeq; WP_011586876.1; NC_008255.1. DR ProteinModelPortal; Q11P94; -. DR STRING; 269798.CHU_3536; -. DR EnsemblBacteria; ABG60769; ABG60769; CHU_3536. DR KEGG; chu:CHU_3536; -. DR PATRIC; 21597947; VBICytHut34013_3518. DR eggNOG; ENOG4108ZG0; Bacteria. DR eggNOG; COG2885; LUCA. DR OrthoDB; EOG6PP9QB; -. DR BioCyc; CHUT269798:GJ83-3529-MONOMER; -. DR Proteomes; UP000001822; Chromosome. DR GO; GO:0009279; C:cell outer membrane; IEA:InterPro. DR GO; GO:0016021; C:integral component of membrane; IEA:InterPro. DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW. DR Gene3D; 2.60.120.260; -; 1. DR Gene3D; 3.30.1330.60; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR006664; OMP_bac. DR InterPro; IPR006665; OmpA/MotB_C. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00691; OmpA; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PRINTS; PR01021; OMPADOMAIN. DR SUPFAM; SSF103088; SSF103088; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51123; OMPA_2; 1. PE 3: Inferred from homology; KW Cell membrane {ECO:0000256|RuleBase:RU003859}; KW Complete proteome {ECO:0000313|Proteomes:UP000001822}; KW Membrane {ECO:0000256|RuleBase:RU003859}; KW Reference proteome {ECO:0000313|Proteomes:UP000001822}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 21 {ECO:0000256|SAM:SignalP}. FT CHAIN 22 505 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004180583. SQ SEQUENCE 505 AA; 56477 MW; 9D944D170DFAB0B8 CRC64; MIRKQIGFLF IFLLSVSFGH ANKINALALK NGCSIIQKPS TFFTPTPYAA KINDWSVFGL LDQNAATGWC SGATSKVPYV FVFELSEDFL ISNFAFNTFC QKEYKNISAK DIKVEYSMTS AKSGFLPAAT YLLEENKYNS FDIAPVKARW IKLTILSNYG NLQWTELMEF EAWGTFVTPG VTAASITGVW NTNFDWVSIN KVANGTIYGC YKWKNGEIYL TQVSRKTYTF AWKQNDDEKL SGWCLLVLNK EGTKMNGIWG YGTDTTKFGY WEFSKLQSTP YACSNDAIAA AGIVKKEPVK TEPKLNVMIE IIDKSSTKPI DGHIDIYSQA TSVSVISKEG LYSTDISVAP YVIVKTFLPS YYPTLDTFVI TAAEQKALYA TRIIELSKLS SGTNILLHNV LFERTSYELL SSSLPALDQL VTVMNQYPGM IIELSGHTDN VGSAKKNMEL SKNRVESAKK YLVSKGISAD RIKSVGYGSK YPVASNDGEQ TRKYNRRVEL RIITM // ID Q171B1_AEDAE Unreviewed; 2844 AA. AC Q171B1; DT 25-JUL-2006, integrated into UniProtKB/TrEMBL. DT 25-JUL-2006, sequence version 1. DT 11-NOV-2015, entry version 69. DE SubName: Full=AAEL007705-PA {ECO:0000313|EMBL:EAT40581.1}; GN ORFNames=AAEL007705 {ECO:0000313|EMBL:EAT40581.1}; OS Aedes aegypti (Yellowfever mosquito) (Culex aegypti). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Aedini; Aedes; Stegomyia. OX NCBI_TaxID=7159 {ECO:0000313|EMBL:EAT40581.1, ECO:0000313|Proteomes:UP000008820}; RN [1] {ECO:0000313|EMBL:EAT40581.1, ECO:0000313|Proteomes:UP000008820} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=LVPib12 {ECO:0000313|Proteomes:UP000008820}; RX PubMed=17510324; DOI=10.1126/science.1138878; RA Nene V., Wortman J.R., Lawson D., Haas B.J., Kodira C.D., Tu Z.J., RA Loftus B.J., Xi Z., Megy K., Grabherr M., Ren Q., Zdobnov E.M., RA Lobo N.F., Campbell K.S., Brown S.E., Bonaldo M.F., Zhu J., RA Sinkins S.P., Hogenkamp D.G., Amedeo P., Arensburger P., RA Atkinson P.W., Bidwell S.L., Biedler J., Birney E., Bruggner R.V., RA Costas J., Coy M.R., Crabtree J., Crawford M., DeBruyn B., RA DeCaprio D., Eiglmeier K., Eisenstadt E., El-Dorry H., Gelbart W.M., RA Gomes S.L., Hammond M., Hannick L.I., Hogan J.R., Holmes M.H., RA Jaffe D., Johnston S.J., Kennedy R.C., Koo H., Kravitz S., RA Kriventseva E.V., Kulp D., Labutti K., Lee E., Li S., Lovin D.D., RA Mao C., Mauceli E., Menck C.F., Miller J.R., Montgomery P., Mori A., RA Nascimento A.L., Naveira H.F., Nusbaum C., O'Leary S.B., Orvis J., RA Pertea M., Quesneville H., Reidenbach K.R., Rogers Y.-H.C., Roth C.W., RA Schneider J.R., Schatz M., Shumway M., Stanke M., Stinson E.O., RA Tubio J.M.C., Vanzee J.P., Verjovski-Almeida S., Werner D., RA White O.R., Wyder S., Zeng Q., Zhao Q., Zhao Y., Hill C.A., RA Raikhel A.S., Soares M.B., Knudson D.L., Lee N.H., Galagan J., RA Salzberg S.L., Paulsen I.T., Dimopoulos G., Collins F.H., Bruce B., RA Fraser-Liggett C.M., Severson D.W.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316:1718-1723(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH477460; EAT40581.1; -; Genomic_DNA. DR RefSeq; XP_001652833.1; XM_001652783.1. DR UniGene; Aae.13858; -. DR SMR; Q171B1; 1363-1431. DR STRING; 7159.AAEL007705-PA; -. DR GeneID; 5569522; -. DR KEGG; aag:AaeL_AAEL007705; -. DR VectorBase; AAEL007705; Aedes aegypti. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR HOGENOM; HOG000018061; -. DR InParanoid; Q171B1; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; Q171B1; -. DR Proteomes; UP000008820; Unassembled WGS sequence. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008820}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000008820}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1346 1366 {ECO:0000256|SAM:Coils}. FT COILED 2593 2620 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2844 AA; 311339 MW; EC44982A8A2169BE CRC64; MTSETRAACH RKTNRPTVSL SSRSDDVRMG DVDPETLLEW LSMGQGDERD MQLIALEQLC MLLLMSDNVD RCFESCPPRT FLPALCKIFL DELAPENVLE VTARAITYYL DVSSECTRRI VAIDGAIKAI CNRLVVADLE SRTSRDLAEQ CIKVLELICT REAGAVFEGG GLNCVLSFIR DNGSQIHKDT LHSAMAVVSR LCTKVEPQGA NVQTCVESLS TLLQHEDPLV ADGALKCFAS VADRFTRKGV DPAPLAEYGL VTELLNRLSN AAGPQTSALT AGASSNATPA AQVNSQETTG TVQLSSSAPK TQGAAEAGRS SQSIATTISL LSTLCRGSPS ITHDLLRSNL PDAMERAFKG DERCVLDCMR LADLILLLLF EGRQALGRVG GSSGQLAPRV RRADSSTERT HRQLIDCIRS KDTEALIEAI ESGGIDVNCM DDVGQTLLNW ASAFGTLEMV EFLCDKGADV NKGQRSSSLH YAACFGRPGI AKVLLKHGAN PDLRDEDGKT PLDKARERPD EGHREVASIL QSPGEWMTAA TRSDLKTDPE DGESEPRGDP EMAPVYLKFF LPIFCKTFQS TMLASVRRSS LGLIKKMIQY VQPEMLSRLC SSEGLQSHEH SLGTLLVEVV ASVLDNEDDE DGHLVVLTII EELMSKTQDD FLDHFARLGV FSKVQALMGE NGFSGGEGDN NDVIKSQDEL KPSATGAQDA SVVSTTGSSS SGTATTNTVE DAKEILPGKA YHWRDWSICR GRDCLYVWSD SAALELSNGS NGWFRFILDG KLATMYSSGS PENGSDSSEN RGEFLEKLQR ARGAVRQGTP SQPILSSPSL SRIVVGNWVL QSQKEHQLHI NNSEGHQVTI LQDELPGFIF ESNRGTKHTF TAETTLGPDF AAGWINTKKK KMRSKAEAQK YQVKNIARDL YNRYFKAAQA VPRGAVAKLC TIVRQIEAAL EEQCAPKASS LLQVRIQPDA SKATNSTWQE KLYNALHDLV QLLNDDGVIS AYEMHSSGLV QSLVAVLSRN YWELEMNRSK ANKYQKQRIS IFKKCMYGDT KNGKNTASIL VQKLVSVLES IEKLPVHMYD SPGGSYGLQI LTKRLSFRLE RAACEQTLFD RTGRNLKMEP LATVGQLNKY LLKMVAKQWY DMDRSSFLYL KKLKEAKPGS VHFKHQHDFD ENGLVYYIGT NGKTTEWVNP AQYGLVTVTS SEGKQLPYGK LEDILSRDSV SVNCHTKDNK KSWFAIDLGM FIIPTAYTLR HARGYGRSAL RNWMFQMSKD GINWATMLTH SDDKSLAEPG STCTWPLECS AEEQQGWRHV RIHQNGRNAS GQTHYLSLSG FEIYGKVVSV CDDMGKAAAK ENEARLRKER RQIRAQLKYI TQGARVVRGV DWHWDDQDGS PPGEGTVTGE IHNGWIDVKW DHGLRNSYRM GAEGKYDLKL SNSENLTAPY DMNNSGAGLV PINAGSVSSA KKVYDKSLNV LTSRKSSSTP SLPEATENKS SVASTEQATS VDNLAWKQAV EVIAENVLSC ARSDLANTSG GSSSNDLSAP GSNNNNLNNQ EVSVIVHSLG ERGNIPDLSQ INTSTSTLVS DLATITENLT LSDNIKNNIS TGSSTQFVSN FGTQLAACSS SSSSEDNNKT NNINETNNKI NLNNSSSSSS GKTAYLPTKL DVLDKMREGV DMLRNNTNNL LSSELLTQSN LLSSVKIALP TPQQPQQTGA TAPAGPFFVA STSTSSTSTL PSGDKIVGDV KFNNTINNNT ASGGVTKKVL NEVKQQEPDD RDIANNLKNN TIVVDSVEVG AAGSSKESTA ESPSAAVTVN PMSVSVPNLT SNENNASQEV QTPPGLLETF AAIARRRTSG SSVSHHVNPS SNASTASSGT SGTNNTQPNN QLSSLGSNIQ AANSSFFPRG QNSVTSLVKL ALSSNFHSGL LSTAQSYPSL SSSSSNNANS VAVSGGTNNN SASGVGSNNG SVQSGNVQAV LNPALTMSLT STSSDSEQVS LEDFLEQCRA PTLLGDLDDD EDMEDENDDD ENEDEYEEVG NTLLQVMVSR NLLSFMDEET LENRLAAAGK RKSWDDEFVL KRQFSALIPA FDPRPGRTNV NQTSDLEIPA PGSSTDSSHP SSSHSEHAPL PQPSLALLLR GPNINGVSDV EIPLSQPDWT IFRAVQELIL QTNMTKQDKF RKIWQPTYTI VYREASSSAG LGSCRGEDFS SGEEGRATPV VSMFSQRSGG STLSPSSPIP GTPLNPTATA HCTVDDVLQL LGQLNAINRT LTSSPSNNDK NLIPDMESNN LNADIFLSKK ITNKLQQQIQ DPLVLASGSL PKWCEDFNQS CPFLFPFETR QLYFNCTAFG ASRSIVWLQS QRDVNLERQR APGLSPRHAD QHEFRVGRLK HERVKVPRGE NLLEWAQQVM KVHCNRKSVL EVEFVGEEGT GLGPTLEFYA LVAAELQRSD LGMWLCDDEP KLIEDEIDLG EGSKPIGYYV RRSTGLFPAP LPQESEICDF VSNYFWFLGV FLAKVLQDGR LVDLPLSNSF LQLLCNNKSL ARGSLSELSS KSALHDDVMI SSLMSEESDR DLVDSYQSKL ANDGCWYDGI LSQENLQEID PIRYEFLKEL QELVQQKQNI EQNDDLTSEE KLLQISELKF NTKTGSVALE DLALTFTYLP SSKNYGYQSA DLIPNGSNID VTINNVEEYC NLTINFCLQE GIAKQLTAFH RGFCEVFPLN KLAAFTSEEI RKMLCGEQNP EWTREDIMTY TEPKLGYSKE SPGFLRFVNV LMGMNASERK AFLQFTTGCS SLPPGGLANL HPRLTVVRKV DAGEGSYPSV NTCVHYLKLP DYPNEEILRE RLLTATKEKG FHLN // ID Q171Z6_AEDAE Unreviewed; 655 AA. AC Q171Z6; DT 25-JUL-2006, integrated into UniProtKB/TrEMBL. DT 25-JUL-2006, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=AAEL007475-PA {ECO:0000313|EMBL:EAT40842.1}; GN ORFNames=AAEL007475 {ECO:0000313|EMBL:EAT40842.1}; OS Aedes aegypti (Yellowfever mosquito) (Culex aegypti). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Aedini; Aedes; Stegomyia. OX NCBI_TaxID=7159 {ECO:0000313|EMBL:EAT40842.1, ECO:0000313|Proteomes:UP000008820}; RN [1] {ECO:0000313|EMBL:EAT40842.1, ECO:0000313|Proteomes:UP000008820} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=LVPib12 {ECO:0000313|Proteomes:UP000008820}; RX PubMed=17510324; DOI=10.1126/science.1138878; RA Nene V., Wortman J.R., Lawson D., Haas B.J., Kodira C.D., Tu Z.J., RA Loftus B.J., Xi Z., Megy K., Grabherr M., Ren Q., Zdobnov E.M., RA Lobo N.F., Campbell K.S., Brown S.E., Bonaldo M.F., Zhu J., RA Sinkins S.P., Hogenkamp D.G., Amedeo P., Arensburger P., RA Atkinson P.W., Bidwell S.L., Biedler J., Birney E., Bruggner R.V., RA Costas J., Coy M.R., Crabtree J., Crawford M., DeBruyn B., RA DeCaprio D., Eiglmeier K., Eisenstadt E., El-Dorry H., Gelbart W.M., RA Gomes S.L., Hammond M., Hannick L.I., Hogan J.R., Holmes M.H., RA Jaffe D., Johnston S.J., Kennedy R.C., Koo H., Kravitz S., RA Kriventseva E.V., Kulp D., Labutti K., Lee E., Li S., Lovin D.D., RA Mao C., Mauceli E., Menck C.F., Miller J.R., Montgomery P., Mori A., RA Nascimento A.L., Naveira H.F., Nusbaum C., O'Leary S.B., Orvis J., RA Pertea M., Quesneville H., Reidenbach K.R., Rogers Y.-H.C., Roth C.W., RA Schneider J.R., Schatz M., Shumway M., Stanke M., Stinson E.O., RA Tubio J.M.C., Vanzee J.P., Verjovski-Almeida S., Werner D., RA White O.R., Wyder S., Zeng Q., Zhao Q., Zhao Y., Hill C.A., RA Raikhel A.S., Soares M.B., Knudson D.L., Lee N.H., Galagan J., RA Salzberg S.L., Paulsen I.T., Dimopoulos G., Collins F.H., Bruce B., RA Fraser-Liggett C.M., Severson D.W.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316:1718-1723(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH477442; EAT40842.1; -; Genomic_DNA. DR RefSeq; XP_001658391.1; XM_001658341.1. DR STRING; 7159.AAEL007475-PA; -. DR EnsemblMetazoa; AAEL007475-RA; AAEL007475-PA; AAEL007475. DR GeneID; 5569218; -. DR KEGG; aag:AaeL_AAEL007475; -. DR VectorBase; AAEL007475; Aedes aegypti. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000230547; -. DR InParanoid; Q171Z6; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q171Z6; -. DR Proteomes; UP000008820; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000008820}; KW Reference proteome {ECO:0000313|Proteomes:UP000008820}. SQ SEQUENCE 655 AA; 73746 MW; 94DAA9C7C2BB0B79 CRC64; MRKALSDHWI NILDFRAILL ADEDQTIVLP ASKRATIALA SLSHMLPSMD GQDLDSIKTS LFQKLDFRET SFSWRSLLPS FSLSFGGSGN GGSDYEHLKS GLQKTLSKQE YDDLMRHIDA YIEGLLTEKY LKKEQEEARA KQVISPEVTV HVASVVRENL QAYNYRLSQA DVDAVAEKVR LELLASYPNV FNPKMEEGGK KAETLGISQE NLLEIQKLVK QQISITNNNF VISDQQLEDI LKKILSSSQL VGLIDARVLL QMKAPQQQQQ ESQAALVENL KNEINEIKMH FTEKLATSSL MIEDNINLLK QDQSRLAEQV NSYRIENDEK YVQLMSDIDA RLAAVKQEQF AGLNKIIKKN IVTILGMNVK EDFSDADLKA WISNLFVAKD YLESRLKEYQ EGVSALIQQE MERSAATLMR DVSEKIQKEV LVTIQSRESN VDATARKTSS QSGGGGLNED DVRRIVRDAL RIYDADKTGL VDYALESAGG QILSTRCTEN YQTHSAQMSI FGIPLWYPTN TPRTVISPTM QPGQCWAFAG FPGYLVIQLN SDIVVTGFSL EHISKLLAPN GQIDSAPKNF SVWGLATEND QEPIQLGNYQ YLDNGAALQY FPVDDPTRPE LVGRTFRIVE LRIETNHGNA RYTCLYRFRV HGERA // ID Q17IW1_AEDAE Unreviewed; 1361 AA. AC Q17IW1; DT 25-JUL-2006, integrated into UniProtKB/TrEMBL. DT 25-JUL-2006, sequence version 1. DT 11-NOV-2015, entry version 46. DE SubName: Full=AAEL002235-PA {ECO:0000313|EMBL:EAT46632.1}; GN ORFNames=AAEL002235 {ECO:0000313|EMBL:EAT46632.1}; OS Aedes aegypti (Yellowfever mosquito) (Culex aegypti). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Culicinae; Aedini; Aedes; Stegomyia. OX NCBI_TaxID=7159 {ECO:0000313|EMBL:EAT46632.1, ECO:0000313|Proteomes:UP000008820}; RN [1] {ECO:0000313|EMBL:EAT46632.1, ECO:0000313|Proteomes:UP000008820} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=LVPib12 {ECO:0000313|Proteomes:UP000008820}; RX PubMed=17510324; DOI=10.1126/science.1138878; RA Nene V., Wortman J.R., Lawson D., Haas B.J., Kodira C.D., Tu Z.J., RA Loftus B.J., Xi Z., Megy K., Grabherr M., Ren Q., Zdobnov E.M., RA Lobo N.F., Campbell K.S., Brown S.E., Bonaldo M.F., Zhu J., RA Sinkins S.P., Hogenkamp D.G., Amedeo P., Arensburger P., RA Atkinson P.W., Bidwell S.L., Biedler J., Birney E., Bruggner R.V., RA Costas J., Coy M.R., Crabtree J., Crawford M., DeBruyn B., RA DeCaprio D., Eiglmeier K., Eisenstadt E., El-Dorry H., Gelbart W.M., RA Gomes S.L., Hammond M., Hannick L.I., Hogan J.R., Holmes M.H., RA Jaffe D., Johnston S.J., Kennedy R.C., Koo H., Kravitz S., RA Kriventseva E.V., Kulp D., Labutti K., Lee E., Li S., Lovin D.D., RA Mao C., Mauceli E., Menck C.F., Miller J.R., Montgomery P., Mori A., RA Nascimento A.L., Naveira H.F., Nusbaum C., O'Leary S.B., Orvis J., RA Pertea M., Quesneville H., Reidenbach K.R., Rogers Y.-H.C., Roth C.W., RA Schneider J.R., Schatz M., Shumway M., Stanke M., Stinson E.O., RA Tubio J.M.C., Vanzee J.P., Verjovski-Almeida S., Werner D., RA White O.R., Wyder S., Zeng Q., Zhao Q., Zhao Y., Hill C.A., RA Raikhel A.S., Soares M.B., Knudson D.L., Lee N.H., Galagan J., RA Salzberg S.L., Paulsen I.T., Dimopoulos G., Collins F.H., Bruce B., RA Fraser-Liggett C.M., Severson D.W.; RT "Genome sequence of Aedes aegypti, a major arbovirus vector."; RL Science 316:1718-1723(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH477236; EAT46632.1; -; Genomic_DNA. DR RefSeq; XP_001654972.1; XM_001654922.1. DR UniGene; Aae.18838; -. DR STRING; 7159.AAEL002235-PA; -. DR EnsemblMetazoa; AAEL002235-RA; AAEL002235-PA; AAEL002235. DR GeneID; 5573936; -. DR KEGG; aag:AaeL_AAEL002235; -. DR VectorBase; AAEL002235; Aedes aegypti. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000044781; -. DR InParanoid; Q17IW1; -. DR OMA; FEAFETD; -. DR OrthoDB; EOG7MPRDC; -. DR PhylomeDB; Q17IW1; -. DR Proteomes; UP000008820; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008820}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008820}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 7 28 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 911 949 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1361 AA; 149075 MW; 1A90DFBBB3FB8B25 CRC64; MRTSLCYTYC TLLIVSLVSS GLLFILVASE ATKPLNHLQS NDAQPEQHHS PLVQQEREQQ PVELGGTGKA PPDTAQIVEN GKQPTKGWES ESGSDDGGTG QSSDRSKLNS GGVKVQEGGS KKRHGLGKTS ARRTSSRSRK RDILTEKPAK LEEPPDLKEV IFLDALSKNS GVTNSGLQDI PGTPSVITLV DQQASELNIE SRLEENIEAT LKNISHQLIS SPVDEPEEQQ ATLDQQQPQE EELPEDATEV QAAPEQAASP EEKIQDTGGG EIGKEEQPGV VQQVIPNVTF DPQHAGSAEE NPMPVFSEWA QKQMAEAEKK HGENVNASAM KRNSKPPGNK MPPLKLRAKN YAAPDCGAKI IASNPEAQST GSVLTSHKDE YLLNPCTSKI WFVVELCEPV QAERVELANF ELFSSSPKEF SVSVSNRFPT RDWSNVGQFT AKDERDVQSF NLHPHLFGKF VRVEIHSHYN SEHYCPVSLF RVYGTSEFEA FETDNTPSLA EDGEEDDDVE LTIEGLNLIE EILGSEENMD GGDPTKNKKA NILKSAGEAV MNIVKKAAEA LGKTNENDSL RNETESINGS IGDNQLGSIG VVAPQHPLYC FSLAFQPLCV SCPPELRNRL ERTLNCKYNL LTSLLSIKSI STSFDESQHL LCANFLGFNL KREQVSGSYN IEQSILSWLP ADVLASFCNL QAYEQGMIET RMKSADTSVK MPERPTSTEP PSTTQAEPQN DTDDQPTLTV IPSVTAPVTV EPSSPSPPNP EQQPSPSLDD VNMFNVGVPQ DTDSVTAKHT EEPSATIPTT STTTTVPVVD STQERDDSGL LDQQNSGSSM EDLDNLLLDQ QMDGLSAGSG SIPTITTTST TTPSPGAPVA QKGQPESVFL RLSNRIKALE RNMSLSGQYL EELSRRYRKQ VEELQHSYAK TLHDIEEQNR RMRDSETQLR EENERLRENF YTFRDSILSW KNIALAVGGF LVVQVVVVYA MIRSCASGGR RADRDEIARE LEHMGDQKPP VKGKLLRRRS IDGVMGVADE KAIGSLKKKR PSEEALNISG TYENLLIAEN GGGDSAKVER KKKNKQRKIS APSMVQQTNG NGKVKRASSV EPSPVGKIGK AELVRTESAP EPRRHSPEKP KPDENNRIEE LPLLEDNDEF IIPTAMDLSY NEFVPDSTSE TVNQTNGMIS SSSSIDSKST NKSGKGRRLS SPAFFKSSLL RSSRKSSGKK STPSQNVSSA SSNTSSTGSS NVRININVHN VNRTEDELSS LPTDDAQTTS STLNTPVSHQ HSWEWYKLKK SSSQDKVTKR KSKSESPEVE TTRHNNGSIN GSEDHRLRTS VSFNGSTGSS EKKIGGGTGG GSFRRLFRKV F // ID Q292X5_DROPS Unreviewed; 621 AA. AC Q292X5; DT 04-APR-2006, integrated into UniProtKB/TrEMBL. DT 14-OCT-2008, sequence version 2. DT 11-NOV-2015, entry version 40. DE SubName: Full=GA15001 {ECO:0000313|EMBL:EAL24736.2}; GN Name=Dpse\GA15001 {ECO:0000313|EMBL:EAL24736.2}; GN ORFNames=Dpse_GA15001 {ECO:0000313|EMBL:EAL24736.2}, GN GA15001 {ECO:0000313|FlyBase:FBgn0075026}; OS Drosophila pseudoobscura pseudoobscura (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=46245 {ECO:0000313|EMBL:EAL24736.2, ECO:0000313|Proteomes:UP000001819}; RN [1] {ECO:0000313|EMBL:EAL24736.2, ECO:0000313|Proteomes:UP000001819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MV2-25 / Tucson 14011-0121.94 RC {ECO:0000313|Proteomes:UP000001819}; RX PubMed=15632085; DOI=10.1101/gr.3059305; RA Richards S., Liu Y., Bettencourt B.R., Hradecky P., Letovsky S., RA Nielsen R., Thornton K., Hubisz M.J., Chen R., Meisel R.P., RA Couronne O., Hua S., Smith M.A., Zhang P., Liu J., Bussemaker H.J., RA van Batenburg M.F., Howells S.L., Scherer S.E., Sodergren E., RA Matthews B.B., Crosby M.A., Schroeder A.J., Ortiz-Barrientos D., RA Rives C.M., Metzker M.L., Muzny D.M., Scott G., Steffen D., RA Wheeler D.A., Worley K.C., Havlak P., Durbin K.J., Egan A., Gill R., RA Hume J., Morgan M.B., Miner G., Hamilton C., Huang Y., Waldron L., RA Verduzco D., Clerc-Blankenburg K.P., Dubchak I., Noor M.A.F., RA Anderson W., White K.P., Clark A.G., Schaeffer S.W., Gelbart W.M., RA Weinstock G.M., Gibbs R.A.; RT "Comparative genome sequencing of Drosophila pseudoobscura: RT chromosomal, gene, and cis-element evolution."; RL Genome Res. 15:1-18(2005). RN [2] {ECO:0000313|EMBL:EAL24736.2, ECO:0000313|Proteomes:UP000001819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MV2-25 {ECO:0000313|EMBL:EAL24736.2}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM000071; EAL24736.2; -; Genomic_DNA. DR RefSeq; XP_001360162.2; XM_001360125.2. DR EnsemblMetazoa; FBtr0278860; FBpp0277298; FBgn0075026. DR GeneID; 4803414; -. DR KEGG; dpo:Dpse_GA15001; -. DR FlyBase; FBgn0075026; Dpse\GA15001. DR InParanoid; Q292X5; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR Proteomes; UP000001819; Chromosome 3. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001819}; KW Reference proteome {ECO:0000313|Proteomes:UP000001819}. FT COILED 126 153 {ECO:0000256|SAM:Coils}. FT COILED 246 266 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 621 AA; 70225 MW; 3A11FEB1948EC7DA CRC64; MRNAWSLLQE DQKLSYVQHV QALLPLPLTL LATCRGHLSS ATASLKSLLE LPLVPRAPEE AETIKSNMAG IEQSIRKALT AEEYENILNH VNSYVQQLVE LKLQQQQQQY TQREQQLSPQ QIHIIVQMMK QNLQDFTAKA ELSEQDLNDL AAKLKLELQR SGDWQPEARL STANLEEINR LIKAEVNLQE SHYTLLLEKI DWGALLERIL GSPKLADFVD GRINLALQEE KVLKDGSGSH ATEQEIDRLN KEIAFIKLAL SDNQAENTNL QQSISRLKIG QEDLLERMQQ HELASDQRFS LLLAEIETKL AALNDSQFVL LNKQVKLSLV EILGFKQSAM GSGKDGAKLD DIDLQNWVRS VFVAKDYLEQ QLLELNKRTN NNIRDEIERS SIVLMSEISE RLKREILLAV EAKHNESSSS LEGEIGEEAV RQIVKAVLAT YDADKTGLVD FALESAGGQI LSTRCTESYQ TKTAQISVFG IPLWYPTNTP RVAISPNVQP GECWAFQGFP GFLVLKLNSL VYVTGFTLEH IPKSLSPTGR IDSAPRNFTV WGLEHEKDQD PVLFGEYEYQ DNGASLQYFT LQNLEIQRPY EIVELRIETN HGQPTYTCLY RFRVHGKPPA S // ID Q29MX3_DROPS Unreviewed; 690 AA. AC Q29MX3; DT 04-APR-2006, integrated into UniProtKB/TrEMBL. DT 14-OCT-2008, sequence version 2. DT 11-NOV-2015, entry version 32. DE SubName: Full=GA19706 {ECO:0000313|EMBL:EAL33570.2}; GN Name=Dpse\GA19706 {ECO:0000313|EMBL:EAL33570.2}; GN ORFNames=Dpse_GA19706 {ECO:0000313|EMBL:EAL33570.2}, GN GA19706 {ECO:0000313|FlyBase:FBgn0079702}; OS Drosophila pseudoobscura pseudoobscura (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=46245 {ECO:0000313|EMBL:EAL33570.2, ECO:0000313|Proteomes:UP000001819}; RN [1] {ECO:0000313|EMBL:EAL33570.2, ECO:0000313|Proteomes:UP000001819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MV2-25 / Tucson 14011-0121.94 RC {ECO:0000313|Proteomes:UP000001819}; RX PubMed=15632085; DOI=10.1101/gr.3059305; RA Richards S., Liu Y., Bettencourt B.R., Hradecky P., Letovsky S., RA Nielsen R., Thornton K., Hubisz M.J., Chen R., Meisel R.P., RA Couronne O., Hua S., Smith M.A., Zhang P., Liu J., Bussemaker H.J., RA van Batenburg M.F., Howells S.L., Scherer S.E., Sodergren E., RA Matthews B.B., Crosby M.A., Schroeder A.J., Ortiz-Barrientos D., RA Rives C.M., Metzker M.L., Muzny D.M., Scott G., Steffen D., RA Wheeler D.A., Worley K.C., Havlak P., Durbin K.J., Egan A., Gill R., RA Hume J., Morgan M.B., Miner G., Hamilton C., Huang Y., Waldron L., RA Verduzco D., Clerc-Blankenburg K.P., Dubchak I., Noor M.A.F., RA Anderson W., White K.P., Clark A.G., Schaeffer S.W., Gelbart W.M., RA Weinstock G.M., Gibbs R.A.; RT "Comparative genome sequencing of Drosophila pseudoobscura: RT chromosomal, gene, and cis-element evolution."; RL Genome Res. 15:1-18(2005). RN [2] {ECO:0000313|EMBL:EAL33570.2, ECO:0000313|Proteomes:UP000001819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MV2-25 {ECO:0000313|EMBL:EAL33570.2}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH379060; EAL33570.2; -; Genomic_DNA. DR RefSeq; XP_001356506.2; XM_001356470.2. DR EnsemblMetazoa; FBtr0282002; FBpp0280440; FBgn0079702. DR GeneID; 4817292; -. DR KEGG; dpo:Dpse_GA19706; -. DR FlyBase; FBgn0079702; Dpse\GA19706. DR InParanoid; Q29MX3; -. DR OMA; WTSELAR; -. DR OrthoDB; EOG7VQJCX; -. DR Proteomes; UP000001819; Partially assembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001819}; KW Reference proteome {ECO:0000313|Proteomes:UP000001819}. SQ SEQUENCE 690 AA; 74390 MW; 8FABA2FFDCC35E60 CRC64; MAAAVAVAQA KRQNNQSAAR QGLESGRHKL VRGSPTAVGA FDSSKDAPGP PGNSHFVPKE PGAEVAKWTS ELARKVDVLM ADVRQLKDNC FKMMSMPIRN SMQFPNGLKN TLLMQNPNLV QSPNGNKNGI HDATAIKQQN SSRDLNSVNN QNLSHDRNSH KDSSQYPNSN QDPNFYKDQY YKQDAYPNEQ SYSDQHRYSK QDSYANEGAY SNQDPYLDQQ RYSNEGPYSN QHNYHTESLP VGYNRLNFAS DELGASIVSV EASPIGHSGI FKRLLGLEFS SNPPVNMLRP SLSPGACFGY RGVRAIAIIH LAKEIIVDTI TLSHPPKDMM PNLCENAPKD FKVIGIKPNY NEKEPLGQFT YHNHANRRTE IYRIDNKSTF RRLVLEFYSN HGGQFTCIYR VEVYGSLPAP DPQGNERGRG KDHGKGDLHA EGADNGQGGD VSGQSDLSVP EAVRGPVDSR GTEDSTGPLY TSTPRDSGRP EDVSGGDLRR GDDVRGQKQV CGRDGEICEK PGCKRCAPRD SSGPVDSSVP GYSSGPDSVR GPVDSRGTVD SSGPVDSSGP RDSSRLGDSS APDESTGPLY TSTPRDSSRP GDSSGPESVR GPVDSSGPVD SSGPRDSSRL GDSSAPDEST GPLYTSTPRD SSRPGDSSRP GDSSGPEDVS GGDVRGQNEG CGRDGGSCKN PGCKNCRREL // ID Q2H8T8_CHAGB Unreviewed; 945 AA. AC Q2H8T8; DT 21-MAR-2006, integrated into UniProtKB/TrEMBL. DT 21-MAR-2006, sequence version 1. DT 11-NOV-2015, entry version 29. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:EAQ91431.1}; GN ORFNames=CHGG_03366 {ECO:0000313|EMBL:EAQ91431.1}; OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC OS 6347 / NRRL 1970) (Soil fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. OX NCBI_TaxID=306901 {ECO:0000313|Proteomes:UP000001056}; RN [1] {ECO:0000313|Proteomes:UP000001056} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970 RC {ECO:0000313|Proteomes:UP000001056}; RG The Broad Institute Genome Sequencing Platform; RA Birren B.W., Lander E.S., Galagan J.E., Devon K., Nusbaum C., RA Ma L.-J., Jaffe D.B., Butler J., Alvarez P., Gnerre S., Grabherr M., RA Kleber M., Mauceli E.W., Brockman W., Rounsley S., Young S.K., RA LaButti K., Pushparaj V., DeCaprio D., Crawford M., Koehrsen M., RA Engels R., Montgomery P., Pearson M., Howarth C., Kodira C.D., RA Yandava C., Zeng Q., Alvarado L., Oleary S., Untereiner W.; RT "Annotation of the Chaetomium globosum CBS 148.51 genome."; RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH408030; EAQ91431.1; -; Genomic_DNA. DR RefSeq; XP_001229882.1; XM_001229881.1. DR STRING; 306901.XP_001229882.1; -. DR EnsemblFungi; EAQ91431; EAQ91431; CHGG_03366. DR GeneID; 4388518; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q2H8T8; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001056; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001056}; KW Reference proteome {ECO:0000313|Proteomes:UP000001056}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 27 {ECO:0000256|SAM:SignalP}. FT CHAIN 28 945 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004208645. SQ SEQUENCE 945 AA; 103576 MW; 208351E4C234DF0E CRC64; MRASVIWAKP LTPALVVLGL HVLAAYGSRS GSETAVTTAP AATEVCESRT INYITHTLPQ QCLRTSWTSP TPAVTPDDST IQPAVTTTAT TATETPGSTQ DNETVQEQDA QEELAASSFM SFEEWKEMML RKSGQDPANI KSQKQREHRE RDPSMNSGDV DSFGEEGEIA LDFDALAEKV SEITSSTDKA TPKTDEAAKE EQIFYDDGKT QYYRSKDAGK TCKERFSYAS FDAGATVLKT SARAKNAKAI LVENKDSYML LECRAKNKFV IVELSDDILV DTVVLANFEF FSSMIRKFRV SVSDRYPVKL DKWVDLGTFE ARNARDMQAF LIEHPQIYTK YLRIEFLSHW GNEFYCPVSL LRVHGTRMLD TWKEPNHDDE PEQIQGSTQE PAAEPQQMQE PTNSENPSVV ETEEIAPRAS IEMGLSPWRP LFQGNFSLQV CELRSPTTAD PTPVESGSVI PPKKPAAAPD SVTPRPSAPS VDDAPRSRPG NSSSSEPASP GASVSQGHSG GAAASSAPST PPQANDNTAS NSGQKQSDGK GDTAENSSTT TTTPRNKTSS VSSASASPTV QESFFKTMTK RLQLLESNTS LSLQYIEEQS RFLQEVLLKM ERKQITRVDS FLDTLNKTVL SELRNVRTQY DQIWQSTVLA LETQREQSQG EIVALTSRLN ILADEVVFQK RMAILQSVLL LACLVLVIFS RGGLAALDSA SFPNFPPAST GTTYRRYGYA RSDSLSGISM PSSSPPHPGP NGQPPNGAAA AQNYSNNNNN NNNGNNLATS ALPRHLYPTS SSASFRDKTL PLTPPSEYSR ESTPATSRLN HLHHSPEPRF YEKQDPDQDQ DPEQEPGQEQ EQEQEYGTPS RPRPRRRHTA SAALTTMPAS DADAEVSSIE INPGMTEEIA SPPRRGEEEK GQEVQEEGKP PLLRARSSQL GGLRKPLPAL PEDPS // ID Q2U5Z7_ASPOR Unreviewed; 732 AA. AC Q2U5Z7; DT 24-JAN-2006, integrated into UniProtKB/TrEMBL. DT 24-JAN-2006, sequence version 1. DT 11-NOV-2015, entry version 34. DE SubName: Full=Predicted protein {ECO:0000313|EMBL:BAE63018.1}; GN ORFNames=AO090120000440 {ECO:0000313|EMBL:BAE63018.1}; OS Aspergillus oryzae (strain ATCC 42149 / RIB 40) (Yellow koji mold). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=510516 {ECO:0000313|EMBL:BAE63018.1, ECO:0000313|Proteomes:UP000006564}; RN [1] {ECO:0000313|Proteomes:UP000006564} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 42149 / RIB 40 {ECO:0000313|Proteomes:UP000006564}; RX PubMed=16372010; DOI=10.1038/nature04300; RA Machida M., Asai K., Sano M., Tanaka T., Kumagai T., Terai G., RA Kusumoto K., Arima T., Akita O., Kashiwagi Y., Abe K., Gomi K., RA Horiuchi H., Kitamoto K., Kobayashi T., Takeuchi M., Denning D.W., RA Galagan J.E., Nierman W.C., Yu J., Archer D.B., Bennett J.W., RA Bhatnagar D., Cleveland T.E., Fedorova N.D., Gotoh O., Horikawa H., RA Hosoyama A., Ichinomiya M., Igarashi R., Iwashita K., Juvvadi P.R., RA Kato M., Kato Y., Kin T., Kokubun A., Maeda H., Maeyama N., RA Maruyama J., Nagasaki H., Nakajima T., Oda K., Okada K., Paulsen I., RA Sakamoto K., Sawano T., Takahashi M., Takase K., Terabayashi Y., RA Wortman J.R., Yamada O., Yamagata Y., Anazawa H., Hata Y., Koide Y., RA Komori T., Koyama Y., Minetoki T., Suharnan S., Tanaka A., Isono K., RA Kuhara S., Ogasawara N., Kikuchi H.; RT "Genome sequencing and analysis of Aspergillus oryzae."; RL Nature 438:1157-1161(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP007166; BAE63018.1; -; Genomic_DNA. DR ProteinModelPortal; Q2U5Z7; -. DR EnsemblFungi; CADAORAT00011568; CADAORAP00011333; CADAORAG00011568. DR HOGENOM; HOG000176993; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000006564; Chromosome 5. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006564}; KW Reference proteome {ECO:0000313|Proteomes:UP000006564}. SQ SEQUENCE 732 AA; 80305 MW; CD1DEF7257EC175D CRC64; MPPKRASTRR AGAVTRTSER GTPSYIPNMS SPDARNPALP DIPTKQSFAY GSSTTPILPR ELSAKPRMNL AEMATNIDEG RRVAQDRDFD RPHMNTRSRR QSISASLSPV RRSRREPTPD QLQLLDSLRE ATMSPNPNGQ DHAEQSTPTP TPPIPHTLST ASSPATESLT NPKYPVLTTD QLYPSPLLRY GSPARNAISL SSPNFATSID NESVVSWNVE RDIHEDDLQR TRPNGYLDGP HGKNITAPPR RFSGFAFAQE PIEELDEPTT QLSITKSRSP EAVAADIQPK PEPLSEPEMV PSPESTPSPE PEPEPEREPE PEPEREPERQ PTPPRAPALA PRTASKPELS SAPTRTIIPS NPIREASFDE STHESTSPLR ERVKSNVRSV GNAAVGLQKG LPIKPVSLVV LAVVSILTAC FFGDQISSIS SSIGSRLPLY GSPFRDLNAT ALQAVHGLSN QVVRLGEEVS SLSKEVDVIK SEVEHIPAPS TIVQPIPAQE TPKTNFLSIG MGVLVDPYNT SPTSGRSAGF LQKLHSRFLP SSSQQQPEPP LAALTPWQDV GECWCSKPRS GMSQLALHLG REIVPEEVVI EHIPKGASIR PEVAPRDMEL WAQFQIVDES NPDSPPSPNP SRTSGILSEE LSLHNHIIDT LRLAYKDEPE GAYSNDELLG PSFYRVGQWT YDLHASNHIQ KFELDAIIDV PAIRVNKVAF RVKSNWGGND TCLYRLKLYG HI // ID Q2UTT7_ASPOR Unreviewed; 648 AA. AC Q2UTT7; DT 24-JAN-2006, integrated into UniProtKB/TrEMBL. DT 24-JAN-2006, sequence version 1. DT 14-OCT-2015, entry version 36. DE SubName: Full=Uncharacterized conserved protein {ECO:0000313|EMBL:BAE55028.1}; GN ORFNames=AO090009000597 {ECO:0000313|EMBL:BAE55028.1}; OS Aspergillus oryzae (strain ATCC 42149 / RIB 40) (Yellow koji mold). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=510516 {ECO:0000313|EMBL:BAE55028.1, ECO:0000313|Proteomes:UP000006564}; RN [1] {ECO:0000313|Proteomes:UP000006564} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 42149 / RIB 40 {ECO:0000313|Proteomes:UP000006564}; RX PubMed=16372010; DOI=10.1038/nature04300; RA Machida M., Asai K., Sano M., Tanaka T., Kumagai T., Terai G., RA Kusumoto K., Arima T., Akita O., Kashiwagi Y., Abe K., Gomi K., RA Horiuchi H., Kitamoto K., Kobayashi T., Takeuchi M., Denning D.W., RA Galagan J.E., Nierman W.C., Yu J., Archer D.B., Bennett J.W., RA Bhatnagar D., Cleveland T.E., Fedorova N.D., Gotoh O., Horikawa H., RA Hosoyama A., Ichinomiya M., Igarashi R., Iwashita K., Juvvadi P.R., RA Kato M., Kato Y., Kin T., Kokubun A., Maeda H., Maeyama N., RA Maruyama J., Nagasaki H., Nakajima T., Oda K., Okada K., Paulsen I., RA Sakamoto K., Sawano T., Takahashi M., Takase K., Terabayashi Y., RA Wortman J.R., Yamada O., Yamagata Y., Anazawa H., Hata Y., Koide Y., RA Komori T., Koyama Y., Minetoki T., Suharnan S., Tanaka A., Isono K., RA Kuhara S., Ogasawara N., Kikuchi H.; RT "Genome sequencing and analysis of Aspergillus oryzae."; RL Nature 438:1157-1161(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP007150; BAE55028.1; -; Genomic_DNA. DR ProteinModelPortal; Q2UTT7; -. DR EnsemblFungi; CADAORAT00004021; CADAORAP00003951; CADAORAG00004021. DR HOGENOM; HOG000172520; -. DR OMA; YVIVELS; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000006564; Chromosome 1. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000006564}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006564}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 28 {ECO:0000256|SAM:SignalP}. FT CHAIN 29 648 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004217377. FT TRANSMEM 599 619 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 648 AA; 71613 MW; 4D634A4F344C471C CRC64; MVIPAWAAAA WTSLTTALIL PGIPGAAAEN KQALCLARHW SEVEAEFIQW PICVESRWER TAPRITQDTT RSPDQTVSVT VSEGAPSTTA IPAPGGQPDH ELDTDSPLDN SNFLSFEDWK KQNLAKVGQS AENVRGNRHA AGKEDRRRPT GINNALDSLG EDTEIDLDFG GFGAEASDAA KPTSWGSSIP TAGITGTAAG ASAGDMEAAV SADLRKGASR GKDAGTTCKE RFNYASFDCA ATVLKTNPEC KGSSSVLVEN KDSYMLNECR AKNKFLILEL CDDILVDTVV LANYEFFSSI FHTFRVSVAD RYPAKTDQWR ELGVYEARNT REIQAFAVEN PLIWARYVKI EFLTHYGNEF YCPLSLVRIH GTTMLEEYKH DGETNRGDEE AAAEALEPSP HPVDVEVKDV AQQPLTTVAL PDEPTNGPTA TIEAQGSCSH HETVRQDAAH ESEIKSVSSP KEESSIPSES VRPSGTQPPS SNPTTQESFF KSVNKRLQML ESNSTLSLLY IEEQSRILRD AFSKVEKRQL AKTSTFLENL NVTVLNELRQ FREQYDQVWK SVALEFEHQR IQYHQEIHSI SAQLGVLADE LVFQKRVSVI QSIMILFCFA LVLFSRVPLG TYIDIPRKKN FEQSAANHSA NSEAEMQK // ID Q2YDP2_BOVIN Unreviewed; 372 AA. AC Q2YDP2; DT 20-DEC-2005, integrated into UniProtKB/TrEMBL. DT 20-DEC-2005, sequence version 1. DT 11-NOV-2015, entry version 61. DE SubName: Full=Sperm associated antigen 4-like {ECO:0000313|EMBL:AAI10132.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSBTAP00000017354}; GN Name=SPAG4L {ECO:0000313|EMBL:AAI10132.1}; GN Synonyms=SUN5 {ECO:0000313|Ensembl:ENSBTAP00000017354}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913; RN [1] {ECO:0000313|EMBL:AAI10132.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Crossbred x Angus {ECO:0000313|EMBL:AAI10132.1}; RC TISSUE=Liver {ECO:0000313|EMBL:AAI10132.1}; RA Moore S., Alexander L., Brownstein M., Guan L., Lobo S., Meng Y., RA Tanaguchi M., Wang Z., Yu J., Prange C., Schreiber K., Shenmen C., RA Wagner L., Bala M., Barbazuk S., Barber S., Babakaiff R., Beland J., RA Chun E., Del Rio L., Gibson S., Hanson R., Kirkpatrick R., Liu J., RA Matsuo C., Mayo M., Santos R.R., Stott J., Tsai M., Wong D., RA Siddiqui A., Holt R., Jones S.J., Marra M.A.; RL Submitted (NOV-2005) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSBTAP00000017354, ECO:0000313|Proteomes:UP000009136} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000017354, RC ECO:0000313|Proteomes:UP000009136}; RX PubMed=19393038; DOI=10.1186/gb-2009-10-4-r42; RA Zimin A.V., Delcher A.L., Florea L., Kelley D.R., Schatz M.C., RA Puiu D., Hanrahan F., Pertea G., Van Tassell C.P., Sonstegard T.S., RA Marcais G., Roberts M., Subramanian P., Yorke J.A., Salzberg S.L.; RT "A whole-genome assembly of the domestic cow, Bos taurus."; RL Genome Biol. 10:R42.01-R42.10(2009). RN [3] {ECO:0000313|Ensembl:ENSBTAP00000017354} RP IDENTIFICATION. RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000017354}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; DAAA02036418; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BC110131; AAI10132.1; -; mRNA. DR RefSeq; NP_001039630.1; NM_001046165.2. DR RefSeq; XP_003583016.1; XM_003582968.3. DR UniGene; Bt.54258; -. DR STRING; 9913.ENSBTAP00000017354; -. DR Ensembl; ENSBTAT00000017354; ENSBTAP00000017354; ENSBTAG00000013056. DR GeneID; 100851588; -. DR GeneID; 514087; -. DR KEGG; bta:100851588; -. DR KEGG; bta:514087; -. DR CTD; 140732; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG055206; -. DR OMA; GNPRFTC; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 20871171; -. DR Proteomes; UP000009136; Chromosome 13. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000009136}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000009136}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 102 119 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 156 183 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 372 AA; 42629 MW; 28232CAA8136BA0F CRC64; MPRSSRSPED SCDQPENVAH VSREVRPRRI IQRGRNICRM TEAPLSNASD AFLLPVRINA PGLTQCMLEC VSWITCLACF LRTRAHRVLF NTCRYKLFIH KLIEKAGVLI LCAFGFWMFS VHLPSKMDIW QEDSIDSPLQ SLRMYQEKVR HHTGEIQDLR GHLNQLIAKL QEMEAMSDEQ KMTQKIIKMI QGDYIEKPDF ALKSIGASID FEQTSATYNH DKACSYWNWI RLWNYAQPPD VILEPNVTPG NCWAFSGDRG QVTIRLAQKV YLSNLTLQHI PKTISLSGSL DTAPKDFVIY GVEGSPKEEV FLGAFQFQPE NIIQTFQLQN QPPRTFGAVK VKISSNWGNP RFTCLYRVRV HGSVTLPREQ PN // ID Q32LM1_BOVIN Unreviewed; 433 AA. AC Q32LM1; DT 06-DEC-2005, integrated into UniProtKB/TrEMBL. DT 06-DEC-2005, sequence version 1. DT 11-NOV-2015, entry version 43. DE SubName: Full=Sperm associated antigen 4 {ECO:0000313|EMBL:AAI09515.1}; GN Name=SPAG4 {ECO:0000313|EMBL:AAI09515.1}; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913 {ECO:0000313|EMBL:AAI09515.1}; RN [1] {ECO:0000313|EMBL:AAI09515.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Crossbred x Angus {ECO:0000313|EMBL:AAI09515.1}; RC TISSUE=Liver {ECO:0000313|EMBL:AAI09515.1}; RA Moore S., Alexander L., Brownstein M., Guan L., Lobo S., Meng Y., RA Tanaguchi M., Wang Z., Yu J., Prange C., Schreiber K., Shenmen C., RA Wagner L., Bala M., Barbazuk S., Barber S., Babakaiff R., Beland J., RA Chun E., Del Rio L., Gibson S., Hanson R., Kirkpatrick R., Liu J., RA Matsuo C., Mayo M., Santos R.R., Stott J., Tsai M., Wong D., RA Siddiqui A., Holt R., Jones S.J., Marra M.A.; RL Submitted (NOV-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC109514; AAI09515.1; -; mRNA. DR RefSeq; NP_001069975.1; NM_001076507.2. DR UniGene; Bt.24857; -. DR STRING; 9913.ENSBTAP00000009127; -. DR PaxDb; Q32LM1; -. DR GeneID; 618468; -. DR KEGG; bta:618468; -. DR CTD; 6676; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000246956; -. DR HOVERGEN; HBG079205; -. DR NextBio; 20901201; -. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 130 153 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 159 184 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 197 231 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 433 AA; 47287 MW; 371F96DE6C6A4C9F CRC64; MRRSPRPGSA ASQHKHTPNF YSDNSNSSVS VTSGDSCGHR SAGPGPGEPE GRRARGSSCG EPALSAGVPG GTTWAGSSRQ KPAPRSHNGQ TACGAATVRG GASVSEEQLD LLPTLDLRQE MPSPQVSKSF LSLLFQVLSV LLSLVGDVLV SVYREVCSIR FLLTAVSLLS LFLAALWWGL LYLAPPLENE PKEMLTLSEY HERVRSQGQQ LQQLQAELVK LHKEMSSVRA ANSERVAQLV FQRLSEDFVQ KPDYALSSVG ASIDLEKTSQ DYEDANTAYF WNRFSFWNFA RPPTVILEPD VFPGNCWAFE GDQGQVVIRL PGRVQLSDIT LQHPPPTVAH TRGANSAPRD FAVYGLQVDG ETEVFLGKFT FDVEKSEIQT FHLQNDPPAA FPKVKIQILS NWGHPRFTCL YRVRAHGIRT SEGAGDSATG GAH // ID Q4DHP7_TRYCC Unreviewed; 465 AA. AC Q4DHP7; DT 13-SEP-2005, integrated into UniProtKB/TrEMBL. DT 13-SEP-2005, sequence version 1. DT 11-NOV-2015, entry version 40. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAN92037.1}; GN ORFNames=Tc00.1047053511491.30 {ECO:0000313|EMBL:EAN92037.1}; OS Trypanosoma cruzi (strain CL Brener). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Trypanosoma; OC Schizotrypanum. OX NCBI_TaxID=353153 {ECO:0000313|Proteomes:UP000002296}; RN [1] {ECO:0000313|EMBL:EAN92037.1, ECO:0000313|Proteomes:UP000002296} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CL Brener {ECO:0000313|EMBL:EAN92037.1, RC ECO:0000313|Proteomes:UP000002296}; RX PubMed=16020725; DOI=10.1126/science.1112631; RA El-Sayed N.M.A., Myler P.J., Bartholomeu D.C., Nilsson D., RA Aggarwal G., Tran A.-N., Ghedin E., Worthey E.A., Delcher A.L., RA Blandin G., Westenberger S.J., Caler E., Cerqueira G.C., Branche C., RA Haas B., Anupama A., Arner E., Aslund L., Attipoe P., Bontempi E., RA Bringaud F., Burton P., Cadag E., Campbell D.A., Carrington M., RA Crabtree J., Darban H., da Silveira J.F., de Jong P., Edwards K., RA Englund P.T., Fazelina G., Feldblyum T., Ferella M., Frasch A.C., RA Gull K., Horn D., Hou L., Huang Y., Kindlund E., Klingbeil M., RA Kluge S., Koo H., Lacerda D., Levin M.J., Lorenzi H., Louie T., RA Machado C.R., McCulloch R., McKenna A., Mizuno Y., Mottram J.C., RA Nelson S., Ochaya S., Osoegawa K., Pai G., Parsons M., Pentony M., RA Pettersson U., Pop M., Ramirez J.L., Rinta J., Robertson L., RA Salzberg S.L., Sanchez D.O., Seyler A., Sharma R., Shetty J., RA Simpson A.J., Sisk E., Tammi M.T., Tarleton R., Teixeira S., RA Van Aken S., Vogt C., Ward P.N., Wickstead B., Wortman J., White O., RA Fraser C.M., Stuart K.D., Andersson B.; RT "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas RT disease."; RL Science 309:409-415(2005). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAN92037.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAHK01000468; EAN92037.1; -; Genomic_DNA. DR RefSeq; XP_813888.1; XM_808795.1. DR ProteinModelPortal; Q4DHP7; -. DR STRING; 353153.XP_813888.1; -. DR PaxDb; Q4DHP7; -. DR EnsemblProtists; EAN92037; EAN92037; Tc00.1047053511491.30. DR GeneID; 3545350; -. DR KEGG; tcr:511491.30; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR Proteomes; UP000002296; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002296}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002296}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 430 451 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 349 376 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 465 AA; 52135 MW; DFD1CF42E01A4A26 CRC64; MKTDRLITLL VLALMAVTCG FLLLNARKRG GLEENPPWKS LSFTTNYASA YLGATLTDFS RACKGASSVL NEDKTKYMIC NCEASRKLFT VQLIREIEVR SVMLVNLEHF SSGVKKFMLL GSKKYPTSEW RVLGEFEASP WRGTQHFDVP TQEPVRFLRF LWVTSHGDDS WCTLTVFKVF GVDVLETLTE DYGGDLDNLL HSLPEKALPA PPVVVKPSPP FFTEVDARTD KAADWLHPLK EKEKLNGGAE IENLFRQVDA IIAVGSNSSG SGGTDDMCTD GDSSGCCDHN EVVENNSTTC TLHEKEAHLS RSRCKCFSRT NLFSRMALSP RPFQGGAALQ IIMQMSKHLK VLQQELEETF ARQRDTERRL EKAETALGNV GKNFRDALRL SCNYRERLID MKKEMDVMNS RLLIALESIN QKGSDVTLRI WVVCFNVVAL IALFASCVAP YSSISSRARR FTQRG // ID Q4N889_THEPA Unreviewed; 707 AA. AC Q4N889; DT 02-AUG-2005, integrated into UniProtKB/TrEMBL. DT 02-AUG-2005, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAN33819.1}; GN OrderedLocusNames=TP01_0581 {ECO:0000313|EMBL:EAN33819.1}; OS Theileria parva (East coast fever infection agent). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida; OC Theileriidae; Theileria. OX NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN33819.1, ECO:0000313|Proteomes:UP000001949}; RN [1] {ECO:0000313|EMBL:EAN33819.1, ECO:0000313|Proteomes:UP000001949} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Muguga {ECO:0000313|EMBL:EAN33819.1, RC ECO:0000313|Proteomes:UP000001949}; RX PubMed=15994558; DOI=10.1126/science.1110439; RA Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., RA Hall N., Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., RA Sato S., Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., RA Jiang L., Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., RA Crabtree J., Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., RA Suh B., Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., RA Allen J., Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., RA Fitzhugh H.A., Morzaria S., Venter J.C., Fraser C.M., Nene V.; RT "Genome sequence of Theileria parva, a bovine pathogen that transforms RT lymphocytes."; RL Science 309:134-137(2005). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAN33819.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAGK01000001; EAN33819.1; -; Genomic_DNA. DR RefSeq; XP_766102.1; XM_761009.1. DR STRING; 333668.XP_766102.1; -. DR EnsemblProtists; EAN33819; EAN33819; TP01_0581. DR GeneID; 3502633; -. DR KEGG; tpv:TP01_0581; -. DR EuPathDB; PiroplasmaDB:TP01_0581; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000135636; -. DR InParanoid; Q4N889; -. DR Proteomes; UP000001949; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001949}; KW Reference proteome {ECO:0000313|Proteomes:UP000001949}. SQ SEQUENCE 707 AA; 82996 MW; 56B163FA8C5F17B1 CRC64; MSLFKSRGNS FEKVPKTPKK SKIEYFFRFF LFILLFFFFL KKVALVNKSR LKLQDNFRVY ESKLSTSRIK AKPFDPDLYS ARIDFASEEF GTKIIAHSKG LSKVSRILED DSGSYMLTPC NTSDWFVLSF PESILIQEIS FLSFEYYSSS YKNIRISITG TYPSGKWLTL GELETDPSRN EIFDLSVVCN TDKNDCWGKY LKVELLDYHR LELNYYCSIT KMMVYGITAV EYLETEISDD SSLYNSFAQP NVTGYPSITY ETTGNTCDDM VGSVVNKQLD TAVDTDTAAD SVMVTTKKEK EAEREVCEIG PLHESPMKKL SDLKCFAKTS EVLNPTFVYD IKNVILNSFY KFLMRLALRK DKNLHNKKLI HRVLKIGKFK RYCGTNLLFQ FDCNRFITNL LLDKYSVYYA NILDQVRLPS RHLWFESVIF YFFKNKRHVI GGIMNDVGIK EVPILVCTES MGLISKFKCY FYFNLKSFST LLFTQFTEGY RIHGFKPRTG SLSHNLRDKL YLFYVDNRIN IMSVPKFKGL TIDDESLVRR YDYKVKDKYI STVTVIDNKL YINVSSSYST YFPFDVDSSN QKLDKELIKS NTSLYRFDFL SKITRYKDNL NRLNKSKRYG KKAQFDEKEK FKHRNFYQQD ENTKSTDSKT HKHVLLKLSE RIKSLEFLTN KLSNKIYQVE NLLNFYIKRQ LYSNQQVTNY YLQLLTY // ID Q4Q7S7_LEIMA Unreviewed; 587 AA. AC Q4Q7S7; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 49. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAJ05833.1}; GN ORFNames=LMJF_30_0320 {ECO:0000313|EMBL:CAJ05833.1}; OS Leishmania major. OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; OC Leishmaniinae; Leishmania. OX NCBI_TaxID=5664 {ECO:0000313|Proteomes:UP000000542}; RN [1] {ECO:0000313|EMBL:CAJ05833.1, ECO:0000313|Proteomes:UP000000542} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MHOM/IL/81/Friedlin {ECO:0000313|Proteomes:UP000000542}; RX PubMed=16020728; DOI=10.1126/science.1112680; RA Ivens A.C., Peacock C.S., Worthey E.A., Murphy L., Aggarwal G., RA Berriman M., Sisk E., Rajandream M.A., Adlem E., Aert R., Anupama A., RA Apostolou Z., Attipoe P., Bason N., Bauser C., Beck A., Beverley S.M., RA Bianchettin G., Borzym K., Bothe G., Bruschi C.V., Collins M., RA Cadag E., Ciarloni L., Clayton C., Coulson R.M.R., Cronin A., RA Cruz A.K., Davies R.M., De Gaudenzi J., Dobson D.E., Duesterhoeft A., RA Fazelina G., Fosker N., Frasch A.C., Fraser A., Fuchs M., Gabel C., RA Goble A., Goffeau A., Harris D., Hertz-Fowler C., Hilbert H., Horn D., RA Huang Y., Klages S., Knights A., Kube M., Larke N., Litvin L., RA Lord A., Louie T., Marra M., Masuy D., Matthews K., Michaeli S., RA Mottram J.C., Mueller-Auer S., Munden H., Nelson S., Norbertczak H., RA Oliver K., O'neil S., Pentony M., Pohl T.M., Price C., Purnelle B., RA Quail M.A., Rabbinowitsch E., Reinhardt R., Rieger M., Rinta J., RA Robben J., Robertson L., Ruiz J.C., Rutter S., Saunders D., RA Schaefer M., Schein J., Schwartz D.C., Seeger K., Seyler A., Sharp S., RA Shin H., Sivam D., Squares R., Squares S., Tosato V., Vogt C., RA Volckaert G., Wambutt R., Warren T., Wedler H., Woodward J., Zhou S., RA Zimmermann W., Smith D.F., Blackwell J.M., Stuart K.D., Barrell B.G., RA Myler P.J.; RT "The genome of the kinetoplastid parasite, Leishmania major."; RL Science 309:436-442(2005). RN [2] {ECO:0000313|EMBL:CAJ05833.1, ECO:0000313|Proteomes:UP000000542} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Friedlin {ECO:0000313|EMBL:CAJ05833.1}; RX PubMed=22038252; DOI=10.1101/gr.122945.111; RA Rogers M.B., Hilley J.D., Dickens N.J., Wilkes J., Bates P.A., RA Depledge D.P., Harris D., Her Y., Herzyk P., Imamura H., Otto T.D., RA Sanders M., Seeger K., Dujardin J.C., Berriman M., Smith D.F., RA Hertz-Fowler C., Mottram J.C.; RT "Chromosome and gene copy number variation allow major structural RT change between species and strains of Leishmania."; RL Genome Res. 21:2129-2142(2011). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; FR796426; CAJ05833.1; -; Genomic_DNA. DR RefSeq; XP_001684621.1; XM_001684569.1. DR ProteinModelPortal; Q4Q7S7; -. DR STRING; 5664.LmjF.30.0320; -. DR EnsemblProtists; LmjF.30.0320:mRNA; LmjF.30.0320:pep; LmjF.30.0320. DR GeneID; 5653550; -. DR KEGG; lma:LMJF_30_0320; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000258883; -. DR InParanoid; Q4Q7S7; -. DR OMA; CTITSFQ; -. DR Proteomes; UP000000542; Chromosome 30. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000542}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000542}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 519 539 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 464 491 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 587 AA; 62518 MW; 2D280E2A74D0328D CRC64; MWPQERLMVL CTLLLLYMLF SAPLELLTVF WHCTTSTAVG ISSPSPLKRL STGSVPGLST NYASLYLGAA VVSMEPSSCH GGAALISESV DTYVLCPCGA PRKQFVVQLI RDVQVRSVMV RNAEHFSSGV RNFTLLGSLQ YPTSTWLVLG HFEAEQRRGR QYFDVAPRSR VRFIKLQWAT SYGPEPWCTI TSFQVYGIDV LETLTRYDGG DDLLAGEYAA VASGGLRDTP DVHRSHPPEL PPTPGEVAAL SLAGSSAVPS RNDATSAKGS TTPAVSIDEL AAGMWDGAAA TVGASRGADA DDLLLAPVDV GASAETGPLS QPGADATRSS PTNSIALAAT APALQSLNCS AAQPIGWNTS LKCTITDLAA LWGPCAVATC GASDCTAVVI PTSAPALSVR TPSSKGLTAS RSIYQSAAGS LLTNLLRQQR STHHELTLLM QRERHLAQEL NRTRILFSDF YARSKAMERE ANEYRDRLHG LQSKLQLLQE RFLLREHSSC CGEGGGADRS GSSMMRSDTA VAVASFVLLA LTVILMFMYS SSSSRSVVGP PSGWGRYYNI GRGSGGVALG GGNGPPLWPR QQRGRAR // ID Q4RGE8_TETNG Unreviewed; 271 AA. AC Q4RGE8; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Chromosome 18 SCAF15100, whole genome shotgun sequence {ECO:0000313|EMBL:CAG12534.1}; DE Flags: Fragment; GN ORFNames=GSTENG00034837001 {ECO:0000313|EMBL:CAG12534.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAG12534.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAG12534.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01015100; CAG12534.1; -; Genomic_DNA. DR STRING; 99883.ENSTNIP00000022019; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 31 58 {ECO:0000256|SAM:Coils}. FT NON_TER 271 271 {ECO:0000313|EMBL:CAG12534.1}. SQ SEQUENCE 271 AA; 30305 MW; 98B1FCD5DBE0A2B5 CRC64; MFEEMQRRNA ATEEARLLVF QQQLSGFRAV AARLVHRIRS LEAQNLKLTK EWQLLQQRPD GSSVSPELQQ HVDGLFRKLA AELDVLANRG GSSEDQRPVA DRMADFALES QGASVISSRC SQTYTCPSPS LTLFGIPLWS SYRSPRTAIQ GSPITAGTCW SFAGAEGTLA VSLSHPVKIT HVTVDHLSRY NSPTGDIKSA PKDLEVYGMK TRAGEGTFLG RFRYDKLGES TQTFSLPKPT EEVYEMVELR VLSNWGQKEY TCLYRFRVHG Q // ID Q4RJM4_TETNG Unreviewed; 959 AA. AC Q4RJM4; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 34. DE SubName: Full=Chromosome 3 SCAF15037, whole genome shotgun sequence {ECO:0000313|EMBL:CAG11408.1}; DE Flags: Fragment; GN ORFNames=GSTENG00033370001 {ECO:0000313|EMBL:CAG11408.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAG11408.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAG11408.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01015037; CAG11408.1; -; Genomic_DNA. DR HOVERGEN; HBG104132; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 407 431 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 438 454 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 629 649 {ECO:0000256|SAM:Coils}. FT NON_TER 959 959 {ECO:0000313|EMBL:CAG11408.1}. SQ SEQUENCE 959 AA; 106655 MW; 699E425E862C5EC1 CRC64; MTMDFSQLHT YTPPQCAPEN TGYTYSLSSS YSTAALEFEQ EHQIAAVYES PRMSRRSLRL QGHYSVDYSH SQSTTRRETR TLRNKKQQSG SGGLSLSLSQ AATPRKTLSF SAVNTPVNSG IFQESSTATD AALFTGLDES HLRQRTVTTT NTFTCVDGQA GRRICSDHSS GVNGDTSASK AHASLTNGYI CKDCSFPSQK IDTFITQSSS SSSQLAQSSS DILSSSSSSS PFTSVYSRDR SRRSKTGVLR SICNTCVHYS KQSLAPLVSL VTVVFSGVLW LGSQARASTG KGVLASFTNS LRQAMSSSLS QLCQFKETTL HWFLGGRNDE REERVLTHSS FCGSMNVKGL VTEDAAHLKL NGSLCHCLLW PGHCVLRSGK VLGCGAVRAL RSLLSLLWMF LTAPGRGLLW FLATGWYQLV SLMTVLNVFF LTQCLPRLWR LLLLLLPFLL LLDLERLAHI ERQLALLGAQ LKQTDHKQDE RHGNILELYN SLKDQLHTRT DRESLGVWVS SLLDQRVGVL QGELEQEHAQ RLQSEEQQES QQRGQATRLA EIEVLLNTLA AKTQQLSCMI GVFDRRCSRS RNSSSRRNGK ARERQSPQRP RHFLSVLFKA QTLTAHLSAF SRAVKQEDHA ALLLDVQRLE AELGKIRQDL QAVVGCRGKC EQLDTLKDTA TQVSAQVRKE LQTLFFGSGG TGELPESLLH WLSQRYVSSP DLQALLASLE MSILRNVSLQ LEHSRVSTLG EAESQAKAIF HTVSGAVQHT AATEGLPEEQ VKIIVQNALR LYSQDRTGLV DYALESGGGS ILSTRCSETY ETKTALMSLF GLPLWYFSQS PRVVIQPDVY PGNCWAFKGS QGYLVIRLSL KIVPTSFCLE HIPRTLSPTG NITSAPRDFT VFGLDDEYQE EGKLLGQYTY QEDGDALQMF PVQEQNDKSF QIIEMRVLSN WGHQEYTCLY RFRVHGNPQ // ID Q4SP17_TETNG Unreviewed; 900 AA. AC Q4SP17; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 43. DE SubName: Full=Chromosome 15 SCAF14542, whole genome shotgun sequence {ECO:0000313|EMBL:CAF97615.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSTNIP00000010688}; DE Flags: Fragment; GN Name=SUCO {ECO:0000313|Ensembl:ENSTNIP00000010688}; GN ORFNames=GSTENG00015035001 {ECO:0000313|EMBL:CAF97615.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF97615.1, ECO:0000313|Ensembl:ENSTNIP00000010688} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF97615.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Ensembl:ENSTNIP00000010688} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01014542; CAF97615.1; -; Genomic_DNA. DR STRING; 99883.ENSTNIP00000010688; -. DR Ensembl; ENSTNIT00000010869; ENSTNIP00000010688; ENSTNIG00000007868. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR HOVERGEN; HBG107549; -. DR OMA; KIWFIIE; -. DR OrthoDB; EOG7MPRDC; -. DR TreeFam; TF105817; -. DR Proteomes; UP000007303; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007303}; KW Reference proteome {ECO:0000313|Proteomes:UP000007303}. FT COILED 689 709 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:CAF97615.1}. SQ SEQUENCE 900 AA; 98420 MW; E269FB2BBEBE4074 CRC64; DLSSPVTSDT DSSVSCKDPE DIPTFDEWKR KVMEVEKEKT QSVHTASSGS SHTGKKVQKN FNNYASVECG AKILGSNPEA KSTSAILMEN MDMYMLNPCS NKIWFIIELC EPVQVKQLDI ANFELFSSTP KDFVVSISDR YPTNKWQKLG TFHARDERTV QSFPLDEHLY AKYVKMFAKY IKVELLSHFG SEHFCPLSLL RVFGTSMVEE YELIADPPER PDDLDDDFDY PPGYTPEVKL SKNLIGSAKE VIVHLFLNSA LLISLCSPES EGGSISADPT IPAAVAPSSE TPEAPTTDLS DQPLPHVEEQ AVLPLEKDEE EPISSTITLL DKEEEPDDEK EKGGFRGRNQ GIPVHCSVPS FSSFCSCNAS LQEYLHQQCS ASLSNKRKCQ TVHQKQTIPS IETPAWQRPL LPSGWQEPQQ PHSEEQEQAA EPEPESQASS SEAPQPPENT ASSKDSILEL PLLEPSQSSN LPKHSVTDSS SAKPTPGVET PLLASGEPGK KPDVLAEERH IEPSAPPSGS SHVQPTVSIT ADESSVVSAE ETLQADVSQS DTNTPDQTDQ ILSPPTSLFY PDPPLLNEAD SVSPEVPNLV PDLSVEPEPS SGHPGITATK TEDISEDAST SAPSAAAPPV SSSLPTSPSL SDIYADPPNG TEQNGNPVHG SSQKESVFMR LNNRIKALEV NMSLSGRYLE QLSQRYRKQM EEMQKAFNKT VIKLQNTSRI AEEQDQRQTE SIQLLQGQLE NMTRLVLNLS VRVSQLQVEV SERQNYLVLS LVLCFCVGVL LCANHCRLAT GPPNTEAEPP VAKSYSYCCA ERQFSSCDEP SLKRSASYPL IHSESFQLAA TEGKQTRRQR AAQRGPHTQT TPPSVASPQP AAESATACVQ PRARLRSRPP DAGVPSPDVL // ID Q4SVY2_TETNG Unreviewed; 1003 AA. AC Q4SVY2; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 39. DE SubName: Full=Chromosome 1 SCAF13708, whole genome shotgun sequence {ECO:0000313|EMBL:CAF95200.1}; DE Flags: Fragment; GN ORFNames=GSTENG00011750001 {ECO:0000313|EMBL:CAF95200.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF95200.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF95200.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01013708; CAF95200.1; -; Genomic_DNA. DR HOGENOM; HOG000136550; -. DR HOVERGEN; HBG107549; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 349 369 {ECO:0000256|SAM:Coils}. FT COILED 663 683 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:CAF95200.1}. SQ SEQUENCE 1003 AA; 110778 MW; 7EA6192F531E033A CRC64; GDVTDPSVPS KEDIPTFDEW KKQVMEVEME KSQSLYTSTT GSPHSAKKVQ KSFKNNYASV ECGAKILAAN SEAKSTSAIL KENMDLYMLN PCSNKIWFVI ELCEPIQVKQ LDIANFELFS STPKDFLVSI SDRYPTNKWV KLGTFHARDE RIVQSFPLDE QLFAKYLKMF IKYIKIELLS HFGSEHFCPL SLIRVFGTSM VEEYEEIADS QYLSERMEYL DEDYDYPPGY QLAEDNPNGS KNLLGSATNA ILNMVNNIAA NVLGATPELE GGTESEGNTT TGGDKKESTE AFPDSALLES AELEHSASQE NASDSSGLSS ATKDSQDDTQ IVTLVEEEEE EEEPRQSTVT LMEEEGEEEE DRRQEETRDA DRNQSDSHIY CPLFSSLSLS CMASLPELLH RWCSARLAKE RLRSLRRRQL GIQTHTHPAP NTPSPIHTPL LIPVPAPTPV TEELSQTETV LKLEVPLMPQ NDVKMAEVHI AQPNTPDTHT PELNVLLEPS RTVIPTHGFS DTQSFSVGLT STNEVKVLPP VKEVAQATVS TPPLQVASIP ETQPAVVASP TLTPSKEQLD PVMQGGDPQR VDDVTDEDLL SSGGNGNVQR TATDFYAELQ NGGESNAGAA NGNGMLLNGG AVHGSSQKES VFMRLNNRIK ALEMNMSLSS RYLEELSQRY RKQMEEMQRA FNKTIIKLQN TSRIAQEQDQ KQTDSIQVLQ SQLVNITKLM LNLTTTVGQL QREVSDRQSY LVVSLVLCLF LGLLLFLQCC CRSSPSTSSD TAPIPRSNHY PSPKRCFSSY DDMNLKRRMT CPIIHSNSLP LCCSEVGPDD LYIVEPLKFS PEKKKKRKSK SLDKVDLLKE YYPPAPLING APKCNGFHPC LSLQPLPPSP PPLPLPSPCP SLPPSVEEVS SPSKESPSEP SSSPVNSEES HTSGLALQTA AYMSASQCNG HGLTLSMQQL ATMSRQEKRS LKRRKSRPAE MPFSAVPSLQ QLIKGNKEIS VGTIGVTAVT GHF // ID Q4SXB7_TETNG Unreviewed; 141 AA. AC Q4SXB7; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 31. DE SubName: Full=Chromosome undetermined SCAF12572, whole genome shotgun sequence {ECO:0000313|EMBL:CAF94715.1}; DE Flags: Fragment; GN ORFNames=GSTENG00011006001 {ECO:0000313|EMBL:CAF94715.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF94715.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF94715.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01012572; CAF94715.1; -; Genomic_DNA. DR STRING; 99883.ENSTNIP00000008662; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG073494; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 6 26 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:CAF94715.1}. FT NON_TER 141 141 {ECO:0000313|EMBL:CAF94715.1}. SQ SEQUENCE 141 AA; 15783 MW; 5057D28EAD3E2C26 CRC64; VPTERCENTM MHIENLRTEL NDVTKKLNYQ LPDPNYWTNY ALESHGAKVY KKQSSNTYEK IEGLKIFGIQ LFSKVGPASV IQGQHPPIPG NCWSFPGSHG NLFIELSHMV TVSHVTLDHV PSSVVPADTI SSAPRQFSVY V // ID Q4T056_TETNG Unreviewed; 163 AA. AC Q4T056; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Chromosome undetermined SCAF11339, whole genome shotgun sequence {ECO:0000313|EMBL:CAF93726.1}; DE Flags: Fragment; GN ORFNames=GSTENG00009509001 {ECO:0000313|EMBL:CAF93726.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF93726.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF93726.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01011339; CAF93726.1; -; Genomic_DNA. DR STRING; 99883.ENSTNIP00000007985; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG108520; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; FT NON_TER 1 1 {ECO:0000313|EMBL:CAF93726.1}. FT NON_TER 163 163 {ECO:0000313|EMBL:CAF93726.1}. SQ SEQUENCE 163 AA; 18171 MW; E18F9C1E1899E23D CRC64; GASVITSRCS QTYTSASPRL TVFGIPLFTL SRGPRTVIQG SLKHPGECWS FVGSKGTLAV SLSHPIRITH VTMEHAQRSH SPTGEIKSAP RDFEVYGIRT QPEKETFLGN FTYDQFGEPS QTFALKDPGE EAYQAVELHV LTNWGQQEYT CLYRFRVHGH MAP // ID Q4T7Z2_TETNG Unreviewed; 628 AA. AC Q4T7Z2; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 33. DE SubName: Full=Chromosome 2 SCAF7940, whole genome shotgun sequence {ECO:0000313|EMBL:CAF90990.1}; DE Flags: Fragment; GN ORFNames=GSTENG00005485001 {ECO:0000313|EMBL:CAF90990.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF90990.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF90990.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01007940; CAF90990.1; -; Genomic_DNA. DR STRING; 99883.ENSTNIP00000006168; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 306 326 {ECO:0000256|SAM:Coils}. FT NON_TER 628 628 {ECO:0000313|EMBL:CAF90990.1}. SQ SEQUENCE 628 AA; 69482 MW; E9E03B62DC89E963 CRC64; MSRRSLRLDD GLLDRNLPHS SSSFSVGGTG WRSSRLLKCR GPRQLSLSCT ESLLMDSPHK MNSSDASLLS SLVEDSSIQD NTLVDSIWGL DHETDPKESS IIADQSTILA DHTLIGPDDC ATSQPVQAVT RFYCQGCEPS TKEHSGLVSP SSCTPAPKAS EPGCPGSSTI YSRARKRKTH RAPARHCGVM NLKESSQNQD LRPNGALCDR PRAPREVVVD EESRLERLEQ RVMVLWEQVE AAGRWAEQRH REVMQLYTEL LQGGGGGGRG EAWLTGLMEQ QLQRFRTLLD TKRRQTRQRQ SGTSRVGQLE LQLQALAAQT EDLQSKQEAV SVGVAPQLHN DVLAQVARLE MALEDISSQI RDEIHTRIHG SPLMGRGVAS AKTAPSESFL RWVSQRYVSR ADLRVALASL ELSILQTIGK KTEGNVGDGT KAALSREDVH VIVENALRRF SEDRTGMPDF ALESGGGSIL STRCSETYRT KVALLSLFGF PLWYFSQSPR AVIQPDVHPG NCWAFRGSSG FLVIRLSMPI FPTAITLEHT PKALSPSGKM HSAPRDFSVY GLDDENQERG HLLGVYTYDQ DGDAVQTFTV SEVYERPFQL VEVQVTSNWG QPDYTCLYRI RVHGTPAD // ID Q4TBT5_TETNG Unreviewed; 1534 AA. AC Q4TBT5; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 53. DE SubName: Full=Chromosome undetermined SCAF7089, whole genome shotgun sequence {ECO:0000313|EMBL:CAF89647.1}; DE Flags: Fragment; GN ORFNames=GSTENG00003619001 {ECO:0000313|EMBL:CAF89647.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF89647.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF89647.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01007089; CAF89647.1; -; Genomic_DNA. DR HOVERGEN; HBG067533; -. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 2. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 5. DR PROSITE; PS50237; HECT; 2. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 279 299 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:CAF89647.1}. SQ SEQUENCE 1534 AA; 170596 MW; 9ECC333AB2A0E6C1 CRC64; QILTRRLRFR LERAPGETAL IDRTGRMLKM EPLATVESLE QYLLKMVRKG EHRTHVTAAA DRAPLSSAVL LQVAKQWYDF ERSSFVFVRK LREGQSFTFR HQHDFDENGI IYWVGTNAKT AYEWVNPAAY GLVVVTSSEG RNLPYGRLED ILSRDSSALN CHTNDDKNAW FAVDLGLWVL PSAYTLRHAR GYGRSALRNW VFQVSKDGQN WTTLYTHVDD CSLNEPGSTA TWPLDPSKEE KQGWRHIRIK QMGKNASGQT HYLSLSGLEL YGTVTAVCED QLGKAVKEAE ANLRRQRRLF RSQVMKYIVP GARVVRGIDW KWRDQDGNPP GEGTVTGEAH NGWIDVTWDA GGSNSYRMGA EGKFDLKLAP GYDPESAATA PSPKPVSSTV SGPSSMVGPS SMPVTNSGTT TTTAWSASST SASLQQQQQS WSSLLPPGRG APMSSSASVP NLSSREASLM ESFVRRAPNM SRTNATNNMN LSRSSSDNNT NTLGRNALTT ATSLMGAQSF PNLTTTGTTS TVTMSTSIVT SSNNVATATT GLSVGQLLSN TLTTSLTSTS SESDTGQEAE FSLYDFLDSC RANTPEEEEY ETKGGRRRTW DDDFVLKRQF SALVPAFDPR PGRTNVQQTT DLEIPPPGSA HTHTHTHTHT HTHPLSFSPR SECCNQGPVS SAGSPRSEVQ EEVECAPSPH LSLTLKVAGL GTSREVELPL SNYKSTIFFY VQRLLQLSCS GAVKTDKLRR IWEPTYTIMY RELKDADKEK ESAKTDVCEH GTGFSARSGV LSPGSLLASQ SGEILGVARE MAQAKAGCSQ NACGVEDVLQ LLRILYIIGG DSASNTRTMQ EGRPRPRLRG GTFLVLWGPQ PSPCFQTLRS CSSTRLQRSS PARRSPPRSC SRSRSALWRH ASWWQPPVGH ADACLCLQEP LALASGALPD WCEQLTAKCP FLIPFETRQL YFTCTAFGAS RAIVWLQNRR EATMERSRPS TTVRRDDPGE FRVGRLKHER VKVPRGEAMM EWAESVMQLH ADRKSVLEVE FQGEEGTGLG PTLEFYALVA AEFQRTSLGI WLCDDDFPDD ESRQVDLGGG LKPPGFYVQR SCGLFPAAFP QDSEELERIA KLFHFLGIFL AKCIQDNRLV DLPLSQPFFK LLCMGDIKST WSRQLYQSCS FPPGQEPERL HLQPFLLLSE SEASTEESQE TYSVGSFDED SKSEFIMDPP KPKPPAWYHG ILTWDDFQLV NPHRASFLKE LKELAMKRRQ ILSSKSLSED EKNTRLQDLM LRNPLGSGPP LSIEDLGLNF QFCPSSKVHG FSALDLKPNG DNEVRRPSAQ VPGKAEQPDE PMVTMENAEE YVELMFDLCM HTGIQKQMEA FRGKRHVTSR RGDGLPQCVD LLFACLSSFS EGFNRVFQME KMSSFSHKEV QMILCGNQSP SWTADDIINY TEPKLGYTRD SPGFLRFVRV LCGMSSDERK AFLQFTTGCS TLPPGGLANL HPRLTIVRKV DATDSSYPSV NTCVHYLKLP EYTSEDIMRE RLLAATMEKG FHLN // ID Q4TD70_TETNG Unreviewed; 511 AA. AC Q4TD70; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 37. DE SubName: Full=Chromosome undetermined SCAF6467, whole genome shotgun sequence {ECO:0000313|EMBL:CAF89162.1}; GN ORFNames=GSTENG00002957001 {ECO:0000313|EMBL:CAF89162.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF89162.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF89162.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01006467; CAF89162.1; -; Genomic_DNA. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 2. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 262 283 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 313 333 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 511 AA; 57984 MW; 954588688D4EC0A8 CRC64; MHIEKLRTKL NDVKRKLNHQ LPDPNFWTNF ALESHGAKVY KKQSSNTYEK IEGLKIFGIQ IFSKVGPASV IQGQQPPIPG NCWSFSGSHG NLFIELSHMV TVSHVTLDHV PSSVVPADTI SSAPRQFSVY VGIKQSCRRP SGDRDEDVLW FFFLSHRDFN RFMIHQSNLL PPGGATRQGL RSSLRLREKG YYDKEGKPTI SYKEQIYRVF KQRCKHKGVR KPNTDSTDRY DTDYIFDYFD DSSSLGSSMS SKSNATPPTW TMVWSLVFFF LLGLALPIIF FGVTRIISFI RPSGSEQTLI SSPVYLPVPT KMCENIMMHI EKLRTELNDV KRKLNHQLPD PNFWTNFALE SHGAKVYKKQ SSNTYEKIEG LTIFGIQIFS KVGPASVIQG QHPPIPGNCW SFPGSHGNLF IELSHMVTVS HVTLDHVSSS VVPADTISSA PRQFSVYGRQ RLDDRAVHLG KFTYDLEGNP TQTFAVKVYD TIAFKYIDLQ IDSNYGHADY TCFYGFRVHG L // ID Q4THL6_TETNG Unreviewed; 251 AA. AC Q4THL6; DT 19-JUL-2005, integrated into UniProtKB/TrEMBL. DT 19-JUL-2005, sequence version 1. DT 14-OCT-2015, entry version 36. DE SubName: Full=Chromosome undetermined SCAF2846, whole genome shotgun sequence {ECO:0000313|EMBL:CAF87616.1}; GN ORFNames=GSTENG00000500001 {ECO:0000313|EMBL:CAF87616.1}; OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon OS nigroviridis). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Tetraodon. OX NCBI_TaxID=99883; RN [1] {ECO:0000313|EMBL:CAF87616.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15496914; DOI=10.1038/nature03025; RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., RA Nicaud S., Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., RA Dasilva C., Salanoubat M., Levy M., Boudet N., Castellano S., RA Anthouard V., Jubin C., Castelli V., Katinka M., Vacherie B., RA Biemont C., Skalli Z., Cattolico L., Poulain J., De Berardinis V., RA Cruaud C., Duprat S., Brottier P., Coutanceau J.-P., Gouzy J., RA Parra G., Lardier G., Chapple C., McKernan K.J., McEwan P., Bosak S., RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J., RA Lindblad-Toh K., Birren B., Nusbaum C., Kahn D., Robinson-Rechavi M., RA Laudet V., Schachter V., Quetier F., Saurin W., Scarpelli C., RA Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.; RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals RT the early vertebrate proto-karyotype."; RL Nature 431:946-957(2004). RN [2] {ECO:0000313|EMBL:CAF87616.1} RP NUCLEOTIDE SEQUENCE. RG Genoscope; RG Whitehead Institute Centre for Genome Research; RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAE01002846; CAF87616.1; -; Genomic_DNA. DR HOVERGEN; HBG073494; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 52 72 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 251 AA; 28332 MW; EE5FA8C22A58F593 CRC64; MVWSLVFFFL LGLALPIIFF GVTRIISFIR PSGSEQTLIS SPVYLPVPTE RCENTMMRIE NLRTELNDVT KKLKLNQLPD PNYWTNYALQ SHGAKVYKKQ SSKTYEKIEG FKIFGIQLFS KVGPASVIQG QHPPIPGNCW SFPGSHGNLF IELSHTVTVS HVTLDHVSSS VVPADTISSA PRQFSVYGRQ RLDDRAVHLG KFTYDLEGNP TQTFAVKVYD TMAFKYIDLQ IDSNYGHADY TCLYGFRVHG L // ID Q4UHC1_THEAN Unreviewed; 848 AA. AC Q4UHC1; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 05-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 36. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CAI73518.1}; GN ORFNames=TA20180 {ECO:0000313|EMBL:CAI73518.1}; OS Theileria annulata. OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida; OC Theileriidae; Theileria. OX NCBI_TaxID=5874 {ECO:0000313|EMBL:CAI73518.1, ECO:0000313|Proteomes:UP000001950}; RN [1] {ECO:0000313|EMBL:CAI73518.1, ECO:0000313|Proteomes:UP000001950} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Ankara {ECO:0000313|Proteomes:UP000001950}; RX PubMed=15994557; DOI=10.1126/science.1110418; RA Pain A., Renauld H., Berriman M., Murphy L., Yeats C.A., Weir W., RA Kerhornou A., Aslett M., Bishop R., Bouchier C., Cochet M., RA Coulson R.M.R., Cronin A., de Villiers E.P., Fraser A., Fosker N., RA Gardner M., Goble A., Griffiths-Jones S., Harris D.E., Katzer F., RA Larke N., Lord A., Maser P., McKellar S., Mooney P., Morton F., RA Nene V., O'Neil S., Price C., Quail M.A., Rabbinowitsch E., RA Rawlings N.D., Rutter S., Saunders D., Seeger K., Shah T., Squares R., RA Squares S., Tivey A., Walker A.R., Woodward J., Dobbelaere D.A.E., RA Langsley G., Rajandream M.A., McKeever D., Shiels B., Tait A., RA Barrell B.G., Hall N.; RT "Genome of the host-cell transforming parasite Theileria annulata RT compared with T. parva."; RL Science 309:131-133(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR940347; CAI73518.1; -; Genomic_DNA. DR RefSeq; XP_954195.1; XM_949102.1. DR STRING; 353154.XP_954195.1; -. DR EnsemblProtists; CAI73518; CAI73518; TA20180. DR GeneID; 3863847; -. DR KEGG; tan:TA20180; -. DR EuPathDB; PiroplasmaDB:TA20180; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000135636; -. DR InParanoid; Q4UHC1; -. DR Proteomes; UP000001950; Chromosome 1 part 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000001950}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001950}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 811 830 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 848 AA; 100204 MW; 7B3CDF0162CB8D9E CRC64; MSLHNSRGNS FKGIPKKPKI EKVALINKSH LKLQDNFSAY ESKLTTSRIK GKPFNPDLYS LKIDFASEEF GTKIIAHSKT LSKVSKILED DSCSYMLTPC NTSDWFVLSF PESILIQEIS FLSFEYYSSL YKKIRISITS SYPSGKWLTL GELETDPSRN EIFDLSVVCN TDKNDCWGKY LKVELLDYHK LELNYYCSIT KMMVFGITAV EYLETEISDD SSLYNTFIQP NGTGYPPITH DTNTVDESSV VSSKHLTGMV TKGSDVDSIV VTSKNEPPVD KEVCELNVFD ESPIKKLSDL KCFGKSSEVL NPTFVYDIKN VIVKSFYKFL MRLALRKDDN FHNKKLIYRV LKIGKIKKYC GTNFLFQFDC NRFITHLLLE KYSVYYSNIL DRIKLPSRHL WHVISRIMND VGIKEVPILV CTESIGFISK FKCYFYFNLK SFSTLLFTQF TEGYRIHTIN SRNSRLNHNL LSKLYFFYVD NKITVMTAPK FRGLTVDDES LVKRYDYKVK DKYISTVTVI NNKLYVNMSS SYSTHFPFDI DPPDPKLDNE LIKANTSLYK DNLNRLNMSK RFDQKVKFHE KEKFKHHNYY QDENTKSTDS KTHKHVLLKL SERIKSLEFL TNKLSNKIYQ LENLLNFYIK RQFYYNQHVE LERELNDEFE VILKVLDIKK YKLVVRDISS LRNALRKTEY YYKRIKQYNE YETSYLYIKT RDLLRLKKSL CKLTNCTSER KSFKRCLFLC KLSKFYKLFN KRKLAKSIAN YGCKCVHSTH FHNLSLDQRD EWLYMDRNDN YNVLLILNDL VCFFRDKFSG YFNFYFLFLL YICTQIFWIC KFNSHDKKIR KLLILKNN // ID Q4WTX9_ASPFU Unreviewed; 840 AA. AC Q4WTX9; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 17-APR-2007, sequence version 2. DT 11-NOV-2015, entry version 47. DE SubName: Full=Sad1/UNC domain protein {ECO:0000313|EMBL:EAL91947.2}; GN ORFNames=AFUA_5G06480 {ECO:0000313|EMBL:EAL91947.2}; OS Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC OS A1100) (Aspergillus fumigatus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=330879 {ECO:0000313|EMBL:EAL91947.2, ECO:0000313|Proteomes:UP000002530}; RN [1] {ECO:0000313|EMBL:EAL91947.2, ECO:0000313|Proteomes:UP000002530} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100 RC {ECO:0000313|Proteomes:UP000002530}; RX PubMed=16372009; DOI=10.1038/nature04332; RA Nierman W.C., Pain A., Anderson M.J., Wortman J.R., Kim H.S., RA Arroyo J., Berriman M., Abe K., Archer D.B., Bermejo C., Bennett J.W., RA Bowyer P., Chen D., Collins M., Coulsen R., Davies R., Dyer P.S., RA Farman M.L., Fedorova N., Fedorova N.D., Feldblyum T.V., Fischer R., RA Fosker N., Fraser A., Garcia J.L., Garcia M.J., Goble A., RA Goldman G.H., Gomi K., Griffith-Jones S., Gwilliam R., Haas B.J., RA Haas H., Harris D.E., Horiuchi H., Huang J., Humphray S., Jimenez J., RA Keller N., Khouri H., Kitamoto K., Kobayashi T., Konzack S., RA Kulkarni R., Kumagai T., Lafton A., Latge J.-P., Li W., Lord A., RA Lu C., Majoros W.H., May G.S., Miller B.L., Mohamoud Y., Molina M., RA Monod M., Mouyna I., Mulligan S., Murphy L.D., O'Neil S., Paulsen I., RA Penalva M.A., Pertea M., Price C., Pritchard B.L., Quail M.A., RA Rabbinowitsch E., Rawlins N., Rajandream M.A., Reichard U., RA Renauld H., Robson G.D., Rodriguez de Cordoba S., Rodriguez-Pena J.M., RA Ronning C.M., Rutter S., Salzberg S.L., Sanchez M., RA Sanchez-Ferrero J.C., Saunders D., Seeger K., Squares R., Squares S., RA Takeuchi M., Tekaia F., Turner G., Vazquez de Aldana C.R., Weidman J., RA White O., Woodward J.R., Yu J.-H., Fraser C.M., Galagan J.E., Asai K., RA Machida M., Hall N., Barrell B.G., Denning D.W.; RT "Genomic sequence of the pathogenic and allergenic filamentous fungus RT Aspergillus fumigatus."; RL Nature 438:1151-1156(2005). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAL91947.2}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAHF01000003; EAL91947.2; -; Genomic_DNA. DR RefSeq; XP_753985.2; XM_748892.2. DR EnsemblFungi; CADAFUAT00006558; CADAFUAP00006558; CADAFUAG00006558. DR GeneID; 3511152; -. DR KEGG; afm:AFUA_5G06480; -. DR EuPathDB; FungiDB:Afu5g06480; -. DR HOGENOM; HOG000172520; -. DR InParanoid; Q4WTX9; -. DR OMA; RNTREVQ; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002530; Chromosome 5. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IEA:EnsemblFungi. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IEA:EnsemblFungi. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002530}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000002530}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 25 {ECO:0000256|SAM:SignalP}. FT CHAIN 26 840 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004246634. FT TRANSMEM 687 704 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 840 AA; 91681 MW; E430593525473B86 CRC64; MLSGRCVGAF LINLAALALL TAGQAVIREQ SQPLCLARGW RDTEAEFIRW PVCIETRWSR SGITVAGGPS VTMSSISSNS PPTSKASHTA VQQHSGKEQE QDTDSPLDNA KFLSFEDWKK QNLAKVGQSA ENVGGNRRSG VTGNESRRPT GISNALDSLG EDAEIELDFG GFGADAPEAA RPPSFGSGVQ VGKSAGSVDS KTGGDANGPS PGMIRSGSSR RKDAGTTCKE RFNYASFDCA ATVLKTNPEC QGSSSVLIEN KDSYMLNECR AKNKFLILEL CDDILVDTVV LANYEFFSSI FHTFRVSVSD RYPAKPDQWR ELGVFEARNS REVQAFAVEN PLIWARYLKI EFLTHYGNEF YCPLSLIRVH GTTMLEEYKH DGEASRVDDE IVDETLEPDH AVTAAIAEPS ENSSDLGAEN RESMRRKLQD GLQDACPNPA QGLERLLANY LDSEICSVQA RPTRTAGQER ADAAVQHDSP STDTTPPGPE ASGPIVPGAG NGTKFAPDAR RAAGQSGADG NPLPASMATM SEPVQHDTTS EADQKSTASS QEEQVPSVDS AKFSATQPPS PNPTTQESFF KSVNKRLQML ESNSTLSLLY IEEQSRILRD AFSKVEKRQL SKTSTFLENL NVTVMNELRQ FREQYDQVWK TVALEFETQR IQYHQEIFSL SAQLGVLADE LVFQKRVAVI QSIMVLFCFG LVLFSRGAMS SYMEFPSVQN MVSRSYSLRS SSPPFSSPSM SPSSTRPSFT YRSRHRRNGT DDTQDSAPSP TISYSPPTPN SETSVPLESI EKQESPPSPG DLELPDIELP QFRSQSSPPV LKSGEDSDGE ISKTSGSMEV // ID Q4WVS5_ASPFU Unreviewed; 732 AA. AC Q4WVS5; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 05-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 39. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAL91301.1}; GN ORFNames=AFUA_5G13150 {ECO:0000313|EMBL:EAL91301.1}; OS Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC OS A1100) (Aspergillus fumigatus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=330879 {ECO:0000313|EMBL:EAL91301.1, ECO:0000313|Proteomes:UP000002530}; RN [1] {ECO:0000313|EMBL:EAL91301.1, ECO:0000313|Proteomes:UP000002530} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100 RC {ECO:0000313|Proteomes:UP000002530}; RX PubMed=16372009; DOI=10.1038/nature04332; RA Nierman W.C., Pain A., Anderson M.J., Wortman J.R., Kim H.S., RA Arroyo J., Berriman M., Abe K., Archer D.B., Bermejo C., Bennett J.W., RA Bowyer P., Chen D., Collins M., Coulsen R., Davies R., Dyer P.S., RA Farman M.L., Fedorova N., Fedorova N.D., Feldblyum T.V., Fischer R., RA Fosker N., Fraser A., Garcia J.L., Garcia M.J., Goble A., RA Goldman G.H., Gomi K., Griffith-Jones S., Gwilliam R., Haas B.J., RA Haas H., Harris D.E., Horiuchi H., Huang J., Humphray S., Jimenez J., RA Keller N., Khouri H., Kitamoto K., Kobayashi T., Konzack S., RA Kulkarni R., Kumagai T., Lafton A., Latge J.-P., Li W., Lord A., RA Lu C., Majoros W.H., May G.S., Miller B.L., Mohamoud Y., Molina M., RA Monod M., Mouyna I., Mulligan S., Murphy L.D., O'Neil S., Paulsen I., RA Penalva M.A., Pertea M., Price C., Pritchard B.L., Quail M.A., RA Rabbinowitsch E., Rawlins N., Rajandream M.A., Reichard U., RA Renauld H., Robson G.D., Rodriguez de Cordoba S., Rodriguez-Pena J.M., RA Ronning C.M., Rutter S., Salzberg S.L., Sanchez M., RA Sanchez-Ferrero J.C., Saunders D., Seeger K., Squares R., Squares S., RA Takeuchi M., Tekaia F., Turner G., Vazquez de Aldana C.R., Weidman J., RA White O., Woodward J.R., Yu J.-H., Fraser C.M., Galagan J.E., Asai K., RA Machida M., Hall N., Barrell B.G., Denning D.W.; RT "Genomic sequence of the pathogenic and allergenic filamentous fungus RT Aspergillus fumigatus."; RL Nature 438:1151-1156(2005). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAL91301.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAHF01000003; EAL91301.1; -; Genomic_DNA. DR RefSeq; XP_753339.1; XM_748246.1. DR EnsemblFungi; CADAFUAT00006081; CADAFUAP00006081; CADAFUAG00006081. DR GeneID; 3511495; -. DR KEGG; afm:AFUA_5G13150; -. DR EuPathDB; FungiDB:Afu5g13150; -. DR HOGENOM; HOG000176993; -. DR InParanoid; Q4WVS5; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000002530; Chromosome 5. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002530}; KW Reference proteome {ECO:0000313|Proteomes:UP000002530}. FT COILED 433 453 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 732 AA; 81519 MW; 1447A2EF59EF9053 CRC64; MPPKRTRRAG AAARSEASII FGHSSPSVSN QPLPDVPTQP SWAYGSPAAP VLPRRLVAKD IGLAEVAESI DQTIRDAEKR DRRNDPDEAN DTDDRPHMNT RSRRRPSAAN ASPVRRRTKR EPTPDQVQLL DALREATVSP NQRNGENETQ AERSTATPTP PIPHTLSTMS SPTSQILPDP KYPSLPIEQL YPSPLQRIGS PTRNDASLEM SQNTGIDDNE SVISWMVERD IHDDDLQRTR SARYRREPVG KNITAPPRRF SGLAFANETI VEEDEPDSRL SVSKTPQEST VESEAQSDHQ TESDQPLVSL EPQKEPPPQV EEVSSAPART IIPNFFTKDQ SFNNSTTQPS DQSFTDHARS TAADSFIPRI SVSLPWTQIL RLSGAILLTA ISLLTIYSFS DSIANIPHDI ASHFPFRNPA PSISLNISDI EALNSLNNQV MRLGAQVSSI SKELSVVKSE VKNVGGPTTI IEPVKVPKKP NFLSIGTGVL IDPRMTSPTY GEKKSRLPKW LRDRASVWGE APRPKPNPPL TALVPWDSVG DCWCSAPRNG VSQLALHLSR PIVPEEVVVE HIPKHATLNP GAAPKDMELW VQYTINKSTS GELPTDAGSA GWYKSYLNWL LSFESGVLET EYQSPMLSER FSLHDYIMSY LRPAYHNEPE SAYWNATTLG PTFYRVGKWK YDIHGQHHVQ EFSLDAIIDQ PDIRVDRVAF RVNSNWGANF TCFYRLKLYG HL // ID Q4X6G2_PLACH Unreviewed; 390 AA. AC Q4X6G2; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 05-JUL-2005, sequence version 1. DT 11-NOV-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CAH87514.1}; DE Flags: Fragment; GN ORFNames=PC302506.00.0 {ECO:0000313|EMBL:CAH87514.1}; OS Plasmodium chabaudi. OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Vinckeia). OX NCBI_TaxID=5825 {ECO:0000313|Proteomes:UP000002509}; RN [1] {ECO:0000313|EMBL:CAH87514.1, ECO:0000313|Proteomes:UP000002509} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AS {ECO:0000313|Proteomes:UP000002509}; RX PubMed=15637271; DOI=10.1126/science.1103717; RA Hall N., Karras M., Raine J.D., Carlton J.M., Kooij T.W.A., RA Berriman M., Florens L., Janssen C.S., Pain A., Christophides G.K., RA James K., Rutherford K., Harris B., Harris D., Churcher C.M., RA Quail M.A., Ormond D., Doggett J., Trueman H.E., Mendoza J., RA Bidwell S.L., Rajandream M.A., Carucci D.J., Yates J.R. III, RA Kafatos F.C., Janse C.J., Barrell B.G., Turner C.M.R., Waters A.P., RA Sinden R.S.; RT "A comprehensive survey of the Plasmodium life cycle by genomic, RT transcriptomic, and proteomic analyses."; RL Science 307:82-86(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAJ01009520; CAH87514.1; -; Genomic_DNA. DR RefSeq; XP_740990.1; XM_735897.1. DR STRING; 5825.PCHAS_130640; -. DR GeneID; 3494069; -. DR KEGG; pcb:PC302506.00.0; -. DR EuPathDB; PlasmoDB:PCHAS_130640; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000282163; -. DR InParanoid; Q4X6G2; -. DR Proteomes; UP000002509; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000002509}; KW Reference proteome {ECO:0000313|Proteomes:UP000002509}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 390 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004246096. FT NON_TER 390 390 {ECO:0000313|EMBL:CAH87514.1}. SQ SEQUENCE 390 AA; 45849 MW; 7A35578281FC0F28 CRC64; MIWWFLISVN FLFFLIKSFF TPKGTIYLND LNSTQNNVET EKYILGENYN LTSLKLKVDF GSLDTGTKII EHSSGIINIK SIQQYDYDSY MLTPCDSDIW WIYSFSDFIH IEKIGLVSLE HYASNFKVIE ILGSDTYPAT KWKKLGKIST NFTKSFELFN IYDHCKNYDE DNCWVKYLKF IVLSHHDIEQ NYYCTLTHLQ IFASSGVDML SDKIYSDDSA NQIESDPENS DEQNKIEIQE QENAENLEDS NKDKVLNHIK KQMHSKEEDS NKLPSKDNYY KDAISSDNFH NDKPFHNNIH NDSARYKNSK QNAHYTNYAH YLSVEKDPFD TNLLEKELTQ SKLIDTDLIK KELMDTELIE DELLNYEFIE KDIKITAFNE ENIFDNITQQ // ID Q4YHT7_PLABA Unreviewed; 378 AA. AC Q4YHT7; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 05-JUL-2005, sequence version 1. DT 14-OCT-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CAI02429.1}; DE Flags: Fragment; GN ORFNames=PB300740.00.0 {ECO:0000313|EMBL:CAI02429.1}; OS Plasmodium berghei (strain Anka). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Vinckeia). OX NCBI_TaxID=5823 {ECO:0000313|Proteomes:UP000007720}; RN [1] {ECO:0000313|EMBL:CAI02429.1, ECO:0000313|Proteomes:UP000007720} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ANKA {ECO:0000313|Proteomes:UP000007720}; RX PubMed=15637271; DOI=10.1126/science.1103717; RA Hall N., Karras M., Raine J.D., Carlton J.M., Kooij T.W.A., RA Berriman M., Florens L., Janssen C.S., Pain A., Christophides G.K., RA James K., Rutherford K., Harris B., Harris D., Churcher C.M., RA Quail M.A., Ormond D., Doggett J., Trueman H.E., Mendoza J., RA Bidwell S.L., Rajandream M.A., Carucci D.J., Yates J.R. III, RA Kafatos F.C., Janse C.J., Barrell B.G., Turner C.M.R., Waters A.P., RA Sinden R.S.; RT "A comprehensive survey of the Plasmodium life cycle by genomic, RT transcriptomic, and proteomic analyses."; RL Science 307:82-86(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAI01004968; CAI02429.1; -; Genomic_DNA. DR RefSeq; XP_673187.1; XM_668095.1. DR GeneID; 3421592; -. DR KEGG; pbe:PB300740.00.0; -. DR EuPathDB; PlasmoDB:PBANKA_130320; -. DR InParanoid; Q4YHT7; -. DR Proteomes; UP000007720; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007720}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007720}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 16 36 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 253 273 {ECO:0000256|SAM:Coils}. FT NON_TER 378 378 {ECO:0000313|EMBL:CAI02429.1}. SQ SEQUENCE 378 AA; 44851 MW; 390B93336799728F CRC64; MLLGLVHIIR YVHPNLIYHI FTILCVIYIF FFFYLFPEKY ILGENYNLAS LKLKVDFGSL DTGTKIIEHS NGIINIKSIQ QYDYDSYMLT PCDSDIWWIY SFSDFIHIEK IGLVSLEHYA SNFKVIEILG SDTYPATKWK KLGKISTNFT KSFEIFNIYD HCKNYDEDNC WVKYLKFIVL SHHNIEKNYY CTLTHLQIFA SSGVDMLSDK IYSDDNINHI ESDPENSDEH KKIKIQEQDN VGHLEVLYED QFLKHIKKQI HSKEEDSKEL DSKDNYYKDA INPDNFHNDK PLHNNIHNSD IRYKNSNKNA HYTNYAHYLS IEKDLFDTNL LEKELTQSKL IDTDLIKKEL MDTELIEDEL LNYEFIEKDI KINSFNEE // ID Q4YRY2_PLABA Unreviewed; 381 AA. AC Q4YRY2; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 05-JUL-2005, sequence version 1. DT 14-OCT-2015, entry version 30. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CAH99225.1}; DE Flags: Fragment; GN ORFNames=PB000076.03.0 {ECO:0000313|EMBL:CAH99225.1}; OS Plasmodium berghei (strain Anka). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Vinckeia). OX NCBI_TaxID=5823 {ECO:0000313|Proteomes:UP000007720}; RN [1] {ECO:0000313|EMBL:CAH99225.1, ECO:0000313|Proteomes:UP000007720} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ANKA {ECO:0000313|Proteomes:UP000007720}; RX PubMed=15637271; DOI=10.1126/science.1103717; RA Hall N., Karras M., Raine J.D., Carlton J.M., Kooij T.W.A., RA Berriman M., Florens L., Janssen C.S., Pain A., Christophides G.K., RA James K., Rutherford K., Harris B., Harris D., Churcher C.M., RA Quail M.A., Ormond D., Doggett J., Trueman H.E., Mendoza J., RA Bidwell S.L., Rajandream M.A., Carucci D.J., Yates J.R. III, RA Kafatos F.C., Janse C.J., Barrell B.G., Turner C.M.R., Waters A.P., RA Sinden R.S.; RT "A comprehensive survey of the Plasmodium life cycle by genomic, RT transcriptomic, and proteomic analyses."; RL Science 307:82-86(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAI01002685; CAH99225.1; -; Genomic_DNA. DR RefSeq; XP_674452.1; XM_669360.1. DR GeneID; 3422897; -. DR KEGG; pbe:PB000076.03.0; -. DR EuPathDB; PlasmoDB:PBANKA_130320; -. DR InParanoid; Q4YRY2; -. DR Proteomes; UP000007720; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007720}; KW Reference proteome {ECO:0000313|Proteomes:UP000007720}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 381 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004247247. FT COILED 256 276 {ECO:0000256|SAM:Coils}. FT NON_TER 381 381 {ECO:0000313|EMBL:CAH99225.1}. SQ SEQUENCE 381 AA; 45019 MW; 6FC18FABD273A8E7 CRC64; MIWWFLISIN FLFFLIKSFF TPKGTIYLND LNNTQNNAAT EKYILGENYN LASLKLKVDF GSLDTGTKII EHSNGIINIK SIQQYDYDSY MLTPCDSDIW WIYSFSDFIH IEKIGLVSLE HYASNFKVIE ILGSDTYPAT KWKKLGKIST NFTKSFEIFN IYDHCKNYDE DNCWVKYLKF IVLSHHNIEK NYYCTLTHLQ IFASSGVDML SDKIYSDDNI NHIESDPENS DEHKKIKIQE QDNVGHLEVL YEDQFLKHIK KQIHSKEEDS KELDSKDNYY KDAINPDNFH NDKPLHNNIH NSDIRYKNSN KNAHYTNYAH YLSIEKDLFD TNLLEKELTQ SKLIDTDLIK KELMDTELIE DELLNYEFIE KDIKINSFNE E // ID Q4YSS3_PLABA Unreviewed; 841 AA. AC Q4YSS3; DT 05-JUL-2005, integrated into UniProtKB/TrEMBL. DT 05-JUL-2005, sequence version 1. DT 14-OCT-2015, entry version 31. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:CAH98934.1}; GN ORFNames=PB001544.02.0 {ECO:0000313|EMBL:CAH98934.1}; OS Plasmodium berghei (strain Anka). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Vinckeia). OX NCBI_TaxID=5823 {ECO:0000313|Proteomes:UP000007720}; RN [1] {ECO:0000313|EMBL:CAH98934.1, ECO:0000313|Proteomes:UP000007720} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ANKA {ECO:0000313|Proteomes:UP000007720}; RX PubMed=15637271; DOI=10.1126/science.1103717; RA Hall N., Karras M., Raine J.D., Carlton J.M., Kooij T.W.A., RA Berriman M., Florens L., Janssen C.S., Pain A., Christophides G.K., RA James K., Rutherford K., Harris B., Harris D., Churcher C.M., RA Quail M.A., Ormond D., Doggett J., Trueman H.E., Mendoza J., RA Bidwell S.L., Rajandream M.A., Carucci D.J., Yates J.R. III, RA Kafatos F.C., Janse C.J., Barrell B.G., Turner C.M.R., Waters A.P., RA Sinden R.S.; RT "A comprehensive survey of the Plasmodium life cycle by genomic, RT transcriptomic, and proteomic analyses."; RL Science 307:82-86(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CAAI01002551; CAH98934.1; -; Genomic_DNA. DR RefSeq; XP_678909.1; XM_673817.1. DR GeneID; 3427500; -. DR KEGG; pbe:PB001544.02.0; -. DR EuPathDB; PlasmoDB:PBANKA_143090; -. DR HOGENOM; HOG000281004; -. DR InParanoid; Q4YSS3; -. DR Proteomes; UP000007720; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007720}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000007720}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 168 192 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 280 321 {ECO:0000256|SAM:Coils}. FT COILED 488 508 {ECO:0000256|SAM:Coils}. FT COILED 581 601 {ECO:0000256|SAM:Coils}. FT COILED 619 639 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 841 AA; 98778 MW; 7BF78E59563A0688 CRC64; MTIHTGNNRR HAKDEKNELK SKPKGRKGNS DNEANYNIDP HENEDNSLIQ VLHSYEDFQN NNMNYGKLKP CKTKETKFGR VRKTLIKIFS LSDLEVNESN KNYFNRKKNG HSNVLLTSRM AMWNKIENNN DPMYDLKIES SHNKSFVNII ANYISMFLND ILNDRKGISY IAIFMIVLSI LITCISGFIT IFNDAKIDLD SWGIIPSKNS YDGVNKFMGY LKLGEEGSSK ETINQNKDTQ QNSKAWKFQE LFDGMKNKIN ESINLNFINK KESGNVSETY SNIKNKQKEL ENNFKRIEAH LKKMESKLKE LQNDIISNTS DIDYFKNDSK KEVENIKKKL QYNYQLFQNK FIDYLKIIDD IKIDVSEKKK AIFNEIENKV HANQITIEEG ISNKIEHQKN YFFEKFSKLE KQMEEIEISI ANKTYSNFEN NEFSKNGGGE KKNVYIYADK QIEDAKKITD EENKKLLTEY KRKQIETEEE NINRNNYISK KLAIIEDIRK ELDILKERTE ASKTFLDKVF PNLELKMLKN VENKIKYYLE IYKKDIINEL TETTVISNEE KYKNMAIKQE KFQKEFFKKI NIQINSQIKN IKEELNKSID NALHSKEFKN DKELIKKINQ TNYNTIETLQ EKVDELYNEF ILDYNQIDWA LESLGAKIVY KMTSYPLNKN DFIEKFLNQI VSFLPSEEIY GMVKPMGKDP SIILKPSNFP GDCFSFKGNT GKVTIHLPAT INVTSVSIQH VHENISNNAN ATPKYFSVYG VVDLNWPENF DESNIDYNDF KNSSLYSCLH KEYGILYPNE ILEKWIKHNK NPSVIHIGDF YFDRKKRIST YQTKHCFPFK R // ID Q501S0_DANRE Unreviewed; 987 AA. AC Q501S0; DT 07-JUN-2005, integrated into UniProtKB/TrEMBL. DT 07-JUN-2005, sequence version 1. DT 11-NOV-2015, entry version 65. DE SubName: Full=Zgc:92151 {ECO:0000313|EMBL:AAH95900.1}; GN Name=sun1 {ECO:0000313|ZFIN:ZDB-GENE-050522-551}; GN Synonyms=zgc:92151 {ECO:0000313|EMBL:AAH95900.1}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|EMBL:AAH95900.1}; RN [1] {ECO:0000313|EMBL:AAH95900.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Whole {ECO:0000313|EMBL:AAH95900.1}; RG NIH - Zebrafish Gene Collection (ZGC) project; RL Submitted (MAY-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC095900; AAH95900.1; -; mRNA. DR UniGene; Dr.105339; -. DR STRING; 7955.ENSDARP00000104532; -. DR PaxDb; Q501S0; -. DR KEGG; dre:553188; -. DR CTD; 23353; -. DR ZFIN; ZDB-GENE-050522-551; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR KO; K19347; -. DR PhylomeDB; Q501S0; -. DR NextBio; 20879987; -. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 358 378 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 422 440 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 447 471 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 621 641 {ECO:0000256|SAM:Coils}. FT COILED 659 686 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 987 AA; 109694 MW; 0246B776C3BA1C69 CRC64; MDFSRLHTYT PPHCTPDNTG YTYSLSSSYS TAALEFEKEH KINPVYDSPK MSRRSLRLQT SSGLYDNSFT EVAGNHSVGS YKRTNTSTTT TTSSSSSVSR SVRGRRQQQD SSIYESQSVT GTPQSTSDLS FTSTDASLIS NLLDQSTLRQ SSTTETYSAT RRRRAVNRSL LENGNVSKTE AHANLANGYF CKDCSFHAEG NEKETSYSVP YSTSESAAYQ TTEAADATMT TMTTSLNSVD GAAHDSYCGS VNVRDVVTAD HLNLNGSLCD DCKGKQHMEM NTERKHYSYI HRVLTVLWAV VTYTGNVLHR VCQGFGSAGA FVSRKMKSVV GLAVCSPGDI CKEKQHMEMN TERKHYSYIH RMLTVLWAVV SYTGYGLLRV CRGFGSAGAF VSRKLKSILW FAVCSPGKAA TGAFWWLGTG WYQLVALMSL INVFLLTRCL PKLLKLLLFL LPFLLLFGLW YLGLPIALSF LPAVNLTEWK TSVTSFASLP ALPSFPSFPS LPALPSFTKE PLLKEQDVPP LVVAQAASDS INSERLALLE QRVSALWESV RQGELKAKQQ HEEALGLTQS LQEQIKTQTD RESLGLWVTE LLQPKFTALE GDMKTETLSR AETEEQHIQH QNILEARLAE LEVLLQNLNS RTEDIHLSQQ TPVQAPVSVG VSQEKHEALL SEVQRLEAEL GRIRGDLQGV MGCQGKCDRL DTIHETVSAQ VKEQLYALLY GRDRGEAVIP EPLLPWLASQ YTSTSDLTAT LVTLERSILG NLSLQLQESK HQQASAETVT QTVAHTAEAA GMSEEQVQLI VQRALKLYSE DRTGQVDYAL ESGGGSVLST RCSETYETKT ALMSLFGIPL WYFSQSPRVV IQPDMYPGNC WAFKGSQGYL VIRLSLRVIP NGFCLEHIPK SLSPSGNISS APRRFSVYGL DDEYQDEGKL LGDYTYQEDG DSLQNFPVME ENDKAFQIIE MRVLSNWGHP EYTCLYRFRV HGKPHAQ // ID Q55XS2_CRYNB Unreviewed; 900 AA. AC Q55XS2; DT 24-MAY-2005, integrated into UniProtKB/TrEMBL. DT 24-MAY-2005, sequence version 1. DT 11-NOV-2015, entry version 42. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAL22413.1}; GN OrderedLocusNames=CNBB2920 {ECO:0000313|EMBL:EAL22413.1}; OS Cryptococcus neoformans var. neoformans serotype D (strain B-3501A) OS (Filobasidiella neoformans). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. OX NCBI_TaxID=283643 {ECO:0000313|EMBL:EAL22413.1, ECO:0000313|Proteomes:UP000001435}; RN [1] {ECO:0000313|Proteomes:UP000001435} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=B-3501A {ECO:0000313|Proteomes:UP000001435}; RX PubMed=15653466; DOI=10.1126/science.1103773; RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D., RA Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E., RA Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J., RA D'Souza C.A., Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J., RA Huang J.C., Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I., RA Kwon-Chung K.J., Lengeler K.B., Maiti R., Marra M.A., Marra R.E., RA Mathewson C.A., Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L., RA Schein J.E., Shvartsbeyn A., Shin H., Shumway M., Specht C.A., RA Suh B.B., Tenney A., Utterback T.R., Wickes B.L., Wortman J.R., RA Wye N.H., Kronstad J.W., Lodge J.K., Heitman J., Davis R.W., RA Fraser C.M., Hyman R.W.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307:1321-1324(2005). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAL22413.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEY01000010; EAL22413.1; -; Genomic_DNA. DR RefSeq; XP_777060.1; XM_771967.1. DR STRING; 283643.XP_777060.1; -. DR EnsemblFungi; EAL22413; EAL22413; CNBB2920. DR GeneID; 4934385; -. DR KEGG; cnb:CNBB2920; -. DR EuPathDB; FungiDB:CNBB2920; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001435; Chromosome 2. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001435}. FT COILED 460 498 {ECO:0000256|SAM:Coils}. FT COILED 522 549 {ECO:0000256|SAM:Coils}. FT COILED 642 662 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 900 AA; 99130 MW; 4EC9FA4510AFAC3D CRC64; MPPRTVAPRP SPARSTRSAR SLTREQAREE DDWEAESMTS GSFKVPRSRS KKAGNAIGLK DTSVNIAAAF HAAQTGHLPP PSHPNNSVSS NSSSSRSLQV PRAISPAEQL AQSARALSPV RFFLRPTEED GDDYTSFSSV GIENAGELNT SGEGESYDYR QEEEYVRQAQ QQKMAKAKAR AEASASLKNR RVKALDEDMP YRPAEEDTVS LASSDSGGGE EGVVRNGALQ GRAGTRGKRL ERGEGYLGMG LGIQPRRRRK SRKNGMDGDE SEEEGTPGTA RAWTPAVEVD GHKRSPTPLQ LLRGRSPMMD RKSPVPLGAY QQRRRPSDIR TIVTNVLHGV VIGLQFVVEL GTTVLYRIIV RPLEKAFGSS KGFVRRAKAD WWKWLGILLG ISLALRFLDN AFRTKGIYTA PDAPPSTIDE MSIRLTSLEH ATATLSDLLR AISEGDNELH QSAIAMKSKI DEMEDAVSAE RKRVEGVRGE LKNEKVIMQS EIDKLRSEIH ILSSQIGKHE NSISSDRSAK SLQGVEREIT QLKSRMEQVE QNVHAALEDG RLVAAVERIL PQWMPIRTDS QGDFVVEPAF WTEMKKVMVG KGEVEQIVRR LIGEAGVSDN KIKESPVDEH KVVEWMENAF DRHVQGGVWV TREEFTSTLN EKLQELARET AEKPISKRPA APSMVTIKSS KGEDLTSLFN SLIDTALLRY SKDTIARADY ALFTAGARVI PHLTSDTFTL QKASAFGKLL WASKDVQGRP PATALHPDTS VGSCWPIKGS EGSLGVMLVD RVVVSDVTIE HAPRELALDI ATAPKVVKVL GLVDYAEGLE KLAEYRATHQ ADLNNEEDTN YLPLGTFTYD PSSYSHIQTF PVSSDIVDLG IRIGVVVFKI ESNWGGDLTC LYRVRVHGNA // ID Q585N7_TRYB2 Unreviewed; 491 AA. AC Q585N7; D6XHQ7; DT 10-MAY-2005, integrated into UniProtKB/TrEMBL. DT 10-MAY-2005, sequence version 1. DT 11-NOV-2015, entry version 62. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAZ11736.1}; GN ORFNames=Tb927.6.1740 {ECO:0000313|EMBL:AAX79690.1}; OS Trypanosoma brucei brucei (strain 927/4 GUTat10.1). OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Trypanosoma. OX NCBI_TaxID=185431 {ECO:0000313|Proteomes:UP000008524}; RN [1] {ECO:0000313|EMBL:AAX79690.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=GUTat10.1 {ECO:0000313|EMBL:AAX79690.1}; RA El-Sayed N.M., Khalak H., Adams M.D.; RL Submitted (NOV-1999) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:AAX79690.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=GUTat10.1 {ECO:0000313|EMBL:AAX79690.1}; RA Ghedin E., Blandin G., Bartholomeu D., Caler E., Haas B., Hannick L., RA Shallom J., Hou L., Djikeng A., Feldblyum T., Hostetler J., RA Johnson J., Jones K., Koo H.L., Larkin C., Pai G., Peterson J., RA Khalak H.G., Salzberg S., Simpson A.J., Tallon L., Van Aken S., RA Wanless D., White O., Wortman J., Fraser C.M., El-Sayed N.M.A.; RL Submitted (NOV-1999) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:AAZ11736.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=927/4 GUTat10.1 {ECO:0000313|EMBL:AAZ11736.1}; RX PubMed=16020724; DOI=10.1126/science.1112181; RA El-Sayed N.M., Myler P.J., Blandin G., Berriman M., Crabtree J., RA Aggarwal G., Caler E., Renauld H., Worthey E.A., Hertz-Fowler C., RA Ghedin E., Peacock C., Bartholomeu D.C., Haas B.J., Tran A.N., RA Wortman J.R., Alsmark U.C., Angiuoli S., Anupama A., Badger J., RA Bringaud F., Cadag E., Carlton J.M., Cerqueira G.C., Creasy T., RA Delcher A.L., Djikeng A., Embley T.M., Hauser C., Ivens A.C., RA Kummerfeld S.K., Pereira-Leal J.B., Nilsson D., Peterson J., RA Salzberg S.L., Shallom J., Silva J.C., Sundaram J., Westenberger S., RA White O., Melville S.E., Donelson J.E., Andersson B., Stuart K.D., RA Hall N.; RT "Comparative genomics of trypanosomatid parasitic protozoa."; RL Science 309:404-409(2005). RN [4] {ECO:0000313|EMBL:AAZ11736.1, ECO:0000313|Proteomes:UP000008524} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=927/4 GUTat10.1 {ECO:0000313|EMBL:AAZ11736.1, RC ECO:0000313|Proteomes:UP000008524}; RX PubMed=16020726; DOI=10.1126/science.1112642; RA Berriman M., Ghedin E., Hertz-Fowler C., Blandin G., Renauld H., RA Bartholomeu D.C., Lennard N.J., Caler E., Hamlin N.E., Haas B., RA Bohme U., Hannick L., Aslett M.A., Shallom J., Marcello L., Hou L., RA Wickstead B., Alsmark U.C.M., Arrowsmith C., Atkin R.J., Barron A.J., RA Bringaud F., Brooks K., Carrington M., Cherevach I., RA Chillingworth T.J., Churcher C., Clark L.N., Corton C.H., Cronin A., RA Davies R.M., Doggett J., Djikeng A., Feldblyum T., Field M.C., RA Fraser A., Goodhead I., Hance Z., Harper D., Harris B.R., Hauser H., RA Hostetler J., Ivens A., Jagels K., Johnson D., Johnson J., Jones K., RA Kerhornou A.X., Koo H., Larke N., Landfear S., Larkin C., Leech V., RA Line A., Lord A., Macleod A., Mooney P.J., Moule S., Martin D.M., RA Morgan G.W., Mungall K., Norbertczak H., Ormond D., Pai G., RA Peacock C.S., Peterson J., Quail M.A., Rabbinowitsch E., RA Rajandream M.A., Reitter C., Salzberg S.L., Sanders M., Schobel S., RA Sharp S., Simmonds M., Simpson A.J., Tallon L., Turner C.M., Tait A., RA Tivey A.R., Van Aken S., Walker D., Wanless D., Wang S., White B., RA White O., Whitehead S., Woodward J., Wortman J., Adams M.D., RA Embley T.M., Gull K., Ullu E., Barry J.D., Fairlamb A.H., RA Opperdoes F., Barrell B.G., Donelson J.E., Hall N., Fraser C.M., RA Melville S.E., El-Sayed N.M.A.; RT "The genome of the African trypanosome Trypanosoma brucei."; RL Science 309:416-422(2005). RN [5] {ECO:0000313|EMBL:AAX79690.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=GUTat10.1 {ECO:0000313|EMBL:AAX79690.1}; RA Haas B., Blandin G., El-Sayed N.; RL Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC013353; AAX79690.1; -; Genomic_DNA. DR EMBL; CP000069; AAZ11736.1; -; Genomic_DNA. DR RefSeq; XP_845295.1; XM_840202.1. DR EnsemblProtists; AAZ11736; AAZ11736; Tb927.6.1740. DR GeneID; 3657810; -. DR KEGG; tbr:Tb927.6.1740; -. DR EuPathDB; TriTrypDB:Tb927.6.1740; -. DR Proteomes; UP000008524; Chromosome 6. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008524}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008524}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 455 479 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 367 394 FT COILED 416 436 SQ SEQUENCE 491 AA; 55403 MW; D29D2777F82B63D8 CRC64; MKRYYLPAAA FVVVVAAAYA STFFRPSEPG WRDEKPHERS KGFTTNYASA YLGATLTDFS PECLDASSVL NEDNEKYMLC PCNTQRKYFT VQLIRGIEVR IMTLVSQEHF SSRVKNFTVL GSSRYPTNEW RVLGHFKADP WRGTQHFDVA NQQPVRFLRF LWATSYGEHS WCALTTFKVF GVDVLETLTE DYTVSVEEQQ QHEQEQEHSI PPTPLTEPLI IVSPPQDDKH TAIGIDYGTS GAGVTAAVIS TVEDHHETNS RSPGGGLLKH SNYEGNLCVD LNGCKDDGSK TKKCNGTTFN SMYLDTIAQR YCSTVLPPEN ASRTCLPHER NLYVIHLLSF CVSRVALSNK ITALSKPHTS SSVLLMLAQM SKQIKTLQQE VVDLNSRHKD MELKAAQREI TLQWLGMQVK DFKRSNNENR DKLQDVMKQI EVLKSKLSLQ LHLGQNCEDD SLVRVMVVGS LTLSLFSSVL SCITVRTFYR PRRRTSATHL G // ID Q5APM8_CANAL Unreviewed; 558 AA. AC Q5APM8; DT 26-APR-2005, integrated into UniProtKB/TrEMBL. DT 26-APR-2005, sequence version 1. DT 11-NOV-2015, entry version 48. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAL04529.1}; GN ORFNames=CaO19.12200 {ECO:0000313|EMBL:EAL04529.1}, GN CaO19.4738 {ECO:0000313|EMBL:EAL04726.1}, GN orf19.4738 {ECO:0000313|CGD:CAL0004434}; OS Candida albicans (strain SC5314 / ATCC MYA-2876) (Yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Candida/Lodderomyces clade; Candida. OX NCBI_TaxID=237561 {ECO:0000313|EMBL:EAL04529.1, ECO:0000313|Proteomes:UP000000559}; RN [1] {ECO:0000313|EMBL:EAL04529.1, ECO:0000313|Proteomes:UP000000559} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=SC5314 {ECO:0000313|EMBL:EAL04529.1}, and RC SC5314 / ATCC MYA-2876 {ECO:0000313|Proteomes:UP000000559}; RX PubMed=15123810; DOI=10.1073/pnas.0401648101; RA Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., RA Magee B.B., Newport G., Thorstenson Y.R., Agabian N., Magee P.T., RA Davis R.W., Scherer S.; RT "The diploid genome sequence of Candida albicans."; RL Proc. Natl. Acad. Sci. U.S.A. 101:7329-7334(2004). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAL04529.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACQ01000002; EAL04529.1; -; Genomic_DNA. DR EMBL; AACQ01000001; EAL04726.1; -; Genomic_DNA. DR RefSeq; XP_723234.1; XM_718141.1. DR RefSeq; XP_723425.1; XM_718332.1. DR EnsemblFungi; EAL04529; EAL04529; CaO19.12200. DR EnsemblFungi; EAL04726; EAL04726; CaO19.4738. DR GeneID; 3634791; -. DR GeneID; 3635041; -. DR KEGG; cal:CaO19.12200; -. DR KEGG; cal:CaO19.4738; -. DR CGD; CAL0004434; orf19.4738. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000559; Unassembled WGS sequence. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000559}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000559}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 20 FT CHAIN 21 558 FT /FTId=PRO_5004253412. FT TRANSMEM 501 518 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 558 AA; 63815 MW; 28DB28917F02A10B CRC64; MIYIWHALVY ISCLSSCVLS QENSSDTSTS LVNKSDLYTP RSNRTLDYSP LIQYIPVFFQ KNYSSIDTPH NNDFLALQSP ATTQTDSNKN SQKNDSVLDE CHFMSFEEWK KQKIESNTTT SNNYSMNGSS ESKSITPSNH SSVISTNVTL MEADGKVYKD KFNFASVDCA ATIMKTNAQA KGASAILKEN KDSYLLNECS VKHKYVIIEL CQDILVDSVV IGNFEFFSSI FKDIRISVSD RFPSQNWKEL GQFTASNIRD VQTFKIENPL IWARYLKLEI LSHYGNEFYC PISIVRVHGK TMMDEFKEDE EGNQHMGAIK EEEPPTPQTI EEDVLLINQT TLNECRVRLP HLQLNEFLKS FNSSNQEFCV PSDAEPQVTT AKTTTAITTQ ESIYKNIMKR LSLLESNATL SLLYIEEQSK LLSTAFSNLE KRQTTNFNTL ISSVNSTLMN QLMVFKESYY ELYEQYGNLF KMQENSHRQL LAETNKKVGL LSSELTFQKR VSIFNSIIII CLLVYVILTR DVAIEYPEDE LNEKSPSPQS KKLSSPFIPI RYKKSKKR // ID Q5AXW2_EMENI Unreviewed; 608 AA. AC Q5AXW2; C8V2P5; DT 26-APR-2005, integrated into UniProtKB/TrEMBL. DT 26-APR-2005, sequence version 1. DT 11-NOV-2015, entry version 50. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAA58267.1}; GN ORFNames=AN6868.2 {ECO:0000313|EMBL:EAA58267.1}, GN ANIA_06868 {ECO:0000313|EMBL:CBF71626.1}; OS Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL OS 194 / M139) (Aspergillus nidulans). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=227321 {ECO:0000313|EMBL:EAA58267.1, ECO:0000313|Proteomes:UP000005890}; RN [1] {ECO:0000313|EMBL:EAA58267.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=FGSC A4; RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (JAN-2004) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:EAA58267.1, ECO:0000313|Proteomes:UP000000560, ECO:0000313|Proteomes:UP000005890} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=FGSC A4 {ECO:0000313|EMBL:EAA58267.1}, and RC FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139 RC {ECO:0000313|Proteomes:UP000000560, RC ECO:0000313|Proteomes:UP000005890}; RX PubMed=16372000; DOI=10.1038/nature04341; RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.-J., Wortman J.R., RA Batzoglou S., Lee S.-I., Bastuerkmen M., Spevak C.C., Clutterbuck J., RA Kapitonov V., Jurka J., Scazzocchio C., Farman M.L., Butler J., RA Purcell S., Harris S., Braus G.H., Draht O., Busch S., D'Enfert C., RA Bouchier C., Goldman G.H., Bell-Pedersen D., Griffiths-Jones S., RA Doonan J.H., Yu J., Vienken K., Pain A., Freitag M., Selker E.U., RA Archer D.B., Penalva M.A., Oakley B.R., Momany M., Tanaka T., RA Kumagai T., Asai K., Machida M., Nierman W.C., Denning D.W., RA Caddick M.X., Hynes M., Paoletti M., Fischer R., Miller B.L., RA Dyer P.S., Sachs M.S., Osmani S.A., Birren B.W.; RT "Sequencing of Aspergillus nidulans and comparative analysis with A. RT fumigatus and A. oryzae."; RL Nature 438:1105-1115(2005). RN [3] {ECO:0000313|Proteomes:UP000000560} RP GENOME REANNOTATION. RC STRAIN=FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139 RC {ECO:0000313|Proteomes:UP000000560}; RX PubMed=19146970; DOI=10.1016/j.fgb.2008.12.003; RA Wortman J.R., Gilsenan J.M., Joardar V., Deegan J., Clutterbuck J., RA Andersen M.R., Archer D., Bencina M., Braus G., Coutinho P., RA von Dohren H., Doonan J., Driessen A.J., Durek P., Espeso E., RA Fekete E., Flipphi M., Estrada C.G., Geysens S., Goldman G., RA de Groot P.W., Hansen K., Harris S.D., Heinekamp T., Helmstaedt K., RA Henrissat B., Hofmann G., Homan T., Horio T., Horiuchi H., James S., RA Jones M., Karaffa L., Karanyi Z., Kato M., Keller N., Kelly D.E., RA Kiel J.A., Kim J.M., van der Klei I.J., Klis F.M., Kovalchuk A., RA Krasevec N., Kubicek C.P., Liu B., Maccabe A., Meyer V., Mirabito P., RA Miskei M., Mos M., Mullins J., Nelson D.R., Nielsen J., Oakley B.R., RA Osmani S.A., Pakula T., Paszewski A., Paulsen I., Pilsyk S., Pocsi I., RA Punt P.J., Ram A.F., Ren Q., Robellet X., Robson G., Seiboth B., RA van Solingen P., Specht T., Sun J., Taheri-Talesh N., Takeshita N., RA Ussery D., vanKuyk P.A., Visser H., van de Vondervoort P.J., RA de Vries R.P., Walton J., Xiang X., Xiong Y., Zeng A.P., Brandt B.W., RA Cornell M.J., van den Hondel C.A., Visser J., Oliver S.G., Turner G.; RT "The 2008 update of the Aspergillus nidulans genome annotation: a RT community effort."; RL Fungal Genet. Biol. 46:S2-13(2009). RN [4] {ECO:0000313|EMBL:CBF71626.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=FGSC A4 {ECO:0000313|EMBL:CBF71626.1}; RA Russo Wortman J., Mabey Gilsenan J., Joardar V., Deegan J., RA Clutterbuck J., Andersen M.R., Archer D., Bencina M., Braus G., RA Coutinho P., von Dohren H., Doonan J., Driessen A.J.M., Durek P., RA Espeso E., Fekete E., Flipphi M., Garcia Estrada C., Geysens S., RA Goldman G., de Groot P.W.J., Hansen K., Harris S.D., Heinekamp T., RA Helmstaedt K., Henrissat B., Hofmann G., Homan T., Horio T., RA Horiuchi H., James S., Jones M., Karaffa L., Karanyi Z., Kato M., RA Keller N., Kelly D.E., Kiel J.A.K.W., Kim J-M., van der Klei I.J., RA Klis F.M., Kovalchuk A., Krasevec N., Kubicek C.P., Liu B., RA MacCabe A., Meyer V., Mirabito P., Miskei M., Mos M., Mullins J., RA Nelson D.R., Nielsen J., Oakley B.R., Osmani S.A., Pakula T., RA Paszewski A., Paulsen I., Pilsyk S., Posci I., Punt P.J., Ram A.F.J., RA Ren Q., Robellet X., Robson G., Seiboth B., van Solingen P., RA Specht T., Sun J., Taheri-Talesh N., Takeshita N., Ussery D., RA vanKuyk P.A., Visser H., van der Vondervoot P.J.I., de Vries R.P., RA Walton J., Xiang X., Xiong Y., Ping Zeng A., Brandt B.W., RA Cornell M.J., van den Hondel C.A.M.J.J., Visser J., Oliver S.G., RA Turner G.; RT "The 2008 update of the Aspergillus nidulans genome annotation: A RT community effort."; RL Fungal Genet. Biol. 46:S2-S13(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BN001301; CBF71626.1; -; Genomic_DNA. DR EMBL; AACD01000113; EAA58267.1; -; Genomic_DNA. DR RefSeq; XP_664472.1; XM_659380.1. DR EnsemblFungi; CADANIAT00007668; CADANIAP00007668; CADANIAG00007668. DR EnsemblFungi; EAA58267; EAA58267; AN6868.2. DR GeneID; 2870569; -. DR KEGG; ani:AN6868.2; -. DR HOGENOM; HOG000176993; -. DR OMA; CWCSAPR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000000560; Chromosome I. DR Proteomes; UP000005890; Partially assembled WGS sequence. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000560}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000560}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 286 304 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 608 AA; 67604 MW; EF69488A44BC6D29 CRC64; MPPKKAATKR TRAANSASPV PASRRSARMS PGLGGSNLPN IPTKTSFAYG SSQTPILPHM LAARPQMNLA EMADSIEEAV QTAKERENSD SPHNMPALST SGTSGTSTRK SAETSPRRTR RQPTPDQVQL LTSLHEASSA TPSTPTRHSF SSGSSVREVA EKQLYPSYMD QLPDQAEVPA DADLQGLGLD NMSVISYNVE RDVHDDDLKR TRSNITAPPR RVSGLDLKHS TILEEDESYI PSPSVDSFSA PAKTIISDHD PRTPLSPHSD DSTSQWEKPK DGWIPWLLRA LIATLVIFGI YSLLGSASSF DAKPIRFNNS DLNALSSQVV NLGAQVSSLS RDMRSVRAEV SNIPAPTTIL QYPSKHGQEI IKTNFLTRGN GVIVDPFLTS PSASRKVTWT QRLYFWLSGD KHMRPQPPLA AMTPWSDFGD CWCSAPKKGV TQLAVLLGQR IVPEDIVVEH LPKEATIRPQ VAPQEMELWA RYRYVGNGRP YKNTWFAFFR RYPKNIAGQD PLPSDQTLVR PSVIEALRLA WRGESDDEFS DDKQLGPDFF RIAKWMYDIN DTNNIQRFPV NAYIDSPDLR VDKVVFRVKS NWGANETCIY RLKLHGKL // ID Q5BB16_EMENI Unreviewed; 1428 AA. AC Q5BB16; C8VMW2; DT 26-APR-2005, integrated into UniProtKB/TrEMBL. DT 26-APR-2005, sequence version 1. DT 11-NOV-2015, entry version 77. DE SubName: Full=Sad1/UNC domain protein (AFU_orthologue AFUA_5G06480) {ECO:0000313|EMBL:CBF86494.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAA63837.1}; GN ORFNames=AN2264.2 {ECO:0000313|EMBL:EAA63837.1}, GN ANIA_02264 {ECO:0000313|EMBL:CBF86494.1}; OS Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL OS 194 / M139) (Aspergillus nidulans). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes; OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus. OX NCBI_TaxID=227321 {ECO:0000313|EMBL:EAA63837.1, ECO:0000313|Proteomes:UP000005890}; RN [1] {ECO:0000313|EMBL:EAA63837.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=FGSC A4; RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., RA Ait-zahra M., Allen N., Allen T., An P., Anderson M., Anderson S., RA Arachchi H., Armbruster J., Bachantsang P., Baldwin J., Barry A., RA Bayul T., Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., RA Borowsky M., Boukhgalter B., Brunache A., Butler J., Calixte N., RA Calvo S., Camarata J., Campo K., Chang J., Cheshatsang Y., Citroen M., RA Collymore A., Considine T., Cook A., Cooke P., Corum B., Cuomo C., RA David R., Dawoe T., Degray S., Dodge S., Dooley K., Dorje P., RA Dorjee K., Dorris L., Duffey N., Dupes A., Elkins T., Engels R., RA Erickson J., Farina A., Faro S., Ferreira P., Fischer H., RA Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G., Gnerre S., RA Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K., Hafez N., RA Hagopian D., Hagos B., Hall J., Hatcher B., Heller A., Higgins H., RA Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E., Iliev I., RA Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M., Karlsson E., RA Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E., Labutti K., RA Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T., RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., RA Lui A., Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., RA Manning J., Marabella R., Maru K., Matthews C., Mauceli E., RA Mccarthy M., Mcdonough S., Mcghee T., Meldrim J., Meneus L., RA Mesirov J., Mihalev A., Mihova T., Mikkelsen T., Mlenga V., Moru K., RA Mozes J., Mulrain L., Munson G., Naylor J., Newes C., Nguyen C., RA Nguyen N., Nguyen T., Nicol R., Nielsen C., Nizzari M., Norbu C., RA Norbu N., O'donnell P., Okoawo O., O'leary S., Omotosho B., RA O'neill K., Osman S., Parker S., Perrin D., Phunkhang P., Piqani B., RA Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V., Raymond C., RA Retta R., Richardson S., Rise C., Rodriguez J., Rogers J., Rogov P., RA Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T., RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C., RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., RA Stetson K., Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., RA Tenzing P., Tesfaye S., Theodore J., Thoulutsang Y., Topham K., RA Towey S., Tsamla T., Tsomo N., Vallee D., Vassiliev H., RA Venkataraman V., Vinson J., Vo A., Wade C., Wang S., Wangchuk T., RA Wangdi T., Whittaker C., Wilkinson J., Wu Y., Wyman D., Yadav S., RA Yang S., Yang X., Yeager S., Yee E., Young G., Zainoun J., Zembeck L., RA Zimmer A., Zody M., Lander E.; RL Submitted (JAN-2004) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:EAA63837.1, ECO:0000313|Proteomes:UP000000560, ECO:0000313|Proteomes:UP000005890} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=FGSC A4 {ECO:0000313|EMBL:EAA63837.1}, and RC FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139 RC {ECO:0000313|Proteomes:UP000000560, RC ECO:0000313|Proteomes:UP000005890}; RX PubMed=16372000; DOI=10.1038/nature04341; RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.-J., Wortman J.R., RA Batzoglou S., Lee S.-I., Bastuerkmen M., Spevak C.C., Clutterbuck J., RA Kapitonov V., Jurka J., Scazzocchio C., Farman M.L., Butler J., RA Purcell S., Harris S., Braus G.H., Draht O., Busch S., D'Enfert C., RA Bouchier C., Goldman G.H., Bell-Pedersen D., Griffiths-Jones S., RA Doonan J.H., Yu J., Vienken K., Pain A., Freitag M., Selker E.U., RA Archer D.B., Penalva M.A., Oakley B.R., Momany M., Tanaka T., RA Kumagai T., Asai K., Machida M., Nierman W.C., Denning D.W., RA Caddick M.X., Hynes M., Paoletti M., Fischer R., Miller B.L., RA Dyer P.S., Sachs M.S., Osmani S.A., Birren B.W.; RT "Sequencing of Aspergillus nidulans and comparative analysis with A. RT fumigatus and A. oryzae."; RL Nature 438:1105-1115(2005). RN [3] {ECO:0000313|Proteomes:UP000000560} RP GENOME REANNOTATION. RC STRAIN=FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139 RC {ECO:0000313|Proteomes:UP000000560}; RX PubMed=19146970; DOI=10.1016/j.fgb.2008.12.003; RA Wortman J.R., Gilsenan J.M., Joardar V., Deegan J., Clutterbuck J., RA Andersen M.R., Archer D., Bencina M., Braus G., Coutinho P., RA von Dohren H., Doonan J., Driessen A.J., Durek P., Espeso E., RA Fekete E., Flipphi M., Estrada C.G., Geysens S., Goldman G., RA de Groot P.W., Hansen K., Harris S.D., Heinekamp T., Helmstaedt K., RA Henrissat B., Hofmann G., Homan T., Horio T., Horiuchi H., James S., RA Jones M., Karaffa L., Karanyi Z., Kato M., Keller N., Kelly D.E., RA Kiel J.A., Kim J.M., van der Klei I.J., Klis F.M., Kovalchuk A., RA Krasevec N., Kubicek C.P., Liu B., Maccabe A., Meyer V., Mirabito P., RA Miskei M., Mos M., Mullins J., Nelson D.R., Nielsen J., Oakley B.R., RA Osmani S.A., Pakula T., Paszewski A., Paulsen I., Pilsyk S., Pocsi I., RA Punt P.J., Ram A.F., Ren Q., Robellet X., Robson G., Seiboth B., RA van Solingen P., Specht T., Sun J., Taheri-Talesh N., Takeshita N., RA Ussery D., vanKuyk P.A., Visser H., van de Vondervoort P.J., RA de Vries R.P., Walton J., Xiang X., Xiong Y., Zeng A.P., Brandt B.W., RA Cornell M.J., van den Hondel C.A., Visser J., Oliver S.G., Turner G.; RT "The 2008 update of the Aspergillus nidulans genome annotation: a RT community effort."; RL Fungal Genet. Biol. 46:S2-13(2009). RN [4] {ECO:0000313|EMBL:CBF86494.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=FGSC A4 {ECO:0000313|EMBL:CBF86494.1}; RA Russo Wortman J., Mabey Gilsenan J., Joardar V., Deegan J., RA Clutterbuck J., Andersen M.R., Archer D., Bencina M., Braus G., RA Coutinho P., von Dohren H., Doonan J., Driessen A.J.M., Durek P., RA Espeso E., Fekete E., Flipphi M., Garcia Estrada C., Geysens S., RA Goldman G., de Groot P.W.J., Hansen K., Harris S.D., Heinekamp T., RA Helmstaedt K., Henrissat B., Hofmann G., Homan T., Horio T., RA Horiuchi H., James S., Jones M., Karaffa L., Karanyi Z., Kato M., RA Keller N., Kelly D.E., Kiel J.A.K.W., Kim J-M., van der Klei I.J., RA Klis F.M., Kovalchuk A., Krasevec N., Kubicek C.P., Liu B., RA MacCabe A., Meyer V., Mirabito P., Miskei M., Mos M., Mullins J., RA Nelson D.R., Nielsen J., Oakley B.R., Osmani S.A., Pakula T., RA Paszewski A., Paulsen I., Pilsyk S., Posci I., Punt P.J., Ram A.F.J., RA Ren Q., Robellet X., Robson G., Seiboth B., van Solingen P., RA Specht T., Sun J., Taheri-Talesh N., Takeshita N., Ussery D., RA vanKuyk P.A., Visser H., van der Vondervoot P.J.I., de Vries R.P., RA Walton J., Xiang X., Xiong Y., Ping Zeng A., Brandt B.W., RA Cornell M.J., van den Hondel C.A.M.J.J., Visser J., Oliver S.G., RA Turner G.; RT "The 2008 update of the Aspergillus nidulans genome annotation: A RT community effort."; RL Fungal Genet. Biol. 46:S2-S13(2009). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BN001307; CBF86494.1; -; Genomic_DNA. DR EMBL; AACD01000037; EAA63837.1; -; Genomic_DNA. DR RefSeq; XP_659868.1; XM_654776.1. DR EnsemblFungi; CADANIAT00008956; CADANIAP00008956; CADANIAG00008956. DR EnsemblFungi; EAA63837; EAA63837; AN2264.2. DR GeneID; 2875195; -. DR KEGG; ani:AN2264.2; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000560; Chromosome VII. DR Proteomes; UP000005890; Partially assembled WGS sequence. DR GO; GO:0003995; F:acyl-CoA dehydrogenase activity; IEA:InterPro. DR GO; GO:0050660; F:flavin adenine dinucleotide binding; IEA:InterPro. DR Gene3D; 1.10.540.10; -; 1. DR InterPro; IPR006091; Acyl-CoA_Oxase/DH_cen-dom. DR InterPro; IPR009075; AcylCo_DH/oxidase_C. DR InterPro; IPR013786; AcylCoA_DH/ox_N. DR InterPro; IPR009100; AcylCoA_DH/oxidase_NM_dom. DR InterPro; IPR007727; Spo12. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00441; Acyl-CoA_dh_1; 1. DR Pfam; PF02770; Acyl-CoA_dh_M; 1. DR Pfam; PF02771; Acyl-CoA_dh_N; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR Pfam; PF05032; Spo12; 1. DR SUPFAM; SSF47203; SSF47203; 1. DR SUPFAM; SSF56645; SSF56645; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000560}; KW Reference proteome {ECO:0000313|Proteomes:UP000000560}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 22 {ECO:0000256|SAM:SignalP}. FT CHAIN 23 1428 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5005364164. FT COILED 1424 1428 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1428 AA; 156189 MW; 56974D7D588AE55D CRC64; MLAPRWLTTA IFVLTYIQKS GGDSQKQPVC LARDWREAVV PLKWPTCVET RWDRWPNGEL TTTPTPASHN NLKSTSGSSS VSVSITVEPG PASSLADHEL DTESPLDNVN FLSFEDWKKQ NLARAGQSAE NIGGNRRAGT AEKDRRRPLG INNALDSLGD DVEIELDFGG FGADASEAAK TATDWVTHVP SRGSGGAQVV PDGGRDTAEA SGQGVPHAGG ERSKDAGTTC KERFNYASFD CAATVLKTNP EAKGSSSVLI ENKDSYMLNE CRAQNKFLIL ELCDDILVDT VVLANYEFFS SIFHTFRVSV ADRYPAKPEQ WKELGIYAAR NTREIQAFAV ENPLIWARYL RIEFLTHYGN EFYCPLSLIR VHGTTMLEEY KHDGEVNRAE EELAGGVAEP ALETETVTED ATKTEVPPPE APSFHVVNSE IRPSKICPKF VTSVELALLG SVNPQTCGIN DTSEESPATE GNKPVLSKTS SSPVIPSAGN AAKAASPEAG DYKASGSSGV NPPNTADTAA SGAASSETDS HNATSDQDTR STAASRDEQG VESIRTTTTQ PPSANPTTQE SFFKSVNKRL QMLETNSSLS LQYIEEQSRI LRDAFNKVEK RQLSKTSTFL ENLNVTVVNE LKQLREQYDQ AWRSVALEFE HQRIQYHQEI HSLSAQLGVL ADEIVFQKRV AVIQSIVVLL CFGLVLFTRG AVGSYIDFPS VQNMVSRSYS LRPSSPILGF GSPPGSPGST RPTSSYRTTP GHRRQVSQDS QDGSVSPTMY APPTPTSDDS RLGPDERGTT SPSPEGARSL AEVAPPLLRS NSSPPDLNGE NEGGCEKNHE DLDSDPECSG FETGGDDTLA ESPVTVMMQQ FQCPTTKETS ITSLKMNSNP LTARSPNTHL AVSNEQDLKT ASSTTDLMDY HRQKLQGKIE NQDKQQASYV SPSDDIMSPC SKKLSDLKGK RFKNVEQDGK GDEFPLSLNP SSATGPRRPW IWYFPAPLPR LAASTPNNQK SNQQDQVEEF VEKECIPAEA LFSAQLGTGE QRWKTNPAVM EELKTKAKKI GLWNMFLPKN HFSQGAGFSN LEYGLMAEYL GKSKVASEAT NNAAPDTGNM EVLAKYGNDQ QKAQWLTPLL EGKIRSAFLM TEPDIASSDA TNIQLDIRRE GNEYVLNGSK WWSSGAGDPR CQIYLVMGKT DPRNPDTYKQ QSVLLVPAST PGITIHRMLS VYGYDDAPHG HGHISFKNVR VPLSAMVLGE GRGFEIIQGR LGPGRIHHAM RTIGAAERAI DWLIARINDD RKKPFGQPLS SHGVILEWLA KSRIEIDAAR LIVLNAAIKI DQGNAKFALK EIAQAKVLVP QTALTVIDRA VQVYGAAGVS QDTPLASLWA MVRTLRIADG PDEVHLQQLG KRENKSRREE VTKRVAWQKE QSDRILTANG FSKLKSLL // ID Q5CU18_CRYPI Unreviewed; 692 AA. AC Q5CU18; DT 12-APR-2005, integrated into UniProtKB/TrEMBL. DT 12-APR-2005, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAK88879.1}; GN ORFNames=cgd2_650 {ECO:0000313|EMBL:EAK88879.1}; OS Cryptosporidium parvum (strain Iowa II). OC Eukaryota; Alveolata; Apicomplexa; Conoidasida; Coccidia; OC Eucoccidiorida; Eimeriorina; Cryptosporidiidae; Cryptosporidium. OX NCBI_TaxID=353152 {ECO:0000313|EMBL:EAK88879.1, ECO:0000313|Proteomes:UP000006726}; RN [1] {ECO:0000313|EMBL:EAK88879.1, ECO:0000313|Proteomes:UP000006726} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Iowa II {ECO:0000313|Proteomes:UP000006726}; RX PubMed=15044751; DOI=10.1126/science.1094786; RA Abrahamsen M.S., Templeton T.J., Enomoto S., Abrahante J.E., Zhu G., RA Lancto C.A., Deng M., Liu C., Widmer G., Tzipori S., Buck G.A., Xu P., RA Bankier A.T., Dear P.H., Konfortov B.A., Spriggs H.F., Iyer L., RA Anantharaman V., Aravind L., Kapur V.; RT "Complete genome sequence of the apicomplexan, Cryptosporidium RT parvum."; RL Science 304:441-445(2004). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAK88879.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEE01000005; EAK88879.1; -; Genomic_DNA. DR RefSeq; XP_626301.1; XM_626301.1. DR EnsemblProtists; EAK88879; EAK88879; cgd2_650. DR GeneID; 3373383; -. DR KEGG; cpv:cgd2_650; -. DR EuPathDB; CryptoDB:cgd2_650; -. DR eggNOG; ENOG410K6QW; Eukaryota. DR eggNOG; ENOG4110FWY; LUCA. DR InParanoid; Q5CU18; -. DR Proteomes; UP000006726; Chromosome 2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006726}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006726}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 105 127 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 297 324 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 692 AA; 81886 MW; 82D770678F97B033 CRC64; MKNKNNKKRI LVEKEKNYHE KANFEIWKTD GIYRNENDRS FMFGNRKLSN MKKMQIDYYP YNEKSKIENE IRGNVLINTR KDTESRKKAS LRIWIYRKLG KILNYIFKII KTLSLFLIIG TFLFIIWNSI NHFGQYGESK MISTQLFDQL IKKEKLVNLI KQKESNGLFS ADYYDETVSF VEFQEKFEEI EDQVIGLNKK IAILAEINEF SKMNRSNLER DFRELINNKT FILENEIIEL NTKLLKEIKS VKNESSTIFK NFNRSLNDIL TKEKNQIIYK NNHPEIESQI QLIIRNLTEH ERFYLKLRND LENISEEVKL IDEAQDVLRS LIWSMNKRNE ELSTFNSENT IGNSQLEALQ GQSSEIGVYL KKILGTKLEM QGYNQVDWAQ SSMGGKVLNP KSNKFCEANK DNNNNSNSII IKSLRYFQDQ LFKFLIQKNH RTTTLKVPIY NSECFEPNQL IKSNQEKTIG NCLLAEVGTN IDIQLSVLIN VTSVGIDHIL FPLQYDNGET VPRKFSVKCI KGSSIEEYEY GHFMYHYPQN NQGLEIFQVN SNHLCNIIRF TIHSSYGDKY FCLYKLRVYG EQVNNNIEFV SEVKRNYFLK LFWKITRKIT IALRNEISNM NIILLNCFNY IKENLKVENK EKIEFEEDNN DIMLDNDVEI DEDKDEYVPN YGNFSESNKF KTKNDKRRRE IE // ID Q5CVA0_CRYPI Unreviewed; 1158 AA. AC Q5CVA0; DT 12-APR-2005, integrated into UniProtKB/TrEMBL. DT 12-APR-2005, sequence version 1. DT 11-NOV-2015, entry version 35. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAK89618.1}; GN ORFNames=cgd8_4850 {ECO:0000313|EMBL:EAK89618.1}; OS Cryptosporidium parvum (strain Iowa II). OC Eukaryota; Alveolata; Apicomplexa; Conoidasida; Coccidia; OC Eucoccidiorida; Eimeriorina; Cryptosporidiidae; Cryptosporidium. OX NCBI_TaxID=353152 {ECO:0000313|EMBL:EAK89618.1, ECO:0000313|Proteomes:UP000006726}; RN [1] {ECO:0000313|EMBL:EAK89618.1, ECO:0000313|Proteomes:UP000006726} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Iowa II {ECO:0000313|Proteomes:UP000006726}; RX PubMed=15044751; DOI=10.1126/science.1094786; RA Abrahamsen M.S., Templeton T.J., Enomoto S., Abrahante J.E., Zhu G., RA Lancto C.A., Deng M., Liu C., Widmer G., Tzipori S., Buck G.A., Xu P., RA Bankier A.T., Dear P.H., Konfortov B.A., Spriggs H.F., Iyer L., RA Anantharaman V., Aravind L., Kapur V.; RT "Complete genome sequence of the apicomplexan, Cryptosporidium RT parvum."; RL Science 304:441-445(2004). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAK89618.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAEE01000003; EAK89618.1; -; Genomic_DNA. DR RefSeq; XP_627369.1; XM_627369.1. DR ProteinModelPortal; Q5CVA0; -. DR STRING; 353152.XP_627369.1; -. DR EnsemblProtists; EAK89618; EAK89618; cgd8_4850. DR GeneID; 3374374; -. DR KEGG; cpv:cgd8_4850; -. DR EuPathDB; CryptoDB:cgd8_4850; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q5CVA0; -. DR Proteomes; UP000006726; Chromosome 8. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000006726}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000006726}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 962 982 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 832 852 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1158 AA; 133545 MW; 3F48A7EE0569C778 CRC64; MIKCKVFPTF LVIFGIFYAI TLCSTIQYIG YCENHEFHEI YVRVIEKLYN KNEEITKRDK RNENNKQIII GKKYFNALDS INNQREIQNQ PNLEYLKDEN SIYKTYSQRR SYFTNLNPAD CRINILRRHK IKISRVSVSV MESFDRLIQL VDILSSRKPC PYCKGCNDSR LDSLPIWVSP ILHRQIKRVN SQLDNKLCLI LKKDSKLNSI IREIIFSIKD KKLTNFDGIT SNYITSLSSL VARTKLRRKE NFPLYLLGGI FELMLDDLRK VGIKISSCHP SQFLFKCSWH PIKQNLEISA KLKAVSKSML NIYKSLELES DSTTSNSDKS FKLSFKRKIS GEYYEPFKVS KEQSNNFEES ILNKDIIKIP TSSGSHSSCI STTIGTSVGS NLSSTLGSSH ISPQNKCIPP FLDSMIMLRN DGSPNPDLHI CGSLETIKHK NTRLEEKEEV EIKVSREDKM DEKSKLKVIQ GAPYVSQLAA LHYDYASSSS NSKVLSWSEG VLRPKSVQSS NPDSYLLVPC NKPMWFVIGF QEDIFLEYIA LFSLEYFSSS FREIEISGSL IYPTKQWIPI GILRRNQILP KEMFDLKTLC VKHDEGNHLF DHLVYNIDDH DHIHTSLKGS RVSDIKEDNT LSKASDKIDH RNNRDQSVSG IRKDSIISGG HIKENNLVHG SNPCWVRYIR VRAISYYEEG HYYCHLNRIQ IFGNNVINRL EVEMGGGDRR SSISMSEMEE SVKDVENRLW KRDIERDIHE TINDQKPQIM YFNSSNFNTG TNNKKRKDDD MLGAFEVKRS ESRSLLIKNQ EKDKYIDKRE IFEDHLESIN GRYFTNSKSH PLLSLIDRVK ILEKQLDTVK LERRAIIADF NSSFDDINDS IYRLSNSVKF LQDILLGSNM NNTENEQNDT DQIKFLKLGD IYLIQSLNLV GSYLERFLGK YYSGEAITYL TQLFESIRLS ILNLFGYFYN HFQTLIIALL FTTLFISQII LFKKYISLKR RLHNTLQFFK SYSQSSNASP KNSILLDENL FFQKNLNFKL HSNPVTNTYR FSNNQSNSNT ISTSNSNMNS NLNINDGNNI SVGDITLGNY TPSTIKNNNN NSNLTFESNS VVFQGNETAF LSQTSVENSH KDSEICKIDN TKSQNITTHK QEAKEDFPDI SDTIGRHE // ID Q5DMX1_CUCME Unreviewed; 584 AA. AC Q5DMX1; DT 29-MAR-2005, integrated into UniProtKB/TrEMBL. DT 29-MAR-2005, sequence version 1. DT 14-OCT-2015, entry version 32. DE SubName: Full=Membrane protein-like {ECO:0000313|EMBL:AAU04771.1}; GN Name=MPL {ECO:0000313|EMBL:AAU04771.1}; OS Cucumis melo (Muskmelon). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; OC Benincaseae; Cucumis. OX NCBI_TaxID=3656 {ECO:0000313|EMBL:AAU04771.1}; RN [1] {ECO:0000313|EMBL:AAU04771.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=15902490; DOI=10.1007/s00438-004-1104-7; RA van Leeuwen H., Garcia-Mas J., Coca M., Puigdomenech P., Monfort A.; RT "Analysis of the melon genome in regions encompassing TIR-NBS-LRR RT resistance genes."; RL Mol. Genet. Genomics 273:240-251(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AY582736; AAU04771.1; -; Genomic_DNA. DR RefSeq; XP_008459524.1; XM_008461302.1. DR RefSeq; XP_008459525.1; XM_008461303.1. DR ProteinModelPortal; Q5DMX1; -. DR GeneID; 103498640; -. DR KEGG; cmo:103498640; -. DR PhylomeDB; Q5DMX1; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 25 47 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 526 549 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 561 581 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 497 524 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 584 AA; 65454 MW; 37D435AFD37D64CB CRC64; MRKPVGALLH DRRAVRVPIS GRNHLYKVSI SLVFILWGLI FLFSLWISRG DGCQEGSILL PDGVSTTNES KLENNKDSDV LCEPPNGESH CTIHLNNSCS INASSPGSDN EILSSEESSS HIQATTRLPE DESSSTRVKP ESKPPKGDIS SDTVLLGLEE FKSRAFVSRG KSETGQAGNT IHRLEPGGAE YNYASASKGA KVLAFNKEAK GASNILGKDK DKYLRNPCSA EEKFVVIELS EETLVVTIEI ANFEHHSSNL KEFEVHGSLV YPTDVWFKLG NFTAPNAKHA HRFVLKDPKW VRYLKLNFLT HYGSEFYCTL STVEVYGMDA VEMMLEDLIS AQHKPSISDE ATPDKRVIPS QPGPIDEVSH GRELQSLANE EGGDGVDLEL SKSNTPDPVE ESHHQQPGRM PGDTVLKILT QKVRSLDLSL SVLERYLEDL TSKYGNIFKE FDKDIGNNNL LIEKTQEDIR NILKIQDNTD KDLRDLISWK SMVSLQLDGL QRHNSILRSE IERVQKNQTS LENKGIVVFL VCLIFSSFAI FRLFLHIVLR VYERTNNSRK FCCISPSWYL LLLSCCIILF VQSL // ID Q5DTM6_MOUSE Unreviewed; 248 AA. AC Q5DTM6; DT 29-MAR-2005, integrated into UniProtKB/TrEMBL. DT 29-MAR-2005, sequence version 1. DT 11-NOV-2015, entry version 41. DE SubName: Full=MKIAA4118 protein {ECO:0000313|EMBL:BAD90292.1}; DE Flags: Fragment; GN Name=Spag4 {ECO:0000313|MGI:MGI:2444120}; GN Synonyms=mKIAA4118 {ECO:0000313|EMBL:BAD90292.1}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090 {ECO:0000313|EMBL:BAD90292.1}; RN [1] {ECO:0000313|EMBL:BAD90292.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Fetal brain {ECO:0000313|EMBL:BAD90292.1}; RA Okazaki N., Kikuno R.F., Ohara R., Inamoto S., Nagase T., Ohara O., RA Koga H.; RT "Prediction of the Coding Sequences of Mouse Homologues of KIAA Gene. RT The Complete Nucleotide Sequences of Mouse KIAA-homologous cDNAs RT Identified by Screening of Terminal sequences of cDNA Clones Randomly RT Sampled from Size-Fractionated Libraries.."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK220494; BAD90292.1; -; mRNA. DR UniGene; Mm.330713; -. DR STRING; 10090.ENSMUSP00000036484; -. DR PaxDb; Q5DTM6; -. DR MGI; MGI:2444120; Spag4. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG079205; -. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 16 43 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:BAD90292.1}. SQ SEQUENCE 248 AA; 27950 MW; 4563172BCAC76C6D CRC64; PHGTHLPNYS QYHHRVHSQG QQLQQLQAEL NKLHKEVSSV RAAHSERVAK LVFQRLNEDF VRKPDYALSS VGASIDLEKT SSDYEDQNTA YFWNRLSFWN YARPPSVILE PDVFPGNCWA FEGDKGQVVI RLPGHVQLSD ITLQHPPPTV AHTGGASSAP RDFAVYGLQA DDETEVFLGK FIFDVQKSEI QTFHLQNDPP SAFPKVKIQI LSNWGHPRFT CLYRVRAHGV RTSEWADDNA TGVTGGPH // ID Q5JX49_HUMAN Unreviewed; 141 AA. AC Q5JX49; DT 15-FEB-2005, integrated into UniProtKB/TrEMBL. DT 15-FEB-2005, sequence version 1. DT 22-JUL-2015, entry version 59. DE SubName: Full=Sperm-associated antigen 4 protein {ECO:0000313|Ensembl:ENSP00000399231}; DE Flags: Fragment; GN Name=SPAG4 {ECO:0000313|Ensembl:ENSP00000399231}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000399231, ECO:0000313|Proteomes:UP000005640}; RN [1] {ECO:0000313|Ensembl:ENSP00000399231, ECO:0000313|Proteomes:UP000005640} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=11780052; DOI=10.1038/414865a; RA Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., RA Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., RA Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M., Brown A.J., RA Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., RA Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., RA Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., RA Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., RA Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., RA Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., RA Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., RA Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., RA Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., RA Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., RA Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., RA Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., RA Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., RA Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., RA Rogers J.; RT "The DNA sequence and comparative analysis of human chromosome 20."; RL Nature 414:865-871(2001). RN [2] {ECO:0000313|Ensembl:ENSP00000399231} RP IDENTIFICATION. RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSP00000399231}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL109827; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR ProteinModelPortal; Q5JX49; -. DR Ensembl; ENST00000430878; ENSP00000399231; ENSG00000061656. DR HGNC; HGNC:11214; SPAG4. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG108520; -. DR NextBio; 35539350; -. DR Proteomes; UP000005640; Chromosome 20. DR Bgee; Q5JX49; -. DR ExpressionAtlas; Q5JX49; baseline and differential. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000005640}; KW Reference proteome {ECO:0000313|Proteomes:UP000005640}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSP00000399231}. SQ SEQUENCE 141 AA; 15618 MW; A8B3688D3D2C76CE CRC64; VFPGNCWAFE GDQGQVVIQL PGRVQLSDIT LQHPPPSVEH TGGANSAPRD FAVFFLLSFF THQGLQVYDE TEVSLGKFTF DVEKSEIQTF HLQNDPPAAF PKVKIQILSN WGHPRFTCLY RVRAHGVRTS EGAEGSAQGP H // ID Q5K8V5_CRYNJ Unreviewed; 718 AA. AC Q5K8V5; Q55LW4; DT 15-FEB-2005, integrated into UniProtKB/TrEMBL. DT 15-FEB-2005, sequence version 1. DT 11-NOV-2015, entry version 65. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAW46470.1}; GN OrderedLocusNames=CNL04360 {ECO:0000313|EMBL:AAW46470.1}; OS Cryptococcus neoformans var. neoformans serotype D (strain JEC21 / OS ATCC MYA-565) (Filobasidiella neoformans). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. OX NCBI_TaxID=214684 {ECO:0000313|EMBL:AAW46470.1, ECO:0000313|Proteomes:UP000002149}; RN [1] {ECO:0000313|EMBL:AAW46470.1, ECO:0000313|Proteomes:UP000002149} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JEC21 / ATCC MYA-565 {ECO:0000313|Proteomes:UP000002149}; RX PubMed=15653466; DOI=10.1126/science.1103773; RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D., RA Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E., RA Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J., RA D'Souza C.A., Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J., RA Huang J.C., Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I., RA Kwon-Chung K.J., Lengeler K.B., Maiti R., Marra M.A., Marra R.E., RA Mathewson C.A., Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L., RA Schein J.E., Shvartsbeyn A., Shin H., Shumway M., Specht C.A., RA Suh B.B., Tenney A., Utterback T.R., Wickes B.L., Wortman J.R., RA Wye N.H., Kronstad J.W., Lodge J.K., Heitman J., Davis R.W., RA Fraser C.M., Hyman R.W.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307:1321-1324(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE017352; AAW46470.1; -; Genomic_DNA. DR RefSeq; XP_567987.1; XM_567987.1. DR UniGene; Fne.7028; -. DR STRING; 214684.XP_567987.1; -. DR PaxDb; Q5K8V5; -. DR EnsemblFungi; AAW46470; AAW46470; CNL04360. DR GeneID; 3254862; -. DR KEGG; cne:CNL04360; -. DR EuPathDB; FungiDB:CNL04360; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q5K8V5; -. DR OMA; CDEIRIE; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000002149; Chromosome 12. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002149}; KW Reference proteome {ECO:0000313|Proteomes:UP000002149}. FT COILED 146 193 {ECO:0000256|SAM:Coils}. FT COILED 348 368 {ECO:0000256|SAM:Coils}. FT COILED 664 684 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 718 AA; 80189 MW; DC750D46F0FAEF72 CRC64; MLTPCRADEH WVVVELCDEI RIEAVEIAVW EFFSGVVREV RVSVGGEDEE DDAEEPGQDD VAGRGHRWKQ VGSFIGKNVR GSQTFSLSQP TSFHRFIRLD FPSYFGSEYY CPVSSLKVYG MNQMEAFKWE QKQLSAVAKD RDRTGNREHE EEERRAKERR EREKKERDER DKQEQREREL DELEKLLHEQ AGRLVPELLT ESGLFSSIDE TAPTNVPTVV SKRDGDSDSP PTNESMATSL IESTSIESTS IESPTSIESP STSYTRAVPP RSDSSESIYA FIIRRLNALE GNSSLVARYI EEQAKVMRSM LKQVQVGWDE WKGEWEDEDR GRWQQERMRQ EDRLGRVLSQ LEQQRIAFDA ERKAIETQLR VLADQLGYER RRGIAQLIIM VVIILLGAAS RSSTMDAILT PLLKEARRRR SDYYHRKSLS GPLAGLHIDM GAGRPPAIIG QARPTSTTPS AHPHRHSSST PTPRLKTSLS RAGSGHRSNT SLKRRGIVPQ VPPSYRSVSS SEFTFSPLSH LPPTSSPSPA NIPNPNPNPR NVRVSFPPPR QTPPPPSVSS RKLAQSAHLH HLHTTAAAAA AREDTERGIT ASMRRRRMRS SLVNDDNEQQ TTVSGLGSGK ADAGGGGGGG GGEEAERVVG AEDNSQGEWG TDDFDTEADD FDTEAEAEAE VSKVEDQVRD KKDSETDRKE QDQLGETEQQ PVREKRGVQG EHVGLARA // ID Q5KM67_CRYNJ Unreviewed; 900 AA. AC Q5KM67; DT 15-FEB-2005, integrated into UniProtKB/TrEMBL. DT 15-FEB-2005, sequence version 1. DT 11-NOV-2015, entry version 48. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAW41558.1}; GN OrderedLocusNames=CNB02760 {ECO:0000313|EMBL:AAW41558.1}; OS Cryptococcus neoformans var. neoformans serotype D (strain JEC21 / OS ATCC MYA-565) (Filobasidiella neoformans). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. OX NCBI_TaxID=214684 {ECO:0000313|EMBL:AAW41558.1, ECO:0000313|Proteomes:UP000002149}; RN [1] {ECO:0000313|EMBL:AAW41558.1, ECO:0000313|Proteomes:UP000002149} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JEC21 / ATCC MYA-565 {ECO:0000313|Proteomes:UP000002149}; RX PubMed=15653466; DOI=10.1126/science.1103773; RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D., RA Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E., RA Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J., RA D'Souza C.A., Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J., RA Huang J.C., Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I., RA Kwon-Chung K.J., Lengeler K.B., Maiti R., Marra M.A., Marra R.E., RA Mathewson C.A., Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L., RA Schein J.E., Shvartsbeyn A., Shin H., Shumway M., Specht C.A., RA Suh B.B., Tenney A., Utterback T.R., Wickes B.L., Wortman J.R., RA Wye N.H., Kronstad J.W., Lodge J.K., Heitman J., Davis R.W., RA Fraser C.M., Hyman R.W.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307:1321-1324(2005). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE017342; AAW41558.1; -; Genomic_DNA. DR RefSeq; XP_568865.1; XM_568865.1. DR UniGene; Fne.3118; -. DR ProteinModelPortal; Q5KM67; -. DR STRING; 214684.XP_568865.1; -. DR PaxDb; Q5KM67; -. DR EnsemblFungi; AAW41558; AAW41558; CNB02760. DR GeneID; 3255584; -. DR KEGG; cne:CNB02760; -. DR EuPathDB; FungiDB:CNB02760; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; Q5KM67; -. DR KO; K19347; -. DR OMA; YRQEEEY; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000002149; Chromosome 2. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002149}; KW Reference proteome {ECO:0000313|Proteomes:UP000002149}. FT COILED 460 498 {ECO:0000256|SAM:Coils}. FT COILED 522 549 {ECO:0000256|SAM:Coils}. FT COILED 642 662 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 900 AA; 99077 MW; F924A0F9C4E7C14C CRC64; MPPRTVAPRP SPARSTRSAR SLTREQAREE DDWEAESMTS GSFKVPRSRS KKGGNAIGLK DTSVNIAAAF HAAQTGHLPP PSHPNNSVSS NSSSSRSLQV PRAISPAEQL AQSARALSPV RFFLRPTEED GDDYTSFSSV GIENAGELNT SGEGESYDYR QEEEYVRQAQ QQKMAKAKAR AEASASLKNR RVKALDEDMP YRPAEEDTVS LASSDSGGGE EGVVRNGALQ GRAGTRGKRL ERGEGYLGMG LGIQPRRRRK SRKNGMDGDE SEEEGTPGTG RAWTPAVEVD GHKRSPTPLQ LLRGRSPMMD RKSPVPLGAY QQRRRPSDIR TIVTNVLHGV VIGLQFVVEL GTTVLYRIIV RPLEKAFGSS KGFVRRAKAD WWKWLGILLG ISLALRFLDN AFRTKGIYTA PDAPPSTIDE MSIRLTSLEH ATATLSDLLR AISEGDNELH QSAIAMRSKI DEMEDAVSAE RKRVEGVRGE LKNEKVIMQS EIDKLRSEIH ILSSQIGKQE NSISSDRSAK SLQAVEREIT QLKSRMGQVE QNVHAALEDG RLVAAVERIL PQWMPIRTDS QGDFVVEPAF WTEMKKVMVG KGEVEQIVRR LIGEAGVSDN KIKESPVDEH KVVEWMENAF DRHVQGGVWV TREEFTSTLK EKLQELARET AEKPISKRPA APSMVTIKSS KGEDLTSLFN SLIDTALLRY SKDTIARADY ALFTAGARVI PHLTSDTFTL QKASAFGKLL WASKDVQGRP PATALHPDTS VGSCWPIKGS EGSLGVMLVD RVVVSDVTIE HAPRELALDI ATAPKVVKVL GLVDYAEGLE KLAEYRATHQ ADLNNEEDTN YLPLGTFTYD PSSYSHIQTF PVSSDIVDLG IRIGVVVFKI ESNWGGDLTC LYRVRVHGNA // ID Q5NBL8_ORYSJ Unreviewed; 455 AA. AC Q5NBL8; DT 01-FEB-2005, integrated into UniProtKB/TrEMBL. DT 01-FEB-2005, sequence version 1. DT 11-NOV-2015, entry version 71. DE SubName: Full=Os01g0267600 protein {ECO:0000313|EMBL:BAF04600.1}; DE SubName: Full=Unc-84 homolog B-like {ECO:0000313|EMBL:BAD81138.1}; DE SubName: Full=Uncharacterized protein; GN Name=P0011D01.17 {ECO:0000313|EMBL:BAD81138.1}; GN OrderedLocusNames=Os01g0267600 {ECO:0000313|EMBL:BAF04600.1}; GN ORFNames=OsJ_01226; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947 {ECO:0000313|Proteomes:UP000000763}; RN [1] {ECO:0000313|EMBL:BAD81138.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12447438; DOI=10.1038/nature01184; RA Sasaki T., Matsumoto T., Yamamoto K., Sakata K., Baba T., Katayose Y., RA Wu J., Niimura Y., Cheng Z., Nagamura Y., Antonio B.A., Kanamori H., RA Hosokawa S., Masukawa M., Arikawa K., Chiden Y., Hayashi M., RA Okamoto M., Ando T., Aoki H., Arita K., Hamada M., Harada C., RA Hijishita S., Honda M., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., RA Ikeno M., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K., RA Karasawa W., Katagiri S., Kikuta A., Kobayashi N., Kono I., RA Machita K., Maehara T., Mizuno H., Mizubayashi T., Mukai Y., RA Nagasaki H., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M., RA Namiki N., Negishi M., Ohta I., Ono N., Saji S., Sakai K., Shibata M., RA Shimokawa T., Shomura A., Song J., Takazaki Y., Terasawa K., Tsuji K., RA Waki K., Yamagata H., Yamane H., Yoshiki S., Yoshihara R., Yukawa K., RA Zhong H., Iwama H., Endo T., Ito H., Hahn J.H., Kim H.-I., Eun M.-Y., RA Yano M., Jiang J., Gojobori T.; RT "The genome sequence and structure of rice chromosome 1."; RL Nature 420:312-316(2002). RN [2] {ECO:0000313|Proteomes:UP000000763} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [3] {ECO:0000313|EMBL:BAF04600.1} RP NUCLEOTIDE SEQUENCE. RG International Rice Genome Sequencing Project; RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N., RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y., RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N., RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M., RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M., RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K., RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., RA Kobayashi N., Machita K., Maehara T., Masukawa M., Mizubayashi T., RA Mukai Y., Nagasaki H., Nagata Y., Naito S., Nakashima M., Nakama Y., RA Nakamichi Y., Nakamura M., Meguro A., Negishi M., Ohta I., Ohta T., RA Okamoto M., Ono N., Saji S., Sakaguchi M., Sakai K., Shibata M., RA Shimokawa T., Song J., Takazaki Y., Terasawa K., Tsugane M., Tsuji K., RA Ueda S., Waki K., Yamagata H., Yamamoto M., Yamamoto S., Yamane H., RA Yoshiki S., Yoshihara R., Yukawa K., Zhong H., Yano M., Yuan Q., RA Ouyang S., Liu J., Jones K.M., Gansberger K., Moffat K., Hill J., RA Bera J., Fadrosh D., Jin S., Johri S., Kim M., Overton L., Reardon M., RA Tsitrin T., Vuong H., Weaver B., Ciecko A., Tallon L., Jackson J., RA Pai G., Aken S.V., Utterback T., Reidmuller S., Feldblyum T., RA Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R., Ying K., RA Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J., Weng Q., RA Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X., RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., RA Samain S., Cattolico L., Pelletier E., Couloux A., Segurens B., RA Wincker P., D'Hont A., Scarpelli C., Weissenbach J., Salanoubat M., RA Quetier F., Yu Y., Kim H.R., Rambo T., Currie J., Collura K., Luo M., RA Yang T., Ammiraju J.S.S., Engler F., Soderlund C., Wing R.A., RA Palmer L.E., de la Bastide M., Spiegel L., Nascimento L., Zutavern T., RA O'Shaughnessy A., Dike S., Dedhia N., Preston R., Balija V., RA McCombie W.R., Chow T., Chen H., Chung M., Chen C., Shaw J., Wu H., RA Hsiao K., Chao Y., Chu M., Cheng C., Hour A., Lee P., Lin S., Lin Y., RA Liou J., Liu S., Hsing Y., Raghuvanshi S., Mohanty A., Bharti A.K., RA Gaur A., Gupta V., Kumar D., Ravi V., Vij S., Kapur A., Khurana P., RA Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K., Singh A., Dalal V., RA Srivastava S., Dixit A., Pal A.K., Ghazi I.A., Yadav M., Pandit A., RA Bhargava A., Sureshbabu K., Batra K., Sharma T.R., Mohapatra T., RA Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S., Keizer G., RA Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K., RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., RA Zimmer P.D., Malone G., Dellagostin O., de Oliveira A.C., Bevan M., RA Bancroft I., Minx P., Cordum H., Wilson R., Cheng Z., Jin W., RA Jiang J., Leong S.A., Iwama H., Gojobori T., Itoh T., Niimura Y., RA Fujii Y., Habara T., Sakai H., Sato Y., Wilson G., Kumar K., RA McCouch S., Juretic N., Hoen D., Wright S., Bruskiewich R., Bureau T., RA Miyao A., Hirochika H., Nishikawa T., Kadowaki K., Sugiura M., RA Burr B., Sasaki T.; RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). RN [5] {ECO:0000313|EMBL:BAF04600.1} RP NUCLEOTIDE SEQUENCE. RG IRGSP(International Rice Genome Sequencing Project); RT "Oryza sativa nipponbare(GA3) genomic DNA, chromosome 1."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. RN [6] {ECO:0000313|EMBL:BAF04600.1} RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RT "The Second Rice Annotation Project Meeting (RAP2)."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. RN [7] {ECO:0000313|EMBL:BAF04600.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=16381971; DOI=10.1093/nar/gkj094; RA Ohyanagi H., Tanaka T., Sakai H., Shigemoto Y., Yamaguchi K., RA Habara T., Fujii Y., Antonio B.A., Nagamura Y., Imanishi T., Ikeo K., RA Itoh T., Gojobori T., Sasaki T.; RT "The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa RT ssp. japonica genome information."; RL Nucleic Acids Res. 34:D741-D744(2006). RN [8] {ECO:0000313|EMBL:BAF04600.1} RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RA Itoh T., Tanaka T., Barrero R.A., Yamasaki C., Fujii Y., Hilton P.B., RA Antonio B.A., Aono H., Apweiler R., Bruskiewich R., Bureau T., RA Burr F., Costa de Oliveira A., Fuks G., Habara T., Haberer G., Han B., RA Harada E., Hiraki A.T., Hirochika H., Hoen D., Hokari H., Hosokawa S., RA Hsing Y., Ikawa H., Ikeo K., Imanishi T., Ito Y., Jaiswal P., RA Kanno M., Kawahara Y., Kawamura T., Kawashima H., Khurana J.P., RA Kikuchi S., Komatsu S., Koyanagi K.O., Kubooka H., Lieberherr D., RA Lin Y.C., Lonsdale D., Matsumoto T., Matsuya A., McCombie W.R., RA Messing J., Miyao A., Mulder N., Nagamura Y., Nam J., Namiki N., RA Numa H., Nurimoto S., O'donovan C., Ohyanagi H., Okido T., Oota S., RA Osato N., Palmer L.E., Quetier F., Raghuvanshi S., Saichi N., RA Sakai H., Sakai Y., Sakata K., Sakurai T., Sato F., Sato Y., RA Schoof H., Seki M., Shibata M., Shimizu Y., Shinozaki K., Shinso Y., RA Singh N.K., Smith-White B., Takeda J., Tanino M., Tatusova T., RA Thongjuea S., Todokoro F., Tsugane M., Tyagi A.K., Vanavichit A., RA Wang A., Wing R.A., Yamaguchi K., Yamamoto M., Yamamoto N., Yu Y., RA Zhang H., Zhao Q., Higo K., Burr B., Gojobori T., Sasaki T.; RT "Curated Genome Annotation of Oryza sativa ssp. japonica and RT Comparative Genome Analysis with Arabidopsis thaliana."; RL Genome Res. 17:175-183(2007). RN [9] {ECO:0000313|Proteomes:UP000000763} RP GENOME REANNOTATION. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=18089549; DOI=10.1093/nar/gkm978; RG The rice annotation project (RAP); RT "The rice annotation project database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). RN [10] {ECO:0000313|EMBL:BAF04600.1} RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RA Tanaka T., Antonio B.A., Kikuchi S., Matsumoto T., Nagamura Y., RA Numa H., Sakai H., Wu J., Itoh T., Sasaki T., Aono R., Fujii Y., RA Habara T., Harada E., Kanno M., Kawahara Y., Kawashima H., Kubooka H., RA Matsuya A., Nakaoka H., Saichi N., Sanbonmatsu R., Sato Y., Shinso Y., RA Suzuki M., Takeda J., Tanino M., Todokoro F., Yamaguchi K., RA Yamamoto N., Yamasaki C., Imanishi T., Okido T., Tada M., Ikeo K., RA Tateno Y., Gojobori T., Lin Y.C., Wei F.J., Hsing Y.I., Zhao Q., RA Han B., Kramer M.R., McCombie R.W., Lonsdale D., O'Donovan C.C., RA Whitfield E.J., Apweiler R., Koyanagi K.O., Khurana J.P., RA Raghuvanshi S., Singh N.K., Tyagi A.K., Haberer G., Fujisawa M., RA Hosokawa S., Ito Y., Ikawa H., Shibata M., Yamamoto M., RA Bruskiewich R.M., Hoen D.R., Bureau TE., Namiki N., Ohyanagi H., RA Sakai Y., Nobushima S., Sakata K., Barrero R.A., Sato Y., Souvorov A., RA Smith-White B., Tatusova T., An S., An G., OOta S., Fuks G., RA Messing J., Christie K.R., Lieberherr D., Kim H., Zuccolo A., RA Wing R.A., Nobuta K., Green P.J., Lu C., Meyers BC., Chaparro C., RA Piegu B., Panaud O., Echeverria M.; RT "The Rice Annotation Project Database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). RN [11] RP NUCLEOTIDE SEQUENCE. RA Wang J., Li R., Fan W., Huang Q., Zhang J., Zhou Y., Hu Y., Zi S., RA Li J., Ni P., Zheng H., Zhang Y., Zhao M., Hao Q., McDermott J., RA Samudrala R., Kristiansen K., Wong G.K.-S.; RT "Improved gene annotation of the rice (Oryza sativa) genomes."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP000969; BAD81138.1; -; Genomic_DNA. DR EMBL; AP008207; BAF04600.1; -; Genomic_DNA. DR EMBL; CM000138; EAZ11359.1; -; Genomic_DNA. DR RefSeq; NP_001042686.1; NM_001049221.1. DR UniGene; Os.24775; -. DR STRING; 39947.LOC_Os01g16220.1; -. DR EnsemblPlants; OS01T0267600-01; OS01T0267600-01; OS01G0267600. DR GeneID; 4324382; -. DR KEGG; osa:4324382; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR OMA; RVSGWYQ; -. DR Proteomes; UP000000763; Chromosome 1. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000763}; KW Reference proteome {ECO:0000313|Proteomes:UP000000763}. FT COILED 180 214 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 455 AA; 48430 MW; BE515678C09DC07F CRC64; MASPSLAAAA ASPLTSLDLA TSPATASRPA AAAASALRKR PVLLLDQRHH PSTPNLDSSA AAAAAAAAAG VAQAQPPPPR RKKAGHTSSS TRPRWQTALS VAAKNAVLLA VLLYVGDLAW RAARPAPPRP VDQAAMAGYD ARVADVEASL ARAFRMLQVQ LEAVDRKIDG EVGAVRGELA ALLEEKRLEL EGQLKRLDAR ADDLSDALGA LKRMEFLRKD EFDKFWNEVK ESLGSGPGTE VDLDQVRALA REITMGEIEK HAADGIGRVD YAVASAGGKV VRHSDAYDAG KRGGFFSSLL SGDTAASPKK ILQPSFGEPG QCFPLQGSSG FVEIKLRKGI VPDAITLEHV SKDVAYDMST APKDCRVSGW YQEAHNEAYS GHAASAKMYV LTEFTYDLDK KNVQTFDITA PDVGIINMVR LDFTSNHGSS ALTCIYRIRV HGHEPVSPGM SVSQS // ID Q5R451_PONAB Unreviewed; 822 AA. AC Q5R451; DT 21-DEC-2004, integrated into UniProtKB/TrEMBL. DT 21-DEC-2004, sequence version 1. DT 11-NOV-2015, entry version 40. DE SubName: Full=Putative uncharacterized protein DKFZp459O152 {ECO:0000313|EMBL:CAH93465.1}; GN Name=DKFZp459O152 {ECO:0000313|EMBL:CAH93465.1}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|EMBL:CAH93465.1}; RN [1] {ECO:0000313|EMBL:CAH93465.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Cortex {ECO:0000313|EMBL:CAH93465.1}; RG The German cDNA Consortium; RA Wambutt R., Heubner D., Mewes H.W., Weil B., Amid C., Osanger A., RA Fobo G., Han M., Wiemann S.; RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR861408; CAH93465.1; -; mRNA. DR ProteinModelPortal; Q5R451; -. DR STRING; 9601.ENSPPYP00000019378; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 296 317 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 329 347 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 472 499 {ECO:0000256|SAM:Coils}. FT COILED 512 532 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 822 AA; 91149 MW; A00E5FE35A024649 CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADSGTSSA VSLKNRAART AKQHRSANKS AFSINHVSRQ VTSSGVSHGG TVSLQDAVTR RPPVLDESWI CEQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGAAAATAH NGFSCSNCSM LSERKDVLTA HPVVPGPVLR VYSRDRNQKC DDCKGKRHLD AHTAAHSQSP RPPGRAGTLR HIWACAGYFL LQILRRIGAA GRAVSRTVWS ALWLAVAAPG KAASGVFWWL GIGWYQFVTL ISWLNVFLLT RCLRNICKFL VLLIPLFLLL AGLSLWGQGD FFSFLPVLNW ASMHRTQRVD DPQDVFKPTT SRLNQPLQGD SEAFPWRWMS GVEQQVASLS GQCHHHGEDL RELTTLLQKL QARVDRMDGG AAGPSASVRD TVGQPLRETD FMAFHQEHEV RVSHLEDILG KLREKSEAIQ KELEQTKQKT ISAVGEQLLP TVEHLQLELD QLKSELSSWR HVKTGCETVD AVRERVDVQV REMVKLLFSE DEEGGSLEQL LQRFSSQFVS KGDLHTMLRD LELQILRNVT HHVSVTKQLP TSEAVVSAVS EAGASGITEA QARAIVNNAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALMSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MMIHPAAFTL EHIPKTLSPT GNISSAPKDF AVYGLENEYQ EEGQLLGQFT YDQDGESLQM FQALKRPDDT AFQIVELRIF SNWGHPEYTC LYRFRVHGEP VK // ID Q5R990_PONAB Unreviewed; 822 AA. AC Q5R990; DT 21-DEC-2004, integrated into UniProtKB/TrEMBL. DT 21-DEC-2004, sequence version 1. DT 11-NOV-2015, entry version 46. DE SubName: Full=Putative uncharacterized protein DKFZp459P0725 {ECO:0000313|EMBL:CAH91670.1}; GN Name=DKFZp459P0725 {ECO:0000313|EMBL:CAH91670.1}; OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pongo. OX NCBI_TaxID=9601 {ECO:0000313|EMBL:CAH91670.1}; RN [1] {ECO:0000313|EMBL:CAH91670.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Cortex {ECO:0000313|EMBL:CAH91670.1}; RG The German cDNA Consortium; RA Koehrer K., Beyer A., Mewes H.W., Weil B., Amid C., Osanger A., RA Fobo G., Han M., Wiemann S.; RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR859501; CAH91670.1; -; mRNA. DR RefSeq; NP_001128825.1; NM_001135353.1. DR ProteinModelPortal; Q5R990; -. DR STRING; 9601.ENSPPYP00000019378; -. DR GeneID; 100189739; -. DR KEGG; pon:100189739; -. DR CTD; 23353; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG104132; -. DR KO; K19347; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 296 317 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 329 347 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 465 499 {ECO:0000256|SAM:Coils}. FT COILED 512 532 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 822 AA; 91045 MW; AED3A84EA422F6FC CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADSGTSSA VSLKNRAART AKQHRSANKS AFSINHVSRQ VTSSGVSHGG TVSLQDAVTR RPPVLDESWI CEQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGAAAATAH NGFSCSNCSM LSERKDVLTA HPVVPGPVLR VYSRDRNQKC DDCKGKRHLD AHTAAHSQSP RPPGRAGTLR HIWACAGYFL LQILRGIGAA GRAVSRTVWL ALWLAVAAPG KAASGVFWWL GIGWYQFVTL ISWLNVFLLT RCLRNICKFL VLLIPLFLLL AGLSLWGQGD FFSFLPVLNW ASMHRTQRVD DPQDVFKPTT SRLNQPLQGD SEAFPWRWMS GVEQQVASLS GQCHHHGEDL RELTALLQKL QARVDRMDGG AAGPSASVRD TVGQPLRETD FMAFHQEHEV RISHLEDILG KLREKSEAIQ KELEQTKQKT ISAVGEQLLP TVEHLQLELD QLKSELSSWR HVKTGCETVD AVRERVDVQV REMVKLLFSE DEEGGSLEQL LQRFSSQFVS KGDLHTMLRD LELQILRNVT HHVSVTKQLP TSEAVVSAVS EAGASGITEA QARAIVNNAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALMSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MMIHPAAFTL EHIPKTLSPT GNISSAPKDF AVYGLENEYL EEGQLLGQFT YDQDGESLQM FQALKRPDDT AFQIVELRIF SNWGHPEYTC LYRFRVHGEP VK // ID Q5S4N2_HUMAN Unreviewed; 154 AA. AC Q5S4N2; DT 21-DEC-2004, integrated into UniProtKB/TrEMBL. DT 21-DEC-2004, sequence version 1. DT 11-NOV-2015, entry version 48. DE SubName: Full=Putative uncharacterized protein {ECO:0000313|EMBL:AAV52793.1}; DE Flags: Fragment; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:AAV52793.1}; RN [1] {ECO:0000313|EMBL:AAV52793.1} RP NUCLEOTIDE SEQUENCE. RA Shi Z., Wang H., Feng E., Su G., Huang L.; RT "New sequences related to infection with Shigella flexneri 2a."; RL Submitted (OCT-2004) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AY776160; AAV52793.1; -; mRNA. DR UniGene; Hs.438072; -. DR STRING; 9606.ENSP00000384015; -. DR PaxDb; Q5S4N2; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG102068; -. DR Bgee; Q5S4N2; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; FT NON_TER 1 1 {ECO:0000313|EMBL:AAV52793.1}. SQ SEQUENCE 154 AA; 17679 MW; 6191F214AD004255 CRC64; ETYETKTALM SLFGIPLWYF SQSPRVVIQP DIYPGNCWAF KGSQGYLVVR LSMMIHPAAF TLEHIPKTLS PTGNISSAPK DFAVYGLENE YQEEGQLLGQ FTYDQDGESL QMFQALKRPD DTAFQIVELR IFSNWGHPEY TCLYRFRVHG EPVK // ID Q5TYS7_DANRE Unreviewed; 198 AA. AC Q5TYS7; DT 07-DEC-2004, integrated into UniProtKB/TrEMBL. DT 26-JUN-2013, sequence version 2. DT 11-NOV-2015, entry version 58. DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000113239}; DE Flags: Fragment; GN Name=suco {ECO:0000313|Ensembl:ENSDARP00000113239, GN ECO:0000313|ZFIN:ZDB-GENE-030131-2941}; GN Synonyms=si:ch211-184m19.1 {ECO:0000313|ZFIN:ZDB-GENE-030131-2941}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000113239, ECO:0000313|Proteomes:UP000000437}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000113239, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000113239, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). RN [2] {ECO:0000313|Ensembl:ENSDARP00000113239} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000113239}; RG Ensembl; RL Submitted (MAY-2013) to UniProtKB. CC -!- CAUTION: The sequence shown here is derived from an Ensembl CC automatic analysis pipeline and should be considered as CC preliminary data. {ECO:0000313|Ensembl:ENSDARP00000113239}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BX470128; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BX901920; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR STRING; 7955.ENSDARP00000015683; -. DR PaxDb; Q5TYS7; -. DR Ensembl; ENSDART00000137605; ENSDARP00000113239; ENSDARG00000016532. DR ZFIN; ZDB-GENE-030131-2941; suco. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR GeneTree; ENSGT00390000013502; -. DR Proteomes; UP000000437; Chromosome 20. DR Bgee; Q5TYS7; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}. FT NON_TER 1 1 {ECO:0000313|Ensembl:ENSDARP00000113239}. SQ SEQUENCE 198 AA; 21837 MW; 8BF04115E0007971 CRC64; XRDERTVQSF PLDEQLYAKY VKVELLSHFG SEHFCPLSLI RVFGTSMVEE YDEIADSQYT SERAEYLDED YDYPPGYLPS EDKASKNLLG SATNAILNMV NNIAANVLGG KPELEDGAEL EGNVSSGTEN VTQASTETTL TPDPTPTEQP HTLDVLELDP TFVKEDMEAP IPEAPSQAPT VAPAEESRIV ILIEEDEE // ID Q5U2W0_RAT Unreviewed; 757 AA. AC Q5U2W0; DT 07-DEC-2004, integrated into UniProtKB/TrEMBL. DT 07-DEC-2004, sequence version 1. DT 11-NOV-2015, entry version 83. DE SubName: Full=Protein Sun1 {ECO:0000313|Ensembl:ENSRNOP00000045984}; DE SubName: Full=Unc-84 homolog A (C. elegans) {ECO:0000313|EMBL:AAH85844.1}; GN Name=Sun1 {ECO:0000313|Ensembl:ENSRNOP00000045984, GN ECO:0000313|RGD:1359142}; GN Synonyms=Unc84a {ECO:0000313|EMBL:AAH85844.1, GN ECO:0000313|RGD:1359142}; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116 {ECO:0000313|EMBL:AAH85844.1}; RN [1] {ECO:0000313|EMBL:AAH85844.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Testis {ECO:0000313|EMBL:AAH85844.1}; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RA Gerhard D.S., Wagner L., Feingold E.A., Shenmen C.M., Grouse L.H., RA Schuler G., Klein S.L., Old S., Rasooly R., Good P., Guyer M., RA Peck A.M., Derge J.G., Lipman D., Collins F.S., Jang W., Sherry S., RA Feolo M., Misquitta L., Lee E., Rotmistrovsky K., Greenhut S.F., RA Schaefer C.F., Buetow K., Bonner T.I., Haussler D., Kent J., RA Kiekhaus M., Furey T., Brent M., Prange C., Schreiber K., Shapiro N., RA Bhat N.K., Hopkins R.F., Hsie F., Driscoll T., Soares M.B., RA Casavant T.L., Scheetz T.E., Brown-stein M.J., Usdin T.B., RA Toshiyuki S., Carninci P., Piao Y., Dudekula D.B., Ko M.S., RA Kawakami K., Suzuki Y., Sugano S., Gruber C.E., Smith M.R., RA Simmons B., Moore T., Waterman R., Johnson S.L., Ruan Y., Wei C.L., RA Mathavan S., Gunaratne P.H., Wu J., Garcia A.M., Hulyk S.W., Fuh E., RA Yuan Y., Sneed A., Kowis C., Hodgson A., Muzny D.M., McPherson J., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madari A., Young A.C., Wetherby K.D., RA Granite S.J., Kwong P.N., Brinkley C.P., Pearson R.L., Bouffard G.G., RA Blakesly R.W., Green E.D., Dickson M.C., Rodriguez A.C., Grimwood J., RA Schmutz J., Myers R.M., Butterfield Y.S., Griffith M., Griffith O.L., RA Krzywinski M.I., Liao N., Morin R., Morrin R., Palmquist D., RA Petrescu A.S., Skalska U., Smailus D.E., Stott J.M., Schnerch A., RA Schein J.E., Jones S.J., Holt R.A., Baross A., Marra M.A., Clifton S., RA Makowski K.A., Bosak S., Malek J.; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [2] {ECO:0000313|Ensembl:ENSRNOP00000045984, ECO:0000313|Proteomes:UP000002494} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000045984, RC ECO:0000313|Proteomes:UP000002494}; RX PubMed=15057822; DOI=10.1038/nature02426; RG Rat Genome Sequencing Project Consortium; RA Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., RA Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., RA Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., RA Morgan M., Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., RA Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., RA Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., RA Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., RA Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., RA Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., RA Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., RA Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., RA Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., RA Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., RA Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., RA Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., RA D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., RA Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., RA Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., RA Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., RA Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., RA Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., RA Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., RA Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., RA Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., RA Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., RA Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., RA Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., RA Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., RA Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., RA Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., RA Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., RA Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., RA Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., RA Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., RA Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., RA Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., RA Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., RA Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., RA Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., RA Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., RA Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., RA Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., RA Collins F.S.; RT "Genome sequence of the Brown Norway rat yields insights into RT mammalian evolution."; RL Nature 428:493-521(2004). RN [3] {ECO:0000313|Ensembl:ENSRNOP00000045984} RP IDENTIFICATION. RC STRAIN=Brown Norway {ECO:0000313|Ensembl:ENSRNOP00000045984}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [4] {ECO:0000213|PubMed:22673903} RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22673903; RA Lundby A., Secher A., Lage K., Nordsborg N.B., Dmytriyev A., RA Lundby C., Olsen J.V.; RT "Quantitative maps of protein phosphorylation sites across 14 RT different rat organs and tissues."; RL Nat. Commun. 3:876-876(2012). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC127903; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BC085844; AAH85844.1; -; mRNA. DR RefSeq; NP_001007148.1; NM_001007147.1. DR UniGene; Rn.100642; -. DR STRING; 10116.ENSRNOP00000045984; -. DR Ensembl; ENSRNOT00000047287; ENSRNOP00000045984; ENSRNOG00000001299. DR GeneID; 360773; -. DR KEGG; rno:360773; -. DR UCSC; RGD:1359142; rat. DR CTD; 23353; -. DR RGD; 1359142; Sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR KO; K19347; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR Reactome; R-RNO-1221632; Meiotic synapsis. DR NextBio; 674047; -. DR Proteomes; UP000002494; Chromosome 12. DR GO; GO:0034993; C:LINC complex; IBA:GO_Central. DR GO; GO:0005634; C:nucleus; IDA:RGD. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0001503; P:ossification; IEP:RGD. DR GO; GO:0009612; P:response to mechanical stimulus; IEP:RGD. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR015880; Znf_C2H2-like. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00355; ZnF_C2H2; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000002494}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:Q5U2W0}; KW Reference proteome {ECO:0000313|Proteomes:UP000002494}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 228 248 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 260 279 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 345 365 {ECO:0000256|SAM:Coils}. FT COILED 407 441 {ECO:0000256|SAM:Coils}. FT COILED 455 475 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 757 AA; 84789 MW; 7BF2DCC436656239 CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFETEH RLEPVFDSPR MSRRSLRLIT TTAAYSSGDG QTVDTHISTS RATPAKGRET RTVKQRSASK PAFSINHLSG KGLSSSTSHD SSCSLRSATV LRHPVLDESL IREQTKVDHF WGLDDDGDLK GGNKAATQGN GELAAEVASS NGYTCRDCRM LSARTDALTA HSAVHGPTSR VYSRDRTLRP RKAASGTFWW LGSGWYQFVT LISWLNVFLL TRCLRNICKV FVLLLPLLLL LGAGFSLWGQ GNFFSLLPML NWTAMQPAQR VDNPKDMHRP GPLSPSPPLK VDPMASQWPQ ESDMGQKIAS LSAQCHNHDE RLAELTVLLQ KLQIRVDQVD DGREGLSLWV KDMVGQHLQE IGSIEPPDAK TDFLTLHHDH EVRLSSLEDV LRKLTEKSEA IQKELEETKL RAGSRDEEQP LLDRVQHLEL ELNLLKSQLS DWQHLRSSCE QADARIQETV QLMFSEDQPG GSLEWLLQKL SSRFVSKDEL QVLLHDLELK LLQNITHHIT VTGQAPTSEA IVSAMSQAGI SGITEAQAHI IVNNALRLYS QDKTGMVDFA LESGGGSILS TRCSETYETK TALLSLFGVP LWYFSQSPRV VIQPDIYPGN CWAFKGSQGY LVVRLSMKIY PTTFTMEHIP KTLSPTGNIS SAPRDFAVYG LETEYQEEGQ PLGRFTYDQE GDSLQMFHTL ERPDQGFQIV ELRVLSNWGH PEYTCLYRFR VHGEPIQ // ID Q6A022_MOUSE Unreviewed; 694 AA. AC Q6A022; DT 13-SEP-2004, integrated into UniProtKB/TrEMBL. DT 13-SEP-2004, sequence version 1. DT 11-NOV-2015, entry version 58. DE SubName: Full=MKIAA0668 protein {ECO:0000313|EMBL:BAD32274.1}; DE Flags: Fragment; GN Name=mKIAA0668 {ECO:0000313|EMBL:BAD32274.1}; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090 {ECO:0000313|EMBL:BAD32274.1}; RN [1] {ECO:0000313|EMBL:BAD32274.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Fetal brain {ECO:0000313|EMBL:BAD32274.1}; RX PubMed=15368895; DOI=10.1093/dnares/11.3.205; RA Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S., RA Saga Y., Seino S., Nishimura M., Kaisho T., Hoshino K., Kitamura H., RA Nagase T., Ohara O., Koga H.; RT "Prediction of the coding sequences of mouse homologues of KIAA gene: RT IV. The complete nucleotide sequences of 500 mouse KIAA-homologous RT cDNAs identified by screening of terminal sequences of cDNA clones RT randomly sampled from size-fractionated libraries."; RL DNA Res. 11:205-218(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK172996; BAD32274.1; -; Transcribed_RNA. DR UniGene; Mm.202715; -. DR ProteinModelPortal; Q6A022; -. DR PRIDE; Q6A022; -. DR HOVERGEN; HBG056957; -. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 194 212 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 246 267 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 455 482 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:BAD32274.1}. SQ SEQUENCE 694 AA; 77841 MW; BD3B958BD03F3646 CRC64; TREEPRKSFF RRVSLPCPIM SRRSQRLTRY SQDDNDGGSS SSGASSVAGS QGTVFKDSPL RTLKRKSSNM KHLSPAPQLG PSSDSHTSYY SESVVRESYI GSPRAVSLAR SALLDDHLHS EPYWSGDLRG RRRRGTGGSE SSKANGLTAE SKASEDFFGS SSGYSSEDDL AGYTDSDQHS SGSRLRSAAS RAGSFVWTLV TFPGRLFGLL YWWIGTTWYR LTTAASLLDV FVLTRSRHFS LNLKSFLWFL LLLLLLTGLT YGAWHFYPLG LQTLQPAVVS WWAAKESRKQ PEVWESRDAS QHFQAEQRVL SRVHSLERRL EALAADFSSN WQKEAIRLER LELRQGAAGH GGGSSLSHED ALSLLEGLVS RREATLKEDL RRDTVAHIQE ELATLRAEHH QDSEDLFKKI VQASQESEAR VQQLKTEWKV ESQFPDWIRQ FLLGDRGARS GLLQRDEMHA QLQELENKIL TKMAEMQGKS AREAAASLGQ ILQKEGIVGV TEEQVHRIVK QALQRYSEDR IGMVDYALES GGASVISTRC SETYETKTAL LSLFGIPLWY HSQSPRVILQ PDVHPGNCWA FQGPQGFAVV RLSARIRPTA VTLEHVPKAL SPNSTISSAP KDFAIFGFDE DLQQEGTLLG TFAYDQDGEP IQTFYFQASK MATYQVVELR ILTNWGHPEY TCIYRFRVHG EPAH // ID Q6BLU5_DEBHA Unreviewed; 667 AA. AC Q6BLU5; DT 16-AUG-2004, integrated into UniProtKB/TrEMBL. DT 16-AUG-2004, sequence version 1. DT 14-OCT-2015, entry version 55. DE SubName: Full=DEHA2F10648p {ECO:0000313|EMBL:CAG89169.1}; GN OrderedLocusNames=DEHA2F10648g {ECO:0000313|EMBL:CAG89169.1}; OS Debaryomyces hansenii (strain ATCC 36239 / CBS 767 / JCM 1990 / NBRC OS 0083 / IGC 2968) (Yeast) (Torulaspora hansenii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; Debaryomyces. OX NCBI_TaxID=284592 {ECO:0000313|EMBL:CAG89169.1, ECO:0000313|Proteomes:UP000000599}; RN [1] {ECO:0000313|EMBL:CAG89169.1, ECO:0000313|Proteomes:UP000000599} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968 RC {ECO:0000313|Proteomes:UP000000599}; RX PubMed=15229592; DOI=10.1038/nature02579; RG Genolevures; RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., RA Lafontaine I., de Montigny J., Marck C., Neuveglise C., Talla E., RA Goffard N., Frangeul L., Aigle M., Anthouard V., Babour A., Barbe V., RA Barnay S., Blanchin S., Beckerich J.-M., Beyne E., Bleykasten C., RA Boisrame A., Boyer J., Cattolico L., Confanioleri F., de Daruvar A., RA Despons L., Fabre E., Fairhead C., Ferry-Dumazet H., Groppi A., RA Hantraye F., Hennequin C., Jauniaux N., Joyet P., Kachouri R., RA Kerrest A., Koszul R., Lemaire M., Lesur I., Ma L., Muller H., RA Nicaud J.-M., Nikolski M., Oztas S., Ozier-Kalogeropoulos O., RA Pellenz S., Potier S., Richard G.-F., Straub M.-L., Suleau A., RA Swennen D., Tekaia F., Wesolowski-Louvel M., Westhof E., Wirth B., RA Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M., Thierry A., RA Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J., RA Wincker P., Souciet J.-L.; RT "Genome evolution in yeasts."; RL Nature 430:35-44(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR382138; CAG89169.1; -; Genomic_DNA. DR RefSeq; XP_460826.1; XM_460826.1. DR ProteinModelPortal; Q6BLU5; -. DR EnsemblFungi; CAG89169; CAG89169; DEHA2F10648g. DR GeneID; 2904340; -. DR KEGG; dha:DEHA2F10648g; -. DR InParanoid; Q6BLU5; -. DR OMA; EHESSSF; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000599; Chromosome F. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000599}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000599}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 24 {ECO:0000256|SAM:SignalP}. FT CHAIN 25 667 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004271388. FT TRANSMEM 596 613 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 667 AA; 75272 MW; 48C38130BF9C170F CRC64; MIINDNRLTF LAIHLFITFQ ITRAELLQTS QSSCLTKSVC NVDSCLLDAG NIFKPQTSLS KIDSLNYTTP SAPKSTQIST STPPSTSIST SISTVLSTLT SIISTTSAEV ISSLPENSVV SSSEDLFTEI SKSDSGIITD DLNNAHFEVT SDVNSTNSTN TTEPFIPVQN YTNKNIQLNQ SNETIDECHF LSFEEWKRQK AVDNKQINDS QPQAETIANE LVPTTSGIDN STQIPVSLDE DQGKIYKDRF NYASVGCAAT IVKTNSHAKG ASAILVENKD SYLLNQCSSS QKFVVIELCQ DILVDTVVIG NFEFFSSNFR KIRISVSDRF PVGSSGMKVL GEFEAENIRD VQSFNIENPL IWARYLKLEI LSHYGDEFYC PISLIRVYGK TMMEEFKMAE GHESFIGGEP EIKNEELVIN NSMKDISNFT GINIQNEECR VALPHLGLTE FLKDINSTAS DYCDAMYPLI NEPETTQTIE TKTTQESIYK NIMKRLSLLE SNASLSLLYI EEQSKLLSQA FTNLEKRQSS NFESLINSFN DTMHNQISYF KNAYFSIQVE ASKLFKAQEN NHQSLLEESH HKMTILGNQL KFQKRLSILN TMIIICILSY VVLTRDVYIE DHMYDHFQQP LRTSQNTSGY SNLLDKYKRK NSKRNNRSKR RKPKSTN // ID Q6C4Y0_YARLI Unreviewed; 724 AA. AC Q6C4Y0; DT 16-AUG-2004, integrated into UniProtKB/TrEMBL. DT 16-AUG-2004, sequence version 1. DT 11-NOV-2015, entry version 49. DE SubName: Full=YALI0E22825p {ECO:0000313|EMBL:CAG79879.1}; GN ORFNames=YALI0_E22825g {ECO:0000313|EMBL:CAG79879.1}; OS Yarrowia lipolytica (strain CLIB 122 / E 150) (Yeast) (Candida OS lipolytica). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. OX NCBI_TaxID=284591 {ECO:0000313|EMBL:CAG79879.1, ECO:0000313|Proteomes:UP000001300}; RN [1] {ECO:0000313|EMBL:CAG79879.1, ECO:0000313|Proteomes:UP000001300} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CLIB 122 / E 150 {ECO:0000313|Proteomes:UP000001300}; RX PubMed=15229592; DOI=10.1038/nature02579; RG Genolevures; RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., RA Lafontaine I., de Montigny J., Marck C., Neuveglise C., Talla E., RA Goffard N., Frangeul L., Aigle M., Anthouard V., Babour A., Barbe V., RA Barnay S., Blanchin S., Beckerich J.-M., Beyne E., Bleykasten C., RA Boisrame A., Boyer J., Cattolico L., Confanioleri F., de Daruvar A., RA Despons L., Fabre E., Fairhead C., Ferry-Dumazet H., Groppi A., RA Hantraye F., Hennequin C., Jauniaux N., Joyet P., Kachouri R., RA Kerrest A., Koszul R., Lemaire M., Lesur I., Ma L., Muller H., RA Nicaud J.-M., Nikolski M., Oztas S., Ozier-Kalogeropoulos O., RA Pellenz S., Potier S., Richard G.-F., Straub M.-L., Suleau A., RA Swennen D., Tekaia F., Wesolowski-Louvel M., Westhof E., Wirth B., RA Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M., Thierry A., RA Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J., RA Wincker P., Souciet J.-L.; RT "Genome evolution in yeasts."; RL Nature 430:35-44(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR382131; CAG79879.1; -; Genomic_DNA. DR RefSeq; XP_504282.1; XM_504282.1. DR ProteinModelPortal; Q6C4Y0; -. DR EnsemblFungi; CAG79879; CAG79879; YALI0_E22825g. DR GeneID; 2912913; -. DR KEGG; yli:YALI0E22825g; -. DR InParanoid; Q6C4Y0; -. DR OMA; SEWISIN; -. DR OrthoDB; EOG7W15C8; -. DR Proteomes; UP000001300; Chromosome E. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001300}; KW Reference proteome {ECO:0000313|Proteomes:UP000001300}. FT COILED 179 199 {ECO:0000256|SAM:Coils}. FT COILED 270 290 {ECO:0000256|SAM:Coils}. FT COILED 456 476 {ECO:0000256|SAM:Coils}. FT COILED 695 715 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 724 AA; 80941 MW; 507D751A35DCF760 CRC64; MFSNTPLRSK SRDRHAGGLK SRKSVQWQTE TEKHDPQTST LEGNRSPRQP HNSSELSLDG SKRHQDFYDY SEEQALFDLL QSDPEKVTQM LTEMDISEEE EHGLQQEDKA SLVNTIQELA LKGAEYTGDA LRKGVTPAVK ALLAAAFFYY VATHVSLPGI GNNNKIPAFE PPVEPPQDIS ELSSRLQRLE SELSRVGSDS DSSSEWISIN KVVIKELPSK VESVEKALGG IDVEARKLGK SMEKITKDTR LLLDDNSAFS DTQKKCVSEL ESLKQQLVRQ REAINAIKSN EMDTTPALAI LGGTVERIDG DLTRLAADLE KQRKLDSKHI TQLVTEAIEK AVPENRPDLF IDTKHQFEDY IASIVPEIKA EVLEIVSSSK HATNDTVDFS ALISKAVAQK VAELDLESLF VNPDELKTVL NEEIGQVKSH VDAKVGKVET EVAQVKSEVG QVKSEVDHIQ SNIRHIESEV AQVEANKTST SRTPREEGVA EKIINFASLI NGARIDTERT TALYDPWSGS NPVYWAFRRS LSTMGIGKPL VRRPWVALIG DMSGGSCWPF NGRRGQLAVK LAAKMVPTSF SLRHAVAADD MFLGSAPRFF NIWIKVDDSK LRDQINTASE AYRPYNIPWD YILVGQYEFD PAENANSWYP VPQNIRNLEI QTEETIFEFV ENWGHDRFTC VYQVGVHGVQ AISEEEEVEA EVIEETEDIT AEEDQEDQSV VKDV // ID Q6CC52_YARLI Unreviewed; 627 AA. AC Q6CC52; DT 16-AUG-2004, integrated into UniProtKB/TrEMBL. DT 16-AUG-2004, sequence version 1. DT 14-OCT-2015, entry version 55. DE SubName: Full=YALI0C12430p {ECO:0000313|EMBL:CAG82070.1}; GN ORFNames=YALI0_C12430g {ECO:0000313|EMBL:CAG82070.1}; OS Yarrowia lipolytica (strain CLIB 122 / E 150) (Yeast) (Candida OS lipolytica). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. OX NCBI_TaxID=284591 {ECO:0000313|EMBL:CAG82070.1, ECO:0000313|Proteomes:UP000001300}; RN [1] {ECO:0000313|EMBL:CAG82070.1, ECO:0000313|Proteomes:UP000001300} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=CLIB 122 / E 150 {ECO:0000313|Proteomes:UP000001300}; RX PubMed=15229592; DOI=10.1038/nature02579; RG Genolevures; RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., RA Lafontaine I., de Montigny J., Marck C., Neuveglise C., Talla E., RA Goffard N., Frangeul L., Aigle M., Anthouard V., Babour A., Barbe V., RA Barnay S., Blanchin S., Beckerich J.-M., Beyne E., Bleykasten C., RA Boisrame A., Boyer J., Cattolico L., Confanioleri F., de Daruvar A., RA Despons L., Fabre E., Fairhead C., Ferry-Dumazet H., Groppi A., RA Hantraye F., Hennequin C., Jauniaux N., Joyet P., Kachouri R., RA Kerrest A., Koszul R., Lemaire M., Lesur I., Ma L., Muller H., RA Nicaud J.-M., Nikolski M., Oztas S., Ozier-Kalogeropoulos O., RA Pellenz S., Potier S., Richard G.-F., Straub M.-L., Suleau A., RA Swennen D., Tekaia F., Wesolowski-Louvel M., Westhof E., Wirth B., RA Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M., Thierry A., RA Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J., RA Wincker P., Souciet J.-L.; RT "Genome evolution in yeasts."; RL Nature 430:35-44(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR382129; CAG82070.1; -; Genomic_DNA. DR RefSeq; XP_501760.1; XM_501760.1. DR EnsemblFungi; CAG82070; CAG82070; YALI0_C12430g. DR GeneID; 2909383; -. DR KEGG; yli:YALI0C12430g; -. DR InParanoid; Q6CC52; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001300; Chromosome C. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001300}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001300}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 627 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004271404. FT TRANSMEM 469 489 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 254 281 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 627 AA; 70834 MW; 8556DFDDEEA19956 CRC64; MNLWKLVLTA GLVMCESQAG QPTPSHPAPT LRSFEEWRIH NLLQSGQNLG STPGSGPGHS NGGGGGSEPE MDPFTGEIDF LFAASDDPGK TYKDRFNYAS FDCGATVVKS NKDVKGAGAI LVENKDSYLL NKCVAGSKHV IIELCQDILV DQVVVGNYEF FSSMFKDIRI SVADRYPVAS GEWRVLGDFT ADNIRDLQTF DITVPQIWAR YVKIEFLSHW GHEYYCPISV VRVHGTTMME EWKRSGGDGD IVAGEKLDES LQEETAQFEK AEETLHEEKM IQENVVEKSN PSLNHTVIEK GECTPKLSLD KVKLGLNEYC RLDEFYRKIE RNNSNNNTDT TTNNQSNNST TTPPTQESIY KTIMKRLSLL ESNATLSLQY VEEQSKVLRD LLYKIEKKQN SKIDDFLFHF NATVLQQLVM FSQQREQKLN RAILELDLQK SQTDKNMAAI SQRFAMLADD LVFQKRMSMF QGLILLVVLI FVALTRGTGP ELSRRSRLMF NTDDSEDNSD FGSFRRHLRR NFKRNISSLS PRWSLNLTPQ SPVSDDGVDE GLGLDSDSED EHIESPPPPR TLGSPLATPG PKKKKSRSLY DEPVQWAGDT PEPVDYMSPA EDDDDKKDHR RDRNKEE // ID Q6CJ02_KLULA Unreviewed; 641 AA. AC Q6CJ02; DT 16-AUG-2004, integrated into UniProtKB/TrEMBL. DT 16-AUG-2004, sequence version 1. DT 11-NOV-2015, entry version 53. DE SubName: Full=KLLA0F22539p {ECO:0000313|EMBL:CAG98795.1}; GN ORFNames=KLLA0_F22539g {ECO:0000313|EMBL:CAG98795.1}; OS Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC OS 1267 / NRRL Y-1140 / WM37) (Yeast) (Candida sphaerica). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Kluyveromyces. OX NCBI_TaxID=284590 {ECO:0000313|Proteomes:UP000000598}; RN [1] {ECO:0000313|EMBL:CAG98795.1, ECO:0000313|Proteomes:UP000000598} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / RC WM37 {ECO:0000313|Proteomes:UP000000598}; RX PubMed=15229592; DOI=10.1038/nature02579; RG Genolevures; RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., RA Lafontaine I., de Montigny J., Marck C., Neuveglise C., Talla E., RA Goffard N., Frangeul L., Aigle M., Anthouard V., Babour A., Barbe V., RA Barnay S., Blanchin S., Beckerich J.-M., Beyne E., Bleykasten C., RA Boisrame A., Boyer J., Cattolico L., Confanioleri F., de Daruvar A., RA Despons L., Fabre E., Fairhead C., Ferry-Dumazet H., Groppi A., RA Hantraye F., Hennequin C., Jauniaux N., Joyet P., Kachouri R., RA Kerrest A., Koszul R., Lemaire M., Lesur I., Ma L., Muller H., RA Nicaud J.-M., Nikolski M., Oztas S., Ozier-Kalogeropoulos O., RA Pellenz S., Potier S., Richard G.-F., Straub M.-L., Suleau A., RA Swennen D., Tekaia F., Wesolowski-Louvel M., Westhof E., Wirth B., RA Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M., Thierry A., RA Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J., RA Wincker P., Souciet J.-L.; RT "Genome evolution in yeasts."; RL Nature 430:35-44(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR382126; CAG98795.1; -; Genomic_DNA. DR RefSeq; XP_456087.1; XM_456087.1. DR STRING; 284590.XP_456087.1; -. DR EnsemblFungi; CAG98795; CAG98795; KLLA0_F22539g. DR GeneID; 2895372; -. DR KEGG; kla:KLLA0F22539g; -. DR eggNOG; ENOG410IE9E; Eukaryota. DR eggNOG; ENOG4111CR2; LUCA. DR HOGENOM; HOG000113639; -. DR InParanoid; Q6CJ02; -. DR OMA; IYNSNQH; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000000598; Chromosome F. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000598}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000598}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 158 177 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 184 211 {ECO:0000256|SAM:Coils}. FT COILED 361 381 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 641 AA; 74258 MW; 195F1271F292E1E2 CRC64; MDRSRLYNKS IREAYENMVK GAGAPHLSAD HDNSKVSRYS ASIIHEDNRR EGLENSEFSD NESLNEYDDE GEENDADDDG EAENDYTQFK KSLIQDDEWL DDESDTDYTE EADQSFLLEK DNDNDSDDES RRYVCPGSEF YFEDSKTSKS GKWYFHKFWW LIIGLVCVWF LESYFSVHGS SPLTPDLNKK INLLQNQLNQ LNHERDLQKS KYQSDLDQNI KLIIQQFEKN VKRILPKNIK DFSLLHSTIE KLESKVDTLG EKVAWNNVNE TLSRLSEILP SEIPVIIESD HSSSETNSTK QQVLLIPELH RYLVELIPPL INNSFDAQSI LKNSDFKYDL NHYVKEILNN EFQYIERSQF IQELQSHIQA IKEDLSREIE SKTSQLQTPQ QLSTVVLKKM IHRIYNSNQH QLEANLNKAT FAQGAKILNH LCSKTVKGTT SPVDLLQDCS FGCSSSTYWL GDSKGCQWAI RFDEPIFLTK ISYLHGRFSH NLEVMAGTPK KITIYVKPLV PISKVENIRR WPVDSTYVEL GQFHYDLYSN AIKQDFLLPD WFIQSKALIR SMVFVVDENH GNSSYTAMRK FIVNGVTPKD LQLMSSFPSD WAQMTPEYSI SLEEQERARL SRVAQWQDQQ VPSFGEDELV D // ID Q6CUU4_KLULA Unreviewed; 566 AA. AC Q6CUU4; DT 16-AUG-2004, integrated into UniProtKB/TrEMBL. DT 16-AUG-2004, sequence version 1. DT 11-NOV-2015, entry version 62. DE SubName: Full=KLLA0C02211p {ECO:0000313|EMBL:CAH01146.1}; GN ORFNames=KLLA0_C02211g {ECO:0000313|EMBL:CAH01146.1}; OS Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC OS 1267 / NRRL Y-1140 / WM37) (Yeast) (Candida sphaerica). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Kluyveromyces. OX NCBI_TaxID=284590 {ECO:0000313|Proteomes:UP000000598}; RN [1] {ECO:0000313|EMBL:CAH01146.1, ECO:0000313|Proteomes:UP000000598} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / RC WM37 {ECO:0000313|Proteomes:UP000000598}; RX PubMed=15229592; DOI=10.1038/nature02579; RG Genolevures; RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., RA Lafontaine I., de Montigny J., Marck C., Neuveglise C., Talla E., RA Goffard N., Frangeul L., Aigle M., Anthouard V., Babour A., Barbe V., RA Barnay S., Blanchin S., Beckerich J.-M., Beyne E., Bleykasten C., RA Boisrame A., Boyer J., Cattolico L., Confanioleri F., de Daruvar A., RA Despons L., Fabre E., Fairhead C., Ferry-Dumazet H., Groppi A., RA Hantraye F., Hennequin C., Jauniaux N., Joyet P., Kachouri R., RA Kerrest A., Koszul R., Lemaire M., Lesur I., Ma L., Muller H., RA Nicaud J.-M., Nikolski M., Oztas S., Ozier-Kalogeropoulos O., RA Pellenz S., Potier S., Richard G.-F., Straub M.-L., Suleau A., RA Swennen D., Tekaia F., Wesolowski-Louvel M., Westhof E., Wirth B., RA Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M., Thierry A., RA Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J., RA Wincker P., Souciet J.-L.; RT "Genome evolution in yeasts."; RL Nature 430:35-44(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CR382123; CAH01146.1; -; Genomic_DNA. DR RefSeq; XP_452295.1; XM_452295.1. DR ProteinModelPortal; Q6CUU4; -. DR STRING; 284590.XP_452295.1; -. DR EnsemblFungi; CAH01146; CAH01146; KLLA0_C02211g. DR GeneID; 2892530; -. DR KEGG; kla:KLLA0C02211g; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000093382; -. DR InParanoid; Q6CUU4; -. DR OMA; ESIVMAN; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000598; Chromosome C. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000598}; KW Reference proteome {ECO:0000313|Proteomes:UP000000598}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 16 {ECO:0000256|SAM:SignalP}. FT CHAIN 17 566 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004271686. FT COILED 290 310 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 566 AA; 65002 MW; 1DA7083217D8D1C2 CRC64; MRYLVFLLVL FLGVNCEGDS VLFSSATDVF ERPKPLKVTT ALATKNSNKS RPLQVSMSST IRNVPLATQG KDSSVFKSFN EWNKEKLRQN KLAQQERPKR SLFHNNNNGD VIRDELEMDF EIFNDDEPEG KIYKDKFNYA SVDCAATIIK TNSEAQGAVS ILFENKDKSL LNPCSVPNKF FVVELCEDIL IESIVMANFE FFSSTFKNVR FSVAERFPVP KNGWKVLGEF EAENIRNTQQ FTITNPMIWA RYLRVEVLSH YGEEFYCPIT LIRAHGIAMI DEFKMEVQNA GEKLEEVVSI EQKLAEEKEK CMIPSSFLTN NMSINFDLDI NHQCLASLKH MNFDEFFTGY KDNENITQAK DSSSFIPINT EESIFKNIMK RLSGLETNGT MSILYIEEQS KLLSKSFDKL EDSYSQQFEA LVKAFNETMT SNLENLNQFA LQLRESSIKI IEEQRLATDK FISTTTNKVE ELENAYQHQA RVMYLILFGL ISSIIYISLT RESYFEEEMV DDGWYTDNSS LQKLKNNIKR SVSETKGHPY TEAAHYSPIS SDSEFEDEDS IVIPAK // ID Q6M9F1_NEUCS Unreviewed; 1019 AA. AC Q6M9F1; DT 28-JUN-2011, integrated into UniProtKB/TrEMBL. DT 28-JUN-2011, sequence version 1. DT 11-NOV-2015, entry version 11. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KHE88594.1}; GN ORFNames=G21B4.210 {ECO:0000313|EMBL:CAF06006.1}, GN GE21DRAFT_6071 {ECO:0000313|EMBL:KHE88594.1}; OS Neurospora crassa. OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=5141 {ECO:0000313|EMBL:CAF06006.1}; RN [1] {ECO:0000313|EMBL:CAF06006.1} RP NUCLEOTIDE SEQUENCE. RA Schulte U., Aign V., Hoheisel J., Brandt P., Fartmann B., Holland R., RA Nyakatura G., Mewes H.W., Mannhaupt G.; RL Submitted (JAN-2004) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:CAF06006.1} RP NUCLEOTIDE SEQUENCE. RA German Neurospora genome project; RL Submitted (JAN-2004) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:KHE88594.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=73 {ECO:0000313|EMBL:KHE88594.1}; RG DOE Joint Genome Institute; RA Baker S.E., Grigoriev I., Haridas S., LaButti K., McCluskey K.; RT "Draft genome sequence of Neurospora crassa strain FGSC 73."; RL Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BX908808; CAF06006.1; -; Genomic_DNA. DR EMBL; KN389634; KHE88594.1; -; Genomic_DNA. DR ProteinModelPortal; Q6M9F1; -. DR EnsemblFungi; KHE88594; KHE88594; GE21DRAFT_6071. DR eggNOG; ENOG410J35R; Eukaryota. DR eggNOG; ENOG41128BM; LUCA. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 498 518 Helical. FT COILED 111 135 {ECO:0000256|SAM:Coils}. FT COILED 164 320 {ECO:0000256|SAM:Coils}. FT COILED 363 383 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1019 AA; 115251 MW; 22244C962E67BA7E CRC64; MPPRRITRRS AVSPSPALSD IGTPKPGKRG TLIPVKQVRN QTPTRFSLSY GSSLVAMPDR NKTAAGTDLE TAFAEIHETV RTDNIKAEAR RRELDARRGS TTPGPRRPDP IEEETEEEEE EDEEVQEEKE EDDDDNQGGY YNDEEEEEPA PDPPRRATKP VQTLNKLQDQ IEKAKLLEKQ RAEERAAEER EKAEKDAAER ERKRLEKDKK DKEEREKRDK AEREKKEQAA KAQQEAKVKA AREAQERAER EVKKRARDEE DQKQAELERA ERNARLNRER SEDARRQAEQ KHAAEAARKK EEQRQAREAS EAEMASLEEA KRQAMRPPPP PSKQLLSTPP TSRTRELVVP DTGNSYVEES DVYTDSEKMR EVLEEEVRMA QQKRLARYTP EPPEPPRIAR RPASTLSNSL QPPSHQVDQH QDLFDTEAKS MSDKQYPSFG KVSKPTAARP NQTSRPRAEQ SNTTNVETPP PPYTTAPPTF VQRLLSLIRR STWGVWKLFT FLVPVLLIGL IVLTASSYGS PDANTSIRWY GWKHWRSNVG QFIPSHPQLT DDQFNDLKDF ILEQSSSTES AVKNIQTLLP RMVHVKRGPN GDLIIQDDFW HALLDKMLKD SSVLTLDGTG DISEEHWDAL RPRLIKAGLF EKGPSDEHIL QIAEGTVSKS WERWVTKNGE KVAQVVKEHL PGDKGDGVTR DAAISRDEFV GLLKKRIAEH KEEIDGQLDS VKKGLETLID TTVKAAISNS EGSLSKSEIT TLVRNIVKKE IPRAQLEAAA KDGIMRNYHD YVETQVNHFG LGNEAGIVLS ESSPVYRLES QALPGNKHLS KLLGKPKPIS SKDKVTLEAE YMLALSAWND VGQCWCAGIS ASRGAELAVE MAKHVIPQAI VVEHVHPNAT NDPGSMPKDI EIWGYYPDAD DNKRLLAWMD ELYPGEREAD MKMVDANNKK SLSLINRKYV KIGELEYDYA KTSGSHGMFV HKLSEELLDL DAATYKVVVR AKTNHGALDH TCIYRLKLFG EELEYEGEE // ID Q6NR00_DROME Unreviewed; 2727 AA. AC Q6NR00; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 11-NOV-2015, entry version 98. DE SubName: Full=LP05936p {ECO:0000313|EMBL:AAQ23602.1}; GN ORFNames=CG5604 {ECO:0000313|EMBL:AAQ23602.1, GN ECO:0000313|FlyBase:FBgn0032208}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227; RN [1] {ECO:0000313|EMBL:AAQ23602.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Berkeley {ECO:0000313|EMBL:AAQ23602.1}; RA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J., RA Champe M., Chavez C., Dorsett V., Dresnek D., Farfan D., Frise E., RA George R., Gonzalez M., Guarin H., Kronmiller B., Li P., Liao G., RA Miranda A., Mungall C.J., Nunoo J., Pacleb J., Paragas V., Park S., RA Patel S., Phouanenavong S., Wan K., Yu C., Lewis S.E., Rubin G.M., RA Celniker S.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BT010284; AAQ23602.1; -; mRNA. DR ProteinModelPortal; Q6NR00; -. DR STRING; 7227.FBpp0079663; -. DR PaxDb; Q6NR00; -. DR PRIDE; Q6NR00; -. DR FlyBase; FBgn0032208; CG5604. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR OrthoDB; EOG7Z69BD; -. DR ChiTaRS; CG5604; fly. DR Bgee; Q6NR00; -. DR ExpressionAtlas; Q6NR00; differential. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0005634; C:nucleus; ISS:FlyBase. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; ISS:FlyBase. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0042787; P:protein ubiquitination involved in ubiquitin-dependent protein catabolic process; ISS:FlyBase. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 2: Evidence at transcript level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2727 AA; 302184 MW; BD060B4DB48864BD CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSQVHK DTLHSAMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDP MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLKRL QSVGGNTHSS LTAAGTQPTS SSQPAATTNS DAINENVAGT ATISNSTKVK SSDAAASPQS ISTTISLLST LCRGSPSITH DILRSQLADA LERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNNP NQGQLAPRPR RNNTNTDRTH RQLIDCIRSK DSEALREAIE SGGIDVNCMD DVGQTLLNWA SAFGTLEMVE YLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKFGAYP DLRDEDGKTP LDKARERLDD GHREVAAILQ SPGEWMSPDH SLLNKDGKKY TLMEPRGDPE MAPIYLKVLL PIFCRTFLGS MLGSVRRASF ALIKKIVQYA YPTVLQSLSE TSFSEDAAST SGQNGGNLLI EVIASVLDNE DDGDGHLIVL NIIEEIMCKT QEEFLDHFAR LGVFAKVQAL MDTDAEELYV QLPGTVEEPA AAQRSSTSVV VAPRPTSDDP MEDAKEILQG KPYHWREWSI CRGRDCLYVW SDSVALELSN GSNGWFRFII DGKLATMYSS GSPENGNDSS ENRGEFLEKL MRARSCVIAG VVSQPILPTA SALRLVVGNW VLQSQKTNQL QIHNTEGHQV TVLQDDLPGF IFESNRGTKH TFSAETVLGP DFASGWSTAK KKRNKSKTEG QKFQVRNLSR EIYNKYFKSA QIIPRGAVAI LTDIVKQIEL SFEEQHMAPN GNWETTLTDA LMKLSQLIHE DGVVSAYEMH SSGLVQALVA VLSVNHWETN SPRCKRNKMQ KQRVSVFKKC ILEDNVESAT NKPRTKSTAS ILIQKLVSVL ESTEKLPVYL YDSPCTGYSL QILQKRLRFR LERAECESTL FDRSGRTLKM EPLATIGQLS KYLLKMVAKQ WYDLDRSTYF YLKKIREHRT ATVFTHSFDF DEEGLLFYIG SNAKTCDWVN PAQYGLVQVT SSEGKTLPYG KLEDILSRDS ISLNCHTKDN KKAWFAIDLG VYIIPTAYTL RHARGYGRSA LRNWLLQGSK DGSTWTTLST HVDDKSLVEP GSTATWPINC ATDDSVWYRH IRIQQNGRNA SGQTHYLSLS GFEIYGRVVG VADDIGKSVK EAEAKTRRER RQIRAQLKHM TTGARVIRGV DWRWEEQDGC AEGTITGEIH NGWIDVKWDH GVRNSYRMGA EGKYDLKLAD CEYLSAFDGN QSMGSASTAA KPSEKGGNTL TSRKSSSTPS LPEATEKNQN PEGASNQTVS ADNLAWKQTV ETIAENVFAS AKTQIISNQL AMNTSSSREA RAKHKESGTN QMHKDNISGP SPLSRELEHI SDLSAINNSM PAINSSNVSD LATISENLSL TELSKENICR VLTPSYKPAE SVTASQSSSH PDVQSSSPRE NDIKNISNIE ENNKMNANNS VNKISKDLLA NLRTSNIAGC PPVTQLSTEA LEMIDKMRDG VDMIRNMSNS ILSTDTFPVP CTNVPVGGKK TPKAQALINP DNANQKQIIV TSEEFPTKSS KKPSVTLKPA QQPNAVLSIV DIKEQPISNE NVSVPSQMSI SVPNLTTTSA SEVPSTSEVA THTGLLETFA AIARRRTSQG TNIQDNQIMN AEANVNEHGD QNASGSFLGH SVTSLVKLAL SSNFHSGLLS TAQSYPSLSS NNSENIAPSN PSNTSAGQQS ASTINHTLTM SLTSTSSDSE QVSLEDFLES CRAPALLGDL DDEDDMDEDN DEEENEDEYE EVGNTLLQVM VSRNLLTFMD DEAMENRLVG VTKRKSWDDE FVLKRQFSAL IPAFDPRPGR TNVNQTSDLE ISPLGAELPK PQQSGGPETI EQPLLGLKLR GPGIGGIPEV EIDLSNTDWT IFRAVQELLQ CSQLNKLDKF RKIWEPTYTI VYREVSPEAQ ESTCLESEEF PQTPDVSSKS GASTLSPNSP MHIGFNVADN NLCSVDDVLE LLTQINGLNQ SEIDSDVKEH GVSVLSEDLF ISKKITNKLQ QQIQDPLVLA SNALPNWCEN LNQSCPFLFP FETRQLYFNC TSFGASRSIV CLQSQRDVTV ERQRIPIMSP RRDDHEFRIG RLKHERVKVP RNEDLLMWAM QVMKTHCNRK SVLEVEFLDE EGTGLGPTLE FYALVAAEIQ RSDLCMWLCD DDLGEDTENS TQSAEGNSKP VGYYVNRREH GIFPAPLPQN SEICENVLKY FWFFGVFVAK VLQDMRLVDI PLSTSFLQLL CHNKVLSRNL QKVISDRRNG DLSVVSEDSD IVETCTKLLR TDSNKSNAFG GILSLENLKE IDPTRYQFLQ EMQNLLLRKQ SIEFDDTISA EKKHELINEL KLQTQNGLEV SLEDLALTFT YLPSSSIYGY TQAELLPNGS SVNVTIDNLE AYCELLMNFI LQDGIAQQMK AFSDGFNEVF PLKKLAAFTP SEARMMICGE QFPHWSREDI ISYTEPKLGY NKDSPGFQRF VNVLLSMSGD ERKAFLQFTT GCSSLPPGGL ANLHPRLTVV RKVDAGVGSY PSVNTCVHYL KLPDYPTEEI MKERLLTATK EKGFHLN // ID Q6NT72_HUMAN Unreviewed; 442 AA. AC Q6NT72; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 11-NOV-2015, entry version 60. DE SubName: Full=UNC84B protein {ECO:0000313|EMBL:AAH69253.1}; DE Flags: Fragment; GN Name=UNC84B {ECO:0000313|EMBL:AAH69253.1}; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606 {ECO:0000313|EMBL:AAH69253.1}; RN [1] {ECO:0000313|EMBL:AAH69253.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Kidney {ECO:0000313|EMBL:AAH69253.1}; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RA Gerhard D.S., Wagner L., Feingold E.A., Shenmen C.M., Grouse L.H., RA Schuler G., Klein S.L., Old S., Rasooly R., Good P., Guyer M., RA Peck A.M., Derge J.G., Lipman D., Collins F.S., Jang W., Sherry S., RA Feolo M., Misquitta L., Lee E., Rotmistrovsky K., Greenhut S.F., RA Schaefer C.F., Buetow K., Bonner T.I., Haussler D., Kent J., RA Kiekhaus M., Furey T., Brent M., Prange C., Schreiber K., Shapiro N., RA Bhat N.K., Hopkins R.F., Hsie F., Driscoll T., Soares M.B., RA Casavant T.L., Scheetz T.E., Brown-stein M.J., Usdin T.B., RA Toshiyuki S., Carninci P., Piao Y., Dudekula D.B., Ko M.S., RA Kawakami K., Suzuki Y., Sugano S., Gruber C.E., Smith M.R., RA Simmons B., Moore T., Waterman R., Johnson S.L., Ruan Y., Wei C.L., RA Mathavan S., Gunaratne P.H., Wu J., Garcia A.M., Hulyk S.W., Fuh E., RA Yuan Y., Sneed A., Kowis C., Hodgson A., Muzny D.M., McPherson J., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madari A., Young A.C., Wetherby K.D., RA Granite S.J., Kwong P.N., Brinkley C.P., Pearson R.L., Bouffard G.G., RA Blakesly R.W., Green E.D., Dickson M.C., Rodriguez A.C., Grimwood J., RA Schmutz J., Myers R.M., Butterfield Y.S., Griffith M., Griffith O.L., RA Krzywinski M.I., Liao N., Morin R., Morrin R., Palmquist D., RA Petrescu A.S., Skalska U., Smailus D.E., Stott J.M., Schnerch A., RA Schein J.E., Jones S.J., Holt R.A., Baross A., Marra M.A., Clifton S., RA Makowski K.A., Bosak S., Malek J.; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC069253; AAH69253.1; -; mRNA. DR UniGene; Hs.517622; -. DR UniGene; Hs.744734; -. DR STRING; 9606.ENSP00000385616; -. DR PaxDb; Q6NT72; -. DR PRIDE; Q6NT72; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOVERGEN; HBG056957; -. DR Bgee; Q6NT72; -. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}. FT COILED 77 97 {ECO:0000256|SAM:Coils}. FT COILED 99 126 {ECO:0000256|SAM:Coils}. FT COILED 129 156 {ECO:0000256|SAM:Coils}. FT COILED 203 223 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:AAH69253.1}. SQ SEQUENCE 442 AA; 49424 MW; 927A114B0409AB79 CRC64; VMSRVHSLER RLEALAAEFS SNWQKEAMRL ERLELRQGAP GQGGGGGLSH EDTLALLEGL VSRREAALKE DFRRETAARI QEELSALRAE HQQDSEDLFK KIVRASQESE ARIQQLKSEW QSMTQESFQE SSVKELRRLE DQLAGLQQEL AALALKQSSV AEEVGLLPQQ IQAVRDDVES QFPAWISQFL ARGGGGRVGL LQREEMQAQL RELESKILTH VAEMQGKSAR EAAASLSLTL QKEGVIGVTE EQVHHIVKQA LQRYSEDRIG LADYALESGG ASVISTRCSE TYETKTALLS LFGIPLWYHS QSPRVILQPD VHPGNCWAFQ GPQGFAVVRL SARIRPTAVT LEHVPKALSP NSTISSAPKD FAIFGFDEDL QQEGTLLGKF TYDQDGEPIQ TFHFQAPTMA TYQVVELRIL TNWGHPEYTC IYRFRVHGEP AH // ID Q74Z91_ASHGO Unreviewed; 616 AA. AC Q74Z91; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 14-OCT-2015, entry version 60. DE SubName: Full=AGR312Wp {ECO:0000313|EMBL:AAS54802.1}; GN ORFNames=AGOS_AGR312W {ECO:0000313|EMBL:AAS54802.1}; OS Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL OS Y-1056) (Yeast) (Eremothecium gossypii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Eremothecium. OX NCBI_TaxID=284811 {ECO:0000313|EMBL:AAS54802.1, ECO:0000313|Proteomes:UP000000591}; RN [1] {ECO:0000313|EMBL:AAS54802.1, ECO:0000313|Proteomes:UP000000591} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056 RC {ECO:0000313|Proteomes:UP000000591}; RX PubMed=15001715; DOI=10.1126/science.1095781; RA Dietrich F.S., Voegeli S., Brachat S., Lerch A., Gates K., Steiner S., RA Mohr C., Poehlmann R., Luedi P., Choi S., Wing R.A., Flavier A., RA Gaffney T.D., Philippsen P.; RT "The Ashbya gossypii genome as a tool for mapping the ancient RT Saccharomyces cerevisiae genome."; RL Science 304:304-307(2004). RN [2] {ECO:0000313|Proteomes:UP000000591} RP GENOME REANNOTATION. RC STRAIN=ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056 RC {ECO:0000313|Proteomes:UP000000591}; RX PubMed=23749448; DOI=10.1534/g3.112.002881; RA Dietrich F.S., Voegeli S., Kuo S., Philippsen P.; RT "Genomes of Ashbya fungi isolated from insects reveal four mating-type RT loci, numerous translocations, lack of transposons, and distinct gene RT duplications."; RL G3 (Bethesda) 3:1225-1239(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE016820; AAS54802.1; -; Genomic_DNA. DR RefSeq; NP_986978.1; NM_212040.1. DR EnsemblFungi; AAS54802; AAS54802; AGOS_AGR312W. DR GeneID; 4623281; -. DR KEGG; ago:AGOS_AGR312W; -. DR HOGENOM; HOG000113639; -. DR InParanoid; Q74Z91; -. DR OMA; IYNSNQH; -. DR OrthoDB; EOG7KM62C; -. DR Proteomes; UP000000591; Chromosome VII. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000591}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000591}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 145 165 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 176 196 {ECO:0000256|SAM:Coils}. FT COILED 224 251 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 616 AA; 71198 MW; 409F77837142E993 CRC64; MMSEDSRQRL YNQSIQNAYN ALLSQKRGPR SSGFPSGSSA VAVQDEEPRL SDMSEFSDDD AQVDRAMIQD DEFDEEGGAE ENDDYSRFKR SLLHDNHEDY GWLDDDDTDY TDQADKSFIQ DDDDDDGSTY VYEGWSARRQ GWGRWAVLGL GALFVVWILA GWGGASPASP DLYRRVNQLQ TQLNHLTHEA ETQRKSFRSE LDSTINMVIQ RFEQNIKRLL PGYTSKLESN IAQLESEMQQ INQKLLMENV TLWQKELVIK LNEKLPEKIP IVMEDNSNML LIPELHDYLS TLISQVVRES VTSLPQFKFN INDYIKEVLN NNFQFVDKQY FLTQLHESLM ANKDEIWQEL EPRLAHLTSP DAVPQQFSSV IMKKLMHKIY NANQHQWESD LNIATFAQGS KLLNHLCSKT HHGPVGPMYL LQDCNGCTST YWNCDAAACS WAIRLVEPMY LIKLGYVHGK FSHNLQIMTA APKKINVYVK LYEGTQNAPQ NTKYWSRDSR FMFLGSWDYD IFDNRIRQDF ELPLWFIQGK YLVRSIAFEV ATNHGNNQYT SLRKFVVNAV TVQDLKLMDK FPRDWKIQVP DYSVMIDDQE RIRASRIAQL HNAGSVPSFG DDELDA // ID Q752A2_ASHGO Unreviewed; 765 AA. AC Q752A2; DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. DT 05-JUL-2004, sequence version 1. DT 14-OCT-2015, entry version 66. DE SubName: Full=AFR673Cp {ECO:0000313|EMBL:AAS54045.1}; GN ORFNames=AGOS_AFR673C {ECO:0000313|EMBL:AAS54045.1}; OS Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL OS Y-1056) (Yeast) (Eremothecium gossypii). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Eremothecium. OX NCBI_TaxID=284811 {ECO:0000313|EMBL:AAS54045.1, ECO:0000313|Proteomes:UP000000591}; RN [1] {ECO:0000313|EMBL:AAS54045.1, ECO:0000313|Proteomes:UP000000591} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056 RC {ECO:0000313|Proteomes:UP000000591}; RX PubMed=15001715; DOI=10.1126/science.1095781; RA Dietrich F.S., Voegeli S., Brachat S., Lerch A., Gates K., Steiner S., RA Mohr C., Poehlmann R., Luedi P., Choi S., Wing R.A., Flavier A., RA Gaffney T.D., Philippsen P.; RT "The Ashbya gossypii genome as a tool for mapping the ancient RT Saccharomyces cerevisiae genome."; RL Science 304:304-307(2004). RN [2] {ECO:0000313|Proteomes:UP000000591} RP GENOME REANNOTATION. RC STRAIN=ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056 RC {ECO:0000313|Proteomes:UP000000591}; RX PubMed=23749448; DOI=10.1534/g3.112.002881; RA Dietrich F.S., Voegeli S., Kuo S., Philippsen P.; RT "Genomes of Ashbya fungi isolated from insects reveal four mating-type RT loci, numerous translocations, lack of transposons, and distinct gene RT duplications."; RL G3 (Bethesda) 3:1225-1239(2013). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE016819; AAS54045.1; -; Genomic_DNA. DR RefSeq; NP_986221.1; NM_212357.1. DR ProteinModelPortal; Q752A2; -. DR EnsemblFungi; AAS54045; AAS54045; AGOS_AFR673C. DR GeneID; 4622510; -. DR KEGG; ago:AGOS_AFR673C; -. DR InParanoid; Q752A2; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000000591; Chromosome VI. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000591}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000591}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 23 {ECO:0000256|SAM:SignalP}. FT CHAIN 24 765 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004285295. FT TRANSMEM 606 623 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 765 AA; 85755 MW; 13E0E5AB3EE07B58 CRC64; MRASSICDIV LAVWLLPLSQ TVAQVVQREN DWQSIEASRA GMFGNSVQGG CDGVCEPQEG PVTAIMNMWE NVDASPIPER PEMTDIDLVP PGKQTGNAFS FTSRGSAEAG VGAPTVWSPQ LTGCGDHSRS YSQAVASEPV SAESSQLASQ APEVLEEFSL READEEGDLH NETEFLPFDE WKRIKLDEEK RASVHSETHT RTRIPVDYNR ADALGDEMEI DVGMFTSMED DEPEGKLYHE KFNYASLDCA ASIVKTNSEA QGASSILYEN KDKYLLNPCS AVNKFVVIEL CQDILVEEIE MANYEFFSST FQNVRFSVSD RFPVPKNGWK VLGEFTAVNS RDIQKFGIPN PKIWARYLRV EILSHYGNEF YCPISVVRTH GKTMMEEFKL AQSVGNSQES EVLQPAPDRI LNSTLHNASF LCTGSHHNNN GGPLSFSSNN VSVELFADDM ASQCRAALPP LRFEEFIDGF DDISCGQKHS KSMNFTGTIS NSATEESIFK NIMKRLTTLE TNATLSILYI EEQSRLLSKS FYSLEKNHAK KFDSLVSIFN ETMMHNLETL NVFAKQLKDS SMHLLEEQKL STDQFTSMTI QRLDMMERDA RFQKRMVYLI LVAVAALLVY VLLTREAYID DYMEDDGWYL DSPPLQKAKD KLMRKAARAV STPTIFKNFQ DDIAPMKIRR NPSFSSSSSI SDVSQYLIDN DDDSVFLGRP RTYSKLLAAD ENEIDIDEIM THSSDNSDVS HGMDRLLDAD GTEVSSRESS DGEPK // ID Q7PSA4_ANOGA Unreviewed; 2929 AA. AC Q7PSA4; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 23-OCT-2007, sequence version 4. DT 11-NOV-2015, entry version 105. DE SubName: Full=AGAP009511-PA {ECO:0000313|EMBL:EAA05937.5}; DE Flags: Fragment; GN ORFNames=AgaP_AGAP009511 {ECO:0000313|EMBL:EAA05937.5}; OS Anopheles gambiae (African malaria mosquito). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Anophelinae; Anopheles. OX NCBI_TaxID=7165 {ECO:0000313|Proteomes:UP000007062}; RN [1] {ECO:0000313|EMBL:EAA05937.5, ECO:0000313|Proteomes:UP000007062} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PEST {ECO:0000313|EMBL:EAA05937.5, RC ECO:0000313|Proteomes:UP000007062}; RX PubMed=12364791; DOI=10.1126/science.1076181; RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R., RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R., RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., RA Lai Z., Kraft C.L., Abril J.F., Anthouard V., Arensburger P., RA Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V., RA Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S., RA Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M., RA Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I., RA Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z., RA Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R., RA Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E., RA Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., RA Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R., RA Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C., RA Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V., RA Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D., RA Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H., RA Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A., RA Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I., RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., RA Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S., RA Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C., RA Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.; RT "The genome sequence of the malaria mosquito Anopheles gambiae."; RL Science 298:129-149(2002). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAA05937.5}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAAB01008839; EAA05937.5; -; Genomic_DNA. DR RefSeq; XP_310184.4; XM_310184.4. DR STRING; 7165.AGAP009511-PA; -. DR PaxDb; Q7PSA4; -. DR EnsemblMetazoa; AGAP009511-RA; AGAP009511-PA; AGAP009511. DR GeneID; 1271399; -. DR KEGG; aga:AgaP_AGAP009511; -. DR VectorBase; AGAP009511; Anopheles gambiae. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR HOGENOM; HOG000018061; -. DR InParanoid; Q7PSA4; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR PhylomeDB; Q7PSA4; -. DR Proteomes; UP000007062; Chromosome 3R. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 4: Predicted; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007062}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Reference proteome {ECO:0000313|Proteomes:UP000007062}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1358 1378 {ECO:0000256|SAM:Coils}. FT COILED 2678 2698 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EAA05937.5}. SQ SEQUENCE 2929 AA; 317242 MW; 626BEBCD3516C06E CRC64; RSNASLSLSL SLSFQMGDVD PETLLEWLSM GQGDERDMQL IALEQLCMLL LMSDNVDRCF ESCPPRTFLP ALCKIFLDEL APENVLEVTA RAITYYLDVS SECTRRIVAI DGAIRAICNR LEVADLESRT SRDLAEQCIK VLELICTREA GAVFEGGGLS CVLAFIRDSG SQIHKDTLHS AMAVVSRLCT KVEPSGANVQ TCVESLSTLL QHEDPLVADG ALKCFASVAD RFTRKGVDPA PLAEYGLVRE LLQRLSNAAG GPQISSSGGG GSAAISSSLG TNSSTHPESS PSAAQLSSSA PKSAQGAMEA GRSSQSIATT ISLLSTLCRG SPSITHDLLR SNLLEAMERA FKGDERCVLD CMRLADLILL LLFEGRQALG RVGGSQGQLA PRVRRADSST ERTHRQLIDC IRSKDTEALI ESIESGGIDV NCMDDVGQTL LNWASAFGTL EMVEFLCDKG ADVNKGQRSS SLHYAACFGR PGIAKVLLKH GANPDLRDED GKTPLDKARE RPDEGHREVA SILQSPGEWM MAATRSDVKC GDSGDETGGA GGVGEPRGDP EMAPVYLKFF LPTFCKTFQS TMLASVRRSS LGLIKKMIQY VQPEVLSKLC SSEGLQSYEQ SLGTLLVEVI ASVLDNEDDE DGHLVVLTIV QELMSKTQND FLDHFARLGV YTKVQALMGE PSFDGSDNND VIKSTSDDAK SAAAAAACSS TDASGVVTVT PGAPTVTTAS GGTASAAVAV EDAKEILHGK AYHWHDWSIC RGRDCLYVWS DSAALELSNG SNGWFRFILD GKLATMYSSG SPENGSDSTE NRGEFLEKLQ RARAAVRQGT VSQPILSAPS LARIAVGNWV LQSQKEHQLH INNSEGHQVT ILQDELPGFI FESNRGTKHT FTAETTLGPD FAAGWINTKK KKMRCKAEAQ KYQLHKLARD LYNRYFKAAQ AIPRGAVAKL SKIVHQIEIA LEEQQSTSKA ALISSTTQQI TPPSAGVSWQ EKLYNALTEL VHLLNEDGVI SAYEMYSSGL VQALVAVLSP NYWDLGMNRT KANKYQKQRL SIFKKCMYGG ELKTGKNTAA ILVQKLVAVL ESIEKLPVYM YDSPGGSYGL QILTKKLSFR LERAACEQTL FDRTGRNLKM EPLATVGHLN KYLLKMVAKQ WYDMERSSFL YLRKMKEAKP GTMQFRHRHD FDENGLIYYI GTNGRTLEWV NPAQYGLVTV TSSEGKQLPY GKLEDILSRD SVSVNCHTKD NKKSWFAIDL GIFILPTAYT LRHARGYGRS ALRNWLFQMS KDGVSWVTLL THTDDKSLAE PGSTCTWPID CPADEQQGYR HVRIHQNGRN ASGQTHYLSL SGFEIYGKVM SVCEDMDKTA AKENEAKLRK ERRQIRSQLK YITDGARVVR GVDWHWDDQD GSPPGEGTVI AEIHNGWIDV KWDHGMRNSY RMGAEGKYDL KLANVDGLMA GAYDLHSSGI SCELADGGST ASGKKKVYDK SLNVLTSRKS SSTPSLPDAT TENRSSVAST EQATSADNLS WKQAVEVITE NVLSSARSDL ATVGSGGSSN DLSSSVVTSS TAATGNNQEV SVTVHSSLSE RGNNIPDLSQ INSSTSMLVS DLATITENLS LSDGSAKQSG TATASSTGGQ QFVSNISGTG VPVLMGSSSS SSSSSSTEEN NKTNNINETN NKINLTSGSG ASSVASGSSA SSGKAGLSYL QTRLDMMGKM REGVDMLRNN TNNFLSSELL TQSNLLSSVK IAFPPIPPAN ATTGSSSSTT ATTIPGSASY GSSIFVASTS TGTNPAASSG TGAKTPTDKF DVKFNNTATA NTFKKVLNEA KQIGAEQPPA PPAPASSMGG RDATNNLKNN IVVVGSTETT VLGGANNHSH NVSHRTSSHH HHHHNHPDSA SQTLANDAQP PPPGLLETFA AIARRRTSGS GSNSNSNANN NNENNNAATH PSSSSSSSSQ QTTAANNSNF FPRGPNSVTS LVKLALSSHT GLLSTAQSYP SLFSSSSNNN AAAGQGAGNN NNANNMVGVG QVNPLNPALT MSLTSTSSDS EQVSLEDFLE QCRAPTLLGD LEDDEDIEDE NDDDENEDEY EEVGNTLLQV MVSRNLLSFM EERTFENRLP TAGKRKSWDD EFVLKRQFSA LIPAFDPRPG KTNVNQTSDL DIPAPGSNSA TDPTEQPQSS SSGRGALPEP GSGQSSTAIS SLPQPTLSLI LRGPNINGVN DVEVDLTQSD WTIFRAVQEL MLQTTMPKQD KFRKIWQPTY TIIYREASPG SSSSLLGGGG KEDFSSGEEG RATPIISMYS QRSHGSTLSP SSPVPGTPSL SGGGGAGGTG ASSGGLLSQL NSINQSLAAT PTNNDKNLMP DVESHYLSPD VFMSKKITNK LQQQIQDPLV LSSGSLPKWC EEYNQTCPFL FPFETRQLYF SCTAFGASRS IVWLQSQRDV SLERQRAPGL SPRHADQQEF RVGRLKHERV KVPRGENLLD WAQQVMKVHC NRKSVLEVEF VGEEGTGLGP TLEFYALVAA ELQRSDLGMW LCDDEEPKLI EDEIDLGEGS KPVGYYVRRS TGLFPAPLPQ DSDISEDVSG YFWFLGVFLA KVLQDNRLVD LPLSNSFLQL LSHSRSMARG ATSQTGSLLG KSGVSDDIMM SSILSEDSDR DRDLLVDSYQ SKMAMASDGA WYDGILTQEN LQEIDPIRYQ FLRELQELVQ QKQAIEQNDA LSSEEKLQQI SELKLNTKTG CVALEDLALT FTYLPSSRNY GYASADLLPN GANIDVTINN VEEYCNLTVA FCLQEGIAKQ LAAFHRGFCE VFALNKLAAF TPDEIRKMLC GEQNPEWTRE DIMTYTEPKL GYTKESPGFL RFVNVLMGMN GSERKAFLQF TTGCSSLPPG GLANLHPRLT VVRKVDAGEG SYPSVNTCVH YLKLPDYPNE QILRERLLTA TKEKGFHLN // ID Q7QJ62_ANOGA Unreviewed; 663 AA. AC Q7QJ62; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 23-OCT-2007, sequence version 4. DT 11-NOV-2015, entry version 59. DE SubName: Full=AGAP007311-PA {ECO:0000313|EMBL:EAA04748.4}; DE Flags: Fragment; GN ORFNames=AgaP_AGAP007311 {ECO:0000313|EMBL:EAA04748.4}; OS Anopheles gambiae (African malaria mosquito). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; OC Culicidae; Anophelinae; Anopheles. OX NCBI_TaxID=7165 {ECO:0000313|Proteomes:UP000007062}; RN [1] {ECO:0000313|EMBL:EAA04748.4, ECO:0000313|Proteomes:UP000007062} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PEST {ECO:0000313|EMBL:EAA04748.4, RC ECO:0000313|Proteomes:UP000007062}; RX PubMed=12364791; DOI=10.1126/science.1076181; RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R., RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R., RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., RA Lai Z., Kraft C.L., Abril J.F., Anthouard V., Arensburger P., RA Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V., RA Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S., RA Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M., RA Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I., RA Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z., RA Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R., RA Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E., RA Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., RA Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R., RA Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C., RA Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V., RA Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D., RA Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H., RA Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A., RA Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I., RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., RA Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S., RA Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C., RA Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.; RT "The genome sequence of the malaria mosquito Anopheles gambiae."; RL Science 298:129-149(2002). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAA04748.4}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAAB01008807; EAA04748.4; -; Genomic_DNA. DR RefSeq; XP_308510.4; XM_308510.4. DR STRING; 7165.AGAP007311-PA; -. DR PaxDb; Q7QJ62; -. DR EnsemblMetazoa; AGAP007311-RA; AGAP007311-PA; AGAP007311. DR GeneID; 1269858; -. DR KEGG; aga:AgaP_AGAP007311; -. DR VectorBase; AGAP007311; Anopheles gambiae. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000230547; -. DR InParanoid; Q7QJ62; -. DR KO; K19347; -. DR OMA; LEHEKDQ; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q7QJ62; -. DR Proteomes; UP000007062; Chromosome 2L. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000007062}; KW Reference proteome {ECO:0000313|Proteomes:UP000007062}. FT COILED 312 361 {ECO:0000256|SAM:Coils}. FT COILED 395 422 {ECO:0000256|SAM:Coils}. FT NON_TER 1 1 {ECO:0000313|EMBL:EAA04748.4}. SQ SEQUENCE 663 AA; 75013 MW; 8FBB10ED73E3C77E CRC64; DQTIVLPASS RATLALSSLS HILPSNMKHS ELDMESLYRY VNERNANFRE KILSASSFLS ALSLPFGWLS TSITFSWPWS WTGGKSDSTS RETIRNTLQR TLSKEEFEEL MRHIDAYIDG MMEQKFASKV DESERLRKAT EPPQREAITP EITVHVAQVI EESLKSYNYR LTDSDVDAVV ERVRQTLQAS YPQLFEEASK KADSNDTPEG GKVVLSSEYL TEIQRLVEQH ITTVHNHHYA ISGSQLEDIL ARILSSDKLS ALIDQRIVAS SVAEAQVRAA VDQRQREALV DDLRKELNDI KAHFSEQLLS SSAQWEEHLQ LVKQNHKQLE QQLKAYRLES NELYQKLLAD IDNRLNAHRE ERYEGVNKVI RENIITILGL NVKQDIADGD LRAWINGLFV ARDHLEQRLE EIQAKVNVDV REEIDRSASR LMLEIGEQIR AEMMSKASSA GGPDDPDPAS SSTSSNLTED DVKRIVRDAL IVYDADKTGR VDYALESAGG QVLSTRCTES YQANSAEFRI FGIIPIPYSS NTPRTVISPT MEPGQCWAFQ GFPGYLVIQL NTEIIVTGFT LEHISKLLVS NGSISSAPKH FTVWGLRALN DPEPVPLGSY EYLDQMGSSV QYFPVENKDW TEPLQIVELR IESNHGNIHY TCLYRFRVHG DKV // ID Q7RL00_PLAYO Unreviewed; 871 AA. AC Q7RL00; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 15-DEC-2003, sequence version 1. DT 11-NOV-2015, entry version 47. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAA22233.1}; GN ORFNames=PY02748 {ECO:0000313|EMBL:EAA22233.1}; OS Plasmodium yoelii yoelii. OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Vinckeia). OX NCBI_TaxID=73239 {ECO:0000313|EMBL:EAA22233.1, ECO:0000313|Proteomes:UP000008553}; RN [1] {ECO:0000313|EMBL:EAA22233.1, ECO:0000313|Proteomes:UP000008553} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17XNL {ECO:0000313|EMBL:EAA22233.1, RC ECO:0000313|Proteomes:UP000008553}; RX PubMed=12368865; DOI=10.1038/nature01099; RA Carlton J.M., Angiuoli S.V., Suh B.B., Kooij T.W., Pertea M., RA Silva J.C., Ermolaeva M.D., Allen J.E., Selengut J.D., Koo H.L., RA Peterson J.D., Pop M., Kosack D.S., Shumway M.F., Bidwell S.L., RA Shallom S.J., van Aken S.E., Riedmuller S.B., Feldblyum T.V., RA Cho J.K., Quackenbush J., Sedegah M., Shoaibi A., Cummings L.M., RA Florens L., Yates J.R. III, Raine J.D., Sinden R.E., Harris M.A., RA Cunningham D.A., Preiser P.R., Bergman L.W., Vaidya A.B., RA van Lin L.H., Janse C.J., Waters A.P., Smith H.O., White O.R., RA Salzberg S.L., Venter J.C., Fraser C.M., Hoffman S.L., Gardner M.J., RA Carucci D.J.; RT "Genome sequence and comparative analysis of the model rodent malaria RT parasite Plasmodium yoelii yoelii."; RL Nature 419:512-519(2002). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAA22233.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABL01000761; EAA22233.1; -; Genomic_DNA. DR RefSeq; XP_730668.1; XM_725575.1. DR ProteinModelPortal; Q7RL00; -. DR EnsemblProtists; EAA22233; EAA22233; EAA22233. DR GeneID; 3829892; -. DR KEGG; pyo:PY02748; -. DR EuPathDB; PlasmoDB:PY02748; -. DR eggNOG; ENOG410J8NG; Eukaryota. DR eggNOG; ENOG410Y04X; LUCA. DR InParanoid; Q7RL00; -. DR Proteomes; UP000008553; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008553}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000008553}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 167 191 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 279 320 {ECO:0000256|SAM:Coils}. FT COILED 329 349 {ECO:0000256|SAM:Coils}. FT COILED 487 507 {ECO:0000256|SAM:Coils}. FT COILED 580 600 {ECO:0000256|SAM:Coils}. FT COILED 618 638 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 871 AA; 102487 MW; 01C6B30FE1DEB34E CRC64; MTIHTGNNRR HAKDEKNESK SKPKGRKGNS DNEANYNIDP HENEDNSLIQ VLHSYEDFQN NKNYTKLKPC KTKETKFGRV RKTLIKIFSL SDLEVNEPNK NNFNRKKNGP NNVLLTSRMV MWNKIENNND PMYDLKIESS HNKSFVNIIA NYISMLLNDI LNDRKGIAYI AIFMIVLSIL ITCISGFITI FNNAKIDLDT WGIIPSKSSY DGVNKFMGYL KLGEEESNKK IINQNKNTRQ NSKAWKFQEL FDDVKNKINE SINLNFINKK ESGNVSETYS NIKNKQKELE NNFKRIETHL KKMESKLKTL QNDIISKTSD IDYFKNDSKK EVENIKKKLQ DNYQLFQNKF VDYLKIIDDI KIDVSEKKKT IFNEIENKVH ANQISIEEGI SSKIEHQKNY FFEKFSKLEK QMEDIEISIA NKTYSNFENN EFLKNGGDEK KNVYIYADKQ IEDIKKITDE ENKKLLTEYK KKQIETEEEN INRNNYISKK LAIIEDIRKE LDILKERTEA SKTFLDKIFP NLELKMLKNV ENKMKYYLEI YKKDIINELT ETTVISNEKK YKNMALKQEK SQKEFFKKIN IQINSQIKNI KEELNKSIDN ALHSKEFKND KDLIKKINQT NYNAIETLQE KVDELYNEFI LDYNQIDWAL ESLGAKIVYK MTSYPLNKND FIEKFLNQIV SFLPSEEIYG MVKPMGKDPS IILKPSNFPG DCFSFKGNTG KVTIHLPATI NVTSVSIQHV HENISNNVNA TPKYFSVYGV VDPNWPEHFE ESNIDYNDFK NSSLYSCLHK EYGILYPNEI LEKWIKHNKN PSVIHIGDFY FDRKKRISTY QTKHCFPFKR IIFEFTDNYG APYTCIYRLK VHGQRCIRKL K // ID Q7RSE9_PLAYO Unreviewed; 1510 AA. AC Q7RSE9; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 15-DEC-2003, sequence version 1. DT 11-NOV-2015, entry version 40. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAA15670.1}; DE Flags: Fragment; GN ORFNames=PY00410 {ECO:0000313|EMBL:EAA15670.1}; OS Plasmodium yoelii yoelii. OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Vinckeia). OX NCBI_TaxID=73239 {ECO:0000313|EMBL:EAA15670.1, ECO:0000313|Proteomes:UP000008553}; RN [1] {ECO:0000313|EMBL:EAA15670.1, ECO:0000313|Proteomes:UP000008553} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=17XNL {ECO:0000313|EMBL:EAA15670.1, RC ECO:0000313|Proteomes:UP000008553}; RX PubMed=12368865; DOI=10.1038/nature01099; RA Carlton J.M., Angiuoli S.V., Suh B.B., Kooij T.W., Pertea M., RA Silva J.C., Ermolaeva M.D., Allen J.E., Selengut J.D., Koo H.L., RA Peterson J.D., Pop M., Kosack D.S., Shumway M.F., Bidwell S.L., RA Shallom S.J., van Aken S.E., Riedmuller S.B., Feldblyum T.V., RA Cho J.K., Quackenbush J., Sedegah M., Shoaibi A., Cummings L.M., RA Florens L., Yates J.R. III, Raine J.D., Sinden R.E., Harris M.A., RA Cunningham D.A., Preiser P.R., Bergman L.W., Vaidya A.B., RA van Lin L.H., Janse C.J., Waters A.P., Smith H.O., White O.R., RA Salzberg S.L., Venter J.C., Fraser C.M., Hoffman S.L., Gardner M.J., RA Carucci D.J.; RT "Genome sequence and comparative analysis of the model rodent malaria RT parasite Plasmodium yoelii yoelii."; RL Nature 419:512-519(2002). CC -!- CAUTION: The sequence shown here is derived from an CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is CC preliminary data. {ECO:0000313|EMBL:EAA15670.1}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AABL01000115; EAA15670.1; -; Genomic_DNA. DR RefSeq; XP_724105.1; XM_719012.1. DR ProteinModelPortal; Q7RSE9; -. DR STRING; 352914.XP_724105.1; -. DR EnsemblProtists; EAA15670; EAA15670; EAA15670. DR GeneID; 3789430; -. DR KEGG; pyo:PY00410; -. DR EuPathDB; PlasmoDB:PY00410; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q7RSE9; -. DR Proteomes; UP000008553; Unassembled WGS sequence. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000008553}; KW Reference proteome {ECO:0000313|Proteomes:UP000008553}. FT COILED 1420 1440 {ECO:0000256|SAM:Coils}. FT NON_TER 1510 1510 {ECO:0000313|EMBL:EAA15670.1}. SQ SEQUENCE 1510 AA; 180736 MW; 6C06E84E842F12B5 CRC64; MDDKKIPQNA SLNFYIYLKR SFYLKNLNMF YTNIINIFLT LFCIIDKQNK IYNFLYNVDG FNLTQFLLDI FEYILLLKNN SILSAKVIEM ENLRNQAKDN QTYQLNHHII NEEIISVLKI LLTQNYHSEF QSALISNIAL NKIMNLFINK NYDEISKKEA VHNNDHISIS EQNEQPQGND ETNQNTIINS SHKISLPYSN IYFKNILYKM KKKNITMKNV SYSSLETNNL DIEIYSNDEN YKKNVENMSE LLKVNIAIKI IKTICYSFNL IQKTHKESIP DKIHLRNVMH VLLIDLSYSI WKLNNLSQTE NIVSLLFLYF NFSFVYKYDT NEFIIDTHFK KKEESNNFNF MMLKDIQSQN HDTNTGWNIS TDYIKHIVEQ TFIHNFFADY KKYIESTKKK EQTKKKQASH CYNHNKMVLN TIYLIKKIKN NINKRKKNIK IIDIERNFLN NKFPEEIYHQ MIPFLNNRSY YLDSIQYSIL LLSYNYKSTI TSSFNLEYLS VLKKYFTMSF FLKNYQKKNI HLNKDIPIIY QYLLKKYLSY FTQPFSVNFK TKKRNTFLQQ YNTEHSEKYL ISYIQAVIRK YKKIKIFRNI NYCLRQIDAD RMLIENSQKI RIIIDFGSLD TGTKIIEHSR GIINIKSIQQ YDYDSYMLTP CDSDIWWIYS FSDFIHIEKI GLVSLEHYAS NFKVIEILGS DTYPATKWKK LGKISTNFTK SFELFNIYDH CKNYDEDNCW VKYLKFIVLS HHNIEKNYYC TLTHLQIFAS SGVDMLSDKI YSDDNINQIE SDPEKSDEQK KIKIQEQDNV ENLEVLYEDK VLKHIKKQIH SKEEDSKEGN SKELTSKDNY YKNSINPDNF HNDKPLHNNI HNSDIRHKNS KHNAHYTNYA HYLSIEKDLF DTNLLEKELT QSKLIDTDLI KKELMDTELI EDELLNYEFI EKDIKINSFN EENIFDNITQ QMQREYGENK QYIKEDTPII SNKGIKYNDK IISQGDNTKN IQRNANKYIM HPKYYNKLLK DIQNVIILKN GKNEKGLEWT NLLGINPKYS TDLKTEPFNN INIEAMKIAY HISNKIYTIC NAFKNYNIAK IFKMSLITSY KHVIINFVQN NGNITFFMNR EKYNYDDYTK EENAKKYFPY KNYFNGKRFV SPFSHALEKV LIILYNSYFI PNEKWNAHIK NKRIDIKKNN KKKLIWKKKK CTALNTLEYY QNFITVKHKH IILIKSEKIK LLNLLFFPKI ICNNDFICLY TNRQIYERLI NNYNDSITNL IRIYRMYDDV NKQKHAERTN YETFRCEQNE GSKFVLNLFL PKKLTKKKNI LHNIINNISK YNTNINNNKR NTLFKYILSR VKKEQNNIQK QNNFYISILN DTLLILYIIY RSNKYYIYTL NFLKNCIDFI FKWNKYKKSN ILMLTINNKS LTNKYYRIIL MNILNEYNIL KKKNKILANN SQIGVNPNED YNGCTMFYEI ARNVFNYDID NNTKLETFFN ANKNTNVCLY GDKLCHIPLT NNQNNNYTTN // ID Q7RYL0_NEUCR Unreviewed; 1096 AA. AC Q7RYL0; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 22-JAN-2014, sequence version 2. DT 11-NOV-2015, entry version 65. DE SubName: Full=Sad1/UNC domain-containing protein {ECO:0000313|EMBL:EAA28007.2}; GN ORFNames=NCU00119 {ECO:0000313|EMBL:EAA28007.2}; OS Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM OS 1257 / FGSC 987). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=367110 {ECO:0000313|EMBL:EAA28007.2, ECO:0000313|Proteomes:UP000001805}; RN [1] {ECO:0000313|EMBL:EAA28007.2, ECO:0000313|Proteomes:UP000001805} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987 RC {ECO:0000313|Proteomes:UP000001805}; RX PubMed=12712197; DOI=10.1038/nature01554; RA Galagan J.E., Calvo S.E., Borkovich K.A., Selker E.U., Read N.D., RA Jaffe D.B., FitzHugh W., Ma L.-J., Smirnov S., Purcell S., Rehman B., RA Elkins T., Engels R., Wang S., Nielsen C.B., Butler J., Endrizzi M., RA Qui D., Ianakiev P., Bell-Pedersen D., Nelson M.A., RA Werner-Washburne M., Selitrennikoff C.P., Kinsey J.A., Braun E.L., RA Zelter A., Schulte U., Kothe G.O., Jedd G., Mewes H.-W., Staben C., RA Marcotte E., Greenberg D., Roy A., Foley K., Naylor J., RA Stange-Thomann N., Barrett R., Gnerre S., Kamal M., Kamvysselis M., RA Mauceli E.W., Bielke C., Rudd S., Frishman D., Krystofova S., RA Rasmussen C., Metzenberg R.L., Perkins D.D., Kroken S., Cogoni C., RA Macino G., Catcheside D.E.A., Li W., Pratt R.J., Osmani S.A., RA DeSouza C.P.C., Glass N.L., Orbach M.J., Berglund J.A., Voelker R., RA Yarden O., Plamann M., Seiler S., Dunlap J.C., Radford A., Aramayo R., RA Natvig D.O., Alex L.A., Mannhaupt G., Ebbole D.J., Freitag M., RA Paulsen I., Sachs M.S., Lander E.S., Nusbaum C., Birren B.W.; RT "The genome sequence of the filamentous fungus Neurospora crassa."; RL Nature 422:859-868(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM002238; EAA28007.2; -; Genomic_DNA. DR RefSeq; XP_957243.2; XM_952150.3. DR EnsemblFungi; EFNCRT00000000491; EFNCRP00000000491; EFNCRG00000000491. DR GeneID; 3873365; -. DR KEGG; ncr:NCU00119; -. DR EuPathDB; FungiDB:NCU00119; -. DR InParanoid; Q7RYL0; -. DR OrthoDB; EOG7SBNXT; -. DR Proteomes; UP000001805; Chromosome 3, Linkage Group III. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001805}; KW Reference proteome {ECO:0000313|Proteomes:UP000001805}; KW Signal {ECO:0000256|SAM:SignalP}. FT SIGNAL 1 19 {ECO:0000256|SAM:SignalP}. FT CHAIN 20 1096 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004290931. FT COILED 665 685 {ECO:0000256|SAM:Coils}. FT COILED 720 747 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1096 AA; 118608 MW; D42817916B35AB92 CRC64; MRTPPTLFLG LLGLHAVAAA LPEPASVCES RTVNYITHTL PQQCLRTAWT TPTAVTSAVA ADTTSSEVPS NETAAPAQAN ESQQQHPDQS KPSAEHTQEQ TKEEQEDEDL AASTFMSFEE WKEMMLRKSG DPANTKGGQK QPAQQRTGGE HDQNGPSSDT DNHRPGDDGE NPLNFDALSE KVSELTSSPS GDPSTDYGSD KARTDDQVVH EDGKTQYYRS KDAGKTCKER FSYSSFDAGA IVKKTSPGAK NAKAILVENK DSYMLLECHA KSKFVIVQLS DDILVDTVVL ANFEFFSSMI RQFKVSVSDR YPVKLDKWVE LGTFEARNSR DIQAFSVEHP QIYTKYIRIE FLSHYGNEYY CPVSLLRVHG TRMLDTWKEP DDRHDDEQET IEAPPVQEQL PQTPEPEQPS PQVGQPSVAS EPAPSTVTEL EEKTHQKTEP VQVVELGFTP WEPVFYRDFS LEICDLRSRT TGQSTAISPE ADNKQGRNSN TAKEQASTGS AVHETLVPKG SSAASKPQEI AKAQPASSAA SHTSVPPQVS GTTTGSPSNK GPLSRSNTAS NETAPSVSPA AKPSGSSNST AGTTSRSDSK DNGNNASASA GTGGSPVNNG SQNNKNNQSR KPASGAGHGG SPTSSAPPSP TVQESFFKTV HKRLTHLESN TSLSLQYIEQ QSRFLQDVLS KLERRQLTRV DTFLDTLNKT VLTELRNVRQ QYDQIWQSTV IALETQREQT EREVVALSGR LNVLADEVVF QKRMAILQSV LLLSCLILVI FNRTGGGGGG GNGGGGIALN SNGGTGGRPG SRGGGGGGGG WFDSPIQAVQ RRSMRPGSGW ISNMGMSMGM SSPFPFSTTV STSGVQQQVT AAATVEARSG SGEDADAVGT STVVDIATAQ QRNQQQLHPN DNHNLGQRQH QHMLQTQQHS YAYPRNNDKA LPLTPTSEYD SREGTPLVHT SPLRQTSTTI DEVLAAEDVD GNSQIYTQSS FGPEPECVPD QEESSRFSSS EFESGGLTPP RTLETYQEST EPNHNGVANV PVRSNSAEES SERIEEDDNG LMPVDSIEYQ QQQTLRPRAR PSRTHLGSET VKPLPAVPET SKFIIT // ID Q7RZ34_NEUCR Unreviewed; 1019 AA. AC Q7RZ34; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 15-DEC-2003, sequence version 1. DT 11-NOV-2015, entry version 55. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EAA28281.1}; GN ORFNames=NCU04440 {ECO:0000313|EMBL:EAA28281.1}; OS Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM OS 1257 / FGSC 987). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. OX NCBI_TaxID=367110 {ECO:0000313|EMBL:EAA28281.1, ECO:0000313|Proteomes:UP000001805}; RN [1] {ECO:0000313|EMBL:EAA28281.1, ECO:0000313|Proteomes:UP000001805} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987 RC {ECO:0000313|Proteomes:UP000001805}; RX PubMed=12712197; DOI=10.1038/nature01554; RA Galagan J.E., Calvo S.E., Borkovich K.A., Selker E.U., Read N.D., RA Jaffe D.B., FitzHugh W., Ma L.-J., Smirnov S., Purcell S., Rehman B., RA Elkins T., Engels R., Wang S., Nielsen C.B., Butler J., Endrizzi M., RA Qui D., Ianakiev P., Bell-Pedersen D., Nelson M.A., RA Werner-Washburne M., Selitrennikoff C.P., Kinsey J.A., Braun E.L., RA Zelter A., Schulte U., Kothe G.O., Jedd G., Mewes H.-W., Staben C., RA Marcotte E., Greenberg D., Roy A., Foley K., Naylor J., RA Stange-Thomann N., Barrett R., Gnerre S., Kamal M., Kamvysselis M., RA Mauceli E.W., Bielke C., Rudd S., Frishman D., Krystofova S., RA Rasmussen C., Metzenberg R.L., Perkins D.D., Kroken S., Cogoni C., RA Macino G., Catcheside D.E.A., Li W., Pratt R.J., Osmani S.A., RA DeSouza C.P.C., Glass N.L., Orbach M.J., Berglund J.A., Voelker R., RA Yarden O., Plamann M., Seiler S., Dunlap J.C., Radford A., Aramayo R., RA Natvig D.O., Alex L.A., Mannhaupt G., Ebbole D.J., Freitag M., RA Paulsen I., Sachs M.S., Lander E.S., Nusbaum C., Birren B.W.; RT "The genome sequence of the filamentous fungus Neurospora crassa."; RL Nature 422:859-868(2003). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CM002239; EAA28281.1; -; Genomic_DNA. DR RefSeq; XP_957517.1; XM_952424.2. DR ProteinModelPortal; Q7RZ34; -. DR EnsemblFungi; EFNCRT00000005218; EFNCRP00000005214; EFNCRG00000005211. DR GeneID; 3873648; -. DR KEGG; ncr:NCU04440; -. DR EuPathDB; FungiDB:NCU04440; -. DR InParanoid; Q7RZ34; -. DR OMA; EPPRIAR; -. DR OrthoDB; EOG7P8PJ5; -. DR Proteomes; UP000001805; Chromosome 4, Linkage Group IV. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 2. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001805}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001805}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 498 518 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 111 135 {ECO:0000256|SAM:Coils}. FT COILED 164 320 {ECO:0000256|SAM:Coils}. FT COILED 363 383 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1019 AA; 115251 MW; 22244C962E67BA7E CRC64; MPPRRITRRS AVSPSPALSD IGTPKPGKRG TLIPVKQVRN QTPTRFSLSY GSSLVAMPDR NKTAAGTDLE TAFAEIHETV RTDNIKAEAR RRELDARRGS TTPGPRRPDP IEEETEEEEE EDEEVQEEKE EDDDDNQGGY YNDEEEEEPA PDPPRRATKP VQTLNKLQDQ IEKAKLLEKQ RAEERAAEER EKAEKDAAER ERKRLEKDKK DKEEREKRDK AEREKKEQAA KAQQEAKVKA AREAQERAER EVKKRARDEE DQKQAELERA ERNARLNRER SEDARRQAEQ KHAAEAARKK EEQRQAREAS EAEMASLEEA KRQAMRPPPP PSKQLLSTPP TSRTRELVVP DTGNSYVEES DVYTDSEKMR EVLEEEVRMA QQKRLARYTP EPPEPPRIAR RPASTLSNSL QPPSHQVDQH QDLFDTEAKS MSDKQYPSFG KVSKPTAARP NQTSRPRAEQ SNTTNVETPP PPYTTAPPTF VQRLLSLIRR STWGVWKLFT FLVPVLLIGL IVLTASSYGS PDANTSIRWY GWKHWRSNVG QFIPSHPQLT DDQFNDLKDF ILEQSSSTES AVKNIQTLLP RMVHVKRGPN GDLIIQDDFW HALLDKMLKD SSVLTLDGTG DISEEHWDAL RPRLIKAGLF EKGPSDEHIL QIAEGTVSKS WERWVTKNGE KVAQVVKEHL PGDKGDGVTR DAAISRDEFV GLLKKRIAEH KEEIDGQLDS VKKGLETLID TTVKAAISNS EGSLSKSEIT TLVRNIVKKE IPRAQLEAAA KDGIMRNYHD YVETQVNHFG LGNEAGIVLS ESSPVYRLES QALPGNKHLS KLLGKPKPIS SKDKVTLEAE YMLALSAWND VGQCWCAGIS ASRGAELAVE MAKHVIPQAI VVEHVHPNAT NDPGSMPKDI EIWGYYPDAD DNKRLLAWMD ELYPGEREAD MKMVDANNKK SLSLINRKYV KIGELEYDYA KTSGSHGMFV HKLSEELLDL DAATYKVVVR AKTNHGALDH TCIYRLKLFG EELEYEGEE // ID Q7XXP5_ORYSJ Unreviewed; 453 AA. AC Q7XXP5; DT 01-OCT-2003, integrated into UniProtKB/TrEMBL. DT 01-OCT-2003, sequence version 1. DT 11-NOV-2015, entry version 81. DE SubName: Full=Os05g0270200 protein {ECO:0000313|EMBL:BAF16971.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAV43967.1}; GN OrderedLocusNames=Os05g0270200 {ECO:0000313|EMBL:BAF16971.1}; GN ORFNames=OJ1653_D06.1 {ECO:0000313|EMBL:AAV67831.1}, OsJ_17853, GN OSJNBa0037H03.16 {ECO:0000313|EMBL:AAV43967.1}; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947 {ECO:0000313|EMBL:BAC78599.1}; RN [1] {ECO:0000313|EMBL:AAV43967.1} RP NUCLEOTIDE SEQUENCE. RA Chow T.-Y., Hsing Y.-I.C., Chen C.-S., Chen H.-H., Liu S.-M., RA Chao Y.-T., Chang S.-J., Chen H.-C., Chen S.-K., Chen T.-R., RA Chen Y.-L., Cheng C.-H., Chung C.-I., Han S.-Y., Hsiao S.-H., RA Hsiung J.-N., Hsu C.-H., Huang J.-J., Kau P.-I., Lee M.-C., Leu H.-L., RA Li Y.-F., Lin S.-J., Lin Y.-C., Wu S.-W., Yu C.-Y., Yu S.-W., RA Wu H.-P., Shaw J.-F., McCombie W.R., Spiegel L., de la Bastide M., RA Zutavern T., Muller S., Nascimento L., Balija V., Bell M., Miller B., RA Katzenberger F., Andrade M.V., Dike S., O'Shaughnessy A., Palmer L.; RT "Oryza sativa (japonica cultivar-group) chromosome 5 BAC clone RT OSJNBa0037H03, complete sequence."; RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:AAV67831.1} RP NUCLEOTIDE SEQUENCE. RA Chow T.-Y., Hsing Y.-I.C., Chen C.-S., Chen H.-H., Liu S.-M., RA Chao Y.-T., Chang S.-J., Chen H.-C., Chen S.-K., Chen T.-R., RA Chen Y.-L., Cheng C.-H., Chung C.-I., Han S.-Y., Hsiao S.-H., RA Hsiung J.-N., Hsu C.-H., Huang J.-J., Kau P.-I., Lee M.-C., Leu H.-L., RA Li Y.-F., Lin S.-J., Lin Y.-C., Wu S.-W., Yu C.-Y., Yu S.-W., RA Wu H.-P., Shaw J.-F.; RT "Oryza sativa BAC OJ1653_D06 genomic sequence."; RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|Proteomes:UP000000763} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [4] {ECO:0000313|EMBL:BAF16971.1} RP NUCLEOTIDE SEQUENCE. RG International Rice Genome Sequencing Project; RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N., RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y., RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N., RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M., RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M., RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K., RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., RA Kobayashi N., Machita K., Maehara T., Masukawa M., Mizubayashi T., RA Mukai Y., Nagasaki H., Nagata Y., Naito S., Nakashima M., Nakama Y., RA Nakamichi Y., Nakamura M., Meguro A., Negishi M., Ohta I., Ohta T., RA Okamoto M., Ono N., Saji S., Sakaguchi M., Sakai K., Shibata M., RA Shimokawa T., Song J., Takazaki Y., Terasawa K., Tsugane M., Tsuji K., RA Ueda S., Waki K., Yamagata H., Yamamoto M., Yamamoto S., Yamane H., RA Yoshiki S., Yoshihara R., Yukawa K., Zhong H., Yano M., Yuan Q., RA Ouyang S., Liu J., Jones K.M., Gansberger K., Moffat K., Hill J., RA Bera J., Fadrosh D., Jin S., Johri S., Kim M., Overton L., Reardon M., RA Tsitrin T., Vuong H., Weaver B., Ciecko A., Tallon L., Jackson J., RA Pai G., Aken S.V., Utterback T., Reidmuller S., Feldblyum T., RA Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R., Ying K., RA Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J., Weng Q., RA Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X., RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., RA Samain S., Cattolico L., Pelletier E., Couloux A., Segurens B., RA Wincker P., D'Hont A., Scarpelli C., Weissenbach J., Salanoubat M., RA Quetier F., Yu Y., Kim H.R., Rambo T., Currie J., Collura K., Luo M., RA Yang T., Ammiraju J.S.S., Engler F., Soderlund C., Wing R.A., RA Palmer L.E., de la Bastide M., Spiegel L., Nascimento L., Zutavern T., RA O'Shaughnessy A., Dike S., Dedhia N., Preston R., Balija V., RA McCombie W.R., Chow T., Chen H., Chung M., Chen C., Shaw J., Wu H., RA Hsiao K., Chao Y., Chu M., Cheng C., Hour A., Lee P., Lin S., Lin Y., RA Liou J., Liu S., Hsing Y., Raghuvanshi S., Mohanty A., Bharti A.K., RA Gaur A., Gupta V., Kumar D., Ravi V., Vij S., Kapur A., Khurana P., RA Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K., Singh A., Dalal V., RA Srivastava S., Dixit A., Pal A.K., Ghazi I.A., Yadav M., Pandit A., RA Bhargava A., Sureshbabu K., Batra K., Sharma T.R., Mohapatra T., RA Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S., Keizer G., RA Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K., RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., RA Zimmer P.D., Malone G., Dellagostin O., de Oliveira A.C., Bevan M., RA Bancroft I., Minx P., Cordum H., Wilson R., Cheng Z., Jin W., RA Jiang J., Leong S.A., Iwama H., Gojobori T., Itoh T., Niimura Y., RA Fujii Y., Habara T., Sakai H., Sato Y., Wilson G., Kumar K., RA McCouch S., Juretic N., Hoen D., Wright S., Bruskiewich R., Bureau T., RA Miyao A., Hirochika H., Nishikawa T., Kadowaki K., Sugiura M., RA Burr B., Sasaki T.; RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [5] {ECO:0000313|EMBL:BAC78599.1} RP NUCLEOTIDE SEQUENCE. RC TISSUE=Panicle {ECO:0000313|EMBL:BAC78599.1}; RX PubMed=15659629; DOI=10.1105/tpc.104.028456; RA Moriguchi K., Suzuki T., Ito Y., Yamazaki Y., Niwa Y., Kurata N.; RT "Functional isolation of novel nuclear proteins showing a variety of RT subnuclear localizations."; RL Plant Cell 17:389-403(2005). RN [6] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). RN [7] {ECO:0000313|EMBL:BAF16971.1} RP NUCLEOTIDE SEQUENCE. RG IRGSP(International Rice Genome Sequencing Project); RT "Oryza sativa nipponbare(GA3) genomic DNA, chromosome 5."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. RN [8] {ECO:0000313|EMBL:BAF16971.1} RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RT "The Second Rice Annotation Project Meeting (RAP2)."; RL Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases. RN [9] {ECO:0000313|EMBL:BAF16971.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=16381971; DOI=10.1093/nar/gkj094; RA Ohyanagi H., Tanaka T., Sakai H., Shigemoto Y., Yamaguchi K., RA Habara T., Fujii Y., Antonio B.A., Nagamura Y., Imanishi T., Ikeo K., RA Itoh T., Gojobori T., Sasaki T.; RT "The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa RT ssp. japonica genome information."; RL Nucleic Acids Res. 34:D741-D744(2006). RN [10] RP NUCLEOTIDE SEQUENCE. RA Wang J., Li R., Fan W., Huang Q., Zhang J., Zhou Y., Hu Y., Zi S., RA Li J., Ni P., Zheng H., Zhang Y., Zhao M., Hao Q., McDermott J., RA Samudrala R., Kristiansen K., Wong G.K.-S.; RT "Improved gene annotation of the rice (Oryza sativa) genomes."; RL Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases. RN [11] {ECO:0000313|EMBL:BAF16971.1} RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RA Itoh T., Tanaka T., Barrero R.A., Yamasaki C., Fujii Y., Hilton P.B., RA Antonio B.A., Aono H., Apweiler R., Bruskiewich R., Bureau T., RA Burr F., Costa de Oliveira A., Fuks G., Habara T., Haberer G., Han B., RA Harada E., Hiraki A.T., Hirochika H., Hoen D., Hokari H., Hosokawa S., RA Hsing Y., Ikawa H., Ikeo K., Imanishi T., Ito Y., Jaiswal P., RA Kanno M., Kawahara Y., Kawamura T., Kawashima H., Khurana J.P., RA Kikuchi S., Komatsu S., Koyanagi K.O., Kubooka H., Lieberherr D., RA Lin Y.C., Lonsdale D., Matsumoto T., Matsuya A., McCombie W.R., RA Messing J., Miyao A., Mulder N., Nagamura Y., Nam J., Namiki N., RA Numa H., Nurimoto S., O'donovan C., Ohyanagi H., Okido T., Oota S., RA Osato N., Palmer L.E., Quetier F., Raghuvanshi S., Saichi N., RA Sakai H., Sakai Y., Sakata K., Sakurai T., Sato F., Sato Y., RA Schoof H., Seki M., Shibata M., Shimizu Y., Shinozaki K., Shinso Y., RA Singh N.K., Smith-White B., Takeda J., Tanino M., Tatusova T., RA Thongjuea S., Todokoro F., Tsugane M., Tyagi A.K., Vanavichit A., RA Wang A., Wing R.A., Yamaguchi K., Yamamoto M., Yamamoto N., Yu Y., RA Zhang H., Zhao Q., Higo K., Burr B., Gojobori T., Sasaki T.; RT "Curated Genome Annotation of Oryza sativa ssp. japonica and RT Comparative Genome Analysis with Arabidopsis thaliana."; RL Genome Res. 17:175-183(2007). RN [12] {ECO:0000313|EMBL:BAF16971.1} RP NUCLEOTIDE SEQUENCE. RG The Rice Annotation Project (RAP); RA Tanaka T., Antonio B.A., Kikuchi S., Matsumoto T., Nagamura Y., RA Numa H., Sakai H., Wu J., Itoh T., Sasaki T., Aono R., Fujii Y., RA Habara T., Harada E., Kanno M., Kawahara Y., Kawashima H., Kubooka H., RA Matsuya A., Nakaoka H., Saichi N., Sanbonmatsu R., Sato Y., Shinso Y., RA Suzuki M., Takeda J., Tanino M., Todokoro F., Yamaguchi K., RA Yamamoto N., Yamasaki C., Imanishi T., Okido T., Tada M., Ikeo K., RA Tateno Y., Gojobori T., Lin Y.C., Wei F.J., Hsing Y.I., Zhao Q., RA Han B., Kramer M.R., McCombie R.W., Lonsdale D., O'Donovan C.C., RA Whitfield E.J., Apweiler R., Koyanagi K.O., Khurana J.P., RA Raghuvanshi S., Singh N.K., Tyagi A.K., Haberer G., Fujisawa M., RA Hosokawa S., Ito Y., Ikawa H., Shibata M., Yamamoto M., RA Bruskiewich R.M., Hoen D.R., Bureau TE., Namiki N., Ohyanagi H., RA Sakai Y., Nobushima S., Sakata K., Barrero R.A., Sato Y., Souvorov A., RA Smith-White B., Tatusova T., An S., An G., OOta S., Fuks G., RA Messing J., Christie K.R., Lieberherr D., Kim H., Zuccolo A., RA Wing R.A., Nobuta K., Green P.J., Lu C., Meyers BC., Chaparro C., RA Piegu B., Panaud O., Echeverria M.; RT "The Rice Annotation Project Database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). RN [13] {ECO:0000313|Proteomes:UP000000763} RP GENOME REANNOTATION. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=18089549; DOI=10.1093/nar/gkm978; RG The rice annotation project (RAP); RT "The rice annotation project database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). RN [14] RP NUCLEOTIDE SEQUENCE. RA Wang J., Li R., Fan W., Huang Q., Zhang J., Zhou Y., Hu Y., Zi S., RA Li J., Ni P., Zheng H., Zhang Y., Zhao M., Hao Q., McDermott J., RA Samudrala R., Kristiansen K., Wong G.K.-S.; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC136522; AAV43967.1; -; Genomic_DNA. DR EMBL; AC115634; AAV67831.1; -; Genomic_DNA. DR EMBL; AB110207; BAC78599.1; -; mRNA. DR EMBL; AP008211; BAF16971.1; -; Genomic_DNA. DR EMBL; CM000142; EEE63045.1; -; Genomic_DNA. DR RefSeq; NP_001055057.1; NM_001061592.1. DR UniGene; Os.6431; -. DR STRING; 39947.LOC_Os05g18770.1; -. DR GeneID; 4338258; -. DR KEGG; osa:4338258; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR KO; K19347; -. DR Proteomes; UP000000763; Chromosome 5. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000763}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000763}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 113 136 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 190 224 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 453 AA; 50342 MW; D1B73097E07B4E9B CRC64; MSVSTAAVPT ANTNGNHALS MDSHSSQDVR RRTVVVARKK ASPELLADGG FNGTSSVDKI TDKKDLSHTI RGESVLGKSK YPLEARKDAI ASAAAADRQK KSGAKQEKAK WEIALSVLMK LCLLISAVAW MGQLFWRWQN GDLSFTTLDM ESRLSKVEGF KKTTKMLQVQ LDILDKKLGN EIDKTRRDIT KQFEDKGNKL EIKMKALEGK TDKLDKSLAE LRDMGFVSKK EFDEIVEQLK KKKGLDGTVG DISLDDIRLF AKEIVEMEIE RHAADGLGMV DYALASGGGK VVKHSEAFRK AKSFMPSRNS LLEQAKKMLE PSFGQPGECF ALQGSSGYVE IKLRTGIIPE AVSLEHVDKS VAYDRSSAPK DFQVSGWYEG PEDDSDKESR VVTNLGEFSY DLEKNNAQTF QLERTADSRV INMVRLDFCS NHGNSELTCI YRFRVHGREP GSP // ID Q8GX04_ARATH Unreviewed; 660 AA. AC Q8GX04; DT 01-MAR-2003, integrated into UniProtKB/TrEMBL. DT 01-MAR-2003, sequence version 1. DT 11-NOV-2015, entry version 63. DE SubName: Full=At1g22882 {ECO:0000313|EMBL:AAO64872.1}; DE SubName: Full=Putative uncharacterized protein At1g22882 {ECO:0000313|EMBL:BAC43121.1}; GN OrderedLocusNames=At1g22882 {ECO:0000313|TAIR:AT1G22882}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000313|EMBL:BAC43121.1}; RN [1] {ECO:0000313|EMBL:BAC43121.1} RP NUCLEOTIDE SEQUENCE. RA Seki M., Iida K., Satou M., Sakurai T., Akiyama K., Ishida J., RA Nakajima M., Enju A., Kamiya A., Narusaka M., Carninci P., Kawai J., RA Hayashizaki Y., Shinozaki K.; RT "Arabidopsis thaliana full-length cDNA."; RL Submitted (NOV-2002) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:AAO64872.1} RP NUCLEOTIDE SEQUENCE. RA Cheuk R., Chen H., Kim C.J., Shinn P., Bowser L., Carninci P., RA Chan M.M., Chang C.H., Dale J.M., Hayashizaki Y., Hsuan V.W., RA Ishida J., Jones T., Kamiya A., Karlin-Neumann G., Kawai J., Lam B., RA Lee J.M., Lin J., Miranda M., Narusaka M., Nguyen M., Onodera C.S., RA Palm C.J., Quach H.L., Sakurai T., Satou M., Seki M., Southwick A., RA Tang C.C., Toriumi M., Wong C., Wu H.C., Yamada K., Yu G., Yuan S., RA Shinozaki K., Davis R.W., Theologis A., Ecker J.R.; RT "Arabidopsis ORF clones."; RL Submitted (MAR-2003) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BT005937; AAO64872.1; -; mRNA. DR EMBL; AK118518; BAC43121.1; -; mRNA. DR ProteinModelPortal; Q8GX04; -. DR STRING; 3702.AT1G22882.1; -. DR PaxDb; Q8GX04; -. DR PRIDE; Q8GX04; -. DR TAIR; AT1G22882; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR PhylomeDB; Q8GX04; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 21 44 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 610 630 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 642 659 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 507 527 {ECO:0000256|SAM:Coils}. FT COILED 563 608 SQ SEQUENCE 660 AA; 74185 MW; 5443117E004368D0 CRC64; MQRSCRTRRR VSVNKFNGRN SFYKVSLSLV FLLWVLLFFS TLLISHGDGA KDEPLDDSMG MADPDDGQSD EKVVPFDGPL SLASASVDVT SDLSRNDDVN LSEESEDKEQ EAEISSTVSG NDIESKDTYL LKQSEINKKD TGIDAGSKYD DFPKKSEINN TGTWNDTEGK DDNNFLKQSQ LNKTGTGNDT ESSDNEFLEQ NQMNKTVLGN GTEINVSKVD QPSRAVPLGL DEFKSRASNS RNKSLSDQVS GVIHRMEPGG KEYNYASASK GAKVLSSNKE AKGAASILSR DNDKYLRNPC STEGKFVVVE LSEETLVNTI KIANFEHYSS NLKEFELQGT LVYPTDTWVH MGNFTASNVK HEQNFTLLEP KWVRYLKLNF ISHYGSEFYC TLSLIEVYGV DAVERMLEDL ISVQDNKNAY KPREGDSEHK EKPMQQIESL EGDDGADKST HREKEKEAPP ENMLAKTEAS MAKSSNKLSE PVEEMRHHQP GSRMPGDTVL KILMQKLRSL DLNLSILERY LEELNLRYGN IFKEMDREAG VREKAIVALR LDLEGMKERQ EGMVSEAEEM KEWRKRVEAE MEKAEKEKEN IRQSLEQVSK RLEWMEKKCL TVFTVCLGFG IIAVIAVVIG MGTGLAEKTG SGAWLLLLIS STFIMFVLSL // ID Q8I5Q5_PLAF7 Unreviewed; 953 AA. AC Q8I5Q5; DT 01-MAR-2003, integrated into UniProtKB/TrEMBL. DT 01-MAR-2003, sequence version 1. DT 14-OCT-2015, entry version 52. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAN36235.1}; GN ORFNames=PFL0730w {ECO:0000313|EMBL:AAN36235.1}; OS Plasmodium falciparum (isolate 3D7). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Laverania). OX NCBI_TaxID=36329 {ECO:0000313|EMBL:AAN36235.1, ECO:0000313|Proteomes:UP000001450}; RN [1] {ECO:0000313|EMBL:AAN36235.1, ECO:0000313|Proteomes:UP000001450} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate 3D7 {ECO:0000313|Proteomes:UP000001450}; RX PubMed=12368864; DOI=10.1038/nature01097; RA Gardner M.J., Hall N., Fung E., White O., Berriman M., Hyman R.W., RA Carlton J.M., Pain A., Nelson K.E., Bowman S., Paulsen I.T., RA James K.D., Eisen J.A., Rutherford K.M., Salzberg S.L., Craig A., RA Kyes S., Chan M.-S., Nene V., Shallom S.J., Suh B., Peterson J., RA Angiuoli S., Pertea M., Allen J., Selengut J., Haft D., Mather M.W., RA Vaidya A.B., Martin D.M.A., Fairlamb A.H., Fraunholz M.J., Roos D.S., RA Ralph S.A., McFadden G.I., Cummings L.M., Subramanian G.M., RA Mungall C., Venter J.C., Carucci D.J., Hoffman S.L., Newbold C., RA Davis R.W., Fraser C.M., Barrell B.G.; RT "Genome sequence of the human malaria parasite Plasmodium RT falciparum."; RL Nature 419:498-511(2002). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE014846; AAN36235.1; -; Genomic_DNA. DR RefSeq; XP_001350555.1; XM_001350519.1. DR ProteinModelPortal; Q8I5Q5; -. DR SMR; Q8I5Q5; 708-946. DR EnsemblProtists; PFL0730w:mRNA; PFL0730w:pep; PFL0730w. DR GeneID; 811199; -. DR KEGG; pfa:PFL0730w; -. DR EuPathDB; PlasmoDB:PF3D7_1215100; -. DR HOGENOM; HOG000281004; -. DR InParanoid; Q8I5Q5; -. DR Proteomes; UP000001450; Chromosome 12. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 2. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001450}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001450}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 162 186 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 272 303 {ECO:0000256|SAM:Coils}. FT COILED 318 338 {ECO:0000256|SAM:Coils}. FT COILED 404 442 {ECO:0000256|SAM:Coils}. FT COILED 552 572 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 953 AA; 113033 MW; 970F692DEC8CD13A CRC64; MNISNSSKNT FISDENNVTY EDDYNETDNN KDEGDDRNTL IHVLHSYEDF QKKNMNYGKV NPYKKKETQL SKIKKNLVRL LSSNDITIHK FNSEEIKGNK KKKMENIKFL TSRAAMWYQV EKNDDPDYDL KLESSKKKNF VYITMNYINI FMNDLLNDKR GMTYIATFMI VLSIIITFIS GFITLYNNNN NNINHYNNHN NHKYHNINSS NNNNNNTINS NAFFSTNEQY LKNLDEKNNW NLNMSRNNYD DINQYMNYSA LQIKRGNDQE NYKKYNEEIF EILEQLKKEI NENKNIKSSE KIQNENKKDD KLNIYEIRLN IEKKINELEN KIINNNKNSD SFKSNLIKEF ENFKNIFHDN YEKFQTQFKD YTNIVNNIKS VIHNKDTFIN NIQKTFTQNQ VDIKNNLTSH IENEKKELLQ KINELQSQVK VMEWNILKQE NLYKGMKQNI LKILGQNKLN TIDNNNNNNN NNNLYDADND SSYDDEWFGE NKNILMGPPI DMDKKKKKQE NENNKNNKNY SNHRFSHNYM PHHISKDIHS NEIETLYTQN EFKELLNVIN DIKEQINMLQ EKNINSKNYL DDTFLQMEEK ILKNAEYKIK YYLEIYKKDI LNEITESKVI YNEEKFKSLT LKHERLQADL LKNINNQIKI QSKLIKDDIS KSIHFMMEQK KGKHNYNNIN NNNNNNIISS SNGSSNNNKM IYSDHLEIIQ KKVDELYNEF ILDYNEIDWA LESLGAKIVY KMTSSPLNRN DFIEKFLNQI ASFLPSEEIY GMIKPMGKDP AIVLKPTNFP GDCFSFKGNH GKITIHLPAT IDITSISIQH VHENISNNSN ATPKYFSVYG MVDLNWPEQF EENDINYDDF KNSSLYSCLH STYGNIIQPN EILQRWLKDN KQPNLIHIGD FYFDRKKRIA TYPTKSCFPM KRIIFEFTEN YGASYTCVYR LKVHGKRCIR KFK // ID Q8IL76_PLAF7 Unreviewed; 1803 AA. AC Q8IL76; DT 01-MAR-2003, integrated into UniProtKB/TrEMBL. DT 22-SEP-2009, sequence version 2. DT 14-OCT-2015, entry version 53. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:AAN36985.2}; GN ORFNames=PF14_0372 {ECO:0000313|EMBL:AAN36985.2}; OS Plasmodium falciparum (isolate 3D7). OC Eukaryota; Alveolata; Apicomplexa; Aconoidasida; Haemosporida; OC Plasmodiidae; Plasmodium; Plasmodium (Laverania). OX NCBI_TaxID=36329 {ECO:0000313|EMBL:AAN36985.2, ECO:0000313|Proteomes:UP000001450}; RN [1] {ECO:0000313|EMBL:AAN36985.2, ECO:0000313|Proteomes:UP000001450} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Isolate 3D7 {ECO:0000313|Proteomes:UP000001450}; RX PubMed=12368864; DOI=10.1038/nature01097; RA Gardner M.J., Hall N., Fung E., White O., Berriman M., Hyman R.W., RA Carlton J.M., Pain A., Nelson K.E., Bowman S., Paulsen I.T., RA James K.D., Eisen J.A., Rutherford K.M., Salzberg S.L., Craig A., RA Kyes S., Chan M.-S., Nene V., Shallom S.J., Suh B., Peterson J., RA Angiuoli S., Pertea M., Allen J., Selengut J., Haft D., Mather M.W., RA Vaidya A.B., Martin D.M.A., Fairlamb A.H., Fraunholz M.J., Roos D.S., RA Ralph S.A., McFadden G.I., Cummings L.M., Subramanian G.M., RA Mungall C., Venter J.C., Carucci D.J., Hoffman S.L., Newbold C., RA Davis R.W., Fraser C.M., Barrell B.G.; RT "Genome sequence of the human malaria parasite Plasmodium RT falciparum."; RL Nature 419:498-511(2002). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE014187; AAN36985.2; -; Genomic_DNA. DR RefSeq; XP_001348546.2; XM_001348510.2. DR ProteinModelPortal; Q8IL76; -. DR IntAct; Q8IL76; 1. DR MINT; MINT-1596767; -. DR EnsemblProtists; PF14_0372:mRNA; PF14_0372:pep; PF14_0372. DR GeneID; 811954; -. DR KEGG; pfa:PF14_0372; -. DR EuPathDB; PlasmoDB:PF3D7_1439300; -. DR InParanoid; Q8IL76; -. DR Proteomes; UP000001450; Chromosome 14. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000001450}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000001450}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 18 {ECO:0000256|SAM:SignalP}. FT CHAIN 19 1803 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5004308507. FT TRANSMEM 1671 1690 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 1761 1784 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 248 268 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1803 AA; 220747 MW; 7672FFFCADACC526 CRC64; MIWWFLISVN FLFFLIKSFF TPKATIYRND LNTNYTNSST EKYILGENYN LTSLKLKIDF GSLDTGTKII EYSKGIINIR SIQQYDYDSY MLTPCNSDIW WIYSFSDIIH IEKIGLVSLE HYASNFKVIE ILGSDVYPAT KWKKLGKIST NFTKSFELFN IYEYCKNYDE DNCWVKYLKF HVLSHHNLEN NYYCTLTHLQ IFASSGVDML SDKIYSDENI NKIEKDLQDN YMDQNNEQLY DQEIIENLEE LYENNESLDD TLSNTSNEKK SKTKTSTNDD NDKIHGSNIY YDNIKKNDEK KKKKKEKKST RTQAKSLDTK LIDKDLMNTK QIEKELLDTK LIENEFIHNK LFDTDMIEKE LMDTELIENE LMNYELFDKD TFFKENYFND EQQRTDESNV DQQNDMYVIK NNKDSMKGDY YIKKKKKKLV TDNTKDLNKC SSYKSSKRDK FFENIKRENH MDDQHNENIY INIKNNKSTH TYKKKNNHIF HKNVYYNILI VLYYLFNQHI KKELYHFNML KNKMQSSFFM NRFYITTRYK YLNKKYINFI NFIKVLKENH EQKLSEYYDN DIYQKLYIKQ EEQKKYIYNL IMNTQNKYEA LIKLLPFSSI QFLLKRKKWI PLLYTMYNKK EFVKNPYFSI INRGERKKRK KTYTIMNDDY IPLNTFYKKN HIIILHDLIY SLFFSNILCN GDLVCLFENK ILLSKVWISR SNIKHFITTK YIFDNYYYND KGINKLKNIY LSPHFYFFRN IFISHKFQNK IYLKFPYYSY YDKKGVLKYM HNNINMNDFL YTTTDVDINE FIPYSRYKNY EYEKERVHMI EREKKKKKMY KRKPYENYMI LYVIYRLKAN NIYHFEFIKY CTNLINNWSK KEIKKNKHLT LLYNLERKYL STFYNKMKCV DIYKSYKILK KKRVFVGAHH HIFNNEYNNN DNNSSNTCRR DKLINDIYKY LQKRNKLNYI PILHNNIKKK NEIQKIKKQQ IKMRENIFNR IILFFHKFVF RKLPYIYFLN FWIYNKNDMI FQMNDCLDIL FNYIFNNIKR IIIIKKKKKK NLYIFLEKLV TSYIKQNCFL KKNFSSLFLK KRNKKHNRHN NKRGYINMNK NYIILYLHYY MEKSKLHPFH KYLNNIEMSY YIKTANNKNY MSLSDIYFFS KNKEYYLTNY ITWYITNMIF LLLNRKEKNW KVDINKEDIY KNEKYFYDNE GVDEKEKINA YRKMNMIPNY LYNNDSYNDE YFYYDDDDDN ENENDNDNIG NNNFYNNNVL YRDHYQYDND NKNSNLNYYL TNRIIINPKK NFLENVCLTI GDMENIIYSN YYIDKFEKKD SKYDKCMKNK IKENIIYDED KDSHDSSSIK ILNEIKKNEE SNNKLIHDVY DIISEYDDNV SNKYRDDDKN VENNHKNTYK RNKQTSKNIS VITGKKERNE NIKVNQKRKE QIKKTKKINF IEEFDNKIID ENLNFPNEEN CIREEKIEEK SKNTRGHALL TLVDKIKTIE TKNNYLLTKL RDIIKITNNK TKIIYHMLSN FKILQNTISL LLKYIMINEK HMKDLNMNKK NSDTFYKILK EICLEQINDK SKNMDSLKYL CKYLQNFLYD EFERKYLFEK IRPGNYPMCD DDQNILLHNK NGYHQKNTDF IIKEKKKSLF NFLYYENHCH NDIFKTPLIY YNSQVDRLQT VYLNIFNFIV NLKVVKFLIY KFKYWKNIFK RYIINGLSYN INPHTFMNYS PKDNHTTNNN MNNTNNVMNN INNNNSNYHY ENWYQIFTNN FYVIFFYILF IIFIVNNFLC FMFYKHLSNK LNAYVKTCTC HNK // ID Q8JFV5_DANRE Unreviewed; 2576 AA. AC Q8JFV5; DT 01-OCT-2002, integrated into UniProtKB/TrEMBL. DT 01-OCT-2002, sequence version 1. DT 11-NOV-2015, entry version 110. DE SubName: Full=Novel protein with HECT-domain (Ubiquitin-transferase) {ECO:0000313|EMBL:CAD32862.1}; DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSDARP00000070818}; GN Name=hectd1 {ECO:0000313|Ensembl:ENSDARP00000070818, GN ECO:0000313|ZFIN:ZDB-GENE-030616-153}; GN Synonyms=OTTDARP00000001496 {ECO:0000313|EMBL:CAD32862.1}; GN ORFNames=dZ142B24.4-001 {ECO:0000313|EMBL:CAD32862.1}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955; RN [1] {ECO:0000313|EMBL:CAD32862.1} RP NUCLEOTIDE SEQUENCE. RA Lloyd D.; RL Submitted (DEC-2004) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|Ensembl:ENSDARP00000070818} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000070818}; RG Ensembl; RL Submitted (FEB-2012) to UniProtKB. RN [3] {ECO:0000313|Ensembl:ENSDARP00000070818, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000070818, RC ECO:0000313|Proteomes:UP000000437}; RX PubMed=23594743; DOI=10.1038/nature12111; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., RA Muffato M., Collins J.E., Humphray S., McLaren K., Matthews L., RA McLaren S., Sealy I., Caccamo M., Churcher C., Scott C., Barrett J.C., RA Koch R., Rauch G.J., White S., Chow W., Kilian B., Quintais L.T., RA Guerra-Assuncao J.A., Zhou Y., Gu Y., Yen J., Vogel J.H., Eyre T., RA Redmond S., Banerjee R., Chi J., Fu B., Langley E., Maguire S.F., RA Laird G.K., Lloyd D., Kenyon E., Donaldson S., Sehra H., RA Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Eliott D., Threadgold G., RA Harden G., Ware D., Mortimer B., Kerry G., Heath P., Phillimore B., RA Tracey A., Corby N., Dunn M., Johnson C., Wood J., Clark S., Pelan S., RA Griffiths G., Smith M., Glithero R., Howden P., Barker N., Stevens C., RA Harley J., Holt K., Panagiotidis G., Lovell J., Beasley H., RA Henderson C., Gordon D., Auger K., Wright D., Collins J., Raisen C., RA Dyer L., Leung K., Robertson L., Ambridge K., Leongamornlert D., RA McGuire S., Gilderthorp R., Griffiths C., Manthravadi D., Nichol S., RA Barker G., Whitehead S., Kay M., Brown J., Murnane C., Gray E., RA Humphries M., Sycamore N., Barker D., Saunders D., Wallis J., RA Babbage A., Hammond S., Mashreghi-Mohammadi M., Barr L., Martin S., RA Wray P., Ellington A., Matthews N., Ellwood M., Woodmansey R., RA Clark G., Cooper J., Tromans A., Grafham D., Skuce C., Pandian R., RA Andrews R., Harrison E., Kimberley A., Garnett J., Fosker N., Hall R., RA Garner P., Kelly D., Bird C., Palmer S., Gehring I., Berger A., RA Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Osoegawa K., Zhu B., Rapp A., Widaa S., RA Langford C., Yang F., Carter N.P., Harrow J., Ning Z., Herrero J., RA Searle S.M., Enright A., Geisler R., Plasterk R.H., Lee C., RA Westerfield M., de Jong P.J., Zon L.I., Postlethwait J.H., RA Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., Rogers J., RA Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the RT human genome."; RL Nature 496:498-503(2013). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BX324186; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BX465838; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL593845; CAD32862.1; -; Genomic_DNA. DR RefSeq; NP_001002504.2; NM_001002504.2. DR UniGene; Dr.78767; -. DR STRING; 7955.ENSDARP00000070818; -. DR Ensembl; ENSDART00000076344; ENSDARP00000070818; ENSDARG00000054213. DR GeneID; 100002034; -. DR KEGG; dre:100002034; -. DR CTD; 25831; -. DR ZFIN; ZDB-GENE-030616-153; hectd1. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR GeneTree; ENSGT00530000063470; -. DR HOGENOM; HOG000018061; -. DR HOVERGEN; HBG067533; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR OrthoDB; EOG7Z69BD; -. DR TreeFam; TF323674; -. DR NextBio; 20785271; -. DR PRO; PR:Q8JFV5; -. DR Proteomes; UP000000437; Chromosome 17. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR Gene3D; 1.25.10.10; -; 2. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 3. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 3. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000437}; KW Ligase {ECO:0000256|SAAS:SAAS00133783}; KW Proteomics identification {ECO:0000213|PeptideAtlas:Q8JFV5}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Transferase {ECO:0000313|EMBL:CAD32862.1}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. FT COILED 1245 1265 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 2576 AA; 285225 MW; A6E86545D1B49BC9 CRC64; MADVDPDTLL EWLQMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFETCPP RTFLPALCKI FLDESAPDNV LEVTARAITY YLDVSAECTR RIVGVDGAIK ALCNRLVVVE LNNRTSRDLA EQCVKVLELI CTRESGAVFE AGGLNCVLSF IRDSGHLVHK DTLHSAMAVV SRLCSKMEPQ DSSLETCVES LSSLLKHEDH QVSDGALRCF ASLADRFTRR GVDPAPLAKH GLTEELLSRM AAAGGTASGP SSTCKPGRTS SGAAPSAADS KLSNQVSTIV SLLSTLCRGS PLVTHDLLRS ELPDSMESAL QGDERCVLDT MRLVDLLLVL LFEGRKALPK STAGSTGRIP GLRRLDSSGE RSHRQLIDCI RSKDTDALID AIDTGAFEVN FMDDVGQTLL NWASAFGTQE MVEFLCERGA DVNRGQRSSS LHYAACFGRP QVAKTLLRHG ANPDLRDEDG KTPLDKARER GHSEVVAILQ SPGDWMCPVN KGDDKKKKDV NKEEEEGSEP KGDPEMAPIY LKRLLPVFAQ TFQQTMLPSI RKASLALIRK MVHYSSEVLL KEVCDSDAGH NLPTVLVEIT ATVLDQEDDD DGHLLALQII RDLVDKGGDV FLDQLARLGV INKVSTLAGP TSDDENEEEA KPEKDDEPQE DAKEIQQGKP YHWRDWSIIR GRDCLYIWSD AAALELSNGS NGWFRFILDG KLATMYSSGS PEGGSDSSES RSEFLEKLQR ARSQVKPVTA SQPILSTQGP TKLTVGNWSL TCLKEGEIAI HNSDGQQATI LKEDLPGFVF ESNRGTKHSF TAETSLGSEF VTGWTGKRGR KLKSKLEKTK QKVKTMARDL YDDHFKAVES MPRGVVVTLR NIATQLESAW ELHTNRQCIE GENTWRDLMK TALENLIVVL KDENTISPYE MCSSGLVQAL FTVLSNSVEL DIKHDCKPLM ERINVFKTAF TENEDDESRP AVALIRKLIA VLESIERLPL HLYDTPGSTY NLQILTRRLR FRLERAPGET ALIDRTGRML KMEPLATVES LEQYLLKMVA KQWYDFDRSS FIFVRKLREG QNFTFRHQHD FDENGIVYWI GTNAKTAYEW VNPAAYGLVV VTSSEGRNLP YGRLEDILSR DSSALNCHTN DDKNAWFAID LGLWFVPSAY TLRHARGYGR SALRNWVFQV SKDGQNWMTL YTHVDDSSLN EPGSTATWPL DPSKDEKQGW RHIRIKQMGK NASGQTHYLS LSGLEIYGTV TGVCEDQLGK AVKEAEANLR RQRRLFRSQV MKYIVPGARV VRGIDWKWRD QDGNPAGEGT VTGEAHNGWI DVTWDAGGSN SYRMGAEGKF DLKLAPGYDP ESAPSPKPVS STVAGTPQSW SSLVKNNCPD KGGSSSTAGA SSSSRKGSSS SVCSVASSSD ISLSSTRVER RVESLFEQGV GVIGGPPGAE GQEPIVVLSS AEAGSASSTS TLTADTGSES ERKTPGPDGT RQSAESTAIS MGIVSVSSPD VSSVSESSSK DAASQRPLCS ATSARLSVSS LLAAGAPMSS SASVPNLSSR EASLMESFVR RAPNMSRTNA TNNMNLSRSS SDNNTNTLGR NVMSTATSPL MGAQSFPNLT TTGTTSTVTM STSIVTSSNN VATATTGLSV GQLLSNTLTT SLTSTSSESD TDFLDSCRAN TLLAELDDEE DLPEPDDDDD ENEDDNQEEQ EYEEVLVRSR VNLGYHVHIH REEEEYETKG GRRRTWDDDF VLKRQFSALV PAFDPRPGRT NVQQTTDLEI PPPGTPRSEV QEEVECAPSP RLALILKVAG LGTTREVELP LTNYKSTIFY YVQKLLQLSC NGAIKPDKLR RIWEPTYTIM YRELKDSDKE RESGKMGCWS VEHVEQYLGT DELPKNDLIT YMQKNADSTF LRHWKLTGSN KSIRKNRNCS QLIAAYKDFC ERGCRSSGLS SGTLSTTQSC DILSAAREQA QAKAGSGQSA CSVEDVLQLL RILFTIGGEP TSGRTLQEDV EELQFNASPE EFTSKKITTK ILQQIEEPLA LASGALPDWC EQLTSKCPFL IPFETRQLYF TCTAFGASRA IVWLQNRREA TMERSRPSTT VRRDDPGEFR VGRLKHERVK VPRGESMMEW AESVMQIHAD RKSVLEVEFQ GEEGTGLGPT LEFYALVAAE FQRTSLGIWL CDDDFPDDES RQVDLGGGLK PPGYYVQRSC GLFPAPFPQD SDELERITKL FLFLGIFLAK CIQDNRLVDL PISQPFFKLL CMGDIKSNMS KLLYQTRGES DCHFSEIQSE ASTEEGQDTY SVGSFDEDSK SEFILDPPKP KPPAWYHGIL TWEDFELVNP HRALFLKELK ALSVKRRQIL GNKSLSEDEK NTRLQDLMLK NPMGSGPPLC VEDLGLNFQF CPSSKVHGFS SVDLKPNGED EMVTMDNAEE YVELMFDFCM HTGIQKQMEA FREGFNKVFP MEKLSSFSHK EVQMILCGNQ SPSWTAEDIV NYTEPKLGYT RDSPGFLRFV RVLCGMSSDE RKAFLQFTTG CSTLPPGGLA NLHPRLTIVR KVDATDASYP SVNTCVHYLK LPEYSSEEIM RERLLAATME KGFHLN // ID Q8LJA6_ORYSJ Unreviewed; 567 AA. AC Q8LJA6; DT 01-OCT-2002, integrated into UniProtKB/TrEMBL. DT 01-OCT-2002, sequence version 1. DT 11-NOV-2015, entry version 61. DE SubName: Full=Putative uncharacterized protein P0025A05.34 {ECO:0000313|EMBL:BAD53432.1}; DE SubName: Full=Putative uncharacterized protein P0518F01.5 {ECO:0000313|EMBL:BAB91704.1}; GN Name=P0518F01.5 {ECO:0000313|EMBL:BAB91704.1}; GN Synonyms=P0025A05.34 {ECO:0000313|EMBL:BAD53432.1}; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947 {ECO:0000313|Proteomes:UP000000763}; RN [1] {ECO:0000313|EMBL:BAB91704.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12447438; DOI=10.1038/nature01184; RA Sasaki T., Matsumoto T., Yamamoto K., Sakata K., Baba T., Katayose Y., RA Wu J., Niimura Y., Cheng Z., Nagamura Y., Antonio B.A., Kanamori H., RA Hosokawa S., Masukawa M., Arikawa K., Chiden Y., Hayashi M., RA Okamoto M., Ando T., Aoki H., Arita K., Hamada M., Harada C., RA Hijishita S., Honda M., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., RA Ikeno M., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K., RA Karasawa W., Katagiri S., Kikuta A., Kobayashi N., Kono I., RA Machita K., Maehara T., Mizuno H., Mizubayashi T., Mukai Y., RA Nagasaki H., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M., RA Namiki N., Negishi M., Ohta I., Ono N., Saji S., Sakai K., Shibata M., RA Shimokawa T., Shomura A., Song J., Takazaki Y., Terasawa K., Tsuji K., RA Waki K., Yamagata H., Yamane H., Yoshiki S., Yoshihara R., Yukawa K., RA Zhong H., Iwama H., Endo T., Ito H., Hahn J.H., Kim H.-I., Eun M.-Y., RA Yano M., Jiang J., Gojobori T.; RT "The genome sequence and structure of rice chromosome 1."; RL Nature 420:312-316(2002). RN [2] {ECO:0000313|Proteomes:UP000000763} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [3] {ECO:0000313|Proteomes:UP000000763} RP GENOME REANNOTATION. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=18089549; DOI=10.1093/nar/gkm978; RG The rice annotation project (RAP); RT "The rice annotation project database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP003278; BAB91704.1; -; Genomic_DNA. DR EMBL; AP003504; BAD53432.1; -; Genomic_DNA. DR STRING; 39947.LOC_Os01g41600.1; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR Proteomes; UP000000763; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000763}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000763}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 505 524 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 548 566 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 567 AA; 61399 MW; A84C80857C2D506B CRC64; MSKKRREGGG GGNGGCDPPA VTDALSMDGG LREVSLSVVF SVWCLLFLLR SQFLHSQTDP SDFYDDVEDG MRENYCKVMP LEAYIFPTEY NASAAAPTCQ PSLHPPDQPQ QETDHRSLEP FNNTTGGKSS AEAAALDELD EFRSRILQGK AENGRVPDGA TPAAHRLEPS GAEYNYAAAS KGAKVLAHNR EAKGAANILG GDKDRYLRNP CSADDKFVDV ELSEETLVRT IGLANLEHYS SNFRDFELYG SPSYPAPAEE WELLGRFTAD NAKHAQRFVL PDPRWTRYLR LRLATHYGSG FYCILSYLEV YGIDAVEQML QEIISGSGAD TDASAAAKAE EGGDGGTLRN DTAQVNARLD GVGGGGGSAA GRNDSAGDGA GAKNNGSRMT VAGDGKPAAA GRFHGDAVLK IMMQKMRSLE LGLSTLEDYT KALNHRYGAK LPDLHTGLSQ TTMALDRMKA DVRDLVEWKG NVKALRILDE NCCVGCRSNV EEMRSIQETM QNKELAVLSI SLFFACLALF KLACDRVLFL FTRKGAAAAE RMCGASKGWI LVLASSSFTT FLVLLYN // ID Q8LR16_ORYSJ Unreviewed; 625 AA. AC Q8LR16; DT 01-OCT-2002, integrated into UniProtKB/TrEMBL. DT 01-OCT-2002, sequence version 1. DT 11-NOV-2015, entry version 71. DE SubName: Full=Membrane protein CH1-like {ECO:0000313|EMBL:BAB92455.1}; DE SubName: Full=Uncharacterized protein; DE SubName: Full=cDNA clone:J023148N24, full insert sequence {ECO:0000313|EMBL:BAG93197.1}; DE SubName: Full=cDNA clone:J033060D18, full insert sequence {ECO:0000313|EMBL:BAH00590.1}; GN Name=P0698A10.35-1 {ECO:0000313|EMBL:BAB92455.1}; ORFNames=OsJ_04271; OS Oryza sativa subsp. japonica (Rice). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. OX NCBI_TaxID=39947 {ECO:0000313|Proteomes:UP000000763}; RN [1] {ECO:0000313|EMBL:BAB92455.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12447438; DOI=10.1038/nature01184; RA Sasaki T., Matsumoto T., Yamamoto K., Sakata K., Baba T., Katayose Y., RA Wu J., Niimura Y., Cheng Z., Nagamura Y., Antonio B.A., Kanamori H., RA Hosokawa S., Masukawa M., Arikawa K., Chiden Y., Hayashi M., RA Okamoto M., Ando T., Aoki H., Arita K., Hamada M., Harada C., RA Hijishita S., Honda M., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., RA Ikeno M., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K., RA Karasawa W., Katagiri S., Kikuta A., Kobayashi N., Kono I., RA Machita K., Maehara T., Mizuno H., Mizubayashi T., Mukai Y., RA Nagasaki H., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M., RA Namiki N., Negishi M., Ohta I., Ono N., Saji S., Sakai K., Shibata M., RA Shimokawa T., Shomura A., Song J., Takazaki Y., Terasawa K., Tsuji K., RA Waki K., Yamagata H., Yamane H., Yoshiki S., Yoshihara R., Yukawa K., RA Zhong H., Iwama H., Endo T., Ito H., Hahn J.H., Kim H.-I., Eun M.-Y., RA Yano M., Jiang J., Gojobori T.; RT "The genome sequence and structure of rice chromosome 1."; RL Nature 420:312-316(2002). RN [2] {ECO:0000313|EMBL:BAG93197.1} RP NUCLEOTIDE SEQUENCE. RA Kikuchi S., Satoh K., Nagata T., Kawagashira N., Doi K., Kishimoto N., RA Yazaki J., Ishikawa M., Yamada H., Ooka H., Hotta I., Kojima K., RA Namiki T., Ohneda E., Yahagi W., Suzuki K., Li C., Ohtsuki K., RA Shishiki T., Otomo Y., Murakami K., Iida Y., Sugano S., Fujimura T., RA Suzuki Y., Tsunoda Y., Kurosaki T., Kodama T., Masuda H., RA Kobayashi M., Xie Q., Lu M., Narikawa R., Sugiyama A., Mizuno K., RA Yokomizo S., Niikura J., Ikeda R., Ishibiki J., Kawamata M., RA Yoshimura A., Miura J., Kusumegi T., Oka M., Ryu R., Ueda M., RA Matsubara K., Kawai J., Carninci P., Adachi J., Aizawa K., Arakawa T., RA Fukuda S., Hara A., Hashidume W., Hayatsu N., Imotani K., Ishii Y., RA Itoh M., Kagawa I., Kondo S., Konno H., Miyazaki A., Osato N., Ota Y., RA Saito R., Sasaki D., Sato K., Shibata K., Shinagawa A., Shiraki T., RA Yoshino M., Hayashizaki Y.; RT "Collection, Mapping, and Annotation of Over 28,000 cDNA Clones from RT japonica Rice."; RL Science 301:376-379(2003). RN [3] {ECO:0000313|EMBL:BAH00590.1} RP NUCLEOTIDE SEQUENCE. RA Adachi J., Aizawa K., Akimura T., Arakawa T., Carninci P., Doi K., RA Fujimura T., Fukuda S., Hanagaki T., Hara A., Hashizume W., RA Hayashida K., Hayashizaki Y., Hayatsu N., Hiramoto K., Hiraoka T., RA Hori F., Hotta I., Iida J., Iida Y., Ikeda R., Imamura K., Imotani K., RA Ishibiki J., Ishii Y., Ishikawa M., Itoh M., Kagawa I., Kanagawa S., RA Katoh H., Kawagashira N., Kawai J., Kawamata M., Kikuchi S., RA Kishikawa-Hirozane T., Kishimoto N., Kobayashi M., Kodama T., RA Kojima K., Kojima Y., Kondo S., Konno H., Kouda M., Koya S., RA Kurihara C., Kurosaki T., Kusumegi T., Li C., Lu M., Masuda H., RA Matsubara K., Matsuyama T., Miura J., Miyazaki A., Mizuno K., RA Murakami K., Murata M., Nagata T., Nakahama Y., Nakamura M., RA Namiki T., Narikawa R., Niikura J., Nishi K., Nomura K., Numasaki R., RA Ohneda E., Ohno M., Ohtsuki K., Oka M., Ooka H., Osato N., Ota Y., RA Otomo Y., Ryu R., Saitoh H., Sakai C., Sakai K., Sakazume N., Sano H., RA Sasaki D., Sato K., Satoh K., Shibata K., Shinagawa A., Shiraki T., RA Shishiki T., Sogabe Y., Sugano S., Sugiyama A., Suzuki K., Suzuki Y., RA Tagami M., Tagami-Takeda Y., Tagawa A., Takahashi F., RA Takaku-Akahira S., Tanaka T., Tomaru A., Toya T., Tsunoda Y., Ueda M., RA Waki K., Xie Q., Yahagi W., Yamada H., Yamamoto M., Yasunishi A., RA Yazaki J., Yokomizo S., Yoshimura A.; RT "Collection, mapping, and annotation of 28K full-length cDNA clones RT from japonica rice."; RL Submitted (JAN-2003) to the EMBL/GenBank/DDBJ databases. RN [4] {ECO:0000313|Proteomes:UP000000763} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=16100779; DOI=10.1038/nature03895; RG International rice genome sequencing project (IRGSP); RT "The map-based sequence of the rice genome."; RL Nature 436:793-800(2005). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038; RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S., RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., RA Cong L., Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., RA Wang J., Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., RA Wang J., Wang X., Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., RA Zhang Z., Bao J., Han Y., Dong L., Ji J., Chen P., Wu S., Liu J., RA Xiao Y., Bu D., Tan J., Yang L., Ye C., Zhang J., Xu J., Zhou Y., RA Yu Y., Zhang B., Zhuang S., Wei H., Liu B., Lei M., Yu H., Li Y., RA Xu H., Wei S., He X., Fang L., Zhang Z., Zhang Y., Huang X., Su Z., RA Tong W., Li J., Tong Z., Li S., Ye J., Wang L., Fang L., Lei T., RA Chen C.-S., Chen H.-C., Xu Z., Li H., Huang H., Zhang F., Xu H., RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W., RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L., RA Zheng W., Hao B., Liu S.-M., Wang W., Yuan L., Cao M., McDermott J., RA Samudrala R., Wang J., Wong G.K.-S., Yang H.; RT "The genomes of Oryza sativa: a history of duplications."; RL PLoS Biol. 3:266-281(2005). RN [6] {ECO:0000313|Proteomes:UP000000763} RP GENOME REANNOTATION. RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763}; RX PubMed=18089549; DOI=10.1093/nar/gkm978; RG The rice annotation project (RAP); RT "The rice annotation project database (RAP-DB): 2008 update."; RL Nucleic Acids Res. 36:D1028-D1033(2008). RN [7] RP NUCLEOTIDE SEQUENCE. RA Wang J., Li R., Fan W., Huang Q., Zhang J., Zhou Y., Hu Y., Zi S., RA Li J., Ni P., Zheng H., Zhang Y., Zhao M., Hao Q., McDermott J., RA Samudrala R., Kristiansen K., Wong G.K.-S.; RT "Improved gene annotation of the rice (Oryza sativa) genomes."; RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AP003297; BAB92455.1; -; Genomic_DNA. DR EMBL; AK072910; BAG93197.1; -; mRNA. DR EMBL; AK121648; BAH00590.1; -; mRNA. DR EMBL; CM000138; EEE55756.1; -; Genomic_DNA. DR STRING; 39947.LOC_Os01g65520.1; -. DR EnsemblPlants; OS01T0876400-02; OS01T0876400-02; OS01G0876400. DR EnsemblPlants; OS01T0876400-03; OS01T0876400-03; OS01G0876400. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR OMA; YGSASYC; -. DR Proteomes; UP000000763; Chromosome 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000763}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000763}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 42 60 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 566 586 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 607 624 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 496 523 {ECO:0000256|SAM:Coils}. FT COILED 545 565 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 625 AA; 69888 MW; C42823A922CF635D CRC64; MQRSRRALLK RKAAAAAKEE EEEAGVGVAT AAAAGRRRRR RLYGFSVSLV VACWVVLLLL NPLVGHGNGQ RDEGIFADEG SSDPSFDSVE PTLSEGSVDS VVQQENGENH ALPGDSCAKP DENHVLSEET LLEKDQLCSN DEAQGDSMDA LPKDNVDQGE NLPRTDDDSV VHPEGEVESE GVPRPARLSR VVPPGLDEFK TRAIAERGKG VPSGQPGNVI HRREPSGKLY NYASAAKGAK VLEFNKEAKG ASNILDKDKD KYLRNPCSAE GKFVIIELSE ETLVDTIAIA NFEHYSSNLK EFEMLSSLNY PTDSWETLGR FTVANAKIAQ NFTFPEPKWA RYLKLNLLSH YGSEFYCTLS MLEVYGMDAV EKMLENLIPV ENKRLEPDDK MKEPVDQQTQ LKEPTEGKES SHEPLDEDEF ELEDDKLNGD SSKNGAHDQV TETRPIQAGR IPGDTVLKVL MQKVQSLDVS FSVLERYLEE LNSRYGQIFK DFDADIDTKD ALLEKIKLEL KHLERSKDDF AKEIEGILSW KLVASSQLNQ LLLDNVIIRS ELERFREKQA DLENRSFAVI FLSFVFGCLA IAKLSIGMIF NTCRLYNFEK FDRVKSGWLV LLFSSCIIAS ILIIQ // ID Q8SQT0_ENCCU Unreviewed; 264 AA. AC Q8SQT0; DT 01-JUN-2002, integrated into UniProtKB/TrEMBL. DT 01-JUN-2002, sequence version 1. DT 11-NOV-2015, entry version 60. DE SubName: Full=SPINDLE POLE BODY ASSOCIATED PROTEIN {ECO:0000313|EMBL:CAD26069.1}; GN OrderedLocusNames=ECU11_1590 {ECO:0000313|EMBL:CAD26069.1}; OS Encephalitozoon cuniculi (strain GB-M1) (Microsporidian parasite). OC Eukaryota; Fungi; Microsporidia; Unikaryonidae; Encephalitozoon. OX NCBI_TaxID=284813 {ECO:0000313|EMBL:CAD26069.1, ECO:0000313|Proteomes:UP000000819}; RN [1] {ECO:0000313|EMBL:CAD26069.1, ECO:0000313|Proteomes:UP000000819} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=GB-M1 {ECO:0000313|EMBL:CAD26069.1, RC ECO:0000313|Proteomes:UP000000819}; RX PubMed=11719806; DOI=10.1038/35106579; RA Katinka M.D., Duprat S., Cornillot E., Metenier G., Thomarat F., RA Prensier G., Barbe V., Peyretaillade E., Brottier P., Wincker P., RA Delbac F., El Alaoui H., Peyret P., Saurin W., Gouy M., RA Weissenbach J., Vivares C.P.; RT "Genome sequence and gene compaction of the eukaryote parasite RT Encephalitozoon cuniculi."; RL Nature 414:450-453(2001). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL590450; CAD26069.1; -; Genomic_DNA. DR RefSeq; NP_586465.1; NM_001042298.1. DR EnsemblFungi; CAD26069; CAD26069; CAD26069. DR GeneID; 860119; -. DR KEGG; ecu:ECU11_1590; -. DR EuPathDB; MicrosporidiaDB:ECU11_1590; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000149206; -. DR InParanoid; Q8SQT0; -. DR OrthoDB; EOG7SR4Z2; -. DR Proteomes; UP000000819; Chromosome XI. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000819}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Reference proteome {ECO:0000313|Proteomes:UP000000819}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 58 76 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 85 105 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 264 AA; 30218 MW; 2A47081653A1B946 CRC64; MNRRDRLRVK RTPDSTLNLG VDEMVMLQTP APRARANKSV EPVETNRYEL GARNIRGYPV YIAIAISYAL FFYMAAKRPI DSMVMMNLME EISILREESA RMSRQMETMK STKEVNYAKI EEGARIRIED TSQLFLYGFL GFRKHKDPAT VFDENVGVGE CLTFKGSSCR FSVDFDKEVE ICKLGIYHPV TRDTSSAVQE FEVFSQGPDG HLLVGEFKYD PDVCGFQTFE FEGRTVKSVE FVVKSNGGNK KFTCIYKLYL FGNK // ID Q8UWL8_TAKRU Unreviewed; 314 AA. AC Q8UWL8; DT 01-MAR-2002, integrated into UniProtKB/TrEMBL. DT 01-MAR-2002, sequence version 1. DT 11-NOV-2015, entry version 45. DE SubName: Full=SUN-like 1 {ECO:0000313|EMBL:AAL32174.1}; GN Name=SUNL1 {ECO:0000313|EMBL:AAL32174.1}; OS Takifugu rubripes (Japanese pufferfish) (Fugu rubripes). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata; OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; OC Takifugu. OX NCBI_TaxID=31033 {ECO:0000313|EMBL:AAL32174.1}; RN [1] {ECO:0000313|EMBL:AAL32174.1} RP NUCLEOTIDE SEQUENCE. RX PubMed=11707075; DOI=10.1006/geno.2001.6648; RA Bagheri-Fam S., Ferraz C., Demaille J., Scherer G., Pfeifer D.; RT "Comparative genomics of the SOX9 region in human and Fugu rubripes: RT conservation of short regulatory sequence elements within large RT intergenic regions."; RL Genomics 78:73-82(2001). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF329945; AAL32174.1; -; Genomic_DNA. DR HOGENOM; HOG000213331; -. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 81 101 Helical. {ECO:0000256|SAM:Phobius}. SQ SEQUENCE 314 AA; 35460 MW; F69048E7D2BD2AE4 CRC64; MGYYDKDGNP TICYRDSVSR VDKRRQNRAI TSDTSDRTDR DSYDIKRFLN YLKESISESS SSGSCMSSNS KDTVITYKTK WLIFSSLVVL ALMLPVISYH VDINSIERPT SYDLVPTSPV CHKCTNQSFG NVMMRIQKLQ TELHYLKEIL NYQLTDANFW TNFALESDGA KVDKKVGIQL FSKVVPAAVI GGQHPPIPGN CWSFPGSHGN LFIELSHTIT VSHVTLDHVL KSVSPNDTIP SAPRHFTVYG LQSLDDKAVH LGKFMYDLEG NPSQTFAVKV HDSIHFKYID LQIESNYGHA DYTCLYGFRV HGRI // ID Q9C9H3_ARATH Unreviewed; 459 AA. AC Q9C9H3; DT 01-JUN-2001, integrated into UniProtKB/TrEMBL. DT 01-JUN-2001, sequence version 1. DT 11-NOV-2015, entry version 74. DE SubName: Full=Putative uncharacterized protein F26A9.26 {ECO:0000313|EMBL:AAG51822.1}; GN Name=F26A9.26 {ECO:0000313|EMBL:AAG51822.1}; GN OrderedLocusNames=At1g71360 {ECO:0000313|TAIR:AT1G71360}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702; RN [1] {ECO:0000313|EMBL:AAG51822.1} RP NUCLEOTIDE SEQUENCE. RA Lin X., Kaul S., Town C.D., Benito M., Creasy T.H., Haas B.J., Wu D., RA Maiti R., Ronning C.M., Koo H., Fujii C.Y., Utterback T.R., RA Barnstead M.E., Bowman C.L., White O., Nierman W.C., Fraser C.M.; RT "Arabidopsis thaliana chromosome 1 BAC F26A9 genomic sequence."; RL Submitted (NOV-1999) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:AAG51822.1} RP NUCLEOTIDE SEQUENCE. RA Town C.D., Kaul S.; RL Submitted (JAN-2001) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC016163; AAG51822.1; -; Genomic_DNA. DR ProteinModelPortal; Q9C9H3; -. DR STRING; 3702.AT1G71360.1; -. DR PaxDb; Q9C9H3; -. DR PRIDE; Q9C9H3; -. DR TAIR; AT1G71360; -. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR HOGENOM; HOG000077411; -. DR PhylomeDB; Q9C9H3; -. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 409 433 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 439 457 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 338 407 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 459 AA; 52328 MW; 019DA9E3F25DC191 CRC64; MKQSEINNNT VPGNDTETTG SKLDQLSRAV PLGLDEFKSR ASNSRDKSLS GQVTGVIHRM EPGGKEYNYA AASKGAKVLS SNKEAKGASS IICRDKDKYL RNPCSTEGKF VVIELSEETL VNTIKIANFE HYSSNLKDFE ILGTLVYPTD TWVHLGNFTA LNMKHEQNFT FADPKWVRYL KLNLLSHYGS EFYCTLSLLE VYGVDAVERM LEDLISIQDK NILKLQEGDT EQKEKKTMQA KESFESDEDK SKQKEKEQEA SPENAVVKDE VSLEKRKLPD PVEEIKHQPG SRMPGDTVLK ILMQKIRSLD VSLSVLESYL EERSLKYGMI FKEMDLEASK REKEVETMRL EVEGMKEREE NTKKEAMEMR KWRMRVETEL EKAENEKEKV KERLEQVLER LEWMEKKGVV VFTICVGFGT IAVVAVVFGM GIVRAEKQGG LAWLLLLISS TFVMFILSL // ID SUN1_ARATH Reviewed; 471 AA. AC Q9FF75; DT 29-APR-2015, integrated into UniProtKB/Swiss-Prot. DT 01-MAR-2001, sequence version 1. DT 11-NOV-2015, entry version 95. DE RecName: Full=Protein SAD1/UNC-84 domain protein 1 {ECO:0000303|PubMed:19807882}; DE Short=AtSUN1 {ECO:0000303|PubMed:19807882}; GN Name=SUN1 {ECO:0000303|PubMed:19807882}; GN OrderedLocusNames=At5g04990 {ECO:0000312|TAIR:AT5G04990}; GN ORFNames=MUG13.15 {ECO:0000312|EMBL:BAB11521.1}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702 {ECO:0000312|Proteomes:UP000006548}; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Columbia; RX PubMed=9330910; DOI=10.1093/dnares/4.3.215; RA Sato S., Kotani H., Nakamura Y., Kaneko T., Asamizu E., Fukami M., RA Miyajima N., Tabata S.; RT "Structural analysis of Arabidopsis thaliana chromosome 5. I. Sequence RT features of the 1.6 Mb regions covered by twenty physically assigned RT P1 clones."; RL DNA Res. 4:215-230(1997). RN [2] RP GENOME REANNOTATION. RC STRAIN=cv. Columbia; RG The Arabidopsis Information Resource (TAIR); RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=cv. Columbia; RX PubMed=14593172; DOI=10.1126/science.1088305; RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., RA Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., RA Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G., RA Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., RA Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., RA Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., RA Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., RA Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., RA Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., RA Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., RA Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., RA Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.; RT "Empirical analysis of transcriptional activity in the Arabidopsis RT genome."; RL Science 302:842-846(2003). RN [4] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-63, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=19376835; DOI=10.1104/pp.109.138677; RA Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A., RA Grossmann J., Gruissem W., Baginsky S.; RT "Large-scale Arabidopsis phosphoproteome profiling reveals novel RT chloroplast kinase substrates and phosphorylation networks."; RL Plant Physiol. 150:889-903(2009). RN [5] RP TISSUE SPECIFICITY, SUBCELLULAR LOCATION, AND SUBUNIT. RX PubMed=19807882; DOI=10.1111/j.1365-313X.2009.04038.x; RA Graumann K., Runions J., Evans D.E.; RT "Characterization of SUN-domain proteins at the higher plant nuclear RT envelope."; RL Plant J. 61:134-144(2010). RN [6] RP FUNCTION, DISRUPTION PHENOTYPE, SUBCELLULAR LOCATION, AND TISSUE RP SPECIFICITY. RX PubMed=21294795; DOI=10.1111/j.1365-313X.2011.04523.x; RA Oda Y., Fukuda H.; RT "Dynamics of Arabidopsis SUN proteins during mitosis and their RT involvement in nuclear shaping."; RL Plant J. 66:629-641(2011). RN [7] RP FUNCTION, INTERACTION WITH WIP1; WIP2 AND WIP3, SUN DOMAIN, AND RP SUBCELLULAR LOCATION. RC STRAIN=cv. Columbia; RX PubMed=22270916; DOI=10.1083/jcb.201108098; RA Zhou X., Graumann K., Evans D.E., Meier I.; RT "Novel plant SUN-KASH bridges are involved in RanGAP anchoring and RT nuclear shape determination."; RL J. Cell Biol. 196:203-211(2012). RN [8] RP ACETYLATION [LARGE SCALE ANALYSIS] AT SER-2, CLEAVAGE OF INITIATOR RP METHIONINE [LARGE SCALE ANALYSIS], AND IDENTIFICATION BY MASS RP SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22223895; DOI=10.1074/mcp.M111.015131; RA Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., RA Meinnel T., Giglione C.; RT "Comparative large-scale characterisation of plant vs. mammal proteins RT reveals similar and idiosyncratic N-alpha acetylation features."; RL Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012). RN [9] RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH LINC1. RX PubMed=24667841; DOI=10.1371/journal.pone.0093406; RA Graumann K.; RT "Evidence for LINC1-SUN associations at the plant nuclear periphery."; RL PLoS ONE 9:E93406-E93406(2014). CC -!- FUNCTION: Component of SUN-protein-containing multivariate CC complexes also called LINC complexes which link the nucleoskeleton CC and cytoskeleton by providing versatile outer nuclear membrane CC attachment sites for cytoskeletal filaments (PubMed:24667841). CC Required for the maintenance and/or formation of polarized nuclear CC shape in root hairs (PubMed:21294795). Modulates the anchoring and CC mobility of WIP proteins and RANGAP1 in the nuclear envelope (NE) CC (PubMed:22270916). {ECO:0000250|UniProtKB:Q8BJS4, CC ECO:0000269|PubMed:21294795, ECO:0000269|PubMed:22270916, CC ECO:0000269|PubMed:24667841}. CC -!- SUBUNIT: Forms homomers (e.g. dimers, trimers and tetramers) and CC heteromers with SUN2 (PubMed:19807882). Core component of the LINC CC complex which is composed of inner nuclear membrane SUN domain- CC containing proteins coupled to outer nuclear membrane WIP CC proteins, the nucleoskeletal CRWN/LINC proteins, and, possibly, CC KAKU4. Interacts with LINC1, WIP1, WIP2 and WIP3 at the nuclear CC envelope (NE) (PubMed:22270916, PubMed:24667841). CC {ECO:0000269|PubMed:19807882, ECO:0000269|PubMed:22270916, CC ECO:0000269|PubMed:24667841}. CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane CC {ECO:0000269|PubMed:19807882, ECO:0000269|PubMed:21294795, CC ECO:0000269|PubMed:24667841}; Single-pass type II membrane protein CC {ECO:0000255}. Cytoplasm, cytoskeleton, phragmoplast CC {ECO:0000269|PubMed:21294795}. Endoplasmic reticulum membrane CC {ECO:0000269|PubMed:21294795}; Single-pass type II membrane CC protein {ECO:0000255}. Nucleus envelope CC {ECO:0000269|PubMed:22270916}. Note=Dynamic localization during CC mitosis, tightly coupled with nuclear envelope (NE) dynamics. NE CC re-formation during metaphase is temporally and spatially CC coordinated with plant-specific microtubule structures such as CC phragmoplasts. During anaphase, after NE breakdown (NEBD), CC predominantly localized with the endoplasmic reticulum, in the CC outside of the segregated chromosomes and not in between CC segregated chromosomes. {ECO:0000269|PubMed:21294795}. CC -!- TISSUE SPECIFICITY: Expressed in roots, hypocotyls, cotyledons and CC leaves and inflorescences. {ECO:0000269|PubMed:19807882, CC ECO:0000269|PubMed:21294795}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring CC and/or migration (By similarity). The SUN domain is required for CC interactions with WIP proteins (PubMed:22270916). CC {ECO:0000250|UniProtKB:O94901, ECO:0000269|PubMed:22270916}. CC -!- DISRUPTION PHENOTYPE: No visible phenotype. When associated with CC SUN2 disruption, abnormal nuclear shape, rounded instead of CC elongated, in some cells (e.g. mature root hairs). CC {ECO:0000269|PubMed:21294795}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB005245; BAB11521.1; -; Genomic_DNA. DR EMBL; CP002688; AED90813.1; -; Genomic_DNA. DR EMBL; AY056433; AAL08289.1; -; mRNA. DR EMBL; AY113173; AAM47476.1; -; mRNA. DR RefSeq; NP_196118.1; NM_120581.4. DR UniGene; At.8378; -. DR ProteinModelPortal; Q9FF75; -. DR SMR; Q9FF75; 265-449. DR IntAct; Q9FF75; 2. DR STRING; 3702.AT5G04990.1; -. DR PaxDb; Q9FF75; -. DR EnsemblPlants; AT5G04990.1; AT5G04990.1; AT5G04990. DR GeneID; 830381; -. DR KEGG; ath:AT5G04990; -. DR TAIR; AT5G04990; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR KO; K19347; -. DR OMA; YETEMAF; -. DR PhylomeDB; Q9FF75; -. DR PRO; PR:Q9FF75; -. DR Proteomes; UP000006548; Chromosome 5. DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-KW. DR GO; GO:0005783; C:endoplasmic reticulum; IDA:TAIR. DR GO; GO:0005789; C:endoplasmic reticulum membrane; IDA:UniProtKB. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IDA:UniProtKB. DR GO; GO:0005637; C:nuclear inner membrane; IEA:UniProtKB-SubCell. DR GO; GO:0009524; C:phragmoplast; IDA:UniProtKB. DR GO; GO:0043495; F:protein anchor; IDA:UniProtKB. DR GO; GO:0006997; P:nucleus organization; IMP:UniProtKB. DR GO; GO:0051291; P:protein heterooligomerization; IDA:UniProtKB. DR GO; GO:0051260; P:protein homooligomerization; IDA:UniProtKB. DR GO; GO:0090435; P:protein localization to nuclear envelope; IDA:UniProtKB. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Acetylation; Coiled coil; Complete proteome; Cytoplasm; Cytoskeleton; KW Endoplasmic reticulum; Membrane; Nucleus; Phosphoprotein; KW Reference proteome; Signal-anchor; Transmembrane; Transmembrane helix. FT INIT_MET 1 1 Removed. {ECO:0000244|PubMed:22223895}. FT CHAIN 2 471 Protein SAD1/UNC-84 domain protein 1. FT /FTId=PRO_0000432816. FT TOPO_DOM 2 105 Nuclear. {ECO:0000305}. FT TRANSMEM 106 128 Helical. {ECO:0000255}. FT TOPO_DOM 129 471 Perinuclear space. {ECO:0000305}. FT DOMAIN 288 452 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 175 225 {ECO:0000255}. FT MOTIF 88 95 Nuclear localization signal. FT {ECO:0000255|PROSITE-ProRule:PRU00768}. FT MOD_RES 2 2 N-acetylserine. FT {ECO:0000244|PubMed:22223895}. FT MOD_RES 63 63 Phosphoserine. FT {ECO:0000244|PubMed:19376835}. SQ SEQUENCE 471 AA; 51502 MW; 04EA132C5D01C5B6 CRC64; MSASTVSITA NTAAATRRTP ILAGEKKSNF DYPQSESLAN GGVGEAGGTS RDLSRGEATL DRSQGQDLGP VTRRSVSAAT GTNTTATQRR TRKVATPKSE KARWKTVVRI FAKQLGALLI IVGLIQLTRK MILKASSPSS PISSYETEMA FSGLESRIAE VDGLVKATTN SMQVQVELLD KKMEREAKVL RQEIERKASA FQSELKKIES RTESLEKSVD EVNAKPWVTK DELERIYEEL KKGNVDDSAF SEISIDELRA YARDIMEKEI EKHAADGLGR VDYALASGGA FVMEHSDPYL VGKGSSWFAT TMRRAHTNAV KMLSPSFGEP GQCFPLKGSE GYVQIRLRGP IIPEAFTLEH VAKSVAYDRS SAPKDCRVSG SLQGPESSAE TENMQLLTEF TYDLDRSNAQ TFNILESSSS GLIDTVRLDF TSNHGSDSHT CIYRFRVHGR APDPVPVVGT NLDQDSSPES E // ID SUN2_ARATH Reviewed; 455 AA. AC Q9SG79; Q8L9I5; DT 29-APR-2015, integrated into UniProtKB/Swiss-Prot. DT 01-MAY-2000, sequence version 1. DT 11-NOV-2015, entry version 96. DE RecName: Full=Protein SAD1/UNC-84 domain protein 2 {ECO:0000303|PubMed:19807882}; DE Short=AtSUN2 {ECO:0000303|PubMed:19807882}; GN Name=SUN2 {ECO:0000303|PubMed:19807882}; GN OrderedLocusNames=At3g10730 {ECO:0000312|TAIR:AT3G10730}; GN ORFNames=T7M13.19 {ECO:0000312|EMBL:AAF19576.1}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. Columbia; RX PubMed=11130713; DOI=10.1038/35048706; RA Salanoubat M., Lemcke K., Rieger M., Ansorge W., Unseld M., RA Fartmann B., Valle G., Bloecker H., Perez-Alonso M., Obermaier B., RA Delseny M., Boutry M., Grivell L.A., Mache R., Puigdomenech P., RA De Simone V., Choisne N., Artiguenave F., Robert C., Brottier P., RA Wincker P., Cattolico L., Weissenbach J., Saurin W., Quetier F., RA Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Benes V., RA Wurmbach E., Drzonek H., Erfle H., Jordan N., Bangert S., RA Wiedelmann R., Kranz H., Voss H., Holland R., Brandt P., Nyakatura G., RA Vezzi A., D'Angelo M., Pallavicini A., Toppo S., Simionati B., RA Conrad A., Hornischer K., Kauer G., Loehnert T.-H., Nordsiek G., RA Reichelt J., Scharfe M., Schoen O., Bargues M., Terol J., Climent J., RA Navarro P., Collado C., Perez-Perez A., Ottenwaelder B., Duchemin D., RA Cooke R., Laudie M., Berger-Llauro C., Purnelle B., Masuy D., RA de Haan M., Maarse A.C., Alcaraz J.-P., Cottet A., Casacuberta E., RA Monfort A., Argiriou A., Flores M., Liguori R., Vitale D., RA Mannhaupt G., Haase D., Schoof H., Rudd S., Zaccaria P., Mewes H.-W., RA Mayer K.F.X., Kaul S., Town C.D., Koo H.L., Tallon L.J., Jenkins J., RA Rooney T., Rizzo M., Walts A., Utterback T., Fujii C.Y., Shea T.P., RA Creasy T.H., Haas B., Maiti R., Wu D., Peterson J., Van Aken S., RA Pai G., Militscher J., Sellers P., Gill J.E., Feldblyum T.V., RA Preuss D., Lin X., Nierman W.C., Salzberg S.L., White O., Venter J.C., RA Fraser C.M., Kaneko T., Nakamura Y., Sato S., Kato T., Asamizu E., RA Sasamoto S., Kimura T., Idesawa K., Kawashima K., Kishida Y., RA Kiyokawa C., Kohara M., Matsumoto M., Matsuno A., Muraki A., RA Nakayama S., Nakazaki N., Shinpo S., Takeuchi C., Wada T., RA Watanabe A., Yamada M., Yasuda M., Tabata S.; RT "Sequence and analysis of chromosome 3 of the plant Arabidopsis RT thaliana."; RL Nature 408:820-822(2000). RN [2] RP GENOME REANNOTATION. RC STRAIN=cv. Columbia; RG The Arabidopsis Information Resource (TAIR); RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=cv. Columbia; RX PubMed=11910074; DOI=10.1126/science.1071006; RA Seki M., Narusaka M., Kamiya A., Ishida J., Satou M., Sakurai T., RA Nakajima M., Enju A., Akiyama K., Oono Y., Muramatsu M., RA Hayashizaki Y., Kawai J., Carninci P., Itoh M., Ishii Y., Arakawa T., RA Shibata K., Shinagawa A., Shinozaki K.; RT "Functional annotation of a full-length Arabidopsis cDNA collection."; RL Science 296:141-145(2002). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=cv. Columbia; RX PubMed=14593172; DOI=10.1126/science.1088305; RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., RA Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., RA Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G., RA Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., RA Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., RA Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., RA Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., RA Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., RA Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., RA Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., RA Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., RA Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.; RT "Empirical analysis of transcriptional activity in the Arabidopsis RT genome."; RL Science 302:842-846(2003). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 111-455. RA Brover V.V., Troukhan M.E., Alexandrov N.A., Lu Y.-P., Flavell R.B., RA Feldmann K.A.; RT "Full-length cDNA from Arabidopsis thaliana."; RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases. RN [6] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=19376835; DOI=10.1104/pp.109.138677; RA Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A., RA Grossmann J., Gruissem W., Baginsky S.; RT "Large-scale Arabidopsis phosphoproteome profiling reveals novel RT chloroplast kinase substrates and phosphorylation networks."; RL Plant Physiol. 150:889-903(2009). RN [7] RP TISSUE SPECIFICITY, INDUCTION BY CELL PROLIFERATION, SUBCELLULAR RP LOCATION, AND SUBUNIT. RX PubMed=19807882; DOI=10.1111/j.1365-313X.2009.04038.x; RA Graumann K., Runions J., Evans D.E.; RT "Characterization of SUN-domain proteins at the higher plant nuclear RT envelope."; RL Plant J. 61:134-144(2010). RN [8] RP FUNCTION, DISRUPTION PHENOTYPE, SUBCELLULAR LOCATION, AND TISSUE RP SPECIFICITY. RX PubMed=21294795; DOI=10.1111/j.1365-313X.2011.04523.x; RA Oda Y., Fukuda H.; RT "Dynamics of Arabidopsis SUN proteins during mitosis and their RT involvement in nuclear shaping."; RL Plant J. 66:629-641(2011). RN [9] RP FUNCTION, INTERACTION WITH WIP1; WIP2 AND WIP3, SUN DOMAIN, AND RP SUBCELLULAR LOCATION. RC STRAIN=cv. Columbia; RX PubMed=22270916; DOI=10.1083/jcb.201108098; RA Zhou X., Graumann K., Evans D.E., Meier I.; RT "Novel plant SUN-KASH bridges are involved in RanGAP anchoring and RT nuclear shape determination."; RL J. Cell Biol. 196:203-211(2012). RN [10] RP ACETYLATION [LARGE SCALE ANALYSIS] AT SER-2, CLEAVAGE OF INITIATOR RP METHIONINE [LARGE SCALE ANALYSIS], AND IDENTIFICATION BY MASS RP SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22223895; DOI=10.1074/mcp.M111.015131; RA Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., RA Meinnel T., Giglione C.; RT "Comparative large-scale characterisation of plant vs. mammal proteins RT reveals similar and idiosyncratic N-alpha acetylation features."; RL Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012). RN [11] RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH LINC1. RX PubMed=24667841; DOI=10.1371/journal.pone.0093406; RA Graumann K.; RT "Evidence for LINC1-SUN associations at the plant nuclear periphery."; RL PLoS ONE 9:E93406-E93406(2014). CC -!- FUNCTION: Component of SUN-protein-containing multivariate CC complexes also called LINC complexes which link the nucleoskeleton CC and cytoskeleton by providing versatile outer nuclear membrane CC attachment sites for cytoskeletal filaments (PubMed:24667841). CC Required for the maintenance and/or formation of polarized nuclear CC shape in root hairs (PubMed:21294795). Modulates the anchoring and CC mobility of WIP proteins in the nuclear envelope (NE) CC (PubMed:22270916). {ECO:0000250|UniProtKB:Q8BJS4, CC ECO:0000269|PubMed:21294795, ECO:0000269|PubMed:22270916, CC ECO:0000269|PubMed:24667841}. CC -!- SUBUNIT: Forms homomers (e.g. dimers, trimers and tetramers) and CC heteromers with SUN1 (PubMed:19807882). Core component of the LINC CC complex which is composed of inner nuclear membrane SUN domain- CC containing proteins coupled to outer nuclear membrane WIP CC proteins, the nucleoskeletal CRWN/LINC proteins, and, possibly, CC KAKU4. Interacts with LINC1, WIP1, WIP2 and WIP3 at the nuclear CC envelope (NE) (PubMed:22270916, PubMed:24667841). CC {ECO:0000269|PubMed:19807882, ECO:0000269|PubMed:22270916, CC ECO:0000269|PubMed:24667841}. CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane CC {ECO:0000269|PubMed:19807882, ECO:0000269|PubMed:21294795, CC ECO:0000269|PubMed:24667841}; Single-pass type II membrane protein CC {ECO:0000255}. Cytoplasm, cytoskeleton, phragmoplast CC {ECO:0000269|PubMed:21294795}. Endoplasmic reticulum membrane CC {ECO:0000269|PubMed:21294795}; Single-pass type II membrane CC protein {ECO:0000255}. Nucleus envelope CC {ECO:0000269|PubMed:22270916}. Note=Dynamic localization during CC mitosis, tightly coupled with nuclear envelope (NE) dynamics. NE CC re-formation during metaphase is temporally and spatially CC coordinated with plant-specific microtubule structures such as CC phragmoplasts. During anaphase, after NE breakdown (NEBD), CC predominantly localized with the endoplasmic reticulum, in the CC outside of the segregated chromosomes and not in between CC segregated chromosomes. {ECO:0000269|PubMed:21294795}. CC -!- TISSUE SPECIFICITY: Expressed in roots, hypocotyls, cotyledons and CC leaves and inflorescences. {ECO:0000269|PubMed:19807882, CC ECO:0000269|PubMed:21294795}. CC -!- INDUCTION: Up-regulated in proliferating tissues. CC {ECO:0000269|PubMed:19807882}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring CC and/or migration (By similarity). The SUN domain is required for CC interactions with WIP proteins (PubMed:22270916). CC {ECO:0000250|UniProtKB:O94901, ECO:0000269|PubMed:22270916}. CC -!- DISRUPTION PHENOTYPE: No visible phenotype. When associated with CC SUN1 disruption, abnormal nuclear shape in some cells such as CC mature root hairs. {ECO:0000269|PubMed:21294795}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- SEQUENCE CAUTION: CC Sequence=AAM65947.1; Type=Erroneous initiation; Note=Translation N-terminally extended.; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AC011708; AAF19576.1; -; Genomic_DNA. DR EMBL; CP002686; AEE74948.1; -; Genomic_DNA. DR EMBL; AK118080; BAC42710.1; -; mRNA. DR EMBL; BT006045; AAP04030.1; -; mRNA. DR EMBL; AY088410; AAM65947.1; ALT_INIT; mRNA. DR RefSeq; NP_566380.2; NM_111910.3. DR UniGene; At.43435; -. DR ProteinModelPortal; Q9SG79; -. DR SMR; Q9SG79; 262-444. DR STRING; 3702.AT3G10730.1; -. DR PaxDb; Q9SG79; -. DR EnsemblPlants; AT3G10730.1; AT3G10730.1; AT3G10730. DR GeneID; 820242; -. DR KEGG; ath:AT3G10730; -. DR TAIR; AT3G10730; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000237750; -. DR KO; K19347; -. DR OMA; VKHSEPF; -. DR PhylomeDB; Q9SG79; -. DR PRO; PR:Q9SG79; -. DR Proteomes; UP000006548; Chromosome 3. DR Genevisible; Q9SG79; AT. DR GO; GO:0005783; C:endoplasmic reticulum; IDA:TAIR. DR GO; GO:0005789; C:endoplasmic reticulum membrane; ISS:UniProtKB. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IDA:UniProtKB. DR GO; GO:0005637; C:nuclear inner membrane; IEA:UniProtKB-SubCell. DR GO; GO:0009524; C:phragmoplast; IDA:UniProtKB. DR GO; GO:0005819; C:spindle; IDA:TAIR. DR GO; GO:0043495; F:protein anchor; IDA:UniProtKB. DR GO; GO:0006997; P:nucleus organization; IMP:UniProtKB. DR GO; GO:0051291; P:protein heterooligomerization; IDA:UniProtKB. DR GO; GO:0051260; P:protein homooligomerization; IDA:UniProtKB. DR GO; GO:0090435; P:protein localization to nuclear envelope; IDA:UniProtKB. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Acetylation; Coiled coil; Complete proteome; Cytoplasm; Cytoskeleton; KW Endoplasmic reticulum; Membrane; Nucleus; Phosphoprotein; KW Reference proteome; Signal-anchor; Transmembrane; Transmembrane helix. FT INIT_MET 1 1 Removed. {ECO:0000244|PubMed:22223895}. FT CHAIN 2 455 Protein SAD1/UNC-84 domain protein 2. FT /FTId=PRO_0000432817. FT TOPO_DOM 2 105 Nuclear. {ECO:0000305}. FT TRANSMEM 106 128 Helical. {ECO:0000255}. FT TOPO_DOM 129 455 Perinuclear space. {ECO:0000305}. FT DOMAIN 285 447 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 201 225 {ECO:0000255}. FT MOTIF 88 95 Nuclear localization signal. FT {ECO:0000255|PROSITE-ProRule:PRU00768}. FT COMPBIAS 78 87 Poly-Thr. {ECO:0000255}. FT MOD_RES 2 2 N-acetylserine. FT {ECO:0000244|PubMed:22223895}. FT MOD_RES 63 63 Phosphoserine. FT {ECO:0000250|UniProtKB:Q9FF75}. FT CONFLICT 302 302 R -> G (in Ref. 5; AAM65947). FT {ECO:0000305}. FT CONFLICT 398 398 D -> N (in Ref. 5; AAM65947). FT {ECO:0000305}. SQ SEQUENCE 455 AA; 49940 MW; BB99EFF2B3EBF917 CRC64; MSASTVSITA SPRTIRRTPV LSGEKKSNFD FPPSESHANA AIGESSAGTN KDLIRAEAAG ERSNTYDVGP VTRKSGSTAT GTNTTTTQRR TRKSQGNKID RGKWKTVVRV FAKQFGALLL LVGLIQLIRK LTLKDSSLSS SNFPIETEMV LSELESRISA VDGLVKTTTK MMQVQVEFLD KKMDSESRAL RQTIDSTSSV LHSELKKVES KTERLQVSVD ELNAKPLVSR EELERVYEEL KKGKVGDSDV NIDKLRAYAR DIVEKEIGKH VADGLGRVDY ALASGGAFVM GHSDPFLVGN GRNWFGTSRR RVHSKAVKML TPSFGEPGQC FPLKGSNGYV LVRLRAPIIP EAVTLEHVSK AVAYDRSSAP KDCRVSGWLG DIDMETETMP LLTEFSYDLD RSNAQTFDIA DSAHSGLVNT VRLDFNSNHG SSSHTCIYRF RVHGRELDSV SVAHA // ID Q9T0A9_ARATH Unreviewed; 466 AA. AC Q9T0A9; DT 01-MAY-2000, integrated into UniProtKB/TrEMBL. DT 01-MAY-2000, sequence version 1. DT 11-NOV-2015, entry version 84. DE SubName: Full=Putative uncharacterized protein AT4g23950 {ECO:0000313|EMBL:CAB43895.1}; GN OrderedLocusNames=At4g23950 {ECO:0000313|EMBL:CAB43895.1}; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; Gunneridae; OC Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; OC Arabidopsis. OX NCBI_TaxID=3702; RN [1] {ECO:0000313|EMBL:CAB43895.1} RP NUCLEOTIDE SEQUENCE. RA Bevan M., Zimmermann W., Grueneisen A., Wambutt R., Kalicki J., RA Wohldmann P., Smith A., Bancroft I., Mewes H.W., Lemcke K., RA Mayer K.F.X.; RL Submitted (JUN-1999) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:CAB81313.1} RP NUCLEOTIDE SEQUENCE. RA Zimmermann W., Grueneisen A., Wambutt R., Kalicki J., Wohldmann P., RA Smith A., Mewes H.W., Lemcke K., Mayer K.F.X.; RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:CAB43895.1} RP NUCLEOTIDE SEQUENCE. RA EU Arabidopsis sequencing project; RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AL078468; CAB43895.1; -; Genomic_DNA. DR EMBL; AL161560; CAB81313.1; -; Genomic_DNA. DR PIR; T08914; T08914. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT TRANSMEM 406 425 Helical. {ECO:0000256|SAM:Phobius}. FT TRANSMEM 446 465 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 296 316 SQ SEQUENCE 466 AA; 52877 MW; 673FBC405FB7E2A5 CRC64; MLIKETNLGS FNLGGIFQRL LYCVPLKKSI YFVDRIGNYT DGSVSKTLNS TSSVFPQATE KENNFCLLRK GQLQDVYEHV LVNNALLICK VVLPERRISK KTLEARDPRY VNLEDKSLKV NGSSQLVNNG TRYRLEPDGN GYNYASAMKG AKVVDHNKEA KGASNVLGKD HDKYLRNPCS VSDKYVVIEL AEETLVDTVR IANFEHYSSN PKEFSLSGSL SFPSDMWTPA GSFAAANVKQ IQSFRLPEPK WTDQIGKETE AQKKKDDVVK TINIIGDKKY EVKEKHNVLK VMMQKVKLIE MNLSLLEDSV KKMNDKQPEV SLEMKKTLVL VEKSKADIRE ITEWKGKMKL PMNLIFFEQE KELRDLELWK TLVASRVESL ARGNSALRLD VEKIVKEQAN LESKELGVLL ISLFFVVLAT IRLVSTRLWA FLGMSITDKA RSLWPDSGWV MILLSSSIMI FIHLLS // ID Q9VIK7_DROME Unreviewed; 1417 AA. AC Q9VIK7; Q7YU09; DT 01-MAY-2000, integrated into UniProtKB/TrEMBL. DT 03-MAR-2009, sequence version 4. DT 11-NOV-2015, entry version 107. DE SubName: Full=CG31678, isoform B {ECO:0000313|EMBL:AAF53910.4}; DE SubName: Full=LD18032p {ECO:0000313|EMBL:AAQ22525.1}; GN ORFNames=CG31678 {ECO:0000313|EMBL:AAF53910.4, GN ECO:0000313|FlyBase:FBgn0051678}, GN Dmel_CG31678 {ECO:0000313|EMBL:AAF53910.4}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., RA Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., RA Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AAQ22525.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=Berkeley {ECO:0000313|EMBL:AAQ22525.1}; RA Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J., RA Champe M., Chavez C., Dorsett V., Dresnek D., Farfan D., Frise E., RA George R., Gonzalez M., Guarin H., Kronmiller B., Li P., Liao G., RA Miranda A., Mungall C.J., Nunoo J., Pacleb J., Paragas V., Park S., RA Patel S., Phouanenavong S., Wan K., Yu C., Lewis S.E., Rubin G.M., RA Celniker S.; RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases. RN [7] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [8] {ECO:0000313|EMBL:AAF53910.4} RP NUCLEOTIDE SEQUENCE. RA Celniker S., Carlson J., Wan K., Frise E., Hoskins R., Park S., RA Svirskas R., Rubin G.; RL Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases. RN [9] {ECO:0000313|EMBL:AAF53910.4} RP NUCLEOTIDE SEQUENCE. RG Berkeley Drosophila Genome Project; RA Celniker S., Carlson J., Wan K., Pfeiffer B., Frise E., George R., RA Hoskins R., Stapleton M., Pacleb J., Park S., Svirskas R., Smith E., RA Yu C., Rubin G.; RT "Drosophila melanogaster release 4 sequence."; RL Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases. RN [10] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [11] {ECO:0000313|EMBL:AAF53910.4, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). RN [12] {ECO:0000313|EMBL:AAF53910.4} RP NUCLEOTIDE SEQUENCE. RG FlyBase; RL Submitted (JUL-2014) to the EMBL/GenBank/DDBJ databases. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE014134; AAF53910.4; -; Genomic_DNA. DR EMBL; BT010056; AAQ22525.1; -; mRNA. DR RefSeq; NP_724277.2; NM_165336.3. DR UniGene; Dm.19853; -. DR STRING; 7227.FBpp0288498; -. DR GeneID; 35327; -. DR UCSC; CG31678-RB; d. melanogaster. DR FlyBase; FBgn0051678; CG31678. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR OMA; AYNEFMP; -. DR GenomeRNAi; 35327; -. DR NextBio; 792976; -. DR Proteomes; UP000000803; Chromosome 2L. DR GO; GO:0046331; P:lateral inhibition; IMP:FlyBase. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Coiled coil {ECO:0000256|SAM:Coils}; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW Membrane {ECO:0000256|SAM:Phobius}; KW Proteomics identification {ECO:0000213|PeptideAtlas:Q9VIK7}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Signal {ECO:0000256|SAM:SignalP}; KW Transmembrane {ECO:0000256|SAM:Phobius}; KW Transmembrane helix {ECO:0000256|SAM:Phobius}. FT SIGNAL 1 29 {ECO:0000256|SAM:SignalP}. FT CHAIN 30 1417 {ECO:0000256|SAM:SignalP}. FT /FTId=PRO_5005371594. FT TRANSMEM 1004 1025 Helical. {ECO:0000256|SAM:Phobius}. FT COILED 348 375 {ECO:0000256|SAM:Coils}. FT COILED 940 988 {ECO:0000256|SAM:Coils}. SQ SEQUENCE 1417 AA; 154525 MW; 3DE0EB7E58644F5F CRC64; MHMRLHLVRF MFINLLLSCC FWLYDNVAAA DSQIPSRAGD SAKDAARKTH IEPEEPAAPP RPEPPPPPLH NGAGVLLPSA QEGVPIVATS SSSSFAEVPQ TGNRPISGDT SVPVKELPAL SDQQRTKSHS EYISEIDLAA DVGGASSSTA QVPNNQLNYN KRSSAGKVHR FSESQQLDQP KQLEQEEPAT GQPKAPELQE PPQLPQAEES FPEIITELPA VTITELPLDR VMNRLDSVIL DGSPATAGNH SDEEQHQQQQ QPHEDQPMQV SEADEEVPQK DEQMPKINDP GGGIQVEGMV TPEAATVGET QESSEELQPG SAAFNGTEGT ANLTNANEEV PMPVFSEWAQ KQMEAEASRE QAMELEQQVV NKSAQRKNNT GSSSGKPPTL KLRSKNYASP DCGAKIIAHN SESKHTEAVL TQSTDEYMLS TCESRIWFVV ELCEAIQAQK VDVANYELFS SSPKNFTVAV SKRFPTRDWS NVGRFAAEDK RTIQTFELHP HLFGKFVRVD ITSHYANEHF CPLSLFRVFG TSEYEAFETE IRPSDDLDDF YDDYGAQEQK AAVGSGGNIF QSASDAVMQM VKKAAEVLVK PTKALKWSEE SVLCQTPAFE AFSCINCNTT LVERINSLLS CQFQQLQALL SLSHLRSDLL NSRVCQEEFG ISLTGSEFAS KMGKEQSYFL SMLPAEHVGA MCKLIQAEQN VTDHNHTKAP SLKQHVSSPE PVQDNATATG VRQDCENSKT PTKEPLTPSL EVVVPEVSQE VPSMEDQSST SSETVSTTNS TPADVNIFNM PSEPEEVVVK VQLPPEPTLP TTLQPSDVES FTDAPSTNAL PGSSEAVANG DLGMEEGNPA NWDGIDNLLT TTVASITAGG GAAAAAAAVV NGNANIGGAG IVGAGGPASL SSVNMQQKLT NGAQSESVFI RLSNRIKALE RNMSLSGQYL EELSRRYKKQ VEELQQTLTQ QTLTVRQLED QSRRYVEQEQ LYQQHSAELA GEVRALSYQV QACILVIIIV GTCIFLMLVL GTVYYRKLRR QQQQLLKKDQ AGHPPVAAKP KLDRRKSYEQ TPNQSTPKQR RPSEEAMLIL KECGDSNMQE LDPPSRQRKI SVCYGSNNNI AANMAIANTN GGASVRNSLH RRKGAKHSWH NSLDTTETSC GEQTDKFFDV DTLKSIKQSC GKPGKKKSLQ QLKPLGLKRQ ESAPATYTPD LQAEEPATQS DFDESLMLDD DDLANFIPTS DLAYNEFMPE GPSGYQIVDT VDGKPGKEPG TKKSRRLSSP AFFKSPFSKS KNKGYSFNGV KNSHAVHEPT SWEWYRLKRS EKHQQQQQAK LVSKSLPSAS LDSSSLSEVN FPLSSSTATT QNSFRILGEA ILSSGEGRIT PNGNGNAMSG GLASSSSGSG SGGSTTSSTT KKKQRALNNL FRKAFDF // ID Q9VKG2_DROME Unreviewed; 275 AA. AC Q9VKG2; DT 01-MAY-2000, integrated into UniProtKB/TrEMBL. DT 11-JUN-2014, sequence version 2. DT 11-NOV-2015, entry version 95. DE SubName: Full=Sperm-associated antigen 4 {ECO:0000313|EMBL:AAF53110.2}; GN Name=spag4 {ECO:0000313|EMBL:AAF53110.2, GN ECO:0000313|FlyBase:FBgn0032368}; GN Synonyms=giac {ECO:0000313|FlyBase:FBgn0032368}; GN ORFNames=CG6589 {ECO:0000313|FlyBase:FBgn0032368}, GN Dmel_CG6589 {ECO:0000313|EMBL:AAF53110.2}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., RA Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., RA Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AAF53110.2, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE014134; AAF53110.2; -; Genomic_DNA. DR RefSeq; NP_609510.2; NM_135666.3. DR ProteinModelPortal; Q9VKG2; -. DR SMR; Q9VKG2; 114-273. DR IntAct; Q9VKG2; 1. DR STRING; 7227.FBpp0079852; -. DR PaxDb; Q9VKG2; -. DR PRIDE; Q9VKG2; -. DR GeneID; 34581; -. DR KEGG; dme:Dmel_CG6589; -. DR UCSC; CG6589-RA; d. melanogaster. DR CTD; 6676; -. DR FlyBase; FBgn0032368; spag4. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; Q9VKG2; -. DR OrthoDB; EOG7VQJCX; -. DR GenomeRNAi; 34581; -. DR NextBio; 789147; -. DR Proteomes; UP000000803; Chromosome 2L. DR Bgee; Q9VKG2; -. DR ExpressionAtlas; Q9VKG2; differential. DR Genevisible; Q9VKG2; DM. DR GO; GO:0007283; P:spermatogenesis; IMP:FlyBase. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 4: Predicted; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}. SQ SEQUENCE 275 AA; 30445 MW; B673B0F5C6822E3A CRC64; MAHNSRNNLG IMRLREDVDD ISHILRQQQI DSKVAQGSCK FNCLGGEPKG VGSGMCNNRD VSAYVDTLFK RKIGHLMDDV YNLKKQVMSA DCSSKSAQST PKPESVALAK PRINYASEEL GARIINVKAH SIDGTNIIRS LLGLDFSTNP PVNMIRTGLS PGSCFGFNGS RATVTLHLAR TIIVEAITLT HVAREMTPDL CVKSAPKNFD VYGLRSENSK RELLGQWSYD NAANKRTQSY SVRSDTFFRN LDFSFNSNHG ANSTCIYRVE VYGRL // ID Q9VL06_DROME Unreviewed; 2727 AA. AC Q9VL06; DT 01-MAY-2000, integrated into UniProtKB/TrEMBL. DT 01-MAY-2000, sequence version 1. DT 11-NOV-2015, entry version 128. DE SubName: Full=CG5604 {ECO:0000313|EMBL:AAF52899.1}; DE EC=6.3.2.19 {ECO:0000313|EMBL:AAF52899.1}; GN ORFNames=CG5604 {ECO:0000313|EMBL:AAF52899.1, GN ECO:0000313|FlyBase:FBgn0032208}, GN Dmel_CG5604 {ECO:0000313|EMBL:AAF52899.1}; OS Drosophila melanogaster (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; OC Ephydroidea; Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803}; RN [1] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=10731132; DOI=10.1126/science.287.5461.2185; RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., RA Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., RA Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., RA Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., RA Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., RA Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., RA Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., RA Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., RA Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., RA Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., RA Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., RA Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., RA Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., RA Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., RA Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; RT "The genome sequence of Drosophila melanogaster."; RL Science 287:2185-2195(2000). RN [2] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537568; RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A., RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A., RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R., RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J., RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C., RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.; RT "Finishing a whole-genome shotgun: release 3 of the Drosophila RT melanogaster euchromatic genome sequence."; RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002). RN [3] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP GENOME REANNOTATION. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083; RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A., RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., RA Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., RA Lewis S.E.; RT "Annotation of the Drosophila melanogaster euchromatic genome: a RT systematic review."; RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002). RN [4] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537573; RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., RA Ashburner M., Celniker S.E.; RT "The transposable elements of the Drosophila melanogaster euchromatin: RT a genomics perspective."; RL Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002). RN [5] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=12537574; RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., RA Karpen G.H.; RT "Heterochromatic sequences in a Drosophila whole-genome shotgun RT assembly."; RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002). RN [6] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022; RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., RA Ashburner M., Anxolabehere D.; RT "Combined evidence annotation of transposable elements in genome RT sequences."; RL PLoS Comput. Biol. 1:166-175(2005). RN [7] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569856; DOI=10.1126/science.1139815; RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.; RT "The Release 5.1 annotation of Drosophila melanogaster RT heterochromatin."; RL Science 316:1586-1591(2007). RN [8] {ECO:0000313|EMBL:AAF52899.1, ECO:0000313|Proteomes:UP000000803} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803}; RX PubMed=17569867; DOI=10.1126/science.1139816; RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M., RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A., RA Dimitri P., Karpen G.H., Celniker S.E.; RT "Sequence finishing and mapping of Drosophila melanogaster RT heterochromatin."; RL Science 316:1625-1628(2007). CC -!- SIMILARITY: Contains 3 ANK repeats. CC {ECO:0000256|RuleBase:RU003321}. CC -!- SIMILARITY: Contains HECT (E6AP-type E3 ubiquitin-protein ligase) CC domain. {ECO:0000256|SAAS:SAAS00133827}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AE014134; AAF52899.1; -; Genomic_DNA. DR RefSeq; NP_609369.1; NM_135525.2. DR ProteinModelPortal; Q9VL06; -. DR SMR; Q9VL06; 391-577, 1327-1389, 2478-2721. DR IntAct; Q9VL06; 5. DR MINT; MINT-927048; -. DR STRING; 7227.FBpp0079663; -. DR PaxDb; Q9VL06; -. DR PRIDE; Q9VL06; -. DR GeneID; 34378; -. DR KEGG; dme:Dmel_CG5604; -. DR UCSC; CG5604-RA; d. melanogaster. DR FlyBase; FBgn0032208; CG5604. DR eggNOG; KOG4276; Eukaryota. DR eggNOG; COG5021; LUCA. DR InParanoid; Q9VL06; -. DR KO; K12231; -. DR OMA; NRQCIEG; -. DR PhylomeDB; Q9VL06; -. DR ChiTaRS; CG5604; fly. DR GenomeRNAi; 34378; -. DR NextBio; 788201; -. DR PRO; PR:Q9VL06; -. DR Proteomes; UP000000803; Chromosome 2L. DR Bgee; Q9VL06; -. DR ExpressionAtlas; Q9VL06; differential. DR Genevisible; Q9VL06; DM. DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central. DR GO; GO:0005634; C:nucleus; ISS:FlyBase. DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW. DR GO; GO:0046872; F:metal ion binding; IEA:InterPro. DR GO; GO:0061630; F:ubiquitin protein ligase activity; IBA:GO_Central. DR GO; GO:0004842; F:ubiquitin-protein transferase activity; ISS:FlyBase. DR GO; GO:0016567; P:protein ubiquitination; IBA:GO_Central. DR GO; GO:0042787; P:protein ubiquitination involved in ubiquitin-dependent protein catabolic process; ISS:FlyBase. DR Gene3D; 1.25.10.10; -; 3. DR Gene3D; 1.25.40.20; -; 1. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR002110; Ankyrin_rpt. DR InterPro; IPR020683; Ankyrin_rpt-contain_dom. DR InterPro; IPR011989; ARM-like. DR InterPro; IPR016024; ARM-type_fold. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR000569; HECT_dom. DR InterPro; IPR010606; Mib_Herc2. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF12796; Ank_2; 1. DR Pfam; PF00632; HECT; 1. DR Pfam; PF06701; MIB_HERC2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00248; ANK; 3. DR SMART; SM00119; HECTc; 1. DR SUPFAM; SSF48371; SSF48371; 2. DR SUPFAM; SSF48403; SSF48403; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR SUPFAM; SSF56204; SSF56204; 4. DR PROSITE; PS50297; ANK_REP_REGION; 1. DR PROSITE; PS50088; ANK_REPEAT; 2. DR PROSITE; PS50237; HECT; 1. DR PROSITE; PS51416; MIB_HERC2; 1. PE 1: Evidence at protein level; KW ANK repeat {ECO:0000256|RuleBase:RU003321}; KW Complete proteome {ECO:0000313|Proteomes:UP000000803}; KW Ligase {ECO:0000256|SAAS:SAAS00133783, ECO:0000313|EMBL:AAF52899.1}; KW Proteomics identification {ECO:0000213|PeptideAtlas:Q9VL06}; KW Reference proteome {ECO:0000313|Proteomes:UP000000803}; KW Ubl conjugation pathway {ECO:0000256|SAAS:SAAS00133781}. SQ SEQUENCE 2727 AA; 302150 MW; 81D7BA87B482BAC1 CRC64; MGDVDPETLL EWLSMGQGDE RDMQLIALEQ LCMLLLMSDN VDRCFESCPP RTFLPALCKI FLDELAPENV LEVTARAITY YLDVSAECTR RIVSIDGAIK AICNHLVVAD LSSRTSRDLA EQCIKVLELI CTREAGAVFE GGGLNCVLSF IRDCGSQVHK DTLHSAMSVV SRLCTKVEPN TPCIQNCVES LSTLLQHEDP MVSDGALKCF ASVADRFTRK WVDPAPLAEY GLTTELLKRL QSVGGNTHSS LTAAGTQPTS SSQPAATTNS DAINENVAGT ATISNSTKVK SSDAAASPQS ISTTISLLST LCRGSPSITH DILRSQLADA LERALQGDER CVLDCMRFAD LLLLLLFEGR QALNRGSNNP NQGQLAPRPR RNNTNTDRTH RQLIDCIRSK DSEALREAIE SGGIDVNCMD DVGQTLLNWA SAFGTLEMVE YLCEKGADVN KGQRSSSLHY AACFGRPAIA KILLKFGAYP DLRDEDGKTP LDKARERLDD GHREVAAILQ SPGEWMSPDH SLLNKDGKKY TLMEPRGDPE MAPIYLKVLL PIFCRTFLGS MLGSVRRASL ALIKKIVQYA YPTVLQSLSE TSFSEDAAST SGQNGGNLLI EVIASVLDNE DDGDGHLIVL NIIEEIMCKT QEEFLDHFAR LGVFAKVQAL MDTDAEELYV QLPGTVEEPA AAQRSSTSVV VAPRPTSDDP MEDAKEILQG KPYHWREWSI CRGRDCLYVW SDSVALELSN GSNGWFRFII DGKLATMYSS GSPENGNDSS ENRGEFLEKL MRARSCVIAG VVSQPILPTA SALRLVVGNW VLQSQKTNQL QIHNTEGHQV TVLQDDLPGF IFESNRGTKH TFSAETVLGP DFASGWSTAK KKRNKSKTEG QKFQVRNLSR EIYNKYFKSA QIIPRGAVAI LTDIVKQIEL SFEEQHMAPN GNWETTLTDA LMKLSQLIHE DGVVSAYEMH SSGLVQALVA VLSVNHWETN SPRCKRNKMQ KQRVSVFKKC ILEDNVESAT NKPRTKSTAS ILIQKLVSVL ESTEKLPVYL YDSPCTGYSL QILQKRLRFR LERAECESTL FDRSGRTLKM EPLATIGQLS KYLLKMVAKQ WYDLDRSTYF YLKKIREHRT ATVFTHSFDF DEEGLLFYIG SNAKTCDWVN PAQYGLVQVT SSEGKTLPYG KLEDILSRDS ISLNCHTKDN KKAWFAIDLG VYIIPTAYTL RHARGYGRSA LRNWLLQGSK DGSTWTTLST HVDDKSLVEP GSTATWPINC ATDDSVWYRH IRIQQNGRNA SGQTHYLSLS GFEIYGRVVG VADDIGKSVK EAEAKTRRER RQIRAQLKHM TTGARVIRGV DWRWEEQDGC AEGTITGEIH NGWIDVKWDH GVRNSYRMGA EGKYDLKLAD CEYLSAFDGN QSMGSASTAA KPSEKGGNTL TSRKSSSTPS LPEATEKNQN PEGASNQTVS ADNLAWKQTV ETIAENVFAS AKTQIISNQL AMNTSSSREA RAKHKESGTN QMHKDNISGP SPLSRELEHI SDLSAINNSM PAINSSNVSD LATISENLSL TELSKENICR VLTPSYKPAE SVTASQSSSH PDVQSSSPRE NDIKNISNIE ENNKMNANNS VNKISKDLLA NLRTSNIAGC PPVTQLSTEA LEMIDKMRDG VDMIRNMSNS ILSTDTFPVP CTNVPVGGKK TPKAQALINP DNANQKQIIV TSEEFPTKSS KKPSVTLKPA QQPNAVLSIV DIKEQPISNE NVSVPSQMSI SVPNLTTTSA SEVPSTSEVA THTGLLETFA AIARRRTSQG TNIQDNQIMN AEANVNEHGD QNASGSFLGH SVTSLVKLAL SSNFHSGLLS TAQSYPSLSS NNSENIAPSN PSNTSAGQQS ASTINHTLTM SLTSTSSDSE QVSLEDFLES CRAPALLGDL DDEDDMDEDN DEEENEDEYE EVGNTLLQVM VSRNLLTFMD DEAMENRLVG VTKRKSWDDE FVLKRQFSAL IPAFDPRPGR TNVNQTSDLE ISPLGAELPK PQQSGGPETI EQPLLGLKLR GPGIGGIPEV EIDLSNTDWT IFRAVQELLQ CSQLNKLDKF RKIWEPTYTI VYREVSPEAQ ESTCLESEEF PQTPDVSSKS GASTLSPNSP MHIGFNVADN NLCSVDDVLE LLTQINGLNQ SEIDSDVKEH GVSVLSEDLF ISKKITNKLQ QQIQDPLVLA SNALPNWCEN LNQSCPFLFP FETRQLYFNC TSFGASRSIV CLQSQRDVTV ERQRIPIMSP RRDDHEFRIG RLKHERVKVP RNEDLLMWAM QVMKTHCNRK SVLEVEFLDE EGTGLGPTLE FYALVAAEIQ RSDLCMWLCD DDLGEDTENS TQSAEGNSKP VGYYVNRREH GIFPAPLPQN SEICENVLKY FWFFGVFVAK VLQDMRLVDI PLSTSFLQLL CHNKVLSRNL QKVISDRRNG DLSVVSEDSD IVETCTKLLR TDSNKSNAFG GILSLENLKE IDPTRYQFLQ EMQNLLLRKQ SIEFDDTISA EKKHELINEL KLQTQNGLEV SLEDLALTFT YLPSSSIYGY TQAELLPNGS SVNVTIDNLE AYCELLMNFI LQDGIAQQMK AFSDGFNEVF PLKKLAAFTP SEARMMICGE QFPHWSREDI ISYTEPKLGY NKDSPGFQRF VNVLLSMSGD ERKAFLQFTT GCSSLPPGGL ANLHPRLTVV RKVDAGVGSY PSVNTCVHYL KLPDYPTEEI MKERLLTATK EKGFHLN // ID SAD1_SCHPO Reviewed; 514 AA. AC Q09825; Q9UU40; DT 01-FEB-1996, integrated into UniProtKB/Swiss-Prot. DT 01-FEB-1996, sequence version 1. DT 11-NOV-2015, entry version 114. DE RecName: Full=Spindle pole body-associated protein sad1; GN Name=sad1; ORFNames=SPBC12D12.01, SPBC16H5.01c; OS Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Schizosaccharomycetes; Schizosaccharomycetales; OC Schizosaccharomycetaceae; Schizosaccharomyces. OX NCBI_TaxID=284812; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA]. RX PubMed=7744953; DOI=10.1083/jcb.129.4.1033; RA Hagan I., Yanagida M.; RT "The product of the spindle formation gene sad1+ associates with the RT fission yeast spindle pole body and is essential for viability."; RL J. Cell Biol. 129:1033-1047(1995). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=972 / ATCC 24843; RX PubMed=11859360; DOI=10.1038/nature724; RA Wood V., Gwilliam R., Rajandream M.A., Lyne M.H., Lyne R., Stewart A., RA Sgouros J.G., Peat N., Hayles J., Baker S.G., Basham D., Bowman S., RA Brooks K., Brown D., Brown S., Chillingworth T., Churcher C.M., RA Collins M., Connor R., Cronin A., Davis P., Feltwell T., Fraser A., RA Gentles S., Goble A., Hamlin N., Harris D.E., Hidalgo J., Hodgson G., RA Holroyd S., Hornsby T., Howarth S., Huckle E.J., Hunt S., Jagels K., RA James K.D., Jones L., Jones M., Leather S., McDonald S., McLean J., RA Mooney P., Moule S., Mungall K.L., Murphy L.D., Niblett D., Odell C., RA Oliver K., O'Neil S., Pearson D., Quail M.A., Rabbinowitsch E., RA Rutherford K.M., Rutter S., Saunders D., Seeger K., Sharp S., RA Skelton J., Simmonds M.N., Squares R., Squares S., Stevens K., RA Taylor K., Taylor R.G., Tivey A., Walsh S.V., Warren T., Whitehead S., RA Woodward J.R., Volckaert G., Aert R., Robben J., Grymonprez B., RA Weltjens I., Vanstreels E., Rieger M., Schaefer M., Mueller-Auer S., RA Gabel C., Fuchs M., Duesterhoeft A., Fritzc C., Holzer E., Moestl D., RA Hilbert H., Borzym K., Langer I., Beck A., Lehrach H., Reinhardt R., RA Pohl T.M., Eger P., Zimmermann W., Wedler H., Wambutt R., Purnelle B., RA Goffeau A., Cadieu E., Dreano S., Gloux S., Lelaure V., Mottier S., RA Galibert F., Aves S.J., Xiang Z., Hunt C., Moore K., Hurst S.M., RA Lucas M., Rochet M., Gaillardin C., Tallada V.A., Garzon A., Thode G., RA Daga R.R., Cruzado L., Jimenez J., Sanchez M., del Rey F., Benito J., RA Dominguez A., Revuelta J.L., Moreno S., Armstrong J., Forsburg S.L., RA Cerutti L., Lowe T., McCombie W.R., Paulsen I., Potashkin J., RA Shpakovski G.V., Ussery D., Barrell B.G., Nurse P.; RT "The genome sequence of Schizosaccharomyces pombe."; RL Nature 415:871-880(2002). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] OF 1-157. RC STRAIN=ATCC 38364 / 968; RX PubMed=10759889; DOI=10.1046/j.1365-2443.2000.00317.x; RA Ding D.-Q., Tomita Y., Yamamoto A., Chikashige Y., Haraguchi T., RA Hiraoka Y.; RT "Large-scale screening of intracellular protein localization in living RT fission yeast cells by the use of a GFP-fusion genomic DNA library."; RL Genes Cells 5:169-190(2000). RN [4] RP SUBCELLULAR LOCATION. RX PubMed=11676915; DOI=10.1016/S0960-9822(01)00478-X; RA Krapp A., Schmidt S., Cano E., Simanis V.; RT "S. pombe cdc11p, together with sid4p, provides an anchor for RT septation initiation network proteins on the spindle pole body."; RL Curr. Biol. 11:1559-1568(2001). RN [5] RP INTERACTION WITH CDC31. RX PubMed=12857865; DOI=10.1091/mbc.E02-10-0661; RA Paoletti A., Bordes N., Haddad R., Schwartz C.L., Chang F., RA Bornens M.; RT "Fission yeast cdc31p is a component of the half-bridge and controls RT SPB duplication."; RL Mol. Biol. Cell 14:2793-2808(2003). RN [6] RP FUNCTION, INTERACTION WITH BQT1, AND SUBCELLULAR LOCATION. RX PubMed=16615890; DOI=10.1016/j.cell.2006.01.048; RA Chikashige Y., Tsutsumi C., Yamane M., Okamasa K., Haraguchi T., RA Hiraoka Y.; RT "Meiotic proteins bqt1 and bqt2 tether telomeres to form the bouquet RT arrangement of chromosomes."; RL Cell 125:59-69(2006). RN [7] RP SUBCELLULAR LOCATION [LARGE SCALE ANALYSIS]. RX PubMed=16823372; DOI=10.1038/nbt1222; RA Matsuyama A., Arai R., Yashiroda Y., Shirai A., Kamata A., Sekido S., RA Kobayashi Y., Hashimoto A., Hamamoto M., Hiraoka Y., Horinouchi S., RA Yoshida M.; RT "ORFeome cloning and global analysis of protein localization in the RT fission yeast Schizosaccharomyces pombe."; RL Nat. Biotechnol. 24:841-847(2006). CC -!- FUNCTION: Associates with the spindle pole body and maintains a CC functional interface between the nuclear membrane and the CC microtubule motor proteins. Involved in chromosome segregation CC during meiosis where it associates with the telomeres. CC {ECO:0000269|PubMed:16615890}. CC -!- SUBUNIT: Interacts with bqt1. The bqt1-bqt2-sad1 complex binds CC rap1. Interacts also with cdc31. {ECO:0000269|PubMed:12857865, CC ECO:0000269|PubMed:16615890}. CC -!- INTERACTION: CC Q92358:bqt1; NbExp=4; IntAct=EBI-929731, EBI-929655; CC P87245:kms1; NbExp=5; IntAct=EBI-929731, EBI-1542265; CC O74843:sif1; NbExp=3; IntAct=EBI-929731, EBI-1542307; CC O94531:ufe1; NbExp=3; IntAct=EBI-929731, EBI-1542297; CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytoskeleton, microtubule CC organizing center, spindle pole body. Nucleus membrane; Single- CC pass membrane protein. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; X85105; CAA59426.1; -; Genomic_DNA. DR EMBL; CU329671; CAA17899.1; -; Genomic_DNA. DR EMBL; AB027829; BAA87133.1; -; Genomic_DNA. DR PIR; A57280; A57280. DR RefSeq; NP_595947.2; NM_001021855.3. DR BioGrid; 276458; 45. DR IntAct; Q09825; 30. DR MINT; MINT-4695470; -. DR MaxQB; Q09825; -. DR EnsemblFungi; SPBC12D12.01.1; SPBC12D12.01.1:pep; SPBC12D12.01. DR GeneID; 2539914; -. DR KEGG; spo:SPBC12D12.01; -. DR EuPathDB; FungiDB:SPBC12D12.01; -. DR PomBase; SPBC12D12.01; sad1. DR InParanoid; Q09825; -. DR KO; K19347; -. DR OrthoDB; EOG7W15C8; -. DR PhylomeDB; Q09825; -. DR NextBio; 20801057; -. DR PRO; PR:Q09825; -. DR Proteomes; UP000002485; Chromosome II. DR GO; GO:0000780; C:condensed nuclear chromosome, centromeric region; IDA:PomBase. DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW. DR GO; GO:0005639; C:integral component of nuclear inner membrane; TAS:PomBase. DR GO; GO:0031021; C:interphase microtubule organizing center; IDA:PomBase. DR GO; GO:0034993; C:LINC complex; IPI:PomBase. DR GO; GO:0035974; C:meiotic spindle pole body; IDA:PomBase. DR GO; GO:0005874; C:microtubule; IEA:UniProtKB-KW. DR GO; GO:0044732; C:mitotic spindle pole body; IDA:PomBase. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IDA:PomBase. DR GO; GO:0005635; C:nuclear envelope; IDA:PomBase. DR GO; GO:0035861; C:site of double-strand break; IDA:PomBase. DR GO; GO:0051301; P:cell division; IEA:UniProtKB-KW. DR GO; GO:0072766; P:centromere clustering at the nuclear envelope; IMP:PomBase. DR GO; GO:0090307; P:mitotic spindle assembly; IMP:PomBase. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Cell cycle; Cell division; Complete proteome; Cytoplasm; Cytoskeleton; KW Membrane; Microtubule; Mitosis; Nucleus; Phosphoprotein; KW Reference proteome; Transmembrane; Transmembrane helix. FT CHAIN 1 514 Spindle pole body-associated protein FT sad1. FT /FTId=PRO_0000218915. FT TRANSMEM 168 188 Helical. {ECO:0000255}. FT DOMAIN 322 488 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT MOD_RES 236 236 Phosphoserine; by CDC2. {ECO:0000255}. SQ SEQUENCE 514 AA; 58077 MW; DCA4CAC1E1F98F0D CRC64; MFTNTPVGGK RERQNGAHPA WSTLGANSAQ IHQNTADLAS KMHKLRYTKI RSPPTRVSIE SITPKQRFPA PNFEQAYHSN IRYEQEESDN EEFENVVKNG HEASTNVFYE SDGDDEEFVN EEYENSIDEE SDDEGYSLNE DTTATNASFR YPMNQRSTRK SQFYSSKFKP LLWFGITLFS TLLIITLLHK GQEFYSRSFS SDNSQPSNSP VPNIPPASND TKTSLKPDII KDFTDSPSKV GGNEEFDYST GDLITKKEFD KILQQKVEQL KQSLKEEMSN YKSSVPFEVE LNDDWKFFIE STVRKYLTDP VSMPNFALLS TGAEVLPALT SKRYVRRPSA FIPRFTSYFF DSLVVRGHEP SIALTPNNAV AMCWSFQGSE GQLGISLSRP VYVTNVTIEH VQHKIAHDLS SAPKDFELWV QGMSSKMFVL LGKARYSLTE DSIQTFSFES SNYIVAEPIQ NVILKIKSNW GNPNYTCLYQ VRVHGTVPNA DEQPIPSLGE KAESTAENTG QDSS // ID SLP1_YEAST Reviewed; 587 AA. AC Q12232; D6W2L1; DT 30-MAY-2006, integrated into UniProtKB/Swiss-Prot. DT 01-NOV-1996, sequence version 1. DT 11-NOV-2015, entry version 102. DE RecName: Full=Uncharacterized protein SLP1 {ECO:0000305}; DE AltName: Full=SUN-like protein 1 {ECO:0000303|PubMed:16923827}; DE Flags: Precursor; GN Name=SLP1 {ECO:0000303|PubMed:16923827}; GN OrderedLocusNames=YOR154W {ECO:0000312|SGD:S000005680}; GN ORFNames=O3545; OS Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Saccharomyces. OX NCBI_TaxID=559292; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA]. RC STRAIN=S288c / FY1678; RX PubMed=9046089; RX DOI=10.1002/(SICI)1097-0061(199701)13:1<73::AID-YEA52>3.0.CO;2-M; RA Bordonne R., Camasses A., Madania A., Poch O., Tarassov I.A., RA Winsor B., Martin R.P.; RT "Analysis of a 35.6 kb region on the right arm of Saccharomyces RT cerevisiae chromosome XV."; RL Yeast 13:73-83(1997). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 204508 / S288c; RX PubMed=9169874; RA Dujon B., Albermann K., Aldea M., Alexandraki D., Ansorge W., RA Arino J., Benes V., Bohn C., Bolotin-Fukuhara M., Bordonne R., RA Boyer J., Camasses A., Casamayor A., Casas C., Cheret G., RA Cziepluch C., Daignan-Fornier B., Dang V.-D., de Haan M., Delius H., RA Durand P., Fairhead C., Feldmann H., Gaillon L., Galisson F., RA Gamo F.-J., Gancedo C., Goffeau A., Goulding S.E., Grivell L.A., RA Habbig B., Hand N.J., Hani J., Hattenhorst U., Hebling U., RA Hernando Y., Herrero E., Heumann K., Hiesel R., Hilger F., Hofmann B., RA Hollenberg C.P., Hughes B., Jauniaux J.-C., Kalogeropoulos A., RA Katsoulou C., Kordes E., Lafuente M.J., Landt O., Louis E.J., RA Maarse A.C., Madania A., Mannhaupt G., Marck C., Martin R.P., RA Mewes H.-W., Michaux G., Paces V., Parle-McDermott A.G., Pearson B.M., RA Perrin A., Pettersson B., Poch O., Pohl T.M., Poirey R., RA Portetelle D., Pujol A., Purnelle B., Ramezani Rad M., Rechmann S., RA Schwager C., Schweizer M., Sor F., Sterky F., Tarassov I.A., RA Teodoru C., Tettelin H., Thierry A., Tobiasch E., Tzermia M., RA Uhlen M., Unseld M., Valens M., Vandenbol M., Vetter I., Vlcek C., RA Voet M., Volckaert G., Voss H., Wambutt R., Wedler H., Wiemann S., RA Winsor B., Wolfe K.H., Zollner A., Zumstein E., Kleine K.; RT "The nucleotide sequence of Saccharomyces cerevisiae chromosome XV."; RL Nature 387:98-102(1997). RN [3] RP GENOME REANNOTATION. RC STRAIN=ATCC 204508 / S288c; RX PubMed=24374639; DOI=10.1534/g3.113.008995; RA Engel S.R., Dietrich F.S., Fisk D.G., Binkley G., Balakrishnan R., RA Costanzo M.C., Dwight S.S., Hitz B.C., Karra K., Nash R.S., Weng S., RA Wong E.D., Lloyd P., Skrzypek M.S., Miyasato S.R., Simison M., RA Cherry J.M.; RT "The reference genome sequence of Saccharomyces cerevisiae: Then and RT now."; RL G3 (Bethesda) 4:389-398(2014). RN [4] RP SUBCELLULAR LOCATION. RX PubMed=11935221; DOI=10.1007/s00294-001-0264-9; RA Terashima H., Fukuchi S., Nakai K., Arisawa M., Hamada K., Yabuki N., RA Kitada K.; RT "Sequence-based approach for identification of cell wall proteins in RT Saccharomyces cerevisiae."; RL Curr. Genet. 40:311-316(2002). RN [5] RP LEVEL OF PROTEIN EXPRESSION [LARGE SCALE ANALYSIS]. RX PubMed=14562106; DOI=10.1038/nature02046; RA Ghaemmaghami S., Huh W.-K., Bower K., Howson R.W., Belle A., RA Dephoure N., O'Shea E.K., Weissman J.S.; RT "Global analysis of protein expression in yeast."; RL Nature 425:737-741(2003). RN [6] RP GENE NAME. RX PubMed=16923827; DOI=10.1083/jcb.200601062; RA Jaspersen S.L., Martin A.E., Glazko G., Giddings T.H. Jr., Morgan G., RA Mushegian A., Winey M.; RT "The Sad1-UNC-84 homology domain in Mps3 interacts with Mps2 to RT connect the spindle pole body with the nuclear envelope."; RL J. Cell Biol. 174:665-675(2006). RN [7] RP FUNCTION. RX PubMed=19325107; DOI=10.1126/science.1167983; RA Jonikas M.C., Collins S.R., Denic V., Oh E., Quan E.M., Schmid V., RA Weibezahn J., Schwappach B., Walter P., Weissman J.S., Schuldiner M.; RT "Comprehensive characterization of genes required for protein folding RT in the endoplasmic reticulum."; RL Science 323:1693-1697(2009). RN [8] RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH EMP56. RX PubMed=23275891; DOI=10.1534/g3.112.004614; RA Friederichs J.M., Gardner J.M., Smoyer C.J., Whetstine C.R., Gogol M., RA Slaughter B.D., Jaspersen S.L.; RT "Genetic analysis of Mps3 SUN domain mutants in Saccharomyces RT cerevisiae reveals an interaction with the SUN-like protein Slp1."; RL G3 (Bethesda) 2:1703-1718(2012). CC -!- FUNCTION: May be involved in membrane protein folding CC (PubMed:19325107). Required for localization of MPS3 to the CC nuclear envelope (PubMed:23275891). {ECO:0000269|PubMed:19325107, CC ECO:0000269|PubMed:23275891}. CC -!- SUBUNIT: Interacts with EMP65. {ECO:0000269|PubMed:23275891}. CC -!- INTERACTION: CC P40085:YER140W; NbExp=1; IntAct=EBI-35990, EBI-22717; CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane CC {ECO:0000269|PubMed:11935221, ECO:0000269|PubMed:23275891}; CC Single-pass type I membrane protein {ECO:0000255}. CC -!- MISCELLANEOUS: Present with 3250 molecules/cell in log phase SD CC medium. {ECO:0000269|PubMed:14562106}. CC -!- SIMILARITY: Belongs to the SLP1 family. {ECO:0000305}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; U55020; AAC49640.1; -; Genomic_DNA. DR EMBL; Z75062; CAA99360.1; -; Genomic_DNA. DR EMBL; BK006948; DAA10927.1; -; Genomic_DNA. DR PIR; S67042; S67042. DR RefSeq; NP_014797.1; NM_001183573.1. DR ProteinModelPortal; Q12232; -. DR BioGrid; 34550; 53. DR DIP; DIP-3829N; -. DR IntAct; Q12232; 1. DR MINT; MINT-535324; -. DR MaxQB; Q12232; -. DR EnsemblFungi; YOR154W; YOR154W; YOR154W. DR GeneID; 854325; -. DR KEGG; sce:YOR154W; -. DR EuPathDB; FungiDB:YOR154W; -. DR SGD; S000005680; SLP1. DR GeneTree; ENSGT00390000013502; -. DR HOGENOM; HOG000093382; -. DR InParanoid; Q12232; -. DR OMA; ESIVMAN; -. DR OrthoDB; EOG7SBNXT; -. DR BioCyc; YEAST:G3O-33671-MONOMER; -. DR NextBio; 976369; -. DR PRO; PR:Q12232; -. DR Proteomes; UP000002311; Chromosome XV. DR GO; GO:0030176; C:integral component of endoplasmic reticulum membrane; IDA:SGD. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; IGI:SGD. DR Gene3D; 2.60.120.260; -; 1. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Complete proteome; Endoplasmic reticulum; Glycoprotein; Membrane; KW Reference proteome; Signal; Transmembrane; Transmembrane helix. FT SIGNAL 1 21 {ECO:0000255}. FT CHAIN 22 587 Uncharacterized protein SLP1. FT /FTId=PRO_0000237659. FT TOPO_DOM 22 541 Lumenal. {ECO:0000305|PubMed:23275891}. FT TRANSMEM 542 562 Helical. {ECO:0000255}. FT TOPO_DOM 563 587 Cytoplasmic. FT {ECO:0000305|PubMed:23275891}. FT DOMAIN 163 331 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT CARBOHYD 25 25 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 378 378 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 381 381 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 408 408 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 448 448 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 486 486 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. SQ SEQUENCE 587 AA; 67452 MW; 4A457C95124FD0B8 CRC64; MANRLLIYGL ILWVSIIGSF ALDRNKTAQN AKIGLHDTTV ITTGSTTNVQ KEHSSPLSTG SLRTHDFRQA SKVDIRQADI RENGERKEQD ALTQPATPRN PGDSSNSFLS FDEWKKVKSK EHSSGPERHL SRVREPVDPS CYKEKECIGE ELEIDLGFLT NKNEWSEREE NQKGFNEEKD IEKVYKKKFN YASLDCAATI VKSNPEAIGA TSTLIESKDK YLLNPCSAPQ QFIVIELCED ILVEEIEIAN YEFFSSTFKR FRVSVSDRIP MVKNEWTILG EFEARNSREL QKFQIHNPQI WASYLKIEIL SHYEDEFYCP ISLIKVYGKS MMDEFKIDQL KAQEDKEQSI GTNNINNLNE QNIQDRCNNI ETRLETPNTS NLSDLAGALS CTSKLIPLKF DEFFKVLNAS FCPSKQMISS SSSSAVPVIP EESIFKNIMK RLSQLETNSS LTVSYIEEQS KLLSKSFEQL EMAHEAKFSH LVTIFNETMM SNLDLLNNFA NQLKDQSLRI LEEQKLENDK FTNRHLLHLE RLEKEVSFQR RIVYASFFAF VGLISYLLIT RELYFEDFEE SKNGAIEKAD IVQQAIR // ID SLPI_SCHPO Reviewed; 659 AA. AC O59729; DT 10-FEB-2009, integrated into UniProtKB/Swiss-Prot. DT 01-AUG-1998, sequence version 1. DT 11-NOV-2015, entry version 80. DE RecName: Full=Uncharacterized protein slp1 {ECO:0000250|UniProtKB:Q12232}; DE AltName: Full=SUN-like protein 1 {ECO:0000250|UniProtKB:Q12232}; DE Flags: Precursor; GN ORFNames=SPBC3E7.09 {ECO:0000312|PomBase:SPBC3E7.09}; OS Schizosaccharomyces pombe (strain 972 / ATCC 24843) (Fission yeast). OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Schizosaccharomycetes; Schizosaccharomycetales; OC Schizosaccharomycetaceae; Schizosaccharomyces. OX NCBI_TaxID=284812; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=972 / ATCC 24843; RX PubMed=11859360; DOI=10.1038/nature724; RA Wood V., Gwilliam R., Rajandream M.A., Lyne M.H., Lyne R., Stewart A., RA Sgouros J.G., Peat N., Hayles J., Baker S.G., Basham D., Bowman S., RA Brooks K., Brown D., Brown S., Chillingworth T., Churcher C.M., RA Collins M., Connor R., Cronin A., Davis P., Feltwell T., Fraser A., RA Gentles S., Goble A., Hamlin N., Harris D.E., Hidalgo J., Hodgson G., RA Holroyd S., Hornsby T., Howarth S., Huckle E.J., Hunt S., Jagels K., RA James K.D., Jones L., Jones M., Leather S., McDonald S., McLean J., RA Mooney P., Moule S., Mungall K.L., Murphy L.D., Niblett D., Odell C., RA Oliver K., O'Neil S., Pearson D., Quail M.A., Rabbinowitsch E., RA Rutherford K.M., Rutter S., Saunders D., Seeger K., Sharp S., RA Skelton J., Simmonds M.N., Squares R., Squares S., Stevens K., RA Taylor K., Taylor R.G., Tivey A., Walsh S.V., Warren T., Whitehead S., RA Woodward J.R., Volckaert G., Aert R., Robben J., Grymonprez B., RA Weltjens I., Vanstreels E., Rieger M., Schaefer M., Mueller-Auer S., RA Gabel C., Fuchs M., Duesterhoeft A., Fritzc C., Holzer E., Moestl D., RA Hilbert H., Borzym K., Langer I., Beck A., Lehrach H., Reinhardt R., RA Pohl T.M., Eger P., Zimmermann W., Wedler H., Wambutt R., Purnelle B., RA Goffeau A., Cadieu E., Dreano S., Gloux S., Lelaure V., Mottier S., RA Galibert F., Aves S.J., Xiang Z., Hunt C., Moore K., Hurst S.M., RA Lucas M., Rochet M., Gaillardin C., Tallada V.A., Garzon A., Thode G., RA Daga R.R., Cruzado L., Jimenez J., Sanchez M., del Rey F., Benito J., RA Dominguez A., Revuelta J.L., Moreno S., Armstrong J., Forsburg S.L., RA Cerutti L., Lowe T., McCombie W.R., Paulsen I., Potashkin J., RA Shpakovski G.V., Ussery D., Barrell B.G., Nurse P.; RT "The genome sequence of Schizosaccharomyces pombe."; RL Nature 415:871-880(2002). CC -!- FUNCTION: May be involved in membrane protein folding. CC {ECO:0000250|UniProtKB:Q12232}. CC -!- SUBUNIT: Interacts with EMP65. {ECO:0000250|UniProtKB:Q12232}. CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane CC {ECO:0000250|UniProtKB:Q12232}; Single-pass type I membrane CC protein {ECO:0000255}. CC -!- SIMILARITY: Belongs to the SLP1 family. {ECO:0000305}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CU329671; CAA19012.1; -; Genomic_DNA. DR PIR; T40383; T40383. DR RefSeq; NP_596096.1; NM_001022012.2. DR ProteinModelPortal; O59729; -. DR MINT; MINT-4675723; -. DR MaxQB; O59729; -. DR EnsemblFungi; SPBC3E7.09.1; SPBC3E7.09.1:pep; SPBC3E7.09. DR GeneID; 2540275; -. DR KEGG; spo:SPBC3E7.09; -. DR EuPathDB; FungiDB:SPBC3E7.09; -. DR PomBase; SPBC3E7.09; -. DR InParanoid; O59729; -. DR OrthoDB; EOG7SBNXT; -. DR PhylomeDB; O59729; -. DR NextBio; 20801405; -. DR PRO; PR:O59729; -. DR Proteomes; UP000002485; Chromosome II. DR GO; GO:0005789; C:endoplasmic reticulum membrane; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0034975; P:protein folding in endoplasmic reticulum; ISO:PomBase. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 3: Inferred from homology; KW Complete proteome; Endoplasmic reticulum; Glycoprotein; Membrane; KW Reference proteome; Signal; Transmembrane; Transmembrane helix. FT SIGNAL 1 25 {ECO:0000255}. FT CHAIN 26 659 Uncharacterized protein slp1. FT /FTId=PRO_0000363387. FT TOPO_DOM 26 556 Lumenal. {ECO:0000250|UniProtKB:Q12232}. FT TRANSMEM 557 574 Helical. {ECO:0000255}. FT TOPO_DOM 575 659 Cytoplasmic. FT {ECO:0000250|UniProtKB:Q12232}. FT DOMAIN 173 335 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COMPBIAS 435 443 Poly-Ser. {ECO:0000255}. FT CARBOHYD 94 94 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 111 111 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 128 128 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 142 142 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 393 393 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 415 415 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 495 495 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. FT CARBOHYD 504 504 N-linked (GlcNAc...). FT {ECO:0000255|PROSITE-ProRule:PRU00498}. SQ SEQUENCE 659 AA; 74072 MW; B0AC716FA05263CC CRC64; MVKRRLSAFG NAFLIYFIIF RLCCCSPQTS HWCKYPALCL KSPDTHNENL VCDAYLSVIA TKSEEKEASN PTTWDFTPTN KYQEPSFHTK TSLNGSDTIS SNFLSKYEYS NGTSTSEFID SISPPLVNET STISSSKKLE QNYSVTEVID TNIITSSSVT LPISEDGSST SAAATIDSNI DEKTVAFSEE KRFNFASTDC AAAVIKTNPE AVGSSSILTE NKDKYMLNKC SAENKFVVIE LCEDIYVDTV QIANFEFFSS IFRDFKVSVS GKYPKYESSW MELGTFTALN LRTLQSFHIE NPLIWAKYLK IEFLTHYGSE FYCPVSLLRV YGKTMIEEFE EANEDFLEQK VNDGSAIKAD EIRKPQESPI FVDEEDTDVQ SKPVRKNPSV ELNSTDTLLS STVISKSLST VVIGNETGKS ESYPATSTRS FNDISPSSSS SYSTAQISTF PSNQESIYKN INKRLSTLEE RKKAFDEIVE KILTNYGKHN AKNMNFTQLL HELNSTLQLE ISKLSKSVVK PSLFALQAKL ELLSAENEYF QSQITSLYQE SSFQKRLLML QLTVLIVLTV YMAVSRLPEN LPTTRSSSNN PIEASRPPFS RDEQDISKAN DFRVSASSAV YTVGPELLQR KKRDPNTSIR SIHEREQDKI IHSRSHSVC // ID SPAG4_HUMAN Reviewed; 437 AA. AC Q9NPE6; O43648; DT 17-JAN-2003, integrated into UniProtKB/Swiss-Prot. DT 01-OCT-2000, sequence version 1. DT 11-NOV-2015, entry version 113. DE RecName: Full=Sperm-associated antigen 4 protein; DE AltName: Full=Outer dense fiber-associated protein SPAG4; DE AltName: Full=SUN domain-containing protein 4; GN Name=SPAG4; Synonyms=SUN4; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA]. RX PubMed=14614621; DOI=10.1007/s00441-003-0821-2; RA Kennedy C., Sebire K., De Kretser D.M., O'Bryan M.K.; RT "Human sperm associated antigen 4 (SPAG4) is a potential cancer RT marker."; RL Cell Tissue Res. 315:279-283(2004). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=11780052; DOI=10.1038/414865a; RA Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., RA Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., RA Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M., Brown A.J., RA Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., RA Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., RA Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., RA Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., RA Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., RA Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., RA Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., RA Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., RA Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., RA Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., RA Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., RA Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., RA Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., RA Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., RA Rogers J.; RT "The DNA sequence and comparative analysis of human chromosome 20."; RL Nature 414:865-871(2001). RN [3] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 265-303. RX PubMed=9691178; RA Tarnasky H., Gill D., Murthy S., Shao X., Demetrick D.J., RA van der Hoorn F.A.; RT "A novel testis-specific gene, SPAG4, whose product interacts RT specifically with outer dense fiber protein ODF27, maps to human RT chromosome 20q11.2."; RL Cytogenet. Cell Genet. 81:65-67(1998). CC -!- FUNCTION: May assist the organization and assembly of outer dense CC fibers (ODFs), a specific structure of the sperm tail. CC -!- SUBUNIT: Homodimer. Interacts with ODF1. May associate with CC microtubules (By similarity). {ECO:0000250}. CC -!- INTERACTION: CC Q8IYM1:SEPT12; NbExp=6; IntAct=EBI-10819434, EBI-2585067; CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Multi-pass membrane CC protein {ECO:0000305}. Cytoplasm, cytoskeleton. Cytoplasm, CC cytoskeleton, flagellum axoneme {ECO:0000250}. Note=In spermatids, CC it is localized in the transient manchette and in the axoneme of CC elongating spermatids and epididymal sperm. {ECO:0000250}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- CAUTION: Although transmembrane domains are strongly predicted, CC they may rather represent hydrophobic globular domains associated CC with microtubules. {ECO:0000305}. CC -!- SEQUENCE CAUTION: CC Sequence=AAC32052.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF262992; AAF75267.1; -; mRNA. DR EMBL; AF262993; AAF75268.1; -; Genomic_DNA. DR EMBL; AL109827; CAB87609.1; -; Genomic_DNA. DR EMBL; AF043344; AAC32052.1; ALT_SEQ; Genomic_DNA. DR CCDS; CCDS13259.1; -. DR RefSeq; NP_003107.1; NM_003116.1. DR UniGene; Hs.123159; -. DR ProteinModelPortal; Q9NPE6; -. DR SMR; Q9NPE6; 241-422. DR BioGrid; 112558; 2. DR IntAct; Q9NPE6; 1. DR STRING; 9606.ENSP00000363391; -. DR PhosphoSite; Q9NPE6; -. DR BioMuta; SPAG4; -. DR DMDM; 27805726; -. DR PaxDb; Q9NPE6; -. DR PRIDE; Q9NPE6; -. DR DNASU; 6676; -. DR Ensembl; ENST00000374273; ENSP00000363391; ENSG00000061656. DR GeneID; 6676; -. DR KEGG; hsa:6676; -. DR UCSC; uc002xdb.1; human. DR CTD; 6676; -. DR GeneCards; SPAG4; -. DR HGNC; HGNC:11214; SPAG4. DR HPA; HPA048393; -. DR HPA; HPA061789; -. DR MIM; 603038; gene. DR neXtProt; NX_Q9NPE6; -. DR PharmGKB; PA36050; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000246956; -. DR HOVERGEN; HBG079205; -. DR InParanoid; Q9NPE6; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q9NPE6; -. DR TreeFam; TF323915; -. DR GenomeRNAi; 6676; -. DR NextBio; 26031; -. DR PRO; PR:Q9NPE6; -. DR Proteomes; UP000005640; Chromosome 20. DR Bgee; Q9NPE6; -. DR CleanEx; HS_SPAG4; -. DR ExpressionAtlas; Q9NPE6; baseline and differential. DR Genevisible; Q9NPE6; HS. DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW. DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0031514; C:motile cilium; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0005198; F:structural molecule activity; TAS:ProtInc. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; TAS:ProtInc. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Cell projection; Cilium; Coiled coil; Complete proteome; Cytoplasm; KW Cytoskeleton; Flagellum; Membrane; Reference proteome; Transmembrane; KW Transmembrane helix. FT CHAIN 1 437 Sperm-associated antigen 4 protein. FT /FTId=PRO_0000218916. FT TRANSMEM 135 155 Helical. {ECO:0000255}. FT TRANSMEM 166 186 Helical. {ECO:0000255}. FT DOMAIN 265 425 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 197 244 {ECO:0000255}. SQ SEQUENCE 437 AA; 48165 MW; 5D1E3BCAEEE8B702 CRC64; MRRSSRPGSA SSSRKHTPNF FSENSSMSIT SEDSKGLRSA EPGPGEPEGR RARGPSCGEP ALSAGVPGGT TWAGSSQQKP APRSHNWQTA CGAATVRGGA SEPTGSPVVS EEPLDLLPTL DLRQEMPPPR VFKSFLSLLF QGLSVLLSLA GDVLVSMYRE VCSIRFLFTA VSLLSLFLSA FWLGLLYLVS PLENEPKEML TLSEYHERVR SQGQQLQQLQ AELDKLHKEV STVRAANSER VAKLVFQRLN EDFVRKPDYA LSSVGASIDL QKTSHDYADR NTAYFWNRFS FWNYARPPTV ILEPHVFPGN CWAFEGDQGQ VVIQLPGRVQ LSDITLQHPP PSVEHTGGAN SAPRDFAVFG LQVYDETEVS LGKFTFDVEK SEIQTFHLQN DPPAAFPKVK IQILSNWGHP RFTCLYRVRA HGVRTSEGAE GSAQGPH // ID SPAG4_MOUSE Reviewed; 443 AA. AC Q9JJF2; A3KGK4; DT 17-JAN-2003, integrated into UniProtKB/Swiss-Prot. DT 27-JUL-2011, sequence version 3. DT 11-NOV-2015, entry version 105. DE RecName: Full=Sperm-associated antigen 4 protein; DE AltName: Full=Outer dense fiber-associated protein SPAG4; DE AltName: Full=SUN domain-containing protein 4; GN Name=Spag4; ORFNames=MNCb-0953, Sun4; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). RC STRAIN=C57BL/6J; TISSUE=Brain; RA Osada N., Kusuda J., Tanuma R., Ito A., Hirata M., Sugano S., RA Hashimoto K.; RT "Isolation of full-length cDNA clones from mouse brain cDNA library RT made by oligo-capping method."; RL Submitted (APR-2000) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.; RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases. RN [4] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1-274 (ISOFORM 1). RC TISSUE=Testis; RX PubMed=11042159; DOI=10.1101/gr.145100; RA Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M., RA Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.; RT "Normalization and subtraction of cap-trapper-selected cDNAs to RT prepare full-length cDNA libraries for rapid discovery of new genes."; RL Genome Res. 10:1617-1630(2000). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 15-443 (ISOFORM 1). RC TISSUE=Testis; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP TISSUE SPECIFICITY. RX PubMed=10373309; DOI=10.1006/dbio.1999.9297; RA Shao X., Tarnasky H.A., Lee J.P., Oko R., van der Hoorn F.A.; RT "Spag4, a novel sperm protein, binds outer dense-fiber protein Odf1 RT and localizes to microtubules of manchette and axoneme."; RL Dev. Biol. 211:109-123(1999). RN [7] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). CC -!- FUNCTION: May assist the organization and assembly of outer dense CC fibers (ODFs), a specific structure of the sperm tail. CC -!- SUBUNIT: Homodimer. Interacts with ODF1. May associate with CC microtubules (By similarity). {ECO:0000250}. CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Multi-pass membrane CC protein {ECO:0000305}. Cytoplasm, cytoskeleton {ECO:0000250}. CC -!- SUBCELLULAR LOCATION: Isoform 1: Membrane {ECO:0000305}; Multi- CC pass membrane protein {ECO:0000305}. Cytoplasm, cytoskeleton CC {ECO:0000250}. Cytoplasm, cytoskeleton, flagellum axoneme CC {ECO:0000250}. Note=In spermatids, isoform 1 is localized in the CC transient manchette and in the axoneme of elongating spermatids CC and epididymal sperm. {ECO:0000250}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q9JJF2-1; Sequence=Displayed; CC Note=Derived from EST data.; CC Name=2; CC IsoId=Q9JJF2-2; Sequence=VSP_005958, VSP_005959; CC -!- TISSUE SPECIFICITY: Isoform 1 is testis specific and is CC exclusively expressed in spermatids. CC {ECO:0000269|PubMed:10373309}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- CAUTION: Although transmembrane domains are strongly predicted, CC they may rather represent hydrophobic globular domains associated CC with microtubules. {ECO:0000305}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB041554; BAA95039.1; -; mRNA. DR EMBL; AL833786; CAM46026.1; -; Genomic_DNA. DR EMBL; CH466551; EDL06167.1; -; Genomic_DNA. DR EMBL; BB610287; -; NOT_ANNOTATED_CDS; mRNA. DR EMBL; BU937622; -; NOT_ANNOTATED_CDS; mRNA. DR EMBL; CA466584; -; NOT_ANNOTATED_CDS; mRNA. DR CCDS; CCDS50777.1; -. [Q9JJF2-1] DR RefSeq; NP_631890.3; NM_139151.4. [Q9JJF2-1] DR UniGene; Mm.330713; -. DR ProteinModelPortal; Q9JJF2; -. DR SMR; Q9JJF2; 243-424. DR STRING; 10090.ENSMUSP00000036484; -. DR PaxDb; Q9JJF2; -. DR PRIDE; Q9JJF2; -. DR Ensembl; ENSMUST00000038860; ENSMUSP00000036484; ENSMUSG00000038180. [Q9JJF2-1] DR GeneID; 245865; -. DR KEGG; mmu:245865; -. DR UCSC; uc008nmc.1; mouse. [Q9JJF2-1] DR CTD; 6676; -. DR MGI; MGI:2444120; Spag4. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000246956; -. DR HOVERGEN; HBG079205; -. DR InParanoid; Q9JJF2; -. DR OMA; KHTPNFY; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 386968; -. DR PRO; PR:Q9JJF2; -. DR Proteomes; UP000000589; Chromosome 2. DR Bgee; Q9JJF2; -. DR CleanEx; MM_SPAG4; -. DR ExpressionAtlas; Q9JJF2; baseline and differential. DR Genevisible; Q9JJF2; MM. DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW. DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0031514; C:motile cilium; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Cell projection; Cilium; Coiled coil; KW Complete proteome; Cytoplasm; Cytoskeleton; Flagellum; Membrane; KW Reference proteome; Transmembrane; Transmembrane helix. FT CHAIN 1 443 Sperm-associated antigen 4 protein. FT /FTId=PRO_0000218917. FT TRANSMEM 137 157 Helical. {ECO:0000255}. FT TRANSMEM 168 188 Helical. {ECO:0000255}. FT DOMAIN 267 427 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 203 244 {ECO:0000255}. FT VAR_SEQ 98 103 VRGGAS -> NRLDLL (in isoform 2). FT {ECO:0000303|Ref.1}. FT /FTId=VSP_005958. FT VAR_SEQ 104 443 Missing (in isoform 2). FT {ECO:0000303|Ref.1}. FT /FTId=VSP_005959. FT CONFLICT 81 81 K -> E (in Ref. 1; BAA95039). FT {ECO:0000305}. FT CONFLICT 259 259 P -> T (in Ref. 1; BAA95039). FT {ECO:0000305}. SQ SEQUENCE 443 AA; 48582 MW; F5B8BD2F57E5F200 CRC64; MRRSPRSGSA ASSHNHTPNF YSENSNSSHS ATSGDSNGRR SAGPELGEPE GRRARGSSCG EPALSPGMPG GDTWAGSSRP KLAPRSHNGQ TACGAATVRG GASEPSGSSV VLEEQLNLLP ILDLRQEMPT PRVSKSFLSL LFQVLSMVLS LAVDGLVCVC REICSIRFLF TAVSLLSIFL AALWWGLLYL IPPLENEPTE MLTLSQYHHR VHSQGQQLQQ LQAELNKLHK EVSSVRAAHS ERVAKLVFQR LNEDFVRKPD YALSSVGASI DLEKTSSDYE DQNTAYFWNR LSFWNYARPP SVILEPDVFP GNCWAFEGDK GQVVIRLPGH VQLSDITLQH PPPTVAHTGG ASSAPRDFAV YGLQADDETE VFLGKFIFDV QKSEIQTFHL QNDPPSAFPK VKIQILSNWG HPRFTCLYRV RAHGVRTSEW ADDNATGVTG GPH // ID SPAG4_RAT Reviewed; 444 AA. AC O55034; DT 17-JAN-2003, integrated into UniProtKB/Swiss-Prot. DT 01-MAY-2000, sequence version 2. DT 11-NOV-2015, entry version 92. DE RecName: Full=Sperm-associated antigen 4 protein; DE AltName: Full=Outer dense fiber-associated protein SPAG4; DE AltName: Full=SUN domain-containing protein 4; GN Name=Spag4; Synonyms=Sun4; OS Rattus norvegicus (Rat). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Rattus. OX NCBI_TaxID=10116; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA], CHARACTERIZATION, DEVELOPMENTAL STAGE, RP HOMODIMERIZATION, AND INTERACTION WITH ODF1. RC TISSUE=Testis; RX PubMed=10373309; DOI=10.1006/dbio.1999.9297; RA Shao X., Tarnasky H.A., Lee J.P., Oko R., van der Hoorn F.A.; RT "Spag4, a novel sperm protein, binds outer dense-fiber protein Odf1 RT and localizes to microtubules of manchette and axoneme."; RL Dev. Biol. 211:109-123(1999). CC -!- FUNCTION: May assist the organization and assembly of outer dense CC fibers (ODFs), a specific structure of the sperm tail. CC -!- SUBUNIT: Homodimer. Interacts with ODF1. May associate with CC microtubules. {ECO:0000269|PubMed:10373309}. CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Multi-pass membrane CC protein {ECO:0000305}. Cytoplasm, cytoskeleton. Cytoplasm, CC cytoskeleton, flagellum axoneme. Note=In spermatids, it is CC localized in the transient manchette and in the axoneme of CC elongating spermatids and epididymal sperm. CC -!- TISSUE SPECIFICITY: Testis specific. Exclusively expressed in CC spermatids. CC -!- DEVELOPMENTAL STAGE: Exclusively expressed in spermatids. Not CC present in mature sperm. Localized with both the manchette and the CC axoneme in step 10-11 spermatids. Detected on manchette CC microtubules of step 12 spermatids. Associates with the tail in CC steps 18-19 spermatids and epididymal sperm. CC {ECO:0000269|PubMed:10373309}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- CAUTION: Although transmembrane domains are strongly predicted, CC they may rather represent hydrophobic globular domains associated CC with microtubules. {ECO:0000305}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF043345; AAC32053.2; -; mRNA. DR RefSeq; NP_113980.1; NM_031792.1. DR UniGene; Rn.28620; -. DR STRING; 10116.ENSRNOP00000058827; -. DR PaxDb; O55034; -. DR PRIDE; O55034; -. DR GeneID; 83623; -. DR KEGG; rno:83623; -. DR UCSC; RGD:620151; rat. DR CTD; 6676; -. DR RGD; 620151; Spag4. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000246956; -. DR HOVERGEN; HBG079205; -. DR InParanoid; O55034; -. DR PhylomeDB; O55034; -. DR NextBio; 616191; -. DR PRO; PR:O55034; -. DR Proteomes; UP000002494; Unplaced. DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0005874; C:microtubule; IDA:RGD. DR GO; GO:0031514; C:motile cilium; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0042802; F:identical protein binding; IDA:RGD. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR027776; SPAG4/SUN4. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF16; PTHR12911:SF16; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Cell projection; Cilium; Coiled coil; Complete proteome; Cytoplasm; KW Cytoskeleton; Flagellum; Membrane; Reference proteome; Transmembrane; KW Transmembrane helix. FT CHAIN 1 444 Sperm-associated antigen 4 protein. FT /FTId=PRO_0000218918. FT TRANSMEM 137 159 Helical. {ECO:0000255}. FT TRANSMEM 166 188 Helical. {ECO:0000255}. FT DOMAIN 267 428 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 204 241 {ECO:0000255}. SQ SEQUENCE 444 AA; 48693 MW; DA05493E8F87C46B CRC64; MRRNPRPGSA ASSHNHTPNF YSENSNSSHS ATSGDSNGRR SAGPELGEPD GRMARGSSCG EPALSSGVPG GDTWAGSSRP KLAPRSHNGQ TACGAATVRG GASEPSGSPA VLEEQLNLLP ILDLRQEMPP PPVSKSFLSL FFQVLSVFLS LVADGLVCVY REICSIRFLF TAVSLLSIFL AALWWGLLYL IPPLENEPKE MLTLSQYHHR VHSQGQQLQQ LQAELSKLHK EVTSVRAAHS ERVAKLVFQR LNEDFVRKPD YALSSVGASI DLEKTSSDYE DRNTAYFWNR LSFWNYARPP SVILEPDVFP GNCWAFEGEQ GQVVIRLPGH VQLSDITLQH PPPTVAHTGG ASSAPRDFAV FGLQADDDET EVFLGKFIFE VQKSEIQTFH LQNDPPSAFP KVKIQILSNW GHPRFTCLYR VRAHGVRISE SAEDNAMGVT GGPH // ID SUN1_CAEEL Reviewed; 473 AA. AC Q20924; DT 02-FEB-2004, integrated into UniProtKB/Swiss-Prot. DT 01-NOV-1996, sequence version 1. DT 11-NOV-2015, entry version 97. DE RecName: Full=Sun domain-containing protein 1; GN Name=sun-1; ORFNames=F57B1.2; OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Bristol N2; RX PubMed=9851916; DOI=10.1126/science.282.5396.2012; RG The C. elegans sequencing consortium; RT "Genome sequence of the nematode C. elegans: a platform for RT investigating biology."; RL Science 282:2012-2018(1998). RN [2] RP FUNCTION, AND SUBCELLULAR LOCATION. RX PubMed=14697201; DOI=10.1016/S0092-8674(03)00985-1; RA Malone C.J., Misner L., Le Bot N., Tsai M.-C., Campbell J.M., RA Ahringer J., White J.G.; RT "The C. elegans hook protein, ZYG-12, mediates the essential RT attachment between the centrosome and nucleus."; RL Cell 115:825-836(2003). CC -!- FUNCTION: Involved in centrosome attachment to the nucleus. CC Required for zyg-12 localization to the nuclear envelope. CC {ECO:0000269|PubMed:14697201}. CC -!- SUBCELLULAR LOCATION: Nucleus membrane CC {ECO:0000305|PubMed:14697201}; Single-pass membrane protein CC {ECO:0000305|PubMed:14697201}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring. CC {ECO:0000250}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; Z78064; CAB01511.1; -; Genomic_DNA. DR PIR; T22830; T22830. DR RefSeq; NP_506281.1; NM_073880.4. DR UniGene; Cel.7684; -. DR ProteinModelPortal; Q20924; -. DR SMR; Q20924; 270-443. DR BioGrid; 44820; 1. DR DIP; DIP-52612N; -. DR MINT; MINT-6669576; -. DR STRING; 6239.F57B1.2; -. DR PaxDb; Q20924; -. DR PRIDE; Q20924; -. DR EnsemblMetazoa; F57B1.2; F57B1.2; WBGene00006311. DR GeneID; 179802; -. DR KEGG; cel:CELE_F57B1.2; -. DR UCSC; F57B1.2; c. elegans. DR CTD; 179802; -. DR WormBase; F57B1.2; CE11288; WBGene00006311; sun-1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000021176; -. DR InParanoid; Q20924; -. DR OMA; VPNHAPK; -. DR OrthoDB; EOG7BZVX6; -. DR NextBio; 906920; -. DR PRO; PR:Q20924; -. DR Proteomes; UP000001940; Chromosome V. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR GO; GO:0005635; C:nuclear envelope; IDA:WormBase. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IMP:WormBase. DR GO; GO:0009792; P:embryo development ending in birth or egg hatching; IMP:WormBase. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0008104; P:protein localization; IMP:WormBase. DR GO; GO:0010824; P:regulation of centrosome duplication; IGI:WormBase. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF2; PTHR12911:SF2; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 3: Inferred from homology; KW Coiled coil; Complete proteome; Membrane; Nucleus; Reference proteome; KW Transmembrane; Transmembrane helix. FT CHAIN 1 473 Sun domain-containing protein 1. FT /FTId=PRO_0000218921. FT TRANSMEM 262 282 Helical. {ECO:0000255}. FT DOMAIN 279 443 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 163 191 {ECO:0000255}. FT COILED 204 235 {ECO:0000255}. SQ SEQUENCE 473 AA; 54132 MW; 99BBFCB48695C7FB CRC64; MALRHTISPQ FSNRHSPPVT RSVSRTGVHQ PLDTSTPVTR RDSQPGTITG TIQRFHESAD DSEIDLNSSK FIYKEHFSYK EITSMKKEMW YDWLEYRIRM VRRRFVPTWA QFKRTLMAVV LFAMLYKYAR DCLFDGTHHN SEGSYADKDA NWASEKQKFH QTISNLRAEF SAHDKQLDFK TDHLEKLLEN VLEHSKGWKE SAIEELKQIK LWQAEISDAL QQMKKEIDDA KSTKIIHSTP EKAPETAPTA SLPPSSQLQP MHITRRALLG VNVANSLIGA SIDHSCSSRP VSAKDGFFYD FMSYFGTFQE GYALLDRDVL SPGEAWCTYD KRATLTVKLA RFVIPKSVSY QHVRWSGIVP NHAPKLYDVV ACTDSCCTKW QPLVANCEYK ERDGSYDEQE QFCSVPTIQN HSPINHVQFR FRENHGDMPK TCAYLIRVYG EPVDPPKETQ PMTDNGTESK LESAIVNSVS ETA // ID SUN1_DICDI Reviewed; 905 AA. AC Q558Z2; DT 19-JAN-2010, integrated into UniProtKB/Swiss-Prot. DT 24-MAY-2005, sequence version 1. DT 11-NOV-2015, entry version 61. DE RecName: Full=Sun domain-containing protein 1; GN Name=sun1; ORFNames=DDB_G0272869; OS Dictyostelium discoideum (Slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. OX NCBI_TaxID=44689; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AX4; RX PubMed=12097910; DOI=10.1038/nature00847; RA Gloeckner G., Eichinger L., Szafranski K., Pachebat J.A., RA Bankier A.T., Dear P.H., Lehmann R., Baumgart C., Parra G., RA Abril J.F., Guigo R., Kumpf K., Tunggal B., Cox E.C., Quail M.A., RA Platzer M., Rosenthal A., Noegel A.A.; RT "Sequence and analysis of chromosome 2 of Dictyostelium discoideum."; RL Nature 418:79-85(2002). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AX4; RX PubMed=15875012; DOI=10.1038/nature03481; RA Eichinger L., Pachebat J.A., Gloeckner G., Rajandream M.A., RA Sucgang R., Berriman M., Song J., Olsen R., Szafranski K., Xu Q., RA Tunggal B., Kummerfeld S., Madera M., Konfortov B.A., Rivero F., RA Bankier A.T., Lehmann R., Hamlin N., Davies R., Gaudet P., Fey P., RA Pilcher K., Chen G., Saunders D., Sodergren E.J., Davis P., RA Kerhornou A., Nie X., Hall N., Anjard C., Hemphill L., Bason N., RA Farbrother P., Desany B., Just E., Morio T., Rost R., Churcher C.M., RA Cooper J., Haydock S., van Driessche N., Cronin A., Goodhead I., RA Muzny D.M., Mourier T., Pain A., Lu M., Harper D., Lindsay R., RA Hauser H., James K.D., Quiles M., Madan Babu M., Saito T., RA Buchrieser C., Wardroper A., Felder M., Thangavelu M., Johnson D., RA Knights A., Loulseged H., Mungall K.L., Oliver K., Price C., RA Quail M.A., Urushihara H., Hernandez J., Rabbinowitsch E., Steffen D., RA Sanders M., Ma J., Kohara Y., Sharp S., Simmonds M.N., Spiegler S., RA Tivey A., Sugano S., White B., Walker D., Woodward J.R., Winckler T., RA Tanaka Y., Shaulsky G., Schleicher M., Weinstock G.M., Rosenthal A., RA Cox E.C., Chisholm R.L., Gibbs R.A., Loomis W.F., Platzer M., RA Kay R.R., Williams J.G., Dear P.H., Noegel A.A., Barrell B.G., RA Kuspa A.; RT "The genome of the social amoeba Dictyostelium discoideum."; RL Nature 435:43-57(2005). RN [3] RP FUNCTION, SUBCELLULAR LOCATION, TOPOLOGY, AND SUBUNIT. RX PubMed=18266910; DOI=10.1111/j.1600-0854.2008.00721.x; RA Xiong H., Rivero F., Euteneuer U., Mondal S., Mana-Capelli S., RA Larochelle D., Vogel A., Gassen B., Noegel A.A.; RT "Dictyostelium Sun-1 connects the centrosome to chromatin and ensures RT genome stability."; RL Traffic 9:708-724(2008). RN [4] RP SUBCELLULAR LOCATION, FUNCTION, AND DISRUPTION PHENOTYPE. RX PubMed=19632001; DOI=10.1016/j.ejcb.2009.06.003; RA Schulz I., Baumann O., Samereier M., Zoglmeier C., Graef R.; RT "Dictyostelium Sun1 is a dynamic membrane protein of both nuclear RT membranes and required for centrosomal association with clustered RT centromeres."; RL Eur. J. Cell Biol. 88:621-638(2009). CC -!- FUNCTION: May have an important role in defining the spacing of CC the nuclear envelope lumen. Essential for centrosome attachment to CC the nucleus, maintenance of correct ploidy, proper mitosis, CC association of the centromere cluster with the centrosome and the CC maintenance of genome stability. Requires direct chromatin binding CC for inner nuclear membrane targeting. CC {ECO:0000269|PubMed:18266910, ECO:0000269|PubMed:19632001}. CC -!- SUBUNIT: Homodimer and homooligomer. CC {ECO:0000269|PubMed:18266910}. CC -!- SUBCELLULAR LOCATION: Nucleus membrane CC {ECO:0000269|PubMed:18266910, ECO:0000269|PubMed:19632001}; CC Single-pass membrane protein {ECO:0000269|PubMed:18266910, CC ECO:0000269|PubMed:19632001}; Nucleoplasmic side CC {ECO:0000269|PubMed:18266910, ECO:0000269|PubMed:19632001}. CC -!- DISRUPTION PHENOTYPE: Shows significant reduced growth. CC {ECO:0000269|PubMed:19632001}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAFI02000008; EAL71072.1; -; Genomic_DNA. DR RefSeq; XP_644924.1; XM_639832.1. DR STRING; 44689.DDB0219949; -. DR PaxDb; Q558Z2; -. DR PRIDE; Q558Z2; -. DR EnsemblProtists; DDB0219949; DDB0219949; DDB_G0272869. DR EnsemblProtists; EAL71072; EAL71072; EBG00001261718. DR GeneID; 8618603; -. DR KEGG; ddi:DDB_G0272869; -. DR dictyBase; DDB_G0272869; sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR InParanoid; Q558Z2; -. DR KO; K19347; -. DR OMA; VYSADKI; -. DR PRO; PR:Q558Z2; -. DR Proteomes; UP000002195; Chromosome 2. DR Proteomes; UP000002195; Unassembled WGS sequence. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:InterPro. DR GO; GO:0034993; C:LINC complex; IEA:InterPro. DR GO; GO:0005635; C:nuclear envelope; IDA:dictyBase. DR GO; GO:0005637; C:nuclear inner membrane; IDA:dictyBase. DR GO; GO:0005640; C:nuclear outer membrane; IDA:dictyBase. DR GO; GO:0005654; C:nucleoplasm; IDA:dictyBase. DR GO; GO:0003682; F:chromatin binding; IDA:dictyBase. DR GO; GO:0003677; F:DNA binding; IDA:dictyBase. DR GO; GO:0042803; F:protein homodimerization activity; IDA:dictyBase. DR GO; GO:0051301; P:cell division; IEA:UniProtKB-KW. DR GO; GO:0034508; P:centromere complex assembly; IMP:dictyBase. DR GO; GO:0051642; P:centrosome localization; IMP:dictyBase. DR GO; GO:0051297; P:centrosome organization; IMP:dictyBase. DR GO; GO:0000070; P:mitotic sister chromatid segregation; IMP:dictyBase. DR GO; GO:0006997; P:nucleus organization; IMP:dictyBase. DR InterPro; IPR018539; SUN1. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF8; PTHR12911:SF8; 3. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Cell cycle; Cell division; Coiled coil; Complete proteome; Membrane; KW Mitosis; Nucleus; Reference proteome; Transmembrane; KW Transmembrane helix. FT CHAIN 1 905 Sun domain-containing protein 1. FT /FTId=PRO_0000390618. FT TOPO_DOM 1 290 Nuclear. {ECO:0000255}. FT TRANSMEM 291 311 Helical. {ECO:0000255}. FT TOPO_DOM 312 905 Perinuclear space. {ECO:0000255}. FT DOMAIN 662 860 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 170 221 {ECO:0000255}. FT COILED 359 456 {ECO:0000255}. FT COILED 504 609 {ECO:0000255}. FT COILED 864 901 {ECO:0000255}. FT COMPBIAS 117 120 Poly-Asp. FT COMPBIAS 177 213 Poly-Gln. FT COMPBIAS 215 221 Poly-Asn. FT COMPBIAS 224 227 Poly-Asn. FT COMPBIAS 281 284 Poly-Asn. FT COMPBIAS 567 572 Poly-Ser. FT COMPBIAS 814 819 Poly-Thr. SQ SEQUENCE 905 AA; 104743 MW; 208FA7368F3280D2 CRC64; MSGDYKPNYQ SSPSRKRLPL QSKDQASIYK YQTPSTLNLY NNTVNNNSSN NSNNHLLHNS NPNSSYLYDS SKQYSNQINI RNNSNSNSNT NNITSKKASS SYSINNKVDH NSHNNNDDDD IEDDVDINYS TNNASSNILH NRFSNSNKDD SYIDYSTDEN PKILKQPQPL YNHLNNQIQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQRNNNNNS NSSNNNNTST TIKRNNQQID NNSNKNIISK FIGDPWKNFY YGSNKSLWPF ERNNNSNNSS NNNNKVNFKQ AIWIFIFSVL FIGCLLGLFS TNFYGIHIYF PSFSTTKTNS PFNSTNNNIQ FSNLITKEQL YPIIDEYFKK NEILKSYNKL FEKIENDIKY LSEREQYKDI INEIKEELKL VKLSNMDEDR VNQLISKMIN HYNNNENNKQ ELKELLSKSI EELTKLKSDS KEQLIQISTE SMNQLGQLKS ESINQLGQVK SESIDKFQST LKSLSKEEQS KIEREFNHQF NQLNKDADQL LSQHSLKIEK LREEINENQQ SSLLKLTQEY KQLEERLKEF SSKLQQSISS SSMDQFESWK LVFIKDIEER INKESSKLTN QYIQLTQQFT KIQSFIKDNP SIDSLTNTIE SLEGIKLLIE DILEVYSADK IAKVDYALGL AGASIEYNAL HYRVSETYPP IKGSGSGSGS GGANGNSLGL YYYNLATNWI FPQPKPNPPE TILDPMVNTG SCWGFYTGNG TIVIRLAKKI AITEVTMEHI SSNISHHIDS APKEFQVFGL INSSDIGQSL GVFTYDTTIN RHLQTFKVNK IQSTTTTTTN QDQNDDDNIQ EFSHVALRIL SNHGYRYTCI YRFRVHGYQI PHPEQEQIQI IQEEQSFKQE EINQQQIEQI EQIEQIEKQQ QSDEL // ID SUN1_HUMAN Reviewed; 812 AA. AC O94901; A5PL20; B3KMV7; B4DZF7; B7WNY4; B7WP53; E9PDU4; E9PF23; AC F8WD13; Q96CZ7; Q9HA14; Q9UH98; DT 02-AUG-2002, integrated into UniProtKB/Swiss-Prot. DT 11-JUL-2003, sequence version 3. DT 11-NOV-2015, entry version 146. DE RecName: Full=SUN domain-containing protein 1; DE AltName: Full=Protein unc-84 homolog A; DE AltName: Full=Sad1/unc-84 protein-like 1; GN Name=SUN1; Synonyms=KIAA0810, UNC84A; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND VARIANT RP TYR-118. RC TISSUE=Brain; RX PubMed=9872452; DOI=10.1093/dnares/5.5.277; RA Nagase T., Ishikawa K., Suyama M., Kikuno R., Miyajima N., Tanaka A., RA Kotani H., Nomura N., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. XI. RT The complete sequences of 100 new cDNA clones from brain which code RT for large proteins in vitro."; RL DNA Res. 5:277-286(1998). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 3; 5; 6 AND 7), AND RP VARIANT TYR-118. RC TISSUE=Mammary gland, Substantia nigra, and Testis; RX PubMed=14702039; DOI=10.1038/ng1285; RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S., RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., RA Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., RA Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., RA Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., RA Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., RA Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.; RT "Complete sequencing and characterization of 21,243 full-length human RT cDNAs."; RL Nat. Genet. 36:40-45(2004). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4). RC TISSUE=Cerebellum; RX PubMed=17974005; DOI=10.1186/1471-2164-8-399; RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., RA Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., RA Ottenwaelder B., Poustka A., Wiemann S., Schupp I.; RT "The full-ORF clone resource of the German cDNA consortium."; RL BMC Genomics 8:399-399(2007). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 8), AND VARIANT RP TYR-118. RC TISSUE=Ovary; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP NUCLEOTIDE SEQUENCE [MRNA] OF 389-812. RX PubMed=10375507; RA Malone C.J., Fixsen W.D., Horvitz H.R., Han M.; RT "UNC-84 localizes to the nuclear envelope and is required for nuclear RT migration and anchoring during C. elegans development."; RL Development 126:3171-3181(1999). RN [7] RP IDENTIFICATION BY MASS SPECTROMETRY, AND SUBCELLULAR LOCATION. RX PubMed=12958361; DOI=10.1126/science.1088176; RA Schirmer E.C., Florens L., Guan T., Yates J.R. III, Gerace L.; RT "Nuclear membrane proteins with potential disease links found by RT subtractive proteomics."; RL Science 301:1380-1382(2003). RN [8] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-138, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=17081983; DOI=10.1016/j.cell.2006.09.026; RA Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., RA Mann M.; RT "Global, in vivo, and site-specific phosphorylation dynamics in RT signaling networks."; RL Cell 127:635-648(2006). RN [9] RP FUNCTION, AND INTERACTION WITH MPS3. RX PubMed=18039933; DOI=10.1083/jcb.200706040; RA Bupp J.M., Martin A.E., Stensrud E.S., Jaspersen S.L.; RT "Telomere anchoring at the nuclear periphery requires the budding RT yeast Sad1-UNC-84 domain protein Mps3."; RL J. Cell Biol. 179:845-854(2007). RN [10] RP SUBCELLULAR LOCATION, SUBUNIT, AND ASSOCIATION WITH THE CENTROSOME. RX PubMed=17132086; DOI=10.1089/dna.2006.25.554; RA Wang Q., Du X., Cai Z., Greene M.I.; RT "Characterization of the structures involved in localization of the RT SUN proteins to the nuclear envelope and the centrosome."; RL DNA Cell Biol. 25:554-562(2006). RN [11] RP SUBCELLULAR LOCATION. RX PubMed=16445915; DOI=10.1016/j.febslet.2006.01.039; RA Hasan S., Guttinger S., Muhlhausser P., Anderegg F., Burgler S., RA Kutay U.; RT "Nuclear envelope localization of human UNC84A does not require RT nuclear lamins."; RL FEBS Lett. 580:1263-1268(2006). RN [12] RP INTERACTION WITH NAT10. RX PubMed=17631499; DOI=10.1074/jbc.M703098200; RA Chi Y.-H., Haller K., Peloponese J.-M. Jr., Jeang K.-T.; RT "Histone acetyltransferase hALP and nuclear membrane protein hsSUN1 RT function in de-condensation of mitotic chromosomes."; RL J. Biol. Chem. 282:27447-27458(2007). RN [13] RP SUBCELLULAR LOCATION, TOPOLOGY, SUBUNIT, AND INTERACTION WITH SUN2. RX PubMed=18845190; DOI=10.1016/j.bbamcr.2008.09.001; RA Lu W., Gotzmann J., Sironi L., Jaeger V.M., Schneider M., Luke Y., RA Uhlen M., Szigyarto C.A., Brachner A., Ellenberg J., Foisner R., RA Noegel A.A., Karakesisoglou I.; RT "Sun1 forms immobile macromolecular assemblies at the nuclear RT envelope."; RL Biochim. Biophys. Acta 1783:2415-2426(2008). RN [14] RP INTERACTION WITH SYNE1; SYNE2 AND SYNE3, AND FUNCTION OF THE LINC RP COMPLEXES. RX PubMed=18396275; DOI=10.1016/j.yexcr.2008.02.022; RA Stewart-Hutchinson P.J., Hale C.M., Wirtz D., Hodzic D.; RT "Structural requirements for the assembly of LINC complexes and their RT function in cellular mechanical stiffness."; RL Exp. Cell Res. 314:1892-1905(2008). RN [15] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=18669648; DOI=10.1073/pnas.0805139105; RA Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., RA Elledge S.J., Gygi S.P.; RT "A quantitative atlas of mitotic phosphorylation."; RL Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008). RN [16] RP SUBCELLULAR LOCATION, INTERACTION WITH EMD; LMNA AND SYNE2, DOMAIN, RP AND ASSOCIATION WITH THE NUCLEOSKELETON. RX PubMed=19933576; DOI=10.1074/jbc.M109.071910; RA Haque F., Mazzeo D., Patel J.T., Smallwood D.T., Ellis J.A., RA Shanahan C.M., Shackleton S.; RT "Mammalian SUN protein interaction networks at the inner nuclear RT membrane and their role in laminopathy disease processes."; RL J. Biol. Chem. 285:3487-3498(2010). RN [17] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-100 AND SER-138, RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-333 (ISOFORM 9), AND RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=20068231; DOI=10.1126/scisignal.2000475; RA Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., RA Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., RA Mann M.; RT "Quantitative phosphoproteomics reveals widespread full RT phosphorylation site occupancy during mitosis."; RL Sci. Signal. 3:RA3-RA3(2010). RN [18] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-138, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Liver; RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). RN [19] RP VARIANTS VAL-203 AND VAL-614. RX PubMed=24375709; DOI=10.1002/humu.22504; RA Li P., Meinke P., Huong Le T.T., Wehnert M., Noegel A.A.; RT "Contribution of SUN1 mutations to the pathomechanism in muscular RT dystrophies."; RL Hum. Mutat. 35:452-461(2014). CC -!- FUNCTION: Component of SUN-protein-containing multivariate CC complexes also called LINC complexes which link the nucleoskeleton CC and cytoskeleton by providing versatile outer nuclear membrane CC attachment sites for cytoskeletal filaments. Required for CC interkinetic nuclear migration (INM) and essential for CC nucleokinesis and centrosome-nucleus coupling during radial CC neuronal migration in the cerebral cortex and during glial CC migration. Anchors chromosome movement in the prophase of meiosis CC and is involved in selective gene expression of coding and non- CC coding RNAs needed for gametogenesis. Required for telomere CC attachment to nuclear envelope and gametogenesis. Helps to define CC the distribution of nuclear pore complexes (NPCs) (By similarity). CC Required for efficient localization of SYNE4 in the nuclear CC envelope (By similarity). {ECO:0000250}. CC -!- SUBUNIT: Dimers and tetramers (By similarity). Core component of CC the LINC complex which is composed of inner nuclear membrane SUN CC domain-containing proteins coupled to outer nuclear membrane KASH CC domain-containing nesprins. SUN domain-containing proteins CC interact with A-type lamins of the nuclear lamina, while at the CC other end of the complex, nesprins interact with unique CC cytoskeletal components. May interact with SYNE1, SYNE2 and SYNE3. CC May interact with SYNE4 (By similarity). Interacts with A-type CC lamin with a strong preference for unprocessed A-type lamin CC compared with the mature protein. Interaction with lamins B1 and C CC is hardly detectable. Interacts with TSNAX (By similarity). CC Interacts with EMD and NAT10. Associates with the nuclear pore CC complex (NPC) (By similarity). Interacts with CCDC155 (via the CC last 22 AA); this interaction mediates CCDC155 telomere CC localization. Interacts with CCDC79/TERB1; promoting the CC accumulation of the LINC complex complexes at the telomere-nuclear CC envelope attachment sites (By similarity). {ECO:0000250}. CC -!- INTERACTION: CC Q8NF91-1:SYNE1; NbExp=2; IntAct=EBI-2796904, EBI-6170938; CC Q8WXH0-1:SYNE2; NbExp=2; IntAct=EBI-2796904, EBI-6170976; CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane CC {ECO:0000269|PubMed:12958361, ECO:0000269|PubMed:16445915, CC ECO:0000269|PubMed:17132086, ECO:0000269|PubMed:18845190, CC ECO:0000269|PubMed:19933576}; Single-pass type II membrane protein CC {ECO:0000269|PubMed:12958361, ECO:0000269|PubMed:16445915, CC ECO:0000269|PubMed:17132086, ECO:0000269|PubMed:18845190, CC ECO:0000269|PubMed:19933576}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=9; CC Name=1; CC IsoId=O94901-1; Sequence=Displayed; CC Name=2; CC IsoId=O94901-2; Sequence=VSP_007743, VSP_007744; CC Note=No experimental confirmation available.; CC Name=3; CC IsoId=O94901-3; Sequence=VSP_007745, VSP_007746; CC Note=No experimental confirmation available.; CC Name=4; CC IsoId=O94901-4; Sequence=VSP_007741, VSP_007742; CC Name=5; CC IsoId=O94901-5; Sequence=VSP_037867, VSP_037868; CC Name=6; CC IsoId=O94901-6; Sequence=VSP_045815, VSP_045816; CC Note=No experimental confirmation available.; CC Name=7; CC IsoId=O94901-7; Sequence=VSP_046108, VSP_007743, VSP_007744; CC Note=No experimental confirmation available.; CC Name=8; CC IsoId=O94901-8; Sequence=VSP_046269; CC Note=No experimental confirmation available.; CC Name=9; CC IsoId=O94901-9; Sequence=VSP_047139, VSP_047140; CC Note=No experimental confirmation available. Contains a CC phosphoserine at position 333. {ECO:0000244|PubMed:20068231}; CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring CC and/or migration. {ECO:0000269|PubMed:19933576}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- SEQUENCE CAUTION: CC Sequence=BAA34530.1; Type=Erroneous initiation; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB018353; BAA34530.1; ALT_INIT; mRNA. DR EMBL; AK022469; BAB14046.1; -; mRNA. DR EMBL; AK022816; BAG51119.1; -; mRNA. DR EMBL; AK302896; BAG64069.1; -; mRNA. DR EMBL; AK309120; -; NOT_ANNOTATED_CDS; mRNA. DR EMBL; BX538211; CAD98070.1; -; mRNA. DR EMBL; AC073957; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AC099731; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; BC013613; AAH13613.1; -; mRNA. DR EMBL; BC142707; AAI42708.1; -; mRNA. DR EMBL; AF202724; AAF15888.1; -; mRNA. DR CCDS; CCDS43533.1; -. [O94901-5] DR CCDS; CCDS47525.1; -. [O94901-8] DR CCDS; CCDS55078.1; -. [O94901-7] DR CCDS; CCDS55079.1; -. [O94901-2] DR CCDS; CCDS55080.1; -. [O94901-6] DR RefSeq; NP_001124437.1; NM_001130965.2. [O94901-8] DR RefSeq; NP_001165415.1; NM_001171944.1. [O94901-6] DR RefSeq; NP_001165416.1; NM_001171945.1. [O94901-7] DR RefSeq; NP_001165417.1; NM_001171946.1. [O94901-2] DR RefSeq; NP_079430.3; NM_025154.5. [O94901-5] DR UniGene; Hs.438072; -. DR ProteinModelPortal; O94901; -. DR SMR; O94901; 616-810. DR BioGrid; 116935; 21. DR IntAct; O94901; 4. DR STRING; 9606.ENSP00000384015; -. DR PhosphoSite; O94901; -. DR BioMuta; SUN1; -. DR MaxQB; O94901; -. DR PaxDb; O94901; -. DR PRIDE; O94901; -. DR DNASU; 23353; -. DR Ensembl; ENST00000389574; ENSP00000374225; ENSG00000164828. [O94901-5] DR Ensembl; ENST00000401592; ENSP00000384015; ENSG00000164828. [O94901-8] DR Ensembl; ENST00000403868; ENSP00000383947; ENSG00000164828. [O94901-2] DR Ensembl; ENST00000425407; ENSP00000392309; ENSG00000164828. [O94901-5] DR Ensembl; ENST00000452783; ENSP00000413439; ENSG00000164828. [O94901-6] DR Ensembl; ENST00000457378; ENSP00000395952; ENSG00000164828. [O94901-7] DR GeneID; 23353; -. DR KEGG; hsa:23353; -. DR UCSC; uc003sje.1; human. [O94901-3] DR UCSC; uc003sjf.3; human. [O94901-5] DR UCSC; uc003sjk.3; human. [O94901-1] DR UCSC; uc021zyl.1; human. [O94901-2] DR CTD; 23353; -. DR GeneCards; SUN1; -. DR H-InvDB; HIX0078880; -. DR HGNC; HGNC:18587; SUN1. DR HPA; HPA008346; -. DR HPA; HPA008461; -. DR MIM; 607723; gene. DR neXtProt; NX_O94901; -. DR PharmGKB; PA165618311; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG104132; -. DR InParanoid; O94901; -. DR KO; K19347; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; O94901; -. DR TreeFam; TF323915; -. DR Reactome; R-HSA-1221632; Meiotic synapsis. DR ChiTaRS; SUN1; human. DR GeneWiki; UNC84A; -. DR GenomeRNAi; 23353; -. DR NextBio; 27033591; -. DR PRO; PR:O94901; -. DR Proteomes; UP000005640; Chromosome 7. DR Bgee; O94901; -. DR CleanEx; HS_UNC84A; -. DR ExpressionAtlas; O94901; baseline and differential. DR Genevisible; O94901; HS. DR GO; GO:0002080; C:acrosomal membrane; IEA:Ensembl. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IEA:Ensembl. DR GO; GO:0043231; C:intracellular membrane-bounded organelle; IDA:HPA. DR GO; GO:0034993; C:LINC complex; IDA:UniProtKB. DR GO; GO:0005635; C:nuclear envelope; IDA:UniProtKB. DR GO; GO:0031965; C:nuclear membrane; IDA:HPA. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IDA:UniProtKB. DR GO; GO:0006998; P:nuclear envelope organization; IGI:MGI. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IDA:UniProtKB. DR GO; GO:0007129; P:synapsis; IEA:Ensembl. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; Membrane; KW Nucleus; Phosphoprotein; Polymorphism; Reference proteome; KW Signal-anchor; Transmembrane; Transmembrane helix. FT CHAIN 1 812 SUN domain-containing protein 1. FT /FTId=PRO_0000218911. FT TOPO_DOM 1 315 Nuclear. {ECO:0000269|PubMed:18845190}. FT TRANSMEM 316 335 Helical. FT TOPO_DOM 336 812 Perinuclear space. FT {ECO:0000269|PubMed:18845190}. FT DOMAIN 649 811 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT REGION 1 138 LMNA-binding. FT REGION 209 302 SYNE2-binding. FT REGION 223 302 EMD-binding. FT COILED 393 430 {ECO:0000255}. FT COILED 455 493 {ECO:0000255}. FT COILED 501 523 {ECO:0000255}. FT MOD_RES 100 100 Phosphoserine. FT {ECO:0000244|PubMed:20068231}. FT MOD_RES 138 138 Phosphoserine. FT {ECO:0000244|PubMed:17081983, FT ECO:0000244|PubMed:20068231, FT ECO:0000244|PubMed:24275569}. FT VAR_SEQ 1 50 Missing (in isoform 5). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_037867. FT VAR_SEQ 1 1 M -> MGRISPGSPGLPRTVWFEVVNM (in isoform FT 7). {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_046108. FT VAR_SEQ 109 109 R -> V (in isoform 4). FT {ECO:0000303|PubMed:17974005}. FT /FTId=VSP_007741. FT VAR_SEQ 110 812 Missing (in isoform 4). FT {ECO:0000303|PubMed:17974005}. FT /FTId=VSP_007742. FT VAR_SEQ 151 279 Missing (in isoform 6). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_045815. FT VAR_SEQ 219 219 K -> KCGASFYVNRILWLARYTASSFSSFLVQLFQVVLMK FT LSYESENYKLKTHESKDCESESYKSKSHESKAHASYYGRMN FT VREVLREDGHLSVNGEAL (in isoform 9). FT {ECO:0000305}. FT /FTId=VSP_047139. FT VAR_SEQ 220 280 CDDCKGKRHLDAHPGRAGTLWHIWACAGYFLLQILRRIGAV FT GQAVSRTAWSALWLAVVAPG -> W (in isoform 5). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_037868. FT VAR_SEQ 221 341 DDCKGKRHLDAHPGRAGTLWHIWACAGYFLLQILRRIGAVG FT QAVSRTAWSALWLAVVAPGKAASGVFWWLGIGWYQFVTLIS FT WLNVFLLTRCLRNICKFLVLLIPLFLLLAGLSLRGQGNF FT -> GASFYVNRILWLARYTASSFSSFLVQLFQVVLMKLSYE FT SENYKLKTHESKDCESESYKSKSHESKAHASYYGRMNVREV FT LREDGHLSVNGEALCKYGFVFLWASVVELVPHAVMLGTSSR FT E (in isoform 3). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_007745. FT VAR_SEQ 221 257 DDCKGKRHLDAHPGRAGTLWHIWACAGYFLLQILRRI -> FT KSQSFKTQKKVCFPNLIFPFCKSQCLHYLSWRLKIIP (in FT isoform 2 and isoform 7). FT {ECO:0000303|PubMed:14702039, FT ECO:0000303|PubMed:15489334}. FT /FTId=VSP_007743. FT VAR_SEQ 221 247 Missing (in isoform 8). FT {ECO:0000303|PubMed:15489334}. FT /FTId=VSP_046269. FT VAR_SEQ 232 232 H -> HTAAHSQSPRL (in isoform 9). FT {ECO:0000305}. FT /FTId=VSP_047140. FT VAR_SEQ 258 812 Missing (in isoform 2 and isoform 7). FT {ECO:0000303|PubMed:14702039, FT ECO:0000303|PubMed:15489334}. FT /FTId=VSP_007744. FT VAR_SEQ 331 331 Missing (in isoform 6). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_045816. FT VAR_SEQ 342 812 Missing (in isoform 3). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_007746. FT VARIANT 118 118 H -> Y (in dbSNP:rs6461378). FT {ECO:0000269|PubMed:14702039, FT ECO:0000269|PubMed:15489334, FT ECO:0000269|PubMed:9872452}. FT /FTId=VAR_059828. FT VARIANT 203 203 A -> V (in dbSNP:rs144929525). FT {ECO:0000269|PubMed:24375709}. FT /FTId=VAR_071065. FT VARIANT 614 614 A -> V. {ECO:0000269|PubMed:24375709}. FT /FTId=VAR_071066. FT CONFLICT 15 15 V -> A (in Ref. 3; CAD98070). FT {ECO:0000305}. FT CONFLICT 78 78 S -> G (in Ref. 3; CAD98070). FT {ECO:0000305}. FT CONFLICT 174 174 A -> V (in Ref. 1; BAA34530). FT {ECO:0000305}. FT CONFLICT 204 204 A -> P (in Ref. 5; AAH13613). FT {ECO:0000305}. FT CONFLICT 445 445 P -> L (in Ref. 2; BAG51119). FT {ECO:0000305}. FT CONFLICT 503 503 E -> K (in Ref. 2; BAG64069). FT {ECO:0000305}. FT CONFLICT 520 520 R -> Q (in Ref. 2; BAG51119). FT {ECO:0000305}. SQ SEQUENCE 812 AA; 90064 MW; B958E95510B6F15F CRC64; MDFSRLHMYS PPQCVPENTG YTYALSSSYS SDALDFETEH KLDPVFDSPR MSRRSLRLAT TACTLGDGEA VGADSGTSSA VSLKNRAART TKQRRSTNKS AFSINHVSRQ VTSSGVSHGG TVSLQDAVTR RPPVLDESWI REQTTVDHFW GLDDDGDLKG GNKAAIQGNG DVGAAAATAH NGFSCSNCSM LSERKDVLTA HPAAPGPVSR VYSRDRNQKC DDCKGKRHLD AHPGRAGTLW HIWACAGYFL LQILRRIGAV GQAVSRTAWS ALWLAVVAPG KAASGVFWWL GIGWYQFVTL ISWLNVFLLT RCLRNICKFL VLLIPLFLLL AGLSLRGQGN FFSFLPVLNW ASMHRTQRVD DPQDVFKPTT SRLKQPLQGD SEAFPWHWMS GVEQQVASLS GQCHHHGENL RELTTLLQKL QARVDQMEGG AAGPSASVRD AVGQPPRETD FMAFHQEHEV RMSHLEDILG KLREKSEAIQ KELEQTKQKT ISAVGEQLLP TVEHLQLELD QLKSELSSWR HVKTGCETVD AVQERVDVQV REMVKLLFSE DQQGGSLEQL LQRFSSQFVS KGDLQTMLRD LQLQILRNVT HHVSVTKQLP TSEAVVSAVS EAGASGITEA QARAIVNSAL KLYSQDKTGM VDFALESGGG SILSTRCSET YETKTALMSL FGIPLWYFSQ SPRVVIQPDI YPGNCWAFKG SQGYLVVRLS MMIHPAAFTL EHIPKTLSPT GNISSAPKDF AVYGLENEYQ EEGQLLGQFT YDQDGESLQM FQALKRPDDT AFQIVELRIF SNWGHPEYTC LYRFRVHGEP VK // ID SUN1_MOUSE Reviewed; 913 AA. AC Q9D666; Q3TIW3; Q3TV96; Q6B4H0; Q80SU8; Q8BZ99; Q99P23; DT 02-AUG-2002, integrated into UniProtKB/Swiss-Prot. DT 02-FEB-2004, sequence version 2. DT 11-NOV-2015, entry version 127. DE RecName: Full=SUN domain-containing protein 1; DE AltName: Full=Protein unc-84 homolog A; DE AltName: Full=Sad1/unc-84 protein-like 1; GN Name=Sun1; Synonyms=Unc84a; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 4), SUBUNIT, SUBCELLULAR LOCATION, RP ASSOCIATION WITH THE CENTROSOME, AND TISSUE SPECIFICITY. RX PubMed=17132086; DOI=10.1089/dna.2006.25.554; RA Wang Q., Du X., Cai Z., Greene M.I.; RT "Characterization of the structures involved in localization of the RT SUN proteins to the nuclear envelope and the centrosome."; RL DNA Cell Biol. 25:554-562(2006). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1; 2 AND 3). RC STRAIN=C57BL/6J; TISSUE=Cerebellum, Embryo, Placenta, and Skin; RX PubMed=16141072; DOI=10.1126/science.1112014; RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., RA Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., RA Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., RA Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., RA Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., RA Hayashizaki Y.; RT "The transcriptional landscape of the mammalian genome."; RL Science 309:1559-1563(2005). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RC STRAIN=C57BL/6J, FVB/N, and FVB/N-3; TISSUE=Brain, and Mammary tumor; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [4] RP NUCLEOTIDE SEQUENCE [MRNA] OF 605-913, SUBCELLULAR LOCATION, TISSUE RP SPECIFICITY, AND INTERACTION WITH TSNAX. RC TISSUE=Testis; RX PubMed=12036294; DOI=10.1006/geno.2002.6779; RA Bray J.D., Chennathukuzhi V.M., Hecht N.B.; RT "Identification and characterization of cDNAs encoding four novel RT proteins that interact with translin associated factor-X."; RL Genomics 79:799-808(2002). RN [5] RP IDENTIFICATION BY MASS SPECTROMETRY, AND SUBCELLULAR LOCATION. RX PubMed=11593002; DOI=10.1073/pnas.211201898; RA Dreger M., Bengtsson L., Schoeneberg T., Otto H., Hucho F.; RT "Nuclear envelope proteomics: novel integral membrane proteins of the RT inner nuclear membrane."; RL Proc. Natl. Acad. Sci. U.S.A. 98:11943-11948(2001). RN [6] RP FUNCTION OF THE LINC COMPLEX, INTERACTION WITH LAMINS AND SYNE2, RP SUBCELLULAR LOCATION, TOPOLOGY, AND TISSUE SPECIFICITY. RX PubMed=16380439; DOI=10.1083/jcb.200509124; RA Crisp M., Liu Q., Roux K., Rattner J.B., Shanahan C., Burke B., RA Stahl P.D., Hodzic D.; RT "Coupling of the nucleus and cytoplasm: role of the LINC complex."; RL J. Cell Biol. 172:41-53(2006). RN [7] RP SUBCELLULAR LOCATION, TOPOLOGY, AND INTERACTION WITH LMNA; SYNE1 AND RP SYNE2. RX PubMed=16648470; DOI=10.1128/MCB.26.10.3738-3751.2006; RA Haque F., Lloyd D.J., Smallwood D.T., Dent C.L., Shanahan C.M., RA Fry A.M., Trembath R.C., Shackleton S.; RT "SUN1 interacts with nuclear lamin A and cytoplasmic nesprins to RT provide a physical connection between the nuclear lamina and the RT cytoskeleton."; RL Mol. Cell. Biol. 26:3738-3751(2006). RN [8] RP FUNCTION. RX PubMed=17543860; DOI=10.1016/j.devcel.2007.03.018; RA Ding X., Xu R., Yu J., Xu T., Zhuang Y., Han M.; RT "SUN1 is required for telomere attachment to nuclear envelope and RT gametogenesis in mice."; RL Dev. Cell 12:863-872(2007). RN [9] RP SUBCELLULAR LOCATION, AND ASSOCIATION WITH THE NUCLEAR PORE COMPLEX. RX PubMed=17724119; DOI=10.1083/jcb.200704108; RA Liu Q., Pante N., Misteli T., Elsagga M., Crisp M., Hodzic D., RA Burke B., Roux K.J.; RT "Functional association of Sun1 with nuclear pore complexes."; RL J. Cell Biol. 178:785-798(2007). RN [10] RP SUBCELLULAR LOCATION, TOPOLOGY, AND SUBUNIT. RX PubMed=18845190; DOI=10.1016/j.bbamcr.2008.09.001; RA Lu W., Gotzmann J., Sironi L., Jaeger V.M., Schneider M., Luke Y., RA Uhlen M., Szigyarto C.A., Brachner A., Ellenberg J., Foisner R., RA Noegel A.A., Karakesisoglou I.; RT "Sun1 forms immobile macromolecular assemblies at the nuclear RT envelope."; RL Biochim. Biophys. Acta 1783:2415-2426(2008). RN [11] RP FUNCTION. RX PubMed=19211677; DOI=10.1242/dev.029868; RA Chi Y.H., Cheng L.I., Myers T., Ward J.M., Williams E., Su Q., RA Faucette L., Wang J.Y., Jeang K.T.; RT "Requirement for Sun1 in the expression of meiotic reproductive genes RT and piRNA."; RL Development 136:965-973(2009). RN [12] RP SUBCELLULAR LOCATION, AND ASSOCIATION WITH TELOMERES. RX PubMed=19841137; DOI=10.1083/jcb.200808016; RA Adelfalk C., Janschek J., Revenkova E., Blei C., Liebe B., Gob E., RA Alsheimer M., Benavente R., de Boer E., Novak I., Hoog C., RA Scherthan H., Jessberger R.; RT "Cohesin SMC1beta protects telomeres in meiocytes."; RL J. Cell Biol. 187:185-199(2009). RN [13] RP SUBCELLULAR LOCATION, AND INTERACTION WITH LMNA AND SYN2. RX PubMed=19843581; DOI=10.1242/jcs.057075; RA Ostlund C., Folker E.S., Choi J.C., Gomes E.R., Gundersen G.G., RA Worman H.J.; RT "Dynamics and molecular interactions of linker of nucleoskeleton and RT cytoskeleton (LINC) complex proteins."; RL J. Cell Sci. 122:4099-4108(2009). RN [14] RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH SYNE2. RX PubMed=19874786; DOI=10.1016/j.neuron.2009.08.018; RA Zhang X., Lei K., Yuan X., Wu X., Zhuang Y., Xu T., Xu R., Han M.; RT "SUN1/2 and Syne/Nesprin-1/2 complexes connect centrosome to the RT nucleus during neurogenesis and neuronal migration in mice."; RL Neuron 64:173-187(2009). RN [15] RP FUNCTION. RX PubMed=19509342; DOI=10.1073/pnas.0812037106; RA Lei K., Zhang X., Ding X., Guo X., Chen M., Zhu B., Xu T., Zhuang Y., RA Xu R., Han M.; RT "SUN1 and SUN2 play critical but partially redundant roles in RT anchoring nuclei in skeletal muscle cells in mice."; RL Proc. Natl. Acad. Sci. U.S.A. 106:10207-10212(2009). RN [16] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-66, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). RN [17] RP INTERACTION WITH EMD. RX PubMed=19933576; DOI=10.1074/jbc.M109.071910; RA Haque F., Mazzeo D., Patel J.T., Smallwood D.T., Ellis J.A., RA Shanahan C.M., Shackleton S.; RT "Mammalian SUN protein interaction networks at the inner nuclear RT membrane and their role in laminopathy disease processes."; RL J. Biol. Chem. 285:3487-3498(2010). RN [18] RP INTERACTION WITH CCDC155. RX PubMed=22826121; DOI=10.1083/jcb.201204085; RA Morimoto A., Shibuya H., Zhu X., Kim J., Ishiguro K., Han M., RA Watanabe Y.; RT "A conserved KASH domain protein associates with telomeres, SUN1, and RT dynactin during mammalian meiosis."; RL J. Cell Biol. 198:165-172(2012). RN [19] RP FUNCTION, DISRUPTION PHENOTYPE, SUBCELLULAR LOCATION, AND TISSUE RP SPECIFICITY. RX PubMed=23348741; DOI=10.1172/JCI66911; RA Horn H.F., Brownstein Z., Lenz D.R., Shivatzki S., Dror A.A., RA Dagan-Rosenfeld O., Friedman L.M., Roux K.J., Kozlov S., Jeang K.T., RA Frydman M., Burke B., Stewart C.L., Avraham K.B.; RT "The LINC complex is essential for hearing."; RL J. Clin. Invest. 123:740-750(2013). RN [20] RP INTERACTION WITH CCDC79. RX PubMed=24413433; DOI=10.1038/ncb2896; RA Shibuya H., Ishiguro K.I., Watanabe Y.; RT "The TRF1-binding protein TERB1 promotes chromosome movement and RT telomere rigidity in meiosis."; RL Nat. Cell Biol. 16:145-156(2014). CC -!- FUNCTION: Component of SUN-protein-containing multivariate CC complexes also called LINC complexes which link the nucleoskeleton CC and cytoskeleton by providing versatile outer nuclear membrane CC attachment sites for cytoskeletal filaments. Required for CC interkinetic nuclear migration (INM) and essential for CC nucleokinesis and centrosome-nucleus coupling during radial CC neuronal migration in the cerebral cortex and during glial CC migration. Anchors chromosome movement in the prophase of meiosis CC and is involved in selective gene expression of coding and non- CC coding RNAs needed for gametogenesis. Required for telomere CC attachment to nuclear envelope and gametogenesis. Helps to define CC the distribution of nuclear pore complexes (NPCs). Required for CC efficient localization of SYNE4 in the nuclear envelope. CC {ECO:0000269|PubMed:16380439, ECO:0000269|PubMed:17543860, CC ECO:0000269|PubMed:19211677, ECO:0000269|PubMed:19509342, CC ECO:0000269|PubMed:19874786, ECO:0000269|PubMed:23348741}. CC -!- SUBUNIT: Dimers and tetramers. Core component of the LINC complex CC which is composed of inner nuclear membrane SUN domain-containing CC proteins coupled to outer nuclear membrane KASH domain-containing CC nesprins. SUN domain-containing proteins interact with A-type CC lamins of the nuclear lamina, while at the other end of the CC complex, nesprins interact with unique cytoskeletal components. CC Interacts with SYNE2. Interact with SYNE1 and SYNE3 (By CC similarity). Interacts with A-type lamin with a strong preference CC for unprocessed A-type lamin compared with the mature protein. CC Interaction with lamins B1 and C is hardly detectable. Interacts CC with NAT10 (By similarity). Interacts with EMD and TSNAX. CC Associates with the nuclear pore complex (NPC). Interacts with CC CCDC155 (via the last 22 amino acids); this interaction mediates CC CCDC155 telomere localization (By similarity). Interacts with CC CCDC79/TERB1; promoting the accumulation of the LINC complex CC complexes at the telomere-nuclear envelope attachment sites. CC {ECO:0000250, ECO:0000269|PubMed:12036294, CC ECO:0000269|PubMed:16380439, ECO:0000269|PubMed:16648470, CC ECO:0000269|PubMed:17132086, ECO:0000269|PubMed:18845190, CC ECO:0000269|PubMed:19843581, ECO:0000269|PubMed:19874786, CC ECO:0000269|PubMed:19933576, ECO:0000269|PubMed:22826121, CC ECO:0000269|PubMed:24413433}. CC -!- INTERACTION: CC P50402:EMD (xeno); NbExp=4; IntAct=EBI-6752574, EBI-489887; CC Q8WXH0-4:SYNE2 (xeno); NbExp=2; IntAct=EBI-6752574, EBI-6838657; CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane CC {ECO:0000269|PubMed:11593002, ECO:0000269|PubMed:12036294, CC ECO:0000269|PubMed:16380439, ECO:0000269|PubMed:16648470, CC ECO:0000269|PubMed:17132086, ECO:0000269|PubMed:17724119, CC ECO:0000269|PubMed:18845190, ECO:0000269|PubMed:19841137, CC ECO:0000269|PubMed:19843581, ECO:0000269|PubMed:19874786, CC ECO:0000269|PubMed:23348741}; Single-pass type II membrane protein CC {ECO:0000269|PubMed:11593002, ECO:0000269|PubMed:12036294, CC ECO:0000269|PubMed:16380439, ECO:0000269|PubMed:16648470, CC ECO:0000269|PubMed:17132086, ECO:0000269|PubMed:17724119, CC ECO:0000269|PubMed:18845190, ECO:0000269|PubMed:19841137, CC ECO:0000269|PubMed:19843581, ECO:0000269|PubMed:19874786, CC ECO:0000269|PubMed:23348741}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=4; CC Name=1; CC IsoId=Q9D666-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9D666-2; Sequence=VSP_009346; CC Note=No experimental confirmation available.; CC Name=3; CC IsoId=Q9D666-3; Sequence=VSP_009347; CC Note=No experimental confirmation available.; CC Name=4; CC IsoId=Q9D666-4; Sequence=VSP_039552; CC -!- TISSUE SPECIFICITY: Widely expressed. Expressed in cochlear outer CC hair cells (at protein level). {ECO:0000269|PubMed:12036294, CC ECO:0000269|PubMed:16380439, ECO:0000269|PubMed:17132086, CC ECO:0000269|PubMed:23348741}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring CC and/or migration. CC -!- DISRUPTION PHENOTYPE: Mutant mice are viable, but display hearing CC loss at all frequencies. {ECO:0000269|PubMed:23348741}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- SEQUENCE CAUTION: CC Sequence=AAH30330.1; Type=Erroneous initiation; Note=Translation N-terminally shortened.; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AY682989; AAT90501.1; -; mRNA. DR EMBL; AK014585; BAB29445.1; -; mRNA. DR EMBL; AK036187; BAC29339.1; -; mRNA. DR EMBL; AK160281; BAE35723.1; -; mRNA. DR EMBL; AK167686; BAE39733.1; -; mRNA. DR EMBL; BC030330; AAH30330.1; ALT_INIT; mRNA. DR EMBL; BC047928; AAH47928.1; -; mRNA. DR EMBL; BC048156; AAH48156.1; -; mRNA. DR EMBL; AF343752; AAK13526.1; -; mRNA. DR CCDS; CCDS19804.1; -. [Q9D666-1] DR CCDS; CCDS57395.1; -. [Q9D666-3] DR CCDS; CCDS57396.1; -. [Q9D666-4] DR CCDS; CCDS57397.1; -. [Q9D666-2] DR RefSeq; NP_001243044.1; NM_001256115.1. [Q9D666-3] DR RefSeq; NP_001243045.1; NM_001256116.1. [Q9D666-4] DR RefSeq; NP_001243046.1; NM_001256117.1. [Q9D666-2] DR RefSeq; NP_001243047.1; NM_001256118.1. DR RefSeq; NP_077771.1; NM_024451.2. [Q9D666-1] DR RefSeq; XP_006504823.1; XM_006504760.1. [Q9D666-1] DR RefSeq; XP_006504824.1; XM_006504761.2. [Q9D666-1] DR RefSeq; XP_011239291.1; XM_011240989.1. [Q9D666-1] DR UniGene; Mm.210845; -. DR ProteinModelPortal; Q9D666; -. DR SMR; Q9D666; 718-911. DR BioGrid; 218484; 5. DR DIP; DIP-60732N; -. DR IntAct; Q9D666; 4. DR MINT; MINT-4139090; -. DR STRING; 10090.ENSMUSP00000056655; -. DR PhosphoSite; Q9D666; -. DR MaxQB; Q9D666; -. DR PaxDb; Q9D666; -. DR PRIDE; Q9D666; -. DR Ensembl; ENSMUST00000058716; ENSMUSP00000056655; ENSMUSG00000036817. [Q9D666-1] DR Ensembl; ENSMUST00000078690; ENSMUSP00000077756; ENSMUSG00000036817. [Q9D666-4] DR Ensembl; ENSMUST00000110883; ENSMUSP00000106507; ENSMUSG00000036817. [Q9D666-2] DR Ensembl; ENSMUST00000110884; ENSMUSP00000106508; ENSMUSG00000036817. [Q9D666-3] DR GeneID; 77053; -. DR KEGG; mmu:77053; -. DR UCSC; uc009agb.2; mouse. [Q9D666-1] DR UCSC; uc009agc.2; mouse. [Q9D666-3] DR UCSC; uc009agd.2; mouse. [Q9D666-2] DR UCSC; uc009age.2; mouse. [Q9D666-4] DR CTD; 23353; -. DR MGI; MGI:1924303; Sun1. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOVERGEN; HBG104132; -. DR InParanoid; Q9D666; -. DR KO; K19347; -. DR OMA; MKLNYES; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q9D666; -. DR TreeFam; TF323915; -. DR Reactome; R-MMU-1221632; Meiotic synapsis. DR Reactome; R-MMU-1221633; Meiotic Synapsis. DR NextBio; 346388; -. DR PRO; PR:Q9D666; -. DR Proteomes; UP000000589; Chromosome 5. DR Bgee; Q9D666; -. DR CleanEx; MM_UNC84A; -. DR ExpressionAtlas; Q9D666; baseline and differential. DR Genevisible; Q9D666; MM. DR GO; GO:0002080; C:acrosomal membrane; IDA:MGI. DR GO; GO:0005737; C:cytoplasm; IDA:MGI. DR GO; GO:0016021; C:integral component of membrane; IDA:MGI. DR GO; GO:0005639; C:integral component of nuclear inner membrane; IDA:MGI. DR GO; GO:0043231; C:intracellular membrane-bounded organelle; ISO:MGI. DR GO; GO:0034993; C:LINC complex; ISO:MGI. DR GO; GO:0005635; C:nuclear envelope; IDA:MGI. DR GO; GO:0031965; C:nuclear membrane; ISO:MGI. DR GO; GO:0005634; C:nucleus; IDA:MGI. DR GO; GO:0005521; F:lamin binding; IDA:UniProtKB. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; ISO:MGI. DR GO; GO:0006998; P:nuclear envelope organization; ISO:MGI. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; ISO:MGI. DR GO; GO:0007129; P:synapsis; IMP:MGI. DR InterPro; IPR012919; SUN_dom. DR InterPro; IPR015880; Znf_C2H2-like. DR Pfam; PF07738; Sad1_UNC; 1. DR SMART; SM00355; ZnF_C2H2; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; Membrane; KW Nucleus; Phosphoprotein; Reference proteome; Signal-anchor; KW Transmembrane; Transmembrane helix. FT CHAIN 1 913 SUN domain-containing protein 1. FT /FTId=PRO_0000218912. FT TOPO_DOM 1 415 Nuclear. FT TRANSMEM 416 436 Helical. FT TOPO_DOM 437 913 Perinuclear space. FT DOMAIN 751 912 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT REGION 1 139 LMNA-binding. {ECO:0000250}. FT REGION 209 399 SYNE2-binding. {ECO:0000250}. FT REGION 310 399 EMD-binding. {ECO:0000250}. FT COILED 491 533 {ECO:0000255}. FT COILED 563 638 {ECO:0000255}. FT MOD_RES 66 66 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 139 139 Phosphoserine. FT {ECO:0000250|UniProtKB:O94901}. FT VAR_SEQ 221 285 RGVSFYLDRTLWLAKSTSSSFASFIVQLFQVVLMKLNFETY FT KLKGYESRAYESQSYETKSHESEA -> P (in isoform FT 4). {ECO:0000303|PubMed:17132086}. FT /FTId=VSP_039552. FT VAR_SEQ 222 344 Missing (in isoform 2). FT {ECO:0000303|PubMed:16141072}. FT /FTId=VSP_009346. FT VAR_SEQ 308 344 Missing (in isoform 3). FT {ECO:0000303|PubMed:16141072}. FT /FTId=VSP_009347. FT CONFLICT 6 6 L -> V (in Ref. 1; AAT90501). FT {ECO:0000305}. FT CONFLICT 108 108 L -> P (in Ref. 2; BAE39733). FT {ECO:0000305}. FT CONFLICT 439 439 F -> L (in Ref. 1; AAT90501). FT {ECO:0000305}. FT CONFLICT 479 479 H -> N (in Ref. 2; BAE35723). FT {ECO:0000305}. FT CONFLICT 505 505 D -> G (in Ref. 2; BAE39733). FT {ECO:0000305}. FT CONFLICT 593 593 E -> A (in Ref. 2; BAE39733). FT {ECO:0000305}. FT CONFLICT 704 704 S -> F (in Ref. 2; BAE39733). FT {ECO:0000305}. FT CONFLICT 856 857 QP -> AA (in Ref. 4; AAK13526). FT {ECO:0000305}. SQ SEQUENCE 913 AA; 101976 MW; B9872C8F2E044964 CRC64; MDFSRLHTYT PPQCVPENTG YTYALSSSYS SDALDFETEH KLEPVFDSPR MSRRSLRLVT TASYSSGDSQ AIDSHISTSR ATPAKGRETR TVKQRRSASK PAFSINHLSG KGLSSSTSHD SSCSLRSATV LRHPVLDESL IREQTKVDHF WGLDDDGDLK GGNKAATQGN GELAAEVASS NGYTCRDCRM LSARTDALTA HSAIHGTTSR VYSRDRTLKP RGVSFYLDRT LWLAKSTSSS FASFIVQLFQ VVLMKLNFET YKLKGYESRA YESQSYETKS HESEAHLGHC GRMTAGELSR VDGESLCDDC KGKKHLEIHT ATHSQLPQPH RVAGAMGRLC IYTGDLLVQA LRRTRAAGWS VAEAVWSVLW LAVSAPGKAA SGTFWWLGSG WYQFVTLISW LNVFLLTRCL RNICKVFVLL LPLLLLLGAG VSLWGQGNFF SLLPVLNWTA MQPTQRVDDS KGMHRPGPLP PSPPPKVDHK ASQWPQESDM GQKVASLSAQ CHNHDERLAE LTVLLQKLQI RVDQVDDGRE GLSLWVKNVV GQHLQEMGTI EPPDAKTDFM TFHHDHEVRL SNLEDVLRKL TEKSEAIQKE LEETKLKAGS RDEEQPLLDR VQHLELELNL LKSQLSDWQH LKTSCEQAGA RIQETVQLMF SEDQQGGSLE WLLEKLSSRF VSKDELQVLL HDLELKLLQN ITHHITVTGQ APTSEAIVSA VNQAGISGIT EAQAHIIVNN ALKLYSQDKT GMVDFALESG GGSILSTRCS ETYETKTALL SLFGVPLWYF SQSPRVVIQP DIYPGNCWAF KGSQGYLVVR LSMKIYPTTF TMEHIPKTLS PTGNISSAPK DFAVYGLETE YQEEGQPLGR FTYDQEGDSL QMFHTLERPD QAFQIVELRV LSNWGHPEYT CLYRFRVHGE PIQ // ID SUN2_DICDI Reviewed; 1278 AA. AC Q54MI3; DT 24-NOV-2009, integrated into UniProtKB/Swiss-Prot. DT 24-MAY-2005, sequence version 1. DT 11-NOV-2015, entry version 50. DE RecName: Full=SUN domain-containing protein 2; DE Flags: Precursor; GN Name=sun2; ORFNames=DDB_G0285925; OS Dictyostelium discoideum (Slime mold). OC Eukaryota; Amoebozoa; Mycetozoa; Dictyosteliida; Dictyostelium. OX NCBI_TaxID=44689; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=AX4; RX PubMed=15875012; DOI=10.1038/nature03481; RA Eichinger L., Pachebat J.A., Gloeckner G., Rajandream M.A., RA Sucgang R., Berriman M., Song J., Olsen R., Szafranski K., Xu Q., RA Tunggal B., Kummerfeld S., Madera M., Konfortov B.A., Rivero F., RA Bankier A.T., Lehmann R., Hamlin N., Davies R., Gaudet P., Fey P., RA Pilcher K., Chen G., Saunders D., Sodergren E.J., Davis P., RA Kerhornou A., Nie X., Hall N., Anjard C., Hemphill L., Bason N., RA Farbrother P., Desany B., Just E., Morio T., Rost R., Churcher C.M., RA Cooper J., Haydock S., van Driessche N., Cronin A., Goodhead I., RA Muzny D.M., Mourier T., Pain A., Lu M., Harper D., Lindsay R., RA Hauser H., James K.D., Quiles M., Madan Babu M., Saito T., RA Buchrieser C., Wardroper A., Felder M., Thangavelu M., Johnson D., RA Knights A., Loulseged H., Mungall K.L., Oliver K., Price C., RA Quail M.A., Urushihara H., Hernandez J., Rabbinowitsch E., Steffen D., RA Sanders M., Ma J., Kohara Y., Sharp S., Simmonds M.N., Spiegler S., RA Tivey A., Sugano S., White B., Walker D., Woodward J.R., Winckler T., RA Tanaka Y., Shaulsky G., Schleicher M., Weinstock G.M., Rosenthal A., RA Cox E.C., Chisholm R.L., Gibbs R.A., Loomis W.F., Platzer M., RA Kay R.R., Williams J.G., Dear P.H., Noegel A.A., Barrell B.G., RA Kuspa A.; RT "The genome of the social amoeba Dictyostelium discoideum."; RL Nature 435:43-57(2005). CC -!- SUBCELLULAR LOCATION: Nucleus membrane {ECO:0000305}; Single-pass CC membrane protein {ECO:0000305}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring. CC {ECO:0000250}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AAFI02000082; EAL64497.1; -; Genomic_DNA. DR RefSeq; XP_638007.1; XM_632915.1. DR STRING; 44689.DDB0304575; -. DR PaxDb; Q54MI3; -. DR EnsemblProtists; DDB0304575; DDB0304575; DDB_G0285925. DR EnsemblProtists; EAL64497; EAL64497; EBG00001267198. DR GeneID; 8625358; -. DR KEGG; ddi:DDB_G0285925; -. DR dictyBase; DDB_G0285925; sunB. DR eggNOG; KOG1396; Eukaryota. DR eggNOG; ENOG41116S0; LUCA. DR InParanoid; Q54MI3; -. DR OMA; SHYGDQL; -. DR PRO; PR:Q54MI3; -. DR Proteomes; UP000002195; Chromosome 4. DR Proteomes; UP000002195; Unassembled WGS sequence. DR GO; GO:0005737; C:cytoplasm; IDA:dictyBase. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0031965; C:nuclear membrane; IEA:UniProtKB-SubCell. DR GO; GO:0005634; C:nucleus; IDA:dictyBase. DR GO; GO:0005773; C:vacuole; IDA:dictyBase. DR GO; GO:0031154; P:culmination involved in sorocarp development; IMP:dictyBase. DR GO; GO:0000281; P:mitotic cytokinesis; IMP:dictyBase. DR InterPro; IPR008979; Galactose-bd-like. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR SUPFAM; SSF49785; SSF49785; 1. DR PROSITE; PS51469; SUN; 1. PE 3: Inferred from homology; KW Coiled coil; Complete proteome; Glycoprotein; Membrane; Nucleus; KW Reference proteome; Signal; Transmembrane; Transmembrane helix. FT SIGNAL 1 25 {ECO:0000255}. FT CHAIN 26 1278 SUN domain-containing protein 2. FT /FTId=PRO_0000389262. FT TRANSMEM 914 934 Helical. {ECO:0000255}. FT DOMAIN 501 658 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 177 349 {ECO:0000255}. FT COILED 464 509 {ECO:0000255}. FT COILED 883 914 {ECO:0000255}. FT COMPBIAS 26 31 Poly-Gln. FT COMPBIAS 76 110 Poly-Asn. FT COMPBIAS 160 164 Poly-Thr. FT COMPBIAS 302 319 Poly-Gln. FT COMPBIAS 325 342 Poly-Gln. FT COMPBIAS 345 348 Poly-Gln. FT COMPBIAS 416 421 Poly-Asn. FT COMPBIAS 466 507 Poly-Gln. FT COMPBIAS 703 710 Poly-Ser. FT COMPBIAS 752 780 Poly-Asn. FT COMPBIAS 797 807 Poly-Ser. FT COMPBIAS 935 940 Poly-Ser. FT COMPBIAS 972 979 Poly-Gly. FT COMPBIAS 1014 1021 Poly-Asn. FT COMPBIAS 1024 1037 Poly-Asn. FT COMPBIAS 1097 1139 Poly-Asn. FT COMPBIAS 1142 1154 Poly-Asn. FT COMPBIAS 1223 1241 Poly-Asn. FT COMPBIAS 1253 1256 Poly-Ser. FT CARBOHYD 47 47 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 56 56 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 166 166 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 293 293 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 356 356 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 370 370 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 386 386 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 391 391 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 405 405 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 558 558 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 669 669 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 832 832 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 941 941 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 969 969 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 983 983 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 994 994 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 999 999 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1005 1005 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1028 1028 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1036 1036 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1041 1041 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1049 1049 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1107 1107 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1138 1138 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1147 1147 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1214 1214 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1227 1227 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1241 1241 N-linked (GlcNAc...). {ECO:0000255}. FT CARBOHYD 1267 1267 N-linked (GlcNAc...). {ECO:0000255}. SQ SEQUENCE 1278 AA; 146049 MW; 2F1D31084CDF0FDE CRC64; MIINKKYILF ILLLLFITSC TIVFSQQQQQ QTEQSEQTEQ AINNDVNNSI NDIFENDSSK QLRQPQQQHT IVDDGNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN PIDNKDILGL KKLALLKQFE EQKSKSENDI NNDIVILNLE NDNPNQIIET TTTTLNNSND NKNNIIDDNQ DEKLNENIKE DKNEIKNEIK NENQEKDKGI IDVEKDENQP NIEEKGKEKQ NLLEKGIENE NQNENQIQIE KEKEIEIEIE KEKEKENKEL IEESKTEKDN QQKENKENTN EINVTVVEEP EQPQPQQQNQ QEQQEQQQQE HKEEQQQQEQ QQQEQQTQQE QQTHQQQQEH QETQKNSSEE TKTQSPIQVN TTDVNNEIEL KNEGDNNSQL NDSSIPITSP LTNDNDTLKT TKEDSNNNNK NEVINNQTPL IDEKNHQHNY EGNNRNGDDV SIISNIPKTN KAPETQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQHVV LTPNDLPDKF NYASSECGAN VLQTNKEAWE VSSILASSRD RYLLNECNKS QWFVVELCEE IGVQIIELAN FEFFSSMFKD FIVLGSNRYP AQSWHYLGQF TAENSRKQQY FVLKEKAWYK YLKVKILSHY GDQLYCPISS FKVYGSTMVD DLKNQVDINI SELEKFQRDL SSIPYPMEIG SDTSYSTTTS TTSSSSTSSS YPSSKTKSSN SEYPSWERIQ SFSEKLRKNV EQQLIQPPSV LNTNDNNNNN NNNNNNNNNN NNNNNNNNNN EEQFIYYETN GNGGPPSTST STSSSSSQNH QARTPQSVFK TLADKIKAIE FNQSIGNKFM EKLERYYSEE IKNLKFDVSE FLNDIIKLGN SLDEKLKDHR KYDDNKFKET SKEIKILKEK IEKLEEQKSA DRNFYLVVTL VSLLIGLLLK PLFTSSSSSS NKSYPNSMPN SPTYLNSGSN NYNNNGIINS SGGSGGGGGN LQNSSFIGIN GQLNFSDDNI SAFLNSSCSN FGLNNNNNNN NGINNNNNNS NSNSNNNSIN NGSININSNN SLQQRIHHNK YIHQRRNSSP LVGVQLESFF SPNAIPPTIP IVPQDDNNIN YNYNNNNNTN NINNNYNYNN NNNNNNNNNN NNNNNNNNNS DNYNNNNNSN NNVNSPSSPT PSSIILSPKF ITSIPKNINY YNNGGSGSHL KNRFSRQASE SVLSQNHYQI NHQNHSLNGV TTNINNNNSN SNGNSNGNSN NMTNGLPPVS MPSSSSHDNL LLHRGNNQSK KYKRRSHL // ID SUN2_HUMAN Reviewed; 717 AA. AC Q9UH99; B0QY62; O75156; Q2NKN8; Q2T9F7; Q504T5; Q6B4H1; Q7Z3E3; DT 02-AUG-2002, integrated into UniProtKB/Swiss-Prot. DT 25-MAR-2003, sequence version 3. DT 11-NOV-2015, entry version 143. DE RecName: Full=SUN domain-containing protein 2; DE AltName: Full=Protein unc-84 homolog B; DE AltName: Full=Rab5-interacting protein; DE Short=Rab5IP; DE AltName: Full=Sad1/unc-84 protein-like 2; GN Name=SUN2; Synonyms=FRIGG, KIAA0668, RAB5IP, UNC84B; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), SUBCELLULAR LOCATION, TISSUE RP SPECIFICITY, AND INTERACTION WITH RAB5A. RC TISSUE=B-cell; RX PubMed=10818110; DOI=10.1074/jbc.M909600199; RA Hoffenberg S., Liu X., Nikolova L., Hall H.S., Dai W., Baughn R.E., RA Dickey B.F., Barbieri M.A., Aballay A., Stahl P.D., Knoll B.J.; RT "A novel membrane-anchored Rab5 interacting protein required for RT homotypic endosome fusion."; RL J. Biol. Chem. 275:24661-24669(2000). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), SUBCELLULAR LOCATION, SUBUNIT, RP AND ASSOCIATION WITH THE CENTROSOME. RX PubMed=17132086; DOI=10.1089/dna.2006.25.554; RA Wang Q., Du X., Cai Z., Greene M.I.; RT "Characterization of the structures involved in localization of the RT SUN proteins to the nuclear envelope and the centrosome."; RL DNA Cell Biol. 25:554-562(2006). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RC TISSUE=Brain; RX PubMed=9734811; DOI=10.1093/dnares/5.3.169; RA Ishikawa K., Nagase T., Suyama M., Miyajima N., Tanaka A., Kotani H., RA Nomura N., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. X. RT The complete sequences of 100 new cDNA clones from brain which can RT code for large proteins in vitro."; RL DNA Res. 5:169-176(1998). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RX PubMed=15461802; DOI=10.1186/gb-2004-5-10-r84; RA Collins J.E., Wright C.L., Edwards C.A., Davis M.P., Grinham J.A., RA Cole C.G., Goward M.E., Aguado B., Mallya M., Mokrab Y., Huckle E.J., RA Beare D.M., Dunham I.; RT "A genome annotation-driven approach to cloning the human ORFeome."; RL Genome Biol. 5:R84.1-R84.11(2004). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RC TISSUE=Fetal kidney; RX PubMed=17974005; DOI=10.1186/1471-2164-8-399; RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., RA Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., RA Ottenwaelder B., Poustka A., Wiemann S., Schupp I.; RT "The full-ORF clone resource of the German cDNA consortium."; RL BMC Genomics 8:399-399(2007). RN [6] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=10591208; DOI=10.1038/990031; RA Dunham I., Hunt A.R., Collins J.E., Bruskiewich R., Beare D.M., RA Clamp M., Smink L.J., Ainscough R., Almeida J.P., Babbage A.K., RA Bagguley C., Bailey J., Barlow K.F., Bates K.N., Beasley O.P., RA Bird C.P., Blakey S.E., Bridgeman A.M., Buck D., Burgess J., RA Burrill W.D., Burton J., Carder C., Carter N.P., Chen Y., Clark G., RA Clegg S.M., Cobley V.E., Cole C.G., Collier R.E., Connor R., RA Conroy D., Corby N.R., Coville G.J., Cox A.V., Davis J., Dawson E., RA Dhami P.D., Dockree C., Dodsworth S.J., Durbin R.M., Ellington A.G., RA Evans K.L., Fey J.M., Fleming K., French L., Garner A.A., RA Gilbert J.G.R., Goward M.E., Grafham D.V., Griffiths M.N.D., Hall C., RA Hall R.E., Hall-Tamlyn G., Heathcott R.W., Ho S., Holmes S., RA Hunt S.E., Jones M.C., Kershaw J., Kimberley A.M., King A., RA Laird G.K., Langford C.F., Leversha M.A., Lloyd C., Lloyd D.M., RA Martyn I.D., Mashreghi-Mohammadi M., Matthews L.H., Mccann O.T., RA Mcclay J., Mclaren S., McMurray A.A., Milne S.A., Mortimore B.J., RA Odell C.N., Pavitt R., Pearce A.V., Pearson D., Phillimore B.J.C.T., RA Phillips S.H., Plumb R.W., Ramsay H., Ramsey Y., Rogers L., Ross M.T., RA Scott C.E., Sehra H.K., Skuce C.D., Smalley S., Smith M.L., RA Soderlund C., Spragon L., Steward C.A., Sulston J.E., Swann R.M., RA Vaudin M., Wall M., Wallis J.M., Whiteley M.N., Willey D.L., RA Williams L., Williams S.A., Williamson H., Wilmer T.E., Wilming L., RA Wright C.L., Hubbard T., Bentley D.R., Beck S., Rogers J., Shimizu N., RA Minoshima S., Kawasaki K., Sasaki T., Asakawa S., Kudoh J., RA Shintani A., Shibuya K., Yoshizaki Y., Aoki N., Mitsuyama S., RA Roe B.A., Chen F., Chu L., Crabtree J., Deschamps S., Do A., Do T., RA Dorman A., Fang F., Fu Y., Hu P., Hua A., Kenton S., Lai H., Lao H.I., RA Lewis J., Lewis S., Lin S.-P., Loh P., Malaj E., Nguyen T., Pan H., RA Phan S., Qi S., Qian Y., Ray L., Ren Q., Shaull S., Sloan D., Song L., RA Wang Q., Wang Y., Wang Z., White J., Willingham D., Wu H., Yao Z., RA Zhan M., Zhang G., Chissoe S., Murray J., Miller N., Minx P., RA Fulton R., Johnson D., Bemis G., Bentley D., Bradshaw H., Bourne S., RA Cordes M., Du Z., Fulton L., Goela D., Graves T., Hawkins J., RA Hinds K., Kemp K., Latreille P., Layman D., Ozersky P., Rohlfing T., RA Scheet P., Walker C., Wamsley A., Wohldmann P., Pepin K., Nelson J., RA Korf I., Bedell J.A., Hillier L.W., Mardis E., Waterston R., RA Wilson R., Emanuel B.S., Shaikh T., Kurahashi H., Saitta S., RA Budarf M.L., McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., RA Edelmann L., Kim U.J., Shizuya H., Simon M.I., Dumanski J.P., RA Peyrard M., Kedra D., Seroussi E., Fransson I., Tapia I., Bruder C.E., RA O'Brien K.P., Wilkinson P., Bodenteich A., Hartman K., Hu X., RA Khan A.S., Lane L., Tilahun Y., Wright H.; RT "The DNA sequence of human chromosome 22."; RL Nature 402:489-495(1999). RN [7] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1; 2 AND 3), AND RP VARIANTS ARG-89 AND SER-671. RC TISSUE=Brain, and Pancreas; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [8] RP NUCLEOTIDE SEQUENCE [MRNA] OF 278-717 (ISOFORM 1). RX PubMed=10375507; RA Malone C.J., Fixsen W.D., Horvitz H.R., Han M.; RT "UNC-84 localizes to the nuclear envelope and is required for nuclear RT migration and anchoring during C. elegans development."; RL Development 126:3171-3181(1999). RN [9] RP TISSUE SPECIFICITY. RX PubMed=12393179; DOI=10.1016/S0925-4439(02)00171-0; RA Sun G., Yuen Chan S., Yuan Y., Wang Chan K., Qiu G., Sun K., RA Ping Leung M.; RT "Isolation of differentially expressed genes in human heart tissues."; RL Biochim. Biophys. Acta 1588:241-246(2002). RN [10] RP PHOSPHORYLATION AT SER-12; SER-54 AND SER-116. RX PubMed=12239280; DOI=10.1074/mcp.M200010-MCP200; RA Gronborg M., Kristiansen T.Z., Stensballe A., Andersen J.S., Ohara O., RA Mann M., Jensen O.N., Pandey A.; RT "A mass spectrometry-based proteomic approach for identification of RT serine/threonine-phosphorylated proteins by enrichment with phospho- RT specific antibodies: identification of a novel protein, Frigg, as a RT protein kinase A substrate."; RL Mol. Cell. Proteomics 1:517-527(2002). RN [11] RP IDENTIFICATION BY MASS SPECTROMETRY, AND SUBCELLULAR LOCATION. RX PubMed=12958361; DOI=10.1126/science.1088176; RA Schirmer E.C., Florens L., Guan T., Yates J.R. III, Gerace L.; RT "Nuclear membrane proteins with potential disease links found by RT subtractive proteomics."; RL Science 301:1380-1382(2003). RN [12] RP SUBCELLULAR LOCATION, AND TOPOLOGY. RX PubMed=15082709; DOI=10.1074/jbc.M313157200; RA Hodzic D.M., Yeater D.B., Bengtsson L., Otto H., Stahl P.D.; RT "Sun2 is a novel mammalian inner nuclear membrane protein."; RL J. Biol. Chem. 279:25805-25812(2004). RN [13] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-54, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Cervix carcinoma; RX PubMed=17081983; DOI=10.1016/j.cell.2006.09.026; RA Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., RA Mann M.; RT "Global, in vivo, and site-specific phosphorylation dynamics in RT signaling networks."; RL Cell 127:635-648(2006). RN [14] RP SUBCELLULAR LOCATION. RX PubMed=17724119; DOI=10.1083/jcb.200704108; RA Liu Q., Pante N., Misteli T., Elsagga M., Crisp M., Hodzic D., RA Burke B., Roux K.J.; RT "Functional association of Sun1 with nuclear pore complexes."; RL J. Cell Biol. 178:785-798(2007). RN [15] RP INTERACTION WITH SUN1. RX PubMed=18845190; DOI=10.1016/j.bbamcr.2008.09.001; RA Lu W., Gotzmann J., Sironi L., Jaeger V.M., Schneider M., Luke Y., RA Uhlen M., Szigyarto C.A., Brachner A., Ellenberg J., Foisner R., RA Noegel A.A., Karakesisoglou I.; RT "Sun1 forms immobile macromolecular assemblies at the nuclear RT envelope."; RL Biochim. Biophys. Acta 1783:2415-2426(2008). RN [16] RP INTERACTION WITH SYNE1; SYNE2 AND SYNE3, AND FUNCTION OF THE LINC RP COMPLEXES. RX PubMed=18396275; DOI=10.1016/j.yexcr.2008.02.022; RA Stewart-Hutchinson P.J., Hale C.M., Wirtz D., Hodzic D.; RT "Structural requirements for the assembly of LINC complexes and their RT function in cellular mechanical stiffness."; RL Exp. Cell Res. 314:1892-1905(2008). RN [17] RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-636. RC TISSUE=Liver; RX PubMed=19159218; DOI=10.1021/pr8008012; RA Chen R., Jiang X., Sun D., Han G., Wang F., Ye M., Wang L., Zou H.; RT "Glycoproteomics analysis of human liver tissue by combination of RT multiple enzyme digestion and hydrazide chemistry."; RL J. Proteome Res. 8:651-661(2009). RN [18] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-38, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Leukemic T-cell; RX PubMed=19690332; DOI=10.1126/scisignal.2000007; RA Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., RA Rodionov V., Han D.K.; RT "Quantitative phosphoproteomic analysis of T cell receptor signaling RT reveals system-wide modulation of protein-protein interactions."; RL Sci. Signal. 2:RA46-RA46(2009). RN [19] RP INTERACTION WITH TMEM43. RX PubMed=21391237; DOI=10.1002/ana.22338; RA Liang W.C., Mitsuhashi H., Keduka E., Nonaka I., Noguchi S., RA Nishino I., Hayashi Y.K.; RT "TMEM43 mutations in Emery-Dreifuss muscular dystrophy-related RT myopathy."; RL Ann. Neurol. 69:1005-1013(2011). RN [20] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=21269460; DOI=10.1186/1752-0509-5-17; RA Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., RA Buerckstuemmer T., Bennett K.L., Superti-Furga G., Colinge J.; RT "Initial characterization of the human central proteome."; RL BMC Syst. Biol. 5:17-17(2011). RN [21] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-54, AND IDENTIFICATION RP BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Liver; RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). CC -!- FUNCTION: Component of SUN-protein-containing multivariate CC complexes also called LINC complexes which link the nucleoskeleton CC and cytoskeleton by providing versatile outer nuclear membrane CC attachment sites for cytoskeletal filaments. Specifically, SYNE2 CC and SUN2 assemble in arrays of transmembrane actin-associated CC nuclear (TAN) lines which are bound to F-actin cables and couple CC the nucleus to retrograde actin flow during actin-dependent CC nuclear movement. Required for interkinetic nuclear migration CC (INM) and essential for nucleokinesis and centrosome-nucleus CC coupling during radial neuronal migration in the cerebral cortex CC and during glial migration. Anchors chromosome movement in the CC prophase of meiosis and is involved in selective gene expression CC of coding and non-coding RNAs needed for gametogenesis. Required CC for telomere attachment to nuclear envelope and gametogenesis. May CC also function on endocytic vesicles as a receptor for RAB5-GDP and CC participate in the activation of RAB5. CC {ECO:0000269|PubMed:18396275}. CC -!- SUBUNIT: Core component of the LINC complex which is composed of CC inner nuclear membrane SUN domain-containing proteins coupled to CC outer nuclear membrane KASH domain-containing nesprins. SUN CC domain-containing proteins interact with A-type lamins of the CC nuclear lamina, while at the other end of the complex, nesprins CC interact with unique cytoskeletal components. Interacts with CC SYNE1, SYNE2 and SYNE3. Interacts with A-type lamin. Interaction CC with lamins B1 and C is hardly detectable (By similarity). CC Interacts with EMD and RAB5A. Interacts with TMEM43. {ECO:0000250, CC ECO:0000269|PubMed:10818110, ECO:0000269|PubMed:17132086, CC ECO:0000269|PubMed:18396275, ECO:0000269|PubMed:18845190, CC ECO:0000269|PubMed:21391237}. CC -!- INTERACTION: CC Self; NbExp=2; IntAct=EBI-1044964, EBI-1044964; CC P53618:COPB1; NbExp=3; IntAct=EBI-1044964, EBI-359063; CC P50402:EMD; NbExp=3; IntAct=EBI-1044964, EBI-489887; CC P52292:KPNA2; NbExp=3; IntAct=EBI-1044964, EBI-349938; CC P02545:LMNA; NbExp=4; IntAct=EBI-1044964, EBI-351935; CC Q8IXM6:NRM; NbExp=3; IntAct=EBI-1044964, EBI-10262547; CC Q8NF91:SYNE1; NbExp=3; IntAct=EBI-1044964, EBI-928867; CC Q8NF91-1:SYNE1; NbExp=2; IntAct=EBI-1044964, EBI-6170938; CC Q8WXH0:SYNE2; NbExp=7; IntAct=EBI-1044964, EBI-2372294; CC Q8WXH0-1:SYNE2; NbExp=2; IntAct=EBI-1044964, EBI-6170976; CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane; Single-pass type II CC membrane protein. Endosome membrane {ECO:0000305}; Single-pass CC type II membrane protein {ECO:0000305}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=3; CC Name=1; CC IsoId=Q9UH99-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9UH99-2; Sequence=VSP_045882; CC Note=No experimental confirmation available.; CC Name=3; CC IsoId=Q9UH99-3; Sequence=VSP_053702; CC Note=No experimental confirmation available.; CC -!- TISSUE SPECIFICITY: Widely expressed. Highly expressed in heart, CC lung and muscle. Weakly expressed in fetal heart. Slightly CC overexpressed in some heart tissues form patients with congenital CC heart defects. {ECO:0000269|PubMed:10818110, CC ECO:0000269|PubMed:12393179}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring CC and/or migration. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- CAUTION: It is uncertain whether Met-1 or Met-50 is the initiator. CC {ECO:0000305}. CC -!- SEQUENCE CAUTION: CC Sequence=BAA31643.1; Type=Erroneous initiation; Note=Translation N-terminally shortened.; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB014568; BAA31643.1; ALT_INIT; mRNA. DR EMBL; AY682988; AAT90500.1; -; mRNA. DR EMBL; CR456474; CAG30360.1; -; mRNA. DR EMBL; BX537962; CAD97926.1; -; mRNA. DR EMBL; AL008583; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL021806; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AL021707; CAI21604.1; -; Genomic_DNA. DR EMBL; AL021707; CAQ07913.1; -; Genomic_DNA. DR EMBL; BC030684; AAH30684.2; -; mRNA. DR EMBL; BC094797; AAH94797.1; -; mRNA. DR EMBL; BC111549; AAI11550.1; -; mRNA. DR EMBL; BC111717; AAI11718.1; -; mRNA. DR EMBL; AF202723; AAF15887.1; -; mRNA. DR CCDS; CCDS13978.1; -. [Q9UH99-1] DR CCDS; CCDS56231.1; -. [Q9UH99-2] DR PIR; T00371; T00371. DR RefSeq; NP_001186508.1; NM_001199579.1. [Q9UH99-2] DR RefSeq; NP_001186509.1; NM_001199580.1. [Q9UH99-1] DR RefSeq; NP_056189.1; NM_015374.2. [Q9UH99-1] DR UniGene; Hs.517622; -. DR UniGene; Hs.744734; -. DR PDB; 3UNP; X-ray; 2.39 A; A=520-717. DR PDB; 4DXR; X-ray; 2.32 A; A=522-717. DR PDB; 4DXS; X-ray; 2.71 A; A=522-717. DR PDB; 4DXT; X-ray; 2.22 A; A=522-717. DR PDB; 4FI9; X-ray; 3.05 A; A=523-717. DR PDBsum; 3UNP; -. DR PDBsum; 4DXR; -. DR PDBsum; 4DXS; -. DR PDBsum; 4DXT; -. DR PDBsum; 4FI9; -. DR ProteinModelPortal; Q9UH99; -. DR SMR; Q9UH99; 522-717. DR BioGrid; 117312; 40. DR IntAct; Q9UH99; 38. DR MINT; MINT-3080157; -. DR STRING; 9606.ENSP00000385616; -. DR PhosphoSite; Q9UH99; -. DR BioMuta; SUN2; -. DR DMDM; 29337242; -. DR MaxQB; Q9UH99; -. DR PaxDb; Q9UH99; -. DR PRIDE; Q9UH99; -. DR DNASU; 25777; -. DR Ensembl; ENST00000405018; ENSP00000385616; ENSG00000100242. [Q9UH99-2] DR Ensembl; ENST00000405510; ENSP00000385740; ENSG00000100242. [Q9UH99-1] DR Ensembl; ENST00000406622; ENSP00000383992; ENSG00000100242. [Q9UH99-1] DR GeneID; 25777; -. DR KEGG; hsa:25777; -. DR UCSC; uc003awh.2; human. [Q9UH99-1] DR UCSC; uc010gxq.2; human. DR CTD; 25777; -. DR GeneCards; SUN2; -. DR H-InvDB; HIX0159176; -. DR HGNC; HGNC:14210; SUN2. DR HPA; HPA001209; -. DR MIM; 613569; gene. DR neXtProt; NX_Q9UH99; -. DR PharmGKB; PA165378369; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG056957; -. DR InParanoid; Q9UH99; -. DR KO; K19347; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q9UH99; -. DR TreeFam; TF323915; -. DR Reactome; R-HSA-1221632; Meiotic synapsis. DR ChiTaRS; SUN2; human. DR GeneWiki; UNC84B; -. DR GenomeRNAi; 25777; -. DR NextBio; 46920; -. DR PRO; PR:Q9UH99; -. DR Proteomes; UP000005640; Chromosome 22. DR Bgee; Q9UH99; -. DR CleanEx; HS_UNC84B; -. DR ExpressionAtlas; Q9UH99; baseline and differential. DR Genevisible; Q9UH99; HS. DR GO; GO:0000794; C:condensed nuclear chromosome; IEA:Ensembl. DR GO; GO:0010008; C:endosome membrane; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0034993; C:LINC complex; IDA:UniProtKB. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IDA:UniProtKB. DR GO; GO:0005637; C:nuclear inner membrane; IEA:UniProtKB-SubCell. DR GO; GO:0031965; C:nuclear membrane; IDA:HPA. DR GO; GO:0042802; F:identical protein binding; IPI:IntAct. DR GO; GO:0005521; F:lamin binding; IDA:UniProtKB. DR GO; GO:0008017; F:microtubule binding; TAS:UniProtKB. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; ISS:UniProtKB. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IDA:UniProtKB. DR GO; GO:0007052; P:mitotic spindle organization; TAS:UniProtKB. DR GO; GO:0006998; P:nuclear envelope organization; IGI:MGI. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; IDA:UniProtKB. DR GO; GO:0007097; P:nuclear migration; TAS:UniProtKB. DR GO; GO:0031022; P:nuclear migration along microfilament; ISS:UniProtKB. DR GO; GO:0030335; P:positive regulation of cell migration; ISS:UniProtKB. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW 3D-structure; Alternative splicing; Coiled coil; Complete proteome; KW Endosome; Glycoprotein; Membrane; Nucleus; Phosphoprotein; KW Polymorphism; Reference proteome; Signal-anchor; Transmembrane; KW Transmembrane helix. FT CHAIN 1 717 SUN domain-containing protein 2. FT /FTId=PRO_0000218913. FT TOPO_DOM 1 212 Nuclear. {ECO:0000269|PubMed:15082709}. FT TRANSMEM 213 233 Helical. FT TOPO_DOM 234 717 Perinuclear space. FT {ECO:0000269|PubMed:15082709}. FT DOMAIN 555 716 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT REGION 1 139 LMNA-binding. {ECO:0000250}. FT COILED 273 296 {ECO:0000255}. FT COILED 348 440 {ECO:0000255}. FT COILED 475 506 {ECO:0000255}. FT COMPBIAS 2 164 Ser-rich. FT COMPBIAS 100 105 Poly-Arg. FT COMPBIAS 316 322 Poly-Gly. FT COMPBIAS 468 471 Poly-Gly. FT MOD_RES 12 12 Phosphoserine. FT {ECO:0000269|PubMed:12239280}. FT MOD_RES 38 38 Phosphoserine. FT {ECO:0000244|PubMed:19690332}. FT MOD_RES 54 54 Phosphoserine. FT {ECO:0000244|PubMed:17081983, FT ECO:0000244|PubMed:24275569, FT ECO:0000269|PubMed:12239280}. FT MOD_RES 107 107 Phosphothreonine. FT {ECO:0000250|UniProtKB:Q8BJS4}. FT MOD_RES 110 110 Phosphoserine. FT {ECO:0000250|UniProtKB:Q8BJS4}. FT MOD_RES 113 113 Phosphoserine. FT {ECO:0000250|UniProtKB:Q8BJS4}. FT MOD_RES 116 116 Phosphoserine. FT {ECO:0000269|PubMed:12239280}. FT MOD_RES 136 136 Phosphoserine. FT {ECO:0000250|UniProtKB:Q8BJS4}. FT CARBOHYD 636 636 N-linked (GlcNAc...). FT {ECO:0000269|PubMed:19159218}. FT VAR_SEQ 141 141 V -> VEDSEGRGSKVTETEPVSSFPA (in isoform FT 2). {ECO:0000303|PubMed:15489334}. FT /FTId=VSP_045882. FT VAR_SEQ 683 717 TMATYQVVELRILTNWGHPEYTCIYRFRVHGEPAH -> SS FT FPLCPWRLLPILGVCIYVAYHGGLGSWER (in isoform FT 3). {ECO:0000303|PubMed:15489334}. FT /FTId=VSP_053702. FT VARIANT 33 33 T -> A (in dbSNP:rs2072799). FT /FTId=VAR_052282. FT VARIANT 89 89 L -> R (in dbSNP:rs35496634). FT {ECO:0000269|PubMed:15489334}. FT /FTId=VAR_052283. FT VARIANT 348 348 R -> C (in dbSNP:rs138708). FT /FTId=VAR_052284. FT VARIANT 671 671 G -> S (in dbSNP:rs2072797). FT {ECO:0000269|PubMed:15489334}. FT /FTId=VAR_024624. FT CONFLICT 644 644 K -> R (in Ref. 5; CAD97926). FT {ECO:0000305}. FT HELIX 525 540 {ECO:0000244|PDB:4DXT}. FT TURN 541 544 {ECO:0000244|PDB:4DXT}. FT HELIX 552 554 {ECO:0000244|PDB:4DXT}. FT STRAND 557 563 {ECO:0000244|PDB:4DXT}. FT HELIX 571 574 {ECO:0000244|PDB:4DXT}. FT TURN 575 577 {ECO:0000244|PDB:4DXT}. FT STRAND 579 584 {ECO:0000244|PDB:4DXR}. FT HELIX 588 592 {ECO:0000244|PDB:4DXT}. FT STRAND 601 605 {ECO:0000244|PDB:4DXT}. FT STRAND 609 627 {ECO:0000244|PDB:4DXT}. FT HELIX 631 633 {ECO:0000244|PDB:4DXT}. FT HELIX 635 637 {ECO:0000244|PDB:4DXT}. FT STRAND 645 655 {ECO:0000244|PDB:4DXT}. FT STRAND 660 666 {ECO:0000244|PDB:4DXT}. FT STRAND 669 671 {ECO:0000244|PDB:3UNP}. FT STRAND 673 678 {ECO:0000244|PDB:4DXT}. FT STRAND 687 694 {ECO:0000244|PDB:4DXT}. FT STRAND 697 699 {ECO:0000244|PDB:4DXT}. FT STRAND 701 706 {ECO:0000244|PDB:4DXT}. FT STRAND 708 714 {ECO:0000244|PDB:4DXT}. SQ SEQUENCE 717 AA; 80311 MW; CCF43C118E935E84 CRC64; MSRRSQRLTR YSQGDDDGSS SSGGSSVAGS QSTLFKDSPL RTLKRKSSNM KRLSPAPQLG PSSDAHTSYY SESLVHESWF PPRSSLEELH GDANWGEDLR VRRRRGTGGS ESSRASGLVG RKATEDFLGS SSGYSSEDDY VGYSDVDQQS SSSRLRSAVS RAGSLLWMVA TSPGRLFRLL YWWAGTTWYR LTTAASLLDV FVLTRRFSSL KTFLWFLLPL LLLTCLTYGA WYFYPYGLQT FHPALVSWWA AKDSRRPDEG WEARDSSPHF QAEQRVMSRV HSLERRLEAL AAEFSSNWQK EAMRLERLEL RQGAPGQGGG GGLSHEDTLA LLEGLVSRRE AALKEDFRRE TAARIQEELS ALRAEHQQDS EDLFKKIVRA SQESEARIQQ LKSEWQSMTQ ESFQESSVKE LRRLEDQLAG LQQELAALAL KQSSVAEEVG LLPQQIQAVR DDVESQFPAW ISQFLARGGG GRVGLLQREE MQAQLRELES KILTHVAEMQ GKSAREAAAS LSLTLQKEGV IGVTEEQVHH IVKQALQRYS EDRIGLADYA LESGGASVIS TRCSETYETK TALLSLFGIP LWYHSQSPRV ILQPDVHPGN CWAFQGPQGF AVVRLSARIR PTAVTLEHVP KALSPNSTIS SAPKDFAIFG FDEDLQQEGT LLGKFTYDQD GEPIQTFHFQ APTMATYQVV ELRILTNWGH PEYTCIYRFR VHGEPAH // ID SUN2_MOUSE Reviewed; 731 AA. AC Q8BJS4; Q3TBU0; Q3U160; Q6B4H2; DT 02-FEB-2004, integrated into UniProtKB/Swiss-Prot. DT 10-AUG-2010, sequence version 3. DT 11-NOV-2015, entry version 97. DE RecName: Full=SUN domain-containing protein 2; DE AltName: Full=Protein unc-84 homolog B; DE AltName: Full=Sad1/unc-84 protein-like 2; GN Name=Sun2; Synonyms=Unc84b; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2), SUBCELLULAR LOCATION, SUBUNIT, RP AND ASSOCIATION WITH THE CENTROSOME. RX PubMed=17132086; DOI=10.1089/dna.2006.25.554; RA Wang Q., Du X., Cai Z., Greene M.I.; RT "Characterization of the structures involved in localization of the RT SUN proteins to the nuclear envelope and the centrosome."; RL DNA Cell Biol. 25:554-562(2006). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 3). RA Oshima A., Takahashi-Fujii A., Tanase T., Imose N., Takeuchi K., RA Arita M., Musashino K., Yuuki H., Hara H., Sugiyama T., Irie R., RA Otsuki T., Sato H., Ota T., Wakamatsu A., Ishii S., Yamamoto J., RA Isono Y., Kawai-Hio Y., Saito K., Nishikawa T., Kimura K., RA Yamashita H., Matsuo K., Nakamura Y., Sekine M., Kikuchi H., Kanda K., RA Wagatsuma M., Murakawa K., Kanehori K., Sugiyama A., Kawakami B., RA Suzuki Y., Sugano S., Nagahari K., Masuho Y., Nagai K., Isogai T.; RT "NEDO cDNA sequencing project."; RL Submitted (JUL-2003) to the EMBL/GenBank/DDBJ databases. RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2). RC STRAIN=C57BL/6J, and NOD; RC TISSUE=Aorta, Dendritic cell, Spleen, and Vein; RX PubMed=16141072; DOI=10.1126/science.1112014; RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., RA Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., RA Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., RA Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., RA Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., RA Hayashizaki Y.; RT "The transcriptional landscape of the mammalian genome."; RL Science 309:1559-1563(2005). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [5] RP TISSUE SPECIFICITY. RX PubMed=12393179; DOI=10.1016/S0925-4439(02)00171-0; RA Sun G., Yuen Chan S., Yuan Y., Wang Chan K., Qiu G., Sun K., RA Ping Leung M.; RT "Isolation of differentially expressed genes in human heart tissues."; RL Biochim. Biophys. Acta 1588:241-246(2002). RN [6] RP FUNCTION OF THE LINC COMPLEX, AND INTERACTION WITH LAMINS AND SYNE2. RX PubMed=16380439; DOI=10.1083/jcb.200509124; RA Crisp M., Liu Q., Roux K., Rattner J.B., Shanahan C., Burke B., RA Stahl P.D., Hodzic D.; RT "Coupling of the nucleus and cytoplasm: role of the LINC complex."; RL J. Cell Biol. 172:41-53(2006). RN [7] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Liver; RX PubMed=17242355; DOI=10.1073/pnas.0609836104; RA Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.; RT "Large-scale phosphorylation analysis of mouse liver."; RL Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007). RN [8] RP SUBCELLULAR LOCATION, FUNCTION, AND INTERACTION WITH LMNA AND SYN2. RX PubMed=19843581; DOI=10.1242/jcs.057075; RA Ostlund C., Folker E.S., Choi J.C., Gomes E.R., Gundersen G.G., RA Worman H.J.; RT "Dynamics and molecular interactions of linker of nucleoskeleton and RT cytoskeleton (LINC) complex proteins."; RL J. Cell Sci. 122:4099-4108(2009). RN [9] RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-650. RC TISSUE=Myoblast; RX PubMed=19656770; DOI=10.1074/mcp.M900195-MCP200; RA Gundry R.L., Raginski K., Tarasova Y., Tchernyshyov I., RA Bausch-Fluck D., Elliott S.T., Boheler K.R., Van Eyk J.E., RA Wollscheid B.; RT "The mouse C2C12 myoblast cell surface N-linked glycoproteome: RT identification, glycosite occupancy, and membrane orientation."; RL Mol. Cell. Proteomics 8:2555-2569(2009). RN [10] RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH SYNE2. RX PubMed=19874786; DOI=10.1016/j.neuron.2009.08.018; RA Zhang X., Lei K., Yuan X., Wu X., Zhuang Y., Xu T., Xu R., Han M.; RT "SUN1/2 and Syne/Nesprin-1/2 complexes connect centrosome to the RT nucleus during neurogenesis and neuronal migration in mice."; RL Neuron 64:173-187(2009). RN [11] RP FUNCTION. RX PubMed=19509342; DOI=10.1073/pnas.0812037106; RA Lei K., Zhang X., Ding X., Guo X., Chen M., Zhu B., Xu T., Zhuang Y., RA Xu R., Han M.; RT "SUN1 and SUN2 play critical but partially redundant roles in RT anchoring nuclei in skeletal muscle cells in mice."; RL Proc. Natl. Acad. Sci. U.S.A. 106:10207-10212(2009). RN [12] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-12; SER-39; SER-55; RP THR-117; SER-120; SER-123 AND SER-147, AND IDENTIFICATION BY MASS RP SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Brain, Brown adipose tissue, Heart, Kidney, Liver, Lung, RC Spleen, and Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). RN [13] RP SUBCELLULAR LOCATION, INTERACTION WITH EMD AND LMNA, DOMAIN, AND RP ASSOCIATION WITH THE NUCLEOSKELETON. RX PubMed=19933576; DOI=10.1074/jbc.M109.071910; RA Haque F., Mazzeo D., Patel J.T., Smallwood D.T., Ellis J.A., RA Shanahan C.M., Shackleton S.; RT "Mammalian SUN protein interaction networks at the inner nuclear RT membrane and their role in laminopathy disease processes."; RL J. Biol. Chem. 285:3487-3498(2010). RN [14] RP FUNCTION. RX PubMed=20724637; DOI=10.1126/science.1189072; RA Luxton G.W., Gomes E.R., Folker E.S., Vintinner E., Gundersen G.G.; RT "Linear arrays of nuclear envelope proteins harness retrograde actin RT flow for nuclear movement."; RL Science 329:956-959(2010). CC -!- FUNCTION: Component of SUN-protein-containing multivariate CC complexes also called LINC complexes which link the nucleoskeleton CC and cytoskeleton by providing versatile outer nuclear membrane CC attachment sites for cytoskeletal filaments. Specifically, Syne2 CC and Sun2 assemble in arrays of transmembrane actin-associated CC nuclear (TAN) lines which are bound to F-actin cables and couple CC the nucleus to retrograde actin flow during actin-dependent CC nuclear movement. Required for interkinetic nuclear migration CC (INM) and essential for nucleokinesis and centrosome-nucleus CC coupling during radial neuronal migration in the cerebral cortex CC and during glial migration. Anchors chromosome movement in the CC prophase of meiosis and is involved in selective gene expression CC of coding and non-coding RNAs needed for gametogenesis. Required CC for telomere attachment to nuclear envelope and gametogenesis. May CC also function on endocytic vesicles as a receptor for Rab5-GDP and CC participate in the activation of Rab5. CC {ECO:0000269|PubMed:16380439, ECO:0000269|PubMed:19509342, CC ECO:0000269|PubMed:19843581, ECO:0000269|PubMed:19874786, CC ECO:0000269|PubMed:20724637}. CC -!- SUBUNIT: Core component of the LINC complex which is composed of CC inner nuclear membrane SUN domain-containing proteins coupled to CC outer nuclear membrane KASH domain-containing nesprins. SUN CC domain-containing proteins interact with A-type lamins of the CC nuclear lamina, while at the other end of the complex, nesprins CC interact with unique cytoskeletal components. Interacts with A- CC type lamin. Interaction with lamins B1 and C is hardly detectable. CC Interacts with SYNE1, SYNE2 and SYNE3. Interacts with EMD. CC Interacts with RAB5A. Interacts with TMEM43 (By similarity). CC {ECO:0000250}. CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane; Single-pass type II CC membrane protein. Endosome membrane {ECO:0000250}; Single-pass CC type II membrane protein {ECO:0000250}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=3; CC Name=1; CC IsoId=Q8BJS4-1; Sequence=Displayed; CC Note=No experimental confirmation available.; CC Name=2; CC IsoId=Q8BJS4-2; Sequence=VSP_039554; CC Name=3; CC IsoId=Q8BJS4-3; Sequence=VSP_039553; CC Note=No experimental confirmation available.; CC -!- TISSUE SPECIFICITY: Highly expressed in heart, placenta and CC muscle. {ECO:0000269|PubMed:12393179}. CC -!- DOMAIN: The SUN domain may play a role in the nuclear anchoring CC and/or migration. {ECO:0000269|PubMed:19933576}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AY682987; AAT90499.1; -; mRNA. DR EMBL; AK128958; BAC87662.1; -; mRNA. DR EMBL; AK080116; BAC37829.1; -; mRNA. DR EMBL; AK156246; BAE33640.1; -; mRNA. DR EMBL; AK171058; BAE42217.1; -; mRNA. DR EMBL; CH466550; EDL04635.1; -; Genomic_DNA. DR CCDS; CCDS27649.1; -. [Q8BJS4-3] DR CCDS; CCDS56992.1; -. [Q8BJS4-2] DR CCDS; CCDS56993.1; -. [Q8BJS4-1] DR RefSeq; NP_001192274.1; NM_001205345.1. [Q8BJS4-1] DR RefSeq; NP_001192275.1; NM_001205346.1. [Q8BJS4-2] DR RefSeq; NP_919323.2; NM_194342.3. [Q8BJS4-3] DR UniGene; Mm.202715; -. DR ProteinModelPortal; Q8BJS4; -. DR SMR; Q8BJS4; 536-731. DR BioGrid; 230179; 3. DR IntAct; Q8BJS4; 1. DR STRING; 10090.ENSMUSP00000047864; -. DR PhosphoSite; Q8BJS4; -. DR MaxQB; Q8BJS4; -. DR PaxDb; Q8BJS4; -. DR PRIDE; Q8BJS4; -. DR Ensembl; ENSMUST00000046259; ENSMUSP00000047864; ENSMUSG00000042524. [Q8BJS4-1] DR Ensembl; ENSMUST00000089311; ENSMUSP00000086724; ENSMUSG00000042524. [Q8BJS4-3] DR Ensembl; ENSMUST00000100439; ENSMUSP00000098006; ENSMUSG00000042524. [Q8BJS4-2] DR GeneID; 223697; -. DR KEGG; mmu:223697; -. DR UCSC; uc007wuh.3; mouse. [Q8BJS4-1] DR UCSC; uc007wui.3; mouse. [Q8BJS4-2] DR UCSC; uc011zwc.2; mouse. [Q8BJS4-3] DR CTD; 25777; -. DR MGI; MGI:2443011; Sun2. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000253025; -. DR HOVERGEN; HBG056957; -. DR InParanoid; Q8BJS4; -. DR KO; K19347; -. DR OMA; EHQQDSE; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q8BJS4; -. DR TreeFam; TF323915; -. DR Reactome; R-MMU-1221632; Meiotic synapsis. DR Reactome; R-MMU-1221633; Meiotic Synapsis. DR NextBio; 376830; -. DR PRO; PR:Q8BJS4; -. DR Proteomes; UP000000589; Chromosome 15. DR Bgee; Q8BJS4; -. DR CleanEx; MM_UNC84B; -. DR ExpressionAtlas; Q8BJS4; baseline and differential. DR Genevisible; Q8BJS4; MM. DR GO; GO:0000794; C:condensed nuclear chromosome; IDA:MGI. DR GO; GO:0010008; C:endosome membrane; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0034993; C:LINC complex; ISO:MGI. DR GO; GO:0000784; C:nuclear chromosome, telomeric region; IDA:MGI. DR GO; GO:0005635; C:nuclear envelope; IDA:MGI. DR GO; GO:0005637; C:nuclear inner membrane; IDA:UniProtKB. DR GO; GO:0031965; C:nuclear membrane; ISO:MGI. DR GO; GO:0042802; F:identical protein binding; ISO:MGI. DR GO; GO:0005521; F:lamin binding; ISO:MGI. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0051642; P:centrosome localization; IMP:UniProtKB. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; ISO:MGI. DR GO; GO:0006998; P:nuclear envelope organization; ISO:MGI. DR GO; GO:0090292; P:nuclear matrix anchoring at nuclear membrane; ISO:MGI. DR GO; GO:0031022; P:nuclear migration along microfilament; IMP:UniProtKB. DR GO; GO:0030335; P:positive regulation of cell migration; IMP:UniProtKB. DR InterPro; IPR030272; SUN2. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF22; PTHR12911:SF22; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; Endosome; KW Glycoprotein; Membrane; Nucleus; Phosphoprotein; Reference proteome; KW Signal-anchor; Transmembrane; Transmembrane helix. FT CHAIN 1 731 SUN domain-containing protein 2. FT /FTId=PRO_0000218914. FT TOPO_DOM 1 226 Nuclear. {ECO:0000250}. FT TRANSMEM 227 247 Helical. {ECO:0000255}. FT TOPO_DOM 248 731 Perinuclear space. FT DOMAIN 569 730 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT REGION 1 128 LMNA-binding. FT COILED 396 452 {ECO:0000255}. FT COILED 486 519 {ECO:0000255}. FT MOD_RES 12 12 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 39 39 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 55 55 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 117 117 Phosphothreonine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 120 120 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 123 123 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT MOD_RES 147 147 Phosphoserine. FT {ECO:0000244|PubMed:21183079}. FT CARBOHYD 650 650 N-linked (GlcNAc...). FT {ECO:0000269|PubMed:19656770}. FT VAR_SEQ 154 185 Missing (in isoform 3). FT {ECO:0000303|Ref.2}. FT /FTId=VSP_039553. FT VAR_SEQ 217 218 Missing (in isoform 2). FT {ECO:0000303|PubMed:16141072, FT ECO:0000303|PubMed:17132086}. FT /FTId=VSP_039554. FT CONFLICT 106 106 S -> G (in Ref. 3; BAE42217). FT {ECO:0000305}. FT CONFLICT 412 412 M -> V (in Ref. 2; BAC87662). FT {ECO:0000305}. FT CONFLICT 451 451 D -> Y (in Ref. 2; BAC87662). FT {ECO:0000305}. FT CONFLICT 579 579 E -> K (in Ref. 3; BAE42217). FT {ECO:0000305}. SQ SEQUENCE 731 AA; 81605 MW; 67830D40C33E3DBA CRC64; MSRRSQRLTR YSQDDNDGGS SSSGASSVAG SQGTVFKDSP LRTLKRKSSN MKHLSPAPQL GPSSDSHTSY YSESVVRESY IGSPRAVSLA RSALLDDHLH SEPYWSGDLR GRRRRGTGGS ESSKANGLTA ESKASEDFFG SSSGYSSEDD LAGYTDSDQH SSGSRLRSAA SRAGSFVWTL VTFPGRLFGL LYWWIGTTWY RLTTAASLLD VFVLTRSRHF SLNLKSFLWF LLLLLLLTGL TYGAWHFYPL GLQTLQPAVV SWWAAKESRK QPEVWESRDA SQHFQAEQRV LSRVHSLERR LEALAADFSS NWQKEAIRLE RLELRQGAAG HGGGSSLSHE DALSLLEGLV SRREATLKED LRRDTVAHIQ EELATLRAEH HQDSEDLFKK IVQASQESEA RVQQLKTEWK SMTQEAFQES SVKELGRLEA QLASLRQELA ALTLKQNSVA DEVGLLPQKI QAARADVESQ FPDWIRQFLL GDRGARSGLL QRDEMHAQLQ ELENKILTKM AEMQGKSARE AAASLGQILQ KEGIVGVTEE QVHRIVKQAL QRYSEDRIGM VDYALESGGA SVISTRCSET YETKTALLSL FGIPLWYHSQ SPRVILQPDV HPGNCWAFQG PQGFAVVRLS ARIRPTAVTL EHVPKALSPN STISSAPKDF AIFGFDEDLQ QEGTLLGTFA YDQDGEPIQT FYFQASKMAT YQVVELRILT NWGHPEYTCI YRFRVHGEPA H // ID SUN3_BOVIN Reviewed; 360 AA. AC Q0II64; DT 04-DEC-2007, integrated into UniProtKB/Swiss-Prot. DT 04-DEC-2007, sequence version 2. DT 11-NOV-2015, entry version 46. DE RecName: Full=SUN domain-containing protein 3; DE AltName: Full=Sad1/unc-84 domain-containing protein 1; GN Name=SUN3; Synonyms=SUNC1; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=Crossbred X Angus; TISSUE=Liver; RG NIH - Mammalian Gene Collection (MGC) project; RL Submitted (AUG-2006) to the EMBL/GenBank/DDBJ databases. CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Single-pass membrane CC protein {ECO:0000305}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- SEQUENCE CAUTION: CC Sequence=AAI22786.1; Type=Erroneous termination; Positions=249; Note=Translated as Gln.; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC122785; AAI22786.1; ALT_SEQ; mRNA. DR RefSeq; NP_001069552.2; NM_001076084.2. DR UniGene; Bt.64065; -. DR STRING; 9913.ENSBTAP00000010721; -. DR PaxDb; Q0II64; -. DR PRIDE; Q0II64; -. DR GeneID; 537130; -. DR KEGG; bta:537130; -. DR CTD; 256979; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG108520; -. DR InParanoid; Q0II64; -. DR NextBio; 20877071; -. DR Proteomes; UP000009136; Unplaced. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil; Complete proteome; Membrane; Reference proteome; KW Transmembrane; Transmembrane helix. FT CHAIN 1 360 SUN domain-containing protein 3. FT /FTId=PRO_0000312219. FT TRANSMEM 48 67 Helical. {ECO:0000255}. FT DOMAIN 196 357 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 103 142 {ECO:0000255}. FT COMPBIAS 56 61 Poly-Leu. SQ SEQUENCE 360 AA; 40619 MW; BC79CF7E40FAA8E3 CRC64; MSGRPNSRGS SRLFRAPSED ASSGSSGSAV LPQEENPNAS GLTRSWKAVM GMVFILTLLL LGFINHMKLK EKAFPQKSRQ IYAVIAEYGS RLYNYQARLR MPKEQLELLK KESQTLENNF REILFLIEQI DVLKALLRDM QDGLHNYSWN ADIDPAEGWN HTEVIDEEMS NLVNYILKKL REDQVQMADY ALKSAGASVV EAGTSESYKN NKAKLYWHGI GFLNYEMPPD IILQPDVHPG KCWAFPGSQG HALIKLARKI IPTAVTMEHI SEKVSPSGNI SSAPKEFSVY GVLKQCEGEE IFLGQFVYNK TGTTVQTFAL QHEVPEFLLC VKLKILSNWG HPNYTCLYRF RVHGTPKDDS // ID SUN3_HUMAN Reviewed; 357 AA. AC Q8TAQ9; A4D2F3; B4DXK1; D3DVM3; E7EWC8; Q4F965; Q7Z4U8; DT 04-DEC-2007, integrated into UniProtKB/Swiss-Prot. DT 04-DEC-2007, sequence version 4. DT 11-NOV-2015, entry version 109. DE RecName: Full=SUN domain-containing protein 3; DE AltName: Full=Sad1/unc-84 domain-containing protein 1; GN Name=SUN3; Synonyms=SUNC1; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RA Guo J.H., She X.Y., Dai F.Y., Yu L.; RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3). RC TISSUE=Testis; RX PubMed=14702039; DOI=10.1038/ng1285; RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S., RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., RA Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., RA Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., RA Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., RA Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., RA Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.; RT "Complete sequencing and characterization of 21,243 full-length human RT cDNAs."; RL Nat. Genet. 36:40-45(2004). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). RA Li H., Nong W., Zhou G., Ke R., Shen C., Zhong G., Zheng Z., Liang M., RA Li M., Lin L., Yang S.; RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases. RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=12853948; DOI=10.1038/nature01782; RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A., Pepin K.H., RA Wagner-McPherson C., Layman D., Maas J., Jaeger S., Walker R., RA Wylie K., Sekhon M., Becker M.C., O'Laughlin M.D., Schaller M.E., RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., RA Sun H., Edwards J., Bradshaw-Cordum H., Ali J., Andrews S., Isak A., RA Vanbrunt A., Nguyen C., Du F., Lamar B., Courtney L., Kalicki J., RA Ozersky P., Bielicki L., Scott K., Holmes A., Harkins R., Harris A., RA Strong C.M., Hou S., Tomlinson C., Dauphin-Kohlberg S., RA Kozlowicz-Reilly A., Leonard S., Rohlfing T., Rock S.M., RA Tin-Wollam A.-M., Abbott A., Minx P., Maupin R., Strowmatt C., RA Latreille P., Miller N., Johnson D., Murray J., Woessner J.P., RA Wendl M.C., Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., RA Bieri T.A., Nelson J.O., Berkowicz N., Wohldmann P.E., Cook L.L., RA Hickenbotham M.T., Eldred J., Williams D., Bedell J.A., Mardis E.R., RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C., Haugen E., RA Gillett W., Zhou Y., James R., Phelps K., Iadanoto S., Bubb K., RA Simms E., Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., RA Baertsch R.A., Brent M.R., Keibler E., Flicek P., Bork P., Suyama M., RA Bailey J.A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., RA Eddy S.R., McPherson J.D., Olson M.V., Eichler E.E., Green E.D., RA Waterston R.H., Wilson R.K.; RT "The DNA sequence of human chromosome 7."; RL Nature 424:157-164(2003). RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., RA Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., RA Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., RA Hannenhalli S., Turner R., Yooseph S., Lu F., Nusskern D.R., RA Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., RA Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., RA Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., RA Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [6] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND VARIANT RP VAL-127. RC TISSUE=Brain; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Single-pass membrane CC protein {ECO:0000305}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=3; CC Name=1; CC IsoId=Q8TAQ9-1; Sequence=Displayed; CC Name=2; CC IsoId=Q8TAQ9-2; Sequence=VSP_029748, VSP_029749; CC Name=3; CC IsoId=Q8TAQ9-3; Sequence=VSP_055624; CC Note=No experimental confirmation available.; CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC -!- SEQUENCE CAUTION: CC Sequence=AAP97300.1; Type=Frameshift; Positions=233, 242; Evidence={ECO:0000305}; CC Sequence=EAL23808.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305}; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF429967; AAP97300.1; ALT_FRAME; mRNA. DR EMBL; AK302011; BAG63413.1; -; mRNA. DR EMBL; DQ099386; AAZ13762.1; -; mRNA. DR EMBL; AC069279; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CH236958; EAL23808.1; ALT_SEQ; Genomic_DNA. DR EMBL; CH471128; EAW60999.1; -; Genomic_DNA. DR EMBL; CH471128; EAW61000.1; -; Genomic_DNA. DR EMBL; BC026189; AAH26189.3; -; mRNA. DR CCDS; CCDS34636.1; -. [Q8TAQ9-1] DR CCDS; CCDS64647.1; -. [Q8TAQ9-3] DR RefSeq; NP_001025190.1; NM_001030019.1. [Q8TAQ9-1] DR RefSeq; NP_001271279.1; NM_001284350.1. [Q8TAQ9-3] DR RefSeq; NP_689995.3; NM_152782.3. [Q8TAQ9-1] DR UniGene; Hs.406741; -. DR ProteinModelPortal; Q8TAQ9; -. DR SMR; Q8TAQ9; 164-354. DR STRING; 9606.ENSP00000297325; -. DR PhosphoSite; Q8TAQ9; -. DR BioMuta; SUN3; -. DR DMDM; 162416243; -. DR PaxDb; Q8TAQ9; -. DR PRIDE; Q8TAQ9; -. DR Ensembl; ENST00000297325; ENSP00000297325; ENSG00000164744. [Q8TAQ9-1] DR Ensembl; ENST00000395572; ENSP00000378939; ENSG00000164744. [Q8TAQ9-1] DR Ensembl; ENST00000412142; ENSP00000410204; ENSG00000164744. [Q8TAQ9-3] DR Ensembl; ENST00000438771; ENSP00000409077; ENSG00000164744. [Q8TAQ9-2] DR GeneID; 256979; -. DR KEGG; hsa:256979; -. DR UCSC; uc003tof.3; human. [Q8TAQ9-1] DR UCSC; uc010kyq.3; human. [Q8TAQ9-2] DR UCSC; uc011kcf.2; human. DR CTD; 256979; -. DR GeneCards; SUN3; -. DR HGNC; HGNC:22429; SUN3. DR HPA; HPA008344; -. DR neXtProt; NX_Q8TAQ9; -. DR PharmGKB; PA165618375; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG108520; -. DR InParanoid; Q8TAQ9; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q8TAQ9; -. DR TreeFam; TF323915; -. DR GenomeRNAi; 256979; -. DR NextBio; 35475989; -. DR PRO; PR:Q8TAQ9; -. DR Proteomes; UP000005640; Chromosome 7. DR Bgee; Q8TAQ9; -. DR CleanEx; HS_SUNC1; -. DR ExpressionAtlas; Q8TAQ9; baseline and differential. DR Genevisible; Q8TAQ9; HS. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0034993; C:LINC complex; IEA:Ensembl. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Alternative splicing; Coiled coil; Complete proteome; Membrane; KW Polymorphism; Reference proteome; Transmembrane; Transmembrane helix. FT CHAIN 1 357 SUN domain-containing protein 3. FT /FTId=PRO_0000312220. FT TRANSMEM 48 64 Helical. {ECO:0000255}. FT DOMAIN 193 354 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 98 146 {ECO:0000255}. FT VAR_SEQ 1 100 Missing (in isoform 2). FT {ECO:0000303|Ref.3}. FT /FTId=VSP_029748. FT VAR_SEQ 2 61 SGKTKARRAAMFFRRCSEDASGSASGNALLSEDENPDANGV FT TRSWKIILSTMLTLTFLLV -> EDYSKYNAYTDFSSCRLE FT CSGAILAHCNLHLLGSSISPASASRVAGTT (in FT isoform 3). FT {ECO:0000303|PubMed:14702039}. FT /FTId=VSP_055624. FT VAR_SEQ 287 287 Y -> YVQRYMCRFVIQA (in isoform 2). FT {ECO:0000303|Ref.3}. FT /FTId=VSP_029749. FT VARIANT 127 127 I -> V (in dbSNP:rs17852360). FT {ECO:0000269|PubMed:15489334}. FT /FTId=VAR_037458. FT VARIANT 177 177 L -> V (in dbSNP:rs7797657). FT /FTId=VAR_037459. FT CONFLICT 231 232 Missing (in Ref. 1; AAP97300). FT {ECO:0000305}. FT CONFLICT 292 292 K -> R (in Ref. 2; BAG63413). FT {ECO:0000305}. SQ SEQUENCE 357 AA; 40503 MW; 5E63D57F1806753B CRC64; MSGKTKARRA AMFFRRCSED ASGSASGNAL LSEDENPDAN GVTRSWKIIL STMLTLTFLL VGLLNHQWLK ETDVPQKSRQ LYAIIAEYGS RLYKYQARLR MPKEQLELLK KESQNLENNF RQILFLIEQI DVLKALLRDM KDGMDNNHNW NTHGDPVEDP DHTEEVSNLV NYVLKKLRED QVEMADYALK SAGASIIEAG TSESYKNNKA KLYWHGIGFL NHEMPPDIIL QPDVYPGKCW AFPGSQGHTL IKLATKIIPT AVTMEHISEK VSPSGNISSA PKEFSVYGIT KKCEGEEIFL GQFIYNKTGT TVQTFELQHA VSEYLLCVKL NIFSNWGHPK YTCLYRFRVH GTPGKHI // ID SUN3_MACFA Reviewed; 261 AA. AC Q95LV7; DT 04-DEC-2007, integrated into UniProtKB/Swiss-Prot. DT 01-DEC-2001, sequence version 1. DT 11-NOV-2015, entry version 35. DE RecName: Full=SUN domain-containing protein 3; DE AltName: Full=Sad1/unc-84 domain-containing protein 1; GN Name=SUN3; Synonyms=SUNC1; ORFNames=QtsA-17495; OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. OX NCBI_TaxID=9541; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Testis; RX PubMed=12498619; DOI=10.1186/1471-2164-3-36; RA Osada N., Hida M., Kusuda J., Tanuma R., Hirata M., Suto Y., Hirai M., RA Terao K., Sugano S., Hashimoto K.; RT "Cynomolgus monkey testicular cDNAs for discovery of novel human genes RT in the human genome sequence."; RL BMC Genomics 3:36-36(2002). CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB071084; BAB64478.1; -; mRNA. DR UniGene; Mfa.6592; -. DR HOVERGEN; HBG108520; -. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil. FT CHAIN 1 261 SUN domain-containing protein 3. FT /FTId=PRO_0000312221. FT DOMAIN 97 258 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 3 29 {ECO:0000255}. SQ SEQUENCE 261 AA; 29588 MW; 0AD69537A1C896A0 CRC64; MPKEQLELLK KESQTLENNF HKILLLIEQI DVLKALLRDM KDGTDNNHSW NTHGDPVEDP DHTEVLDEEM SNLVNYVLKK LREDQVQMAD YALKSAGASI IEAGTSESYK NNKAKLYWHG ISFLNHEMPP DIILQPDVYP GNCWAFPGSQ GHTLIKLATK IIPTAVTMEH ISEKVSPSGN ISSAPKEFSV YGITKKCEGE EIFLGQFIYN KTGTTVQTFE LQHAVSEYLL CVKLNIFSNW GHPKYTCLYR FRVHGTPGKH I // ID SUN3_MOUSE Reviewed; 320 AA. AC Q5SS91; Q8BHY0; DT 04-DEC-2007, integrated into UniProtKB/Swiss-Prot. DT 21-DEC-2004, sequence version 1. DT 11-NOV-2015, entry version 85. DE RecName: Full=SUN domain-containing protein 3; DE AltName: Full=Sad1/unc-84 domain-containing protein 1; GN Name=Sun3; Synonyms=Sunc1; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2). RC STRAIN=C57BL/6J; TISSUE=Kidney, and Testis; RX PubMed=16141072; DOI=10.1126/science.1112014; RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., RA Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., RA Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., RA Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., RA Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., RA Hayashizaki Y.; RT "The transcriptional landscape of the mammalian genome."; RL Science 309:1559-1563(2005). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [4] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000305}; Single-pass membrane CC protein {ECO:0000305}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q5SS91-1; Sequence=Displayed; CC Name=2; CC IsoId=Q5SS91-2; Sequence=VSP_029750; CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK052771; BAC35140.1; -; mRNA. DR EMBL; AK132922; BAE21423.1; -; mRNA. DR EMBL; AL669837; CAI24620.1; -; Genomic_DNA. DR EMBL; AL669837; CAI24621.1; -; Genomic_DNA. DR EMBL; BC109334; AAI09335.1; -; mRNA. DR CCDS; CCDS24430.1; -. [Q5SS91-2] DR CCDS; CCDS70140.1; -. [Q5SS91-1] DR RefSeq; NP_001277448.1; NM_001290519.1. [Q5SS91-1] DR RefSeq; NP_001277449.1; NM_001290520.1. [Q5SS91-2] DR RefSeq; NP_808244.1; NM_177576.3. [Q5SS91-2] DR UniGene; Mm.79210; -. DR ProteinModelPortal; Q5SS91; -. DR SMR; Q5SS91; 124-317. DR STRING; 10090.ENSMUSP00000099973; -. DR PhosphoSite; Q5SS91; -. DR PaxDb; Q5SS91; -. DR PRIDE; Q5SS91; -. DR Ensembl; ENSMUST00000043377; ENSMUSP00000045199; ENSMUSG00000040985. [Q5SS91-1] DR Ensembl; ENSMUST00000102909; ENSMUSP00000099973; ENSMUSG00000040985. [Q5SS91-2] DR GeneID; 194974; -. DR KEGG; mmu:194974; -. DR UCSC; uc007hzt.1; mouse. [Q5SS91-1] DR CTD; 256979; -. DR MGI; MGI:3041199; Sun3. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG108520; -. DR InParanoid; Q5SS91; -. DR OMA; CVKLNIF; -. DR OrthoDB; EOG7J446H; -. DR PhylomeDB; Q5SS91; -. DR TreeFam; TF323915; -. DR NextBio; 371685; -. DR PRO; PR:Q5SS91; -. DR Proteomes; UP000000589; Chromosome 11. DR Bgee; Q5SS91; -. DR CleanEx; MM_SUNC1; -. DR ExpressionAtlas; Q5SS91; baseline and differential. DR Genevisible; Q5SS91; MM. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0034993; C:LINC complex; IDA:MGI. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR InterPro; IPR030274; SUN3. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF24; PTHR12911:SF24; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; Membrane; KW Reference proteome; Transmembrane; Transmembrane helix. FT CHAIN 1 320 SUN domain-containing protein 3. FT /FTId=PRO_0000312222. FT TRANSMEM 7 29 Helical. {ECO:0000255}. FT DOMAIN 156 317 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 63 102 {ECO:0000255}. FT VAR_SEQ 1 60 Missing (in isoform 2). FT {ECO:0000303|PubMed:15489334, FT ECO:0000303|PubMed:16141072}. FT /FTId=VSP_029750. SQ SEQUENCE 320 AA; 36758 MW; 8DEC1E6B0D0BC1FB CRC64; MLTRSWKIIL STVFISTFLL VGLLNHQWLK ETEFPQKPRQ LYTVIAEYGS RLYNYQARLR MPKEQQELLK KESQTLENNF REILFLIEQI DVLKALLKDM KDGVHNHSLP VHRDAVQDQA TTDVLDEEMS NLVHYVLKKF RGDQIQLADY ALKSAGASVI EAGTSESYKN NKAKLYWHGI GFLNYEMPPD MILQPDVHPG KCWAFPGSQG HILIKLARKI IPTAVTMEHI SEKVSPSGNI SSAPKEFSVY GVMKKCEGEE IFLGQFIYNK MEATIQTFEL QNEASESLLC VKLQILSNWG HPKYTCLYRF RVHGIPSDYT // ID SUN5_HUMAN Reviewed; 379 AA. AC Q8TC36; A6NJ82; Q5T9R0; DT 17-JAN-2003, integrated into UniProtKB/Swiss-Prot. DT 01-JUN-2002, sequence version 1. DT 11-NOV-2015, entry version 102. DE RecName: Full=SUN domain-containing protein 5; DE AltName: Full=Sad1 and UNC84 domain-containing protein 5; DE AltName: Full=Sperm-associated antigen 4-like protein; DE AltName: Full=Testis and spermatogenesis-related gene 4 protein; GN Name=SUN5; Synonyms=SPAG4L, TSARG4; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA], AND TISSUE SPECIFICITY. RX PubMed=12621555; RA Xing X.W., Li L.Y., Fu J.J., Zhu W.B., Liu G., Liu S.F., Lu G.X.; RT "Cloning of cDNA of TSARG4, a human spermatogenesis related gene."; RL Sheng Wu Hua Xue Yu Sheng Wu Wu Li Xue Bao 35:283-288(2003). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=11780052; DOI=10.1038/414865a; RA Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., RA Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., RA Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M., Brown A.J., RA Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., RA Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., RA Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., RA Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., RA Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., RA Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., RA Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., RA Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., RA Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., RA Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., RA Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., RA Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., RA Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., RA Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., RA Rogers J.; RT "The DNA sequence and comparative analysis of human chromosome 20."; RL Nature 414:865-871(2001). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., RA Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., RA Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., RA Hannenhalli S., Turner R., Yooseph S., Lu F., Nusskern D.R., RA Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., RA Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., RA Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., RA Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Testis; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane {ECO:0000250}; CC Single-pass membrane protein {ECO:0000250}. Note=Restricted to the CC apical nuclear region of round spermatids that face the acrosomic CC vesicle. {ECO:0000250}. CC -!- TISSUE SPECIFICITY: Widely expressed. CC {ECO:0000269|PubMed:12621555}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF401350; AAM90665.1; -; mRNA. DR EMBL; AL139826; CAI16148.1; -; Genomic_DNA. DR EMBL; AL121756; CAI16148.1; JOINED; Genomic_DNA. DR EMBL; AL121756; CAI22528.1; -; Genomic_DNA. DR EMBL; AL139826; CAI22528.1; JOINED; Genomic_DNA. DR EMBL; CH471077; EAW76343.1; -; Genomic_DNA. DR EMBL; BC026118; AAH26118.1; -; mRNA. DR EMBL; BC029528; AAH29528.1; -; mRNA. DR CCDS; CCDS13209.1; -. DR RefSeq; NP_542406.2; NM_080675.3. DR UniGene; Hs.375186; -. DR ProteinModelPortal; Q8TC36; -. DR SMR; Q8TC36; 174-364. DR BioGrid; 126678; 2. DR STRING; 9606.ENSP00000348496; -. DR PhosphoSite; Q8TC36; -. DR BioMuta; SUN5; -. DR DMDM; 27805720; -. DR PaxDb; Q8TC36; -. DR PRIDE; Q8TC36; -. DR Ensembl; ENST00000356173; ENSP00000348496; ENSG00000167098. DR GeneID; 140732; -. DR KEGG; hsa:140732; -. DR UCSC; uc002wyi.3; human. DR CTD; 140732; -. DR GeneCards; SUN5; -. DR HGNC; HGNC:16252; SUN5. DR HPA; HPA048529; -. DR MIM; 613942; gene. DR neXtProt; NX_Q8TC36; -. DR PharmGKB; PA38095; -. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG055206; -. DR InParanoid; Q8TC36; -. DR OMA; GNPRFTC; -. DR PhylomeDB; Q8TC36; -. DR TreeFam; TF323915; -. DR GeneWiki; SPAG4L; -. DR GenomeRNAi; 140732; -. DR NextBio; 84315; -. DR PRO; PR:Q8TC36; -. DR Proteomes; UP000005640; Chromosome 20. DR Bgee; Q8TC36; -. DR CleanEx; HS_SPAG4L; -. DR ExpressionAtlas; Q8TC36; baseline and differential. DR Genevisible; Q8TC36; HS. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:UniProtKB-SubCell. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; IEP:UniProtKB. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 2: Evidence at transcript level; KW Coiled coil; Complete proteome; Membrane; Nucleus; Polymorphism; KW Reference proteome; Transmembrane; Transmembrane helix. FT CHAIN 1 379 SUN domain-containing protein 5. FT /FTId=PRO_0000218919. FT TOPO_DOM 1 105 Nuclear. {ECO:0000255}. FT TRANSMEM 106 122 Helical. {ECO:0000255}. FT TOPO_DOM 123 379 Perinuclear space. {ECO:0000255}. FT DOMAIN 205 364 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 141 182 {ECO:0000255}. FT VARIANT 16 16 E -> K (in dbSNP:rs3746387). FT /FTId=VAR_015147. FT VARIANT 39 39 E -> D (in dbSNP:rs1133358). FT /FTId=VAR_026677. FT VARIANT 120 120 I -> V (in dbSNP:rs35216976). FT /FTId=VAR_052285. FT VARIANT 174 174 A -> T (in dbSNP:rs17123951). FT /FTId=VAR_026678. SQ SEQUENCE 379 AA; 43081 MW; 0FAE87B1CC1DBCDF CRC64; MPRSSRSPGD PGALLEDVAH NPRPRRIAQR GRNTSRMAED TSPNMNDNIL LPVRNNDQAL GLTQCMLGCV SWFTCFACSL RTQAQQVLFN TCRCKLLCQK LMEKTGILLL CAFGFWMFSI HLPSKMKVWQ DDSINGPLQS LRLYQEKVRH HSGEIQDLRG SMNQLIAKLQ EMEAMSDEQK MAQKIMKMIH GDYIEKPDFA LKSIGASIDF EHTSVTYNHE KAHSYWNWIQ LWNYAQPPDV ILEPNVTPGN CWAFEGDRGQ VTIQLAQKVY LSNLTLQHIP KTISLSGSLD TAPKDFVIYG MEGSPKEEVF LGAFQFQPEN IIQMFPLQNQ PARAFSAVKV KISSNWGNPG FTCLYRVRVH GSVAPPREQP HQNPYPKRD // ID SUN5_MOUSE Reviewed; 373 AA. AC Q9DA32; D2DR64; Q5DT38; DT 17-JAN-2003, integrated into UniProtKB/Swiss-Prot. DT 29-MAY-2013, sequence version 2. DT 11-NOV-2015, entry version 86. DE RecName: Full=SUN domain-containing protein 5; DE AltName: Full=Sperm-associated antigen 4-like protein; GN Name=Sun5; Synonyms=Spag4l; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2), AND TISSUE SPECIFICITY. RC STRAIN=C57BL/6J; TISSUE=Testis; RX PubMed=15552040; RA Xing X.W., Li L.Y., Liu G., Lu G.X.; RT "Cloning of cDNA of SRG4, a mouse spermatogenesis related gene and RT expression in mouse different developing stages."; RL Yi Chuan Xue Bao 31:1066-1071(2004). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), SUBCELLULAR LOCATION, AND RP ALTERNATIVE SPLICING. RX PubMed=21159740; DOI=10.1093/molehr/gaq099; RA Frohnert C., Schweizer S., Hoyer-Fender S.; RT "SPAG4L/SPAG4L-2 are testis-specific SUN domain proteins restricted to RT the apical nuclear envelope of round spermatids facing the acrosome."; RL Mol. Hum. Reprod. 17:207-218(2011). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). RC STRAIN=C57BL/6J; TISSUE=Testis; RX PubMed=16141072; DOI=10.1126/science.1112014; RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., RA Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., RA Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., RA Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., RA Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., RA Hayashizaki Y.; RT "The transcriptional landscape of the mammalian genome."; RL Science 309:1559-1563(2005). RN [4] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Testis; RX PubMed=21183079; DOI=10.1016/j.cell.2010.12.001; RA Huttlin E.L., Jedrychowski M.P., Elias J.E., Goswami T., Rad R., RA Beausoleil S.A., Villen J., Haas W., Sowa M.E., Gygi S.P.; RT "A tissue-specific atlas of mouse protein phosphorylation and RT expression."; RL Cell 143:1174-1189(2010). CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane CC {ECO:0000269|PubMed:21159740}; Single-pass membrane protein CC {ECO:0000269|PubMed:21159740}. Note=Isoform 1 is restricted to the CC apical nuclear region of round spermatids that face the acrosomic CC vesicle. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; Synonyms=SPAG4L-2; CC IsoId=Q9DA32-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9DA32-2; Sequence=VSP_046528; CC -!- TISSUE SPECIFICITY: Testis-specific, abundantly expressed in CC spermatocytes and round spermatids. {ECO:0000269|PubMed:15552040}. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AY307077; AAP69223.1; -; mRNA. DR EMBL; FJ667498; ACV74249.1; -; mRNA. DR EMBL; AK006225; BAB24468.1; -; mRNA. DR RefSeq; NP_001291977.1; NM_001305048.1. DR RefSeq; NP_083875.1; NM_029599.2. DR UniGene; Mm.33629; -. DR ProteinModelPortal; Q9DA32; -. DR SMR; Q9DA32; 172-359. DR PhosphoSite; Q9DA32; -. DR MaxQB; Q9DA32; -. DR PRIDE; Q9DA32; -. DR DNASU; 76407; -. DR GeneID; 76407; -. DR KEGG; mmu:76407; -. DR UCSC; uc008nim.2; mouse. [Q9DA32-2] DR UCSC; uc012cgr.2; mouse. [Q9DA32-1] DR CTD; 140732; -. DR MGI; MGI:1923657; Sun5. DR HOGENOM; HOG000007503; -. DR HOVERGEN; HBG055206; -. DR InParanoid; Q9DA32; -. DR OrthoDB; EOG7J446H; -. DR TreeFam; TF323915; -. DR NextBio; 345089; -. DR PRO; PR:Q9DA32; -. DR Proteomes; UP000000589; Unplaced. DR Bgee; Q9DA32; -. DR CleanEx; MM_SPAG4L; -. DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW. DR GO; GO:0005635; C:nuclear envelope; IBA:GO_Central. DR GO; GO:0005637; C:nuclear inner membrane; IEA:UniProtKB-SubCell. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0090286; P:cytoskeletal anchoring at nuclear membrane; IBA:GO_Central. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007283; P:spermatogenesis; ISS:UniProtKB. DR InterPro; IPR030273; SUN5. DR InterPro; IPR012919; SUN_dom. DR PANTHER; PTHR12911:SF19; PTHR12911:SF19; 1. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; Membrane; KW Nucleus; Reference proteome; Transmembrane; Transmembrane helix. FT CHAIN 1 373 SUN domain-containing protein 5. FT /FTId=PRO_0000218920. FT TOPO_DOM 1 103 Nuclear. {ECO:0000255}. FT TRANSMEM 104 120 Helical. {ECO:0000255}. FT TOPO_DOM 121 373 Perinuclear space. {ECO:0000255}. FT DOMAIN 204 362 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT COILED 136 180 {ECO:0000255}. FT VAR_SEQ 44 69 Missing (in isoform 2). FT {ECO:0000303|PubMed:15552040, FT ECO:0000303|PubMed:16141072}. FT /FTId=VSP_046528. FT CONFLICT 287 287 P -> L (in Ref. 1; AAP69223 and 3; FT BAB24468). {ECO:0000305}. FT CONFLICT 294 294 I -> F (in Ref. 1; AAP69223 and 3; FT BAB24468). {ECO:0000305}. FT CONFLICT 319 322 VIQM -> IIQT (in Ref. 1; AAP69223 and 3; FT BAB24468). {ECO:0000305}. SQ SEQUENCE 373 AA; 42653 MW; 5082E36162B5C012 CRC64; MPRTRNIGAL CTLPEDTTHS GRPRRGVQRS YISRMAEPAP ANMNDPLLLP LRMNTPGLSL VQILLGYMSW LTYLACFLRT QTQQVFLNTC RCKLFCQKVM EKMGLLVLCV FGFWMFSMHL PSKVEVWQDD SINGPLQSLR MYQEKVRHHT GEIQDLRGSM NQLIAKLQKM EAISDEQKMA QKIMKMIQGD YIEKPDFALK SIGASIDFEH TSATYNHDKA RSYWNWIRLW NYAQPPDVIL EPNVTPGNCW AFASDRGQVT IRLAQKVYLS NITLQHIPKT ISLSGSPDTA PKDIVIYGLE SLPREEVFLG AFQFQPENVI QMFQLQNLPP RSFAAVKVKI SSNWGNPRFT CMYRVRVHGS VTPPKDSHLE PLS // ID UNC84_CAEEL Reviewed; 1111 AA. AC Q20745; Q9U475; Q9U476; DT 28-MAR-2003, integrated into UniProtKB/Swiss-Prot. DT 01-OCT-2001, sequence version 2. DT 11-NOV-2015, entry version 121. DE RecName: Full=Nuclear migration and anchoring protein unc-84; DE AltName: Full=Uncoordinated protein 84; GN Name=unc-84; ORFNames=F54B11.3; OS Caenorhabditis elegans. OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. OX NCBI_TaxID=6239; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS A AND B), FUNCTION, RP CHARACTERIZATION, AND MUTAGENESIS OF PRO-91; ASP-932; ARG-984; SER-988 RP AND GLY-1002. RC STRAIN=Bristol N2; TISSUE=Embryo; RX PubMed=10375507; RA Malone C.J., Fixsen W.D., Horvitz H.R., Han M.; RT "UNC-84 localizes to the nuclear envelope and is required for nuclear RT migration and anchoring during C. elegans development."; RL Development 126:3171-3181(1999). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Bristol N2; RX PubMed=9851916; DOI=10.1126/science.282.5396.2012; RG The C. elegans sequencing consortium; RT "Genome sequence of the nematode C. elegans: a platform for RT investigating biology."; RL Science 282:2012-2018(1998). RN [3] RP INTERACTION WITH UNC-83. RX PubMed=11748140; RA Starr D.A., Hermann G.J., Malone C.J., Fixsen W., Priess J.R., RA Horvitz H.R., Han M.; RT "unc-83 encodes a novel component of the nuclear envelope and is RT essential for proper nuclear migration."; RL Development 128:5039-5050(2001). RN [4] RP SUBCELLULAR LOCATION. RX PubMed=11870211; RA Gruenbaum Y., Lee K.K., Liu J., Cohen M., Wilson K.L.; RT "The expression, lamin-dependent localization and RNAi depletion RT phenotype for emerin in C. elegans."; RL J. Cell Sci. 115:923-929(2002). RN [5] RP INTERACTION WITH ANC-1. RX PubMed=12169658; DOI=10.1126/science.1075119; RA Starr D.A., Han M.; RT "Role of ANC-1 in tethering nuclei to the actin cytoskeleton."; RL Science 298:406-409(2002). RN [6] RP SUBCELLULAR LOCATION, TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE. RX PubMed=11907270; DOI=10.1091/mbc.01-06-0294; RA Lee K.K., Starr D.A., Cohen M., Liu J., Han M., Wilson K.L., RA Gruenbaum Y.; RT "Lamin-dependent localization of UNC-84, a protein required for RT nuclear migration in Caenorhabditis elegans."; RL Mol. Biol. Cell 13:892-901(2002). RN [7] RP SUBCELLULAR LOCATION, TOPOLOGY, AND INTERACTION WITH UNC-83. RX PubMed=16481402; DOI=10.1091/mbc.E05-09-0894; RA McGee M.D., Rillo R., Anderson A.S., Starr D.A.; RT "UNC-83 IS a KASH protein required for nuclear migration and is RT recruited to the outer nuclear membrane by a physical interaction with RT the SUN protein UNC-84."; RL Mol. Biol. Cell 17:1790-1801(2006). CC -!- FUNCTION: Involved in nuclear migration and anchoring. Not CC required for centrosome attachment to the nucleus. Probably CC anchors the structural protein anc-1 to the nucleus, creating a CC bridge across the nuclear envelope between the cytoskeleton and CC the nucleus. Probably involved in nuclear migration via its CC interaction with unc-83. Recruits both unc-83 and anc-1 to the CC nuclear envelope. Together these proteins may function to bridge CC the two membranes of the nuclear envelope, connecting the nuclear CC matrix to the cytoskeleton. {ECO:0000269|PubMed:10375507}. CC -!- SUBUNIT: Interacts with unc-83 via its unc-84 domain. Interacts CC indirectly with anc-1. Probably interacts with Lamin via its N- CC terminal domain. {ECO:0000269|PubMed:11748140, CC ECO:0000269|PubMed:12169658, ECO:0000269|PubMed:16481402}. CC -!- SUBCELLULAR LOCATION: Nucleus inner membrane; Single-pass type II CC membrane protein. Cytoplasm, cytoskeleton {ECO:0000305}. CC Note=Associated with nuclei during interphase, prophase, CC prometaphase, metaphase and early anaphase. Released from nuclear CC membrane in the same time that the nuclear envelope disassembly, CC during late anaphase, and begins to reaccumulate in early CC telophase. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=a; CC IsoId=Q20745-1; Sequence=Displayed; CC Name=b; CC IsoId=Q20745-2; Sequence=VSP_007081, VSP_007082; CC -!- TISSUE SPECIFICITY: Ubiquitous. {ECO:0000269|PubMed:11907270}. CC -!- DEVELOPMENTAL STAGE: Expressed in all cells of embryos from the CC 26-cell stage. Then, it is ubiquitously expressed throughout the CC development. {ECO:0000269|PubMed:11907270}. CC -!- DOMAIN: The SUN domain probably plays a role in the nuclear CC anchoring and/or migration. Required for the localization at the CC nuclear membrane of unc-83 and anc-1. CC -!- SIMILARITY: Contains 1 SUN domain. {ECO:0000255|PROSITE- CC ProRule:PRU00802}. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF200706; AAF15883.1; -; mRNA. DR EMBL; AF200707; AAF15884.1; -; mRNA. DR EMBL; Z70208; CAA94142.2; -; Genomic_DNA. DR EMBL; Z70208; CAC42306.1; -; Genomic_DNA. DR PIR; T22608; T22608. DR RefSeq; NP_001024707.1; NM_001029536.2. [Q20745-1] DR RefSeq; NP_001024708.1; NM_001029537.3. [Q20745-2] DR UniGene; Cel.17328; -. DR ProteinModelPortal; Q20745; -. DR SMR; Q20745; 921-1107. DR BioGrid; 46381; 3. DR IntAct; Q20745; 5. DR MINT; MINT-1069689; -. DR STRING; 6239.F54B11.3a; -. DR PaxDb; Q20745; -. DR PRIDE; Q20745; -. DR EnsemblMetazoa; F54B11.3a; F54B11.3a; WBGene00006816. [Q20745-1] DR GeneID; 181480; -. DR KEGG; cel:CELE_F54B11.3; -. DR UCSC; F54B11.3b.2; c. elegans. [Q20745-1] DR CTD; 181480; -. DR WormBase; F54B11.3a; CE28236; WBGene00006816; unc-84. DR WormBase; F54B11.3b; CE27761; WBGene00006816; unc-84. DR eggNOG; KOG2687; Eukaryota. DR eggNOG; ENOG410YM6S; LUCA. DR GeneTree; ENSGT00390000011587; -. DR HOGENOM; HOG000018377; -. DR InParanoid; Q20745; -. DR KO; K19347; -. DR OMA; WKSEFAS; -. DR OrthoDB; EOG7J446H; -. DR NextBio; 914118; -. DR PRO; PR:Q20745; -. DR Proteomes; UP000001940; Chromosome X. DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW. DR GO; GO:0005856; C:cytoskeleton; IEA:UniProtKB-SubCell. DR GO; GO:0016021; C:integral component of membrane; IDA:WormBase. DR GO; GO:0005635; C:nuclear envelope; IDA:WormBase. DR GO; GO:0005637; C:nuclear inner membrane; IDA:WormBase. DR GO; GO:0005521; F:lamin binding; IPI:WormBase. DR GO; GO:0043495; F:protein anchor; IBA:GO_Central. DR GO; GO:0040011; P:locomotion; IMP:WormBase. DR GO; GO:0007399; P:nervous system development; IMP:WormBase. DR GO; GO:0006998; P:nuclear envelope organization; IBA:GO_Central. DR GO; GO:0007097; P:nuclear migration; IMP:WormBase. DR GO; GO:0030473; P:nuclear migration along microtubule; IMP:WormBase. DR GO; GO:0018991; P:oviposition; IMP:WormBase. DR GO; GO:0009791; P:post-embryonic development; IMP:WormBase. DR GO; GO:0030334; P:regulation of cell migration; TAS:WormBase. DR GO; GO:0040025; P:vulval development; IMP:WormBase. DR InterPro; IPR012919; SUN_dom. DR Pfam; PF07738; Sad1_UNC; 1. DR PROSITE; PS51469; SUN; 1. PE 1: Evidence at protein level; KW Alternative splicing; Complete proteome; Cytoplasm; Cytoskeleton; KW Developmental protein; Membrane; Nucleus; Reference proteome; KW Signal-anchor; Transmembrane; Transmembrane helix. FT CHAIN 1 1111 Nuclear migration and anchoring protein FT unc-84. FT /FTId=PRO_0000218910. FT TOPO_DOM 1 509 Nuclear. {ECO:0000269|PubMed:16481402}. FT TRANSMEM 510 530 Helical. FT TOPO_DOM 531 1111 Perinuclear space. FT {ECO:0000269|PubMed:16481402}. FT DOMAIN 945 1109 SUN. {ECO:0000255|PROSITE- FT ProRule:PRU00802}. FT VAR_SEQ 877 879 LRA -> VTN (in isoform b). FT {ECO:0000303|PubMed:10375507}. FT /FTId=VSP_007081. FT VAR_SEQ 880 1111 Missing (in isoform b). FT {ECO:0000303|PubMed:10375507}. FT /FTId=VSP_007082. FT MUTAGEN 91 91 P->S: In e1411; defects in nuclear FT migration and anchoring. FT {ECO:0000269|PubMed:10375507}. FT MUTAGEN 932 932 D->N: In n323; defects in nuclear FT migration and anchoring. FT {ECO:0000269|PubMed:10375507}. FT MUTAGEN 984 984 R->K: In n371; defects in nuclear FT migration and anchoring. FT {ECO:0000269|PubMed:10375507}. FT MUTAGEN 988 988 S->F: In sa61; defects in nuclear FT migration and anchoring. FT {ECO:0000269|PubMed:10375507}. FT MUTAGEN 1002 1002 G->D: In n321 and n399; defects in FT nuclear migration and anchoring. FT {ECO:0000269|PubMed:10375507}. SQ SEQUENCE 1111 AA; 125861 MW; 6A07438E2BDC8BA6 CRC64; MAPATEADNN FDTHEWKSEF ASTRSGRNSP NIFAKVRRKL LLTPPVRNAR SPRLTEEELD ALTGDLPYAT NYTYAYSKIY DPSLPDHWEV PNLGGTTSGS LSEQEHWSAA SLSRQLLYIL RFPVYLVLHV ITYILEAFYH VIKITSFTIW DYLLYLVKLA KTRYYAYQDH RRRTALIRNR QEPFSTKAAR SIRRFFEILV YVVLTPYRML TRSNNGVEQY QYRSIKDQLE NERASRMTTR SQTLERSRKF DGLSKSPARR AAPAFVKTST ITRITAKVFS SSPFGEGTSE NITPTVVTTR TVKQRSVTPR FRQTRATREA ITRALDTPEL EIDTPLSTYG LRSRGLSHLN TPEPTFDIGH AAATSTPLFP QETYNYQYEE ATGNKIKTAF TWLGYLILFP FFAARHVWYT FYDYGKSAYM KLTNYQQAPM ETIHVRDINE PAPSSSDVHD AVGVSWRIRI ADFLSSFVAT IVEAHQVVFA MFKGGIVETV SYFGGLFAGL TDKKSSKFSW CQILGLLLAL LFAIFLLGFL TSDNTAIRVK EITKDKNASK KSEGSLPAVP IWISAANHVK HYTWMVKEFV VDIAFDTYNY GKSTIGRLGT TPRYAWDLIA SGCGAVGNGL KSVLSSSFRF IDFCAGKLFY YGSDGFLSAN KSIGTFFNGC YETLYNGCTA IVGHTKSFIY NASNAVYNFF STIFAGLLNF STSSQNSILS LLKSFGTGIT NIFYNFIYAP IAGVFNFAGD NYMYFFNEVA AVFGKVYNSV VSVLKTVINW ILFLIAYPFS LCTRAWIRIS QYAPEDVVQV IPIPQAITPT PDVERIVEEP LRKVTDVEDE ELVIIPAPAP KPIPVPAPTP APVIIHQTNV VETVDKDAII KEVTEKLRAE LSAQFQQELS AKFEQNYNTI IEQLKMENTN IQYDKNHLEA IIRQMIYEYD TDKTGKVDYA LESSGGAVVS TRCSETYKSY TRLEKFWDIP IYYFHYSPRV VIQRNSKSLF PGECWCFKES RGYIAVELSH FIDVSSISYE HIGSEVAPEG NRSSAPKGVL VWAYKQIDDL NSRVLIGDYT YDLDGPPLQF FLAKHKPDFP VKFVELEVTS NYGAPFTCLY RLRVHGKVVQ V //