#=GF ID THP2
#=GF AC PF09432.15
#=GF DE Tho complex subunit THP2
#=GF AU Mistry J;0000-0003-2479-5322
#=GF AU Wood V;0000-0001-6330-7526
#=GF SE manual
#=GF GA 28.00 28.00;
#=GF TC 30.70 88.80;
#=GF NC 26.00 22.50;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch -Z 81514348 --cpu 4 -E 1000 HMM pfamseq
#=GF TP Family
#=GF RN [1]
#=GF RM 11060033
#=GF RT A protein complex containing Tho2, Hpr1, Mft1 and a novel
#=GF RT protein, Thp2, connects transcription elongation with mitotic
#=GF RT recombination in Saccharomyces cerevisiae.
#=GF RA Chavez S, Beilharz T, Rondon AG, Erdjument-Bromage H, Tempst P,
#=GF RA Svejstrup JQ, Lithgow T, Aguilera A;
#=GF RL EMBO J. 2000;19:5824-5834.
#=GF RN [2]
#=GF RM 12093753
#=GF RT The yeast THO complex and mRNA export factors link RNA
#=GF RT metabolism with transcription and genome instability.
#=GF RA Jimeno S, Rondon AG, Luna R, Aguilera A;
#=GF RL EMBO J. 2002;21:3526-3535.
#=GF DR INTERPRO; IPR018557;
#=GF DR SO; 0100021; polypeptide_conserved_region;
#=GF CC The THO complex plays a role in coupling transcription
#=GF CC elongation to mRNA export. It is composed of subunits THP2,
#=GF CC HPR1, THO2 and MFT1 [1].
#=GF SQ 26
#=GS A0A1G4JI36_9SACH/121-251 AC A0A1G4JI36.1
#=GS Q6FQZ2_CANGA/122-250 AC Q6FQZ2.1
#=GS G0VDT0_NAUCC/118-246 AC G0VDT0.1
#=GS G0W885_NAUDC/117-245 AC G0W885.1
#=GS G8BNX6_TETPH/133-261 AC G8BNX6.1
#=GS C5E1Y6_LACTC/120-243 AC C5E1Y6.1
#=GS G8ZYI7_TORDE/112-240 AC G8ZYI7.1
#=GS I6NCT5_ERECY/120-250 AC I6NCT5.1
#=GS Q6CXV8_KLULA/119-249 AC Q6CXV8.2
#=GS A0A7H9B3I6_ZYGMR/116-244 AC A0A7H9B3I6.1
#=GS C5DTV5_ZYGRC/116-244 AC C5DTV5.1
#=GS H2AVH8_KAZAF/115-243 AC H2AVH8.1
#=GS Q75D17_EREGS/119-247 AC Q75D17.2
#=GS A7TEX7_VANPO/115-244 AC A7TEX7.1
#=GS W0T571_KLUMD/122-252 AC W0T571.1
#=GS A0A0C7N7V1_9SACH/121-248 AC A0A0C7N7V1.1
#=GS THP2_YEAST/115-243 AC O13539.1
#=GS THP2_YEAST/115-243 DR PDB; 7LUV B; 115-227;
#=GS THP2_YEAST/115-243 DR PDB; 7APX C; 115-235;
#=GS THP2_YEAST/115-243 DR PDB; 7V2Y E; 115-184;
#=GS THP2_YEAST/115-243 DR PDB; 7V2W J; 115-240;
#=GS THP2_YEAST/115-243 DR PDB; 7AQO J; 115-238;
#=GS THP2_YEAST/115-243 DR PDB; 7AQO C; 115-235;
#=GS I2GZR0_TETBL/114-242 AC I2GZR0.1
#=GS A0A1X7R5N4_9SACH/118-247 AC A0A1X7R5N4.1
#=GS A0A1G4MJ07_LACFM/120-248 AC A0A1G4MJ07.1
#=GS A0A7H9HKJ4_9SACH/112-240 AC A0A7H9HKJ4.1
#=GS J4U4S3_SACK1/115-243 AC J4U4S3.1
#=GS A0A1G4IRN9_9SACH/121-245 AC A0A1G4IRN9.1
#=GS J8Q711_SACAR/115-243 AC J8Q711.1
#=GS J7RYT2_KAZNA/124-252 AC J7RYT2.1
#=GS A0A4C2E9V6_9SACH/116-244 AC A0A4C2E9V6.1
A0A1G4JI36_9SACH/121-251 f-EHQSLLGRLSASLDLSESANERSV-R.PAKDDDATKNVFHQLLKQYSAt.nsSQDELTKLRDQLMELINDQKLEKAQYSLENQHTLKEVFSQLAHQVTEWKEQFQSLEDIMFGNGPRSMLSLFHEVDKMKP--ll
Q6FQZ2_CANGA/122-250 .LTYINLLTKLSVNLAKQIEFADHSVSE.FLLEDWKPPHELQSILEKFVD....MEEDPEVLNDQLNKYMDNIKMERAKYSLENKYSLQEQLKTLESELSRWRDAWVNIESLMFGDSPNSMKGMLQNIESMKKEL..
G0VDT0_NAUCC/118-246 .LEYVNLLERLSVDLAKQVEISDPSVSK.FVLNDWNPPKGVQAILDKFAD....PSADAALLKMELVHYLDDIKMSRAKYSLENKYSLQDKVVNLNTELNRWRKELDDIEMMMFGDGATSIKKMLANVESLRSKI..
G0W885_NAUDC/117-245 .LEYINLLQRLSVDLVRQIEISDPNVSK.INVDGWNPPKKIQVLLDKFGE....PDADTRELKIQVQRYLDDIKMSRAKYSLENKYSLQEKLSEVTKAVNQWRAEWDNIEMMLFGDGSNSMKNMLANVESIKSKL..
G8BNX6_TETPH/133-261 .MIYTNLLGRLSVGLIQQVQVSNTENSE.IMINDYPPPEEIVSILEKFNT....ETTETDDLRGQLDDYLQKIKMDRAKYTLENEYLLKDSLLTLSKEVNYWRKEYDNLEMLMFSDGPNTIMKMMKNVDSLRLKV..
C5E1Y6_LACTC/120-243 .LEHINLIGRLSSVLTEMLP--------.SELDDSTQNLELAQILEAYNTd.skGSEDTDCLKEKLLDWIDSIKMEKARYSLENQHILRDSLKALTMEVTRWRENYESIEGMMFGENANSISQMLHKVQRLRPQL..
G8ZYI7_TORDE/112-240 .LRYVNLLERLSVDLVKEIEIADPTVTE.FVVNKWNPPKGIFEILDELAD....PATDVVAVRSRLNGYLDRIKMERAKYTIENKHSLQGTLRDLNKEVSNWRKEWDSIENVMFGDGSHSMKKMLQNIDSLKSKL..
I6NCT5_ERECY/120-250 .LKHLKLLNALAVDMCYPLVNQEDTEN-.IAVNKEHYPRELAPVLEEYDAy.gaDIEDIRNLRSKLMQYFENIKSSRAKYLLENKYLLADSLKELTKLVAAWSQKWEHLENILFGDSPASLRKLLQTMETVKASL..
Q6CXV8_KLULA/119-249 .MRHLSLVANLSDDLVHKLESSDESNK-.VLVNKNPLPAVLKETVKQYEEi.gdEQQRIENIRAKLFQYLDEIKAGRAKYALENKYILNSTLQQITKEVSEWSQRWTHIENSLFGDSPTSLKKLVQKAENIKEL-l.
A0A7H9B3I6_ZYGMR/116-244 .LEYVNLLERLSVDLAKQIEIADPKVSE.FIVDNWNPPKGIYAILETLGD....PTVDPKDIATRIRGYLDQIKMERAKYTIQNKYSLQETLHDLTKEVNSWRKECDSMENLMFGDSSNSMKKMLQNVDSLKFR-l.
C5DTV5_ZYGRC/116-244 .LDYVNLLQRLSVDLAKQIEISDPEVSE.FVVDNWSPPDGMQSILEQLAN....PDKDSTHLQSQLDQYLDQIKMERAKYTIENKYSLQETLNEVNKEVNYWRRNWNAIENLMFGDSAHSIKKMLQSIDLLRAKL..
H2AVH8_KAZAF/115-243 h-EYVNLLERLSVDLGKQVDISDSNVTE.LVVDDWTPPSELISLLEQYNE....SSSDIELQDSKVDRYLDQLKLLRAKYAMENNYLLKSTLNDLNDEVNYWRREYENIESMMFGNGPNSMKKMLHNVEVLKVK-a.
Q75D17_EREGS/119-247 .LQHLQLVNQLSVELAYPLGRRGSEH--.VTVNREGPPPELVAALAAYDAg..pDAPAAAELRAELLRYLDDIKATRARYLLENKYLLADSLRQLTRDVSSWSQKWESLEGTLFGDAPSSLRSLLRSVDTTKATI..
A7TEX7_VANPO/115-244 .LVYINLLERLASDLFIQVEDAQFKDNEvIMVDEVAAPVEVQDVLKKYIT....ESSETSVLRDELDKYLNEIKMERAELTIKNKFSLQPTLNELSKEVNYWRKEWDNMEMLMFGDGPNSMKRMMKNIESLRAK-a.
W0T571_KLUMD/122-252 .MRHLSLISNLSDDLVVKLESHDDSNL-.VVTNKDPLPPVLKNTIRKYEEl.gpEQQHIEDIRAALFQYLDDIKAGRAKYALENKYILNTSLQEITKEVSEWSQRWTNIENTLFGDSPNSLKKLIQKADEIKEL-l.
A0A0C7N7V1_9SACH/121-248 .MEHINLVGRLSSKLCDPALASRS----.-AAGDRDDAFDLKQILEAYNSlgdeDVEDTANLKLQLLRWTDSMKMENARFTLENQHILRDKLKSITAEVTQWKKNYESVENTMFGEDPHSIMQTIQRIQKMKPQL..
THP2_YEAST/115-243 .LRYINLLKRLSVDLAKQVEVSDPSVTV.YEMDKWVPSEKLQGILEQYCA....PDTDIRGVDAQIKNYLDQIKMARAKFGLENKYSLKERLSTLTKELNHWRKEWDDIEMLMFGDDAHSMKKMIQKIDSLKSEI..
#=GR THP2_YEAST/115-243 SS .HHHHHTTTTT------------SS--E.EESSS----HHHHHHHHTTCS....SS--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHXXXX..
I2GZR0_TETBL/114-242 q-KYINMVNRLSVDLAKQIETADIRKDK.YIVDNWLPPKEIEEILQEFTD....DDSEAVRLRARLEQYLDQLKMERVKYTLENRYTIEDKLILANKEVNRWRIEWDKLETLMFGRGPNSLKNMLQKNEQLAEKL..
A0A1X7R5N4_9SACH/118-247 .LEYINLLNRLSVELVKQVDISDPDISE.FVFDNWKPPAELQKIIDNYYG...dENKNFTSLNGDLQDYFNSIKLSRAKYTLENRYVLQRHLTELNKEANYWRGELDNIELLLFGEGPHSIRKVLQNVEVLKNKL..
A0A1G4MJ07_LACFM/120-248 .LEHGNLLGRLSSNLGNQMK-QDIDTS-.ISVEVS-KRDTLREIMAKYDSi.dgNDDQPEILRQELLDYIDSMKMEKARYSLVNQYMLSDSYKQLTKEVTQWKHQYESLEGIMFGDNPNSIRSMVYKIESLKEKL..
A0A7H9HKJ4_9SACH/112-240 .LTYVNLLERLSVDLVKEIEIADPSVTE.FVVDKWNPPKGLQPILENLAD....CNTDPEIATARLDGYLDQIKMERAKYTIENRHSLQGILRDLNKEVNDWRKEWDSIENWMFGDSEHSMKKMLQNIDSLKSKL..
J4U4S3_SACK1/115-243 .LKYINLLKRLSVDLAKQVEVSDPSVTV.YELDNWVPSEKLQGILEEYCA....PETDIRGVDAQIKNYLGQIKMARAKFGLENKYSLKEGLSTLTKELNHWRKEWDDIEMLMFGDDAHSMKKMIQKIDSLKSEI..
A0A1G4IRN9_9SACH/121-245 .LEHTNLIGRLSSNLCDAASAF------.SQVEKED-YFDLESWLQSYKSe.daSVETREILKSRLVSWITSIKMEKARYLVENQHILRDMLKSLTSDVAQWRTNYESIESMLFGDAHNSIAKTLLDITNLRQEL..
J8Q711_SACAR/115-243 .LRYVNLLQRLSVDLAKQIEVSDSSVTV.YEVDNWGPSEKLQGILEQYCA....PDTDIRGVDSQIKNYLDQIKMARAKFGLENKHSLKERLSTLTKELNHWRKEWDDIETLMFGDDEHSMKKMIQKIDSLKSEL..
J7RYT2_KAZNA/124-252 .LEYLNLLGTYAVDLARQIEISDPSVSH.FDIDDWKPPRKLLEILDKFQS....EDCEPIKIRDELQSYLDNIKLSRAKFTLENKHILQDKLGVLSKEVSYWRKEWDNIENMMFGEGSDSMRSMLQTVDSLRSKI..
A0A4C2E9V6_9SACH/116-244 .LDYVNLLQRLSVDLAKQIEISEPEVSE.FVVNNWNPPHDMQLILEQLAD....PKKDSAQLQSQLDQHLDQIKMERAKYTIENKYSLQETLNEINKEVNYWRRNWNAIENLMFGDSSHSIKKMLQSIDLLRTKL..
#=GC SS_cons .HHHHHTTTTT------------SS--E.EESSS----HHHHHHHHTTCS....SS--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHXXXX..
#=GC seq_cons .LcYlNLLpRLSVDLs+QlEhuDssso..hhl-casPPccLpsIL-pass....sss-sptL+upLppYLDpIKMpRAKYoLENKY.Lp-pLppLoKEVspWRccW-sIEshMFGDussShKKMLQsl-sLKscl..
//