Difference between revisions of "IS Families/IS256 family"
Line 3: | Line 3: | ||
Recently, a study of [https://www.annualreviews.org/doi/abs/10.1146/annurev-genet-112414-055018 ICE] elements identified examples from [[wikipedia:Group_B_streptococcal_infection|type B ''Streptococcus'']] [Tn''GBS''<ref name=":1"><pubmed>19183283</pubmed></nowiki></ref>] and ''[[wikipedia:Mycoplasma|Mycoplasma]]''<ref><nowiki><pubmed>23888872</pubmed></nowiki></ref> which include a DDE type Tpase rather than the more common phage integrase-like gene. Using a cascade [[wikipedia:BLAST_(biotechnology)|PSI-Blast]] approach not only revealed two new IS families (IS''Lre2'' and IS''Kra4'') but established a distant relationship with the IS''256'' and IS''H6'' families<ref name=":2"><pubmed>24418649</pubmed></nowiki></ref> ([[:File:Fig. IS256.1.png|Fig. IS256.1]] and [[:File:Fig. IS256.2.png|Fig. IS256.2]]). | Recently, a study of [https://www.annualreviews.org/doi/abs/10.1146/annurev-genet-112414-055018 ICE] elements identified examples from [[wikipedia:Group_B_streptococcal_infection|type B ''Streptococcus'']] [Tn''GBS''<ref name=":1"><pubmed>19183283</pubmed></nowiki></ref>] and ''[[wikipedia:Mycoplasma|Mycoplasma]]''<ref><nowiki><pubmed>23888872</pubmed></nowiki></ref> which include a DDE type Tpase rather than the more common phage integrase-like gene. Using a cascade [[wikipedia:BLAST_(biotechnology)|PSI-Blast]] approach not only revealed two new IS families (IS''Lre2'' and IS''Kra4'') but established a distant relationship with the IS''256'' and IS''H6'' families<ref name=":2"><pubmed>24418649</pubmed></nowiki></ref> ([[:File:Fig. IS256.1.png|Fig. IS256.1]] and [[:File:Fig. IS256.2.png|Fig. IS256.2]]). | ||
− | [[Image:Fig. IS256.1.png|thumb|center| | + | [[Image:Fig. IS256.1.png|thumb|center|720x720px|'''Fig. IS256.1.''' Phylogenetic tree of prokaryotic Mutator-like transposases. Each p-MULT clade is colored according to figure 1. p-MULT 1 and p-MULT 2 transposases are encoded by ISs of the IS''256'' and IS''H6'' families, respectively. p-MULT 3 transposases are encoded by both the Tn''GBS'' family and the IS''Lre2'' family. p-MULT 4 encoded by both transposons and by ISs form three different lineages: IS''Azba1'', IS''Mich2'', and IS''Kra4''. Transposons of the IS''Azba1'' group encoding a pRiA4_Orf3-like protein are indicated by blue dots. IS of the IS''Mich2'' group with a predicted −1 frameshift in the transposase gene are indicated by pink dots. TE names are indicated at the extremity of the tree branches. TEs with a predicted σ<sub>A</sub> promoter at a distance of 13–17 bp from the IR-genome junctions in more than 20% of their insertion sites are indicated by small black dots.|alt=]] |
[[Image:Fig. IS256.2.png|thumb|center|600x600px|'''Fig. IS256.2.''' Alignment of the protein domains encompassing the catalytic DDE residues in p-MULT. Transposase sequences were aligned by the [https://mafft.cbrc.jp/alignment/software/ MAFFT] alignment software and visualized using [https://www.jalview.org/ Jalview]. The alignment was filtered for redundancy to subsequently retain a subset of transposases for each p-MULT family representative of their diversity. Only regions surrounding the predicted DDE residues and the C/D(2)H motif were kept in the alignment. Numbers given in parentheses correspond to the distance in aa residues between the different motifs. Transposases accession numbers are indicated on the left.|alt=]] | [[Image:Fig. IS256.2.png|thumb|center|600x600px|'''Fig. IS256.2.''' Alignment of the protein domains encompassing the catalytic DDE residues in p-MULT. Transposase sequences were aligned by the [https://mafft.cbrc.jp/alignment/software/ MAFFT] alignment software and visualized using [https://www.jalview.org/ Jalview]. The alignment was filtered for redundancy to subsequently retain a subset of transposases for each p-MULT family representative of their diversity. Only regions surrounding the predicted DDE residues and the C/D(2)H motif were kept in the alignment. Numbers given in parentheses correspond to the distance in aa residues between the different motifs. Transposases accession numbers are indicated on the left.|alt=]] | ||
Line 23: | Line 23: | ||
=====IS''1249'' group===== | =====IS''1249'' group===== | ||
There are more than 30 members confined at present to the [[wikipedia:Actinobacteria|Actinobacteria]] and the [[wikipedia:Firmicutes|Firmicutes]]. They are about 1300 pb in length with IR of about 26bp ([[:File:Fig. IS256.4.png|Fig. IS256.4]]) and generally generate DR of 8bp (with variations of between 0 and 10). | There are more than 30 members confined at present to the [[wikipedia:Actinobacteria|Actinobacteria]] and the [[wikipedia:Firmicutes|Firmicutes]]. They are about 1300 pb in length with IR of about 26bp ([[:File:Fig. IS256.4.png|Fig. IS256.4]]) and generally generate DR of 8bp (with variations of between 0 and 10). | ||
− | [[Image:Fig. IS256.4.png|thumb|center|600x600px|'''Fig. IS256.4.''' IS''256'' and IS''1249'' Weblogo showing the IS ends. |alt=]] | + | [[Image:Fig. IS256.4.png|thumb|center|600x600px|'''Fig. IS256.4.''' '''IS''256'' and IS''1249'' Weblogo showing the IS ends.''' Left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format (Crooks et al., 2004). |alt=]] |
=====IS''C1250'' group===== | =====IS''C1250'' group===== | ||
Line 30: | Line 30: | ||
====IS''H6''==== | ====IS''H6''==== | ||
This group (MULT2) were originally observed uniquely in [[wikipedia:Archaea|archaea]]<ref><nowiki><pubmed>17347521</pubmed></nowiki></ref>. There are 11 members of about 1450 bp with highly conserved IR of 24-27bp ([[:File:Fig. IS256.5.png|Fig. IS256.5]]), DR of 8 bp and a single Tpase orf encoding a protein of 450 bp. | This group (MULT2) were originally observed uniquely in [[wikipedia:Archaea|archaea]]<ref><nowiki><pubmed>17347521</pubmed></nowiki></ref>. There are 11 members of about 1450 bp with highly conserved IR of 24-27bp ([[:File:Fig. IS256.5.png|Fig. IS256.5]]), DR of 8 bp and a single Tpase orf encoding a protein of 450 bp. | ||
− | [[Image:Fig. IS256.5.png|thumb|center|600x600px|'''Fig. IS256.5.''' IS''H6'' and IS''Lre2'' Weblogo showing the IRs. |alt=]] | + | [[Image:Fig. IS256.5.png|thumb|center|600x600px|'''Fig. IS256.5.''' '''IS''H6'' and IS''Lre2'' Weblogo showing the IRs.''' Left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format (Crooks et al., 2004). |alt=]] |
====IS''Lre2''==== | ====IS''Lre2''==== | ||
Line 40: | Line 40: | ||
=====IS''Azba1''===== | =====IS''Azba1''===== | ||
There are presently 28 members of this group. They encode a Tpase of between 450 and 480 aa, are 1400 to 2900 bp long with IR of about 20 bp ([[:File:Fig. IS256.6.png|Fig. IS256.6]]) and no DR. Six (IS''Afe13'', IS''Cot1'', IS''Ec51'', IS''Kpn19'', IS''Sysp7'') carry an orf in addition to the Tpase and this specifies a protein related to [[wikipedia:Site-specific_recombination|serine-recombinases]] or [[wikipedia:Site-specific_recombination|resolvases]]. Four of these also include a third orf annotated as [[wikipedia:Hypothetical_protein|hypothetical protein]]. The fifth, IS''Afe13'', carries the Tpase, a resolvase, and an alternative orf annotated as ORF-3-like from plasmid pRiA4b. Other proteins found in this family are annotated as being hypothetical or putative TnpR resolvases although no direct evidence for resolvase function is available. Eight other members simply encode the Tpase and the ORF-3 like protein. While IS''Cep1'' includes the ORF-3-like protein and a third annotated as phage integrase or xerC/D. | There are presently 28 members of this group. They encode a Tpase of between 450 and 480 aa, are 1400 to 2900 bp long with IR of about 20 bp ([[:File:Fig. IS256.6.png|Fig. IS256.6]]) and no DR. Six (IS''Afe13'', IS''Cot1'', IS''Ec51'', IS''Kpn19'', IS''Sysp7'') carry an orf in addition to the Tpase and this specifies a protein related to [[wikipedia:Site-specific_recombination|serine-recombinases]] or [[wikipedia:Site-specific_recombination|resolvases]]. Four of these also include a third orf annotated as [[wikipedia:Hypothetical_protein|hypothetical protein]]. The fifth, IS''Afe13'', carries the Tpase, a resolvase, and an alternative orf annotated as ORF-3-like from plasmid pRiA4b. Other proteins found in this family are annotated as being hypothetical or putative TnpR resolvases although no direct evidence for resolvase function is available. Eight other members simply encode the Tpase and the ORF-3 like protein. While IS''Cep1'' includes the ORF-3-like protein and a third annotated as phage integrase or xerC/D. | ||
− | [[Image:Fig. IS256.6.png|thumb|center|600x600px|'''Fig. IS256.6.''' IS''Kra1'' grp IS''Azba1'', IS''Kra1'' grp IS''Kra1'', and IS''Kra1'' grp IS''Mich2'' Weblogo showing the IRs.|alt=]] | + | [[Image:Fig. IS256.6.png|thumb|center|600x600px|'''Fig. IS256.6.''' '''IS''Kra1'' grp IS''Azba1'', IS''Kra1'' grp IS''Kra1'', and IS''Kra1'' grp IS''Mich2'' Weblogo showing the IRs.''' Left (IRL) and right IRR inverted terminal repeats are shown in WebLogo format (Crooks et al., 2004).|alt=]] |
=====IS''Mich2''===== | =====IS''Mich2''===== |
Revision as of 19:03, 13 July 2020
Contents
The IS256 cluster
IS256 was first identified in 1987 as part of the gentamycin resistance transposon Tn4001[1][2] from Staphylococcus aureus[2]. It was observed that tandem duplication of IS256 contiguous with Tn4001 resulted in an increase in the level of resistance to gentamycin, tobramycin and kanamycin (Gm, Tm, and Km) implying the presence strong of IS256-associated promoters. Other examples of IS256-mediated increased gene expression have also been observed[3][4] IS256 is widely distributed in staphylococci and enterococcus[5][6] where it is part of a variety composite transposons[7][8][9][10].
Recently, a study of ICE elements identified examples from type B Streptococcus [TnGBS[11]] and Mycoplasma[12] which include a DDE type Tpase rather than the more common phage integrase-like gene. Using a cascade PSI-Blast approach not only revealed two new IS families (ISLre2 and ISKra4) but established a distant relationship with the IS256 and ISH6 families[13] (Fig. IS256.1 and Fig. IS256.2).
Analysis of the N-terminal Tpase region[13] also identified two shared domains (N1 and N2). N2 corresponds to a potential HTH domain in the region of the IS256 Tpase which recognizes the terminal IRs[14] (Fig. IS256.3).
The cluster can be divided into five clades containing nine groups based on branching of the Tpases phylogenetic tree: two types of closely related TnGBS, TnGBS1 and TnGBS2, and ISLre2 (MULT3); the Mycoplasma ICE; IS256 (MULT1); ISH6 (MULT2); ISAzba1, ISMich2, ISKra4 (MULT4)[13] (Fig. IS256.1 and Fig. IS256.2 and Fig. IS256.3).
There is a distant relationship with the Tpase of the eukaryotic Mutator TE and, like MuDR from Zea mays, many generate 8-9-bp target repeat on insertion. They have therefore been called MULE (for Mutator-Like Elements). Like MuDr/Foldback, members of these groups carry a largely α-helical insertion domain between the second D and E catalytic residues. This includes a conserved C/D(2)H signature present in the eukaryotic and prokaryotic IS[13][15].
IS256
The IS256 family can be subdivided into 3 groups: IS256, IS1249, and ISC1250.
IS256 group
The classical IS256 group has large number of members in both bacteria and archaea. They are between 1200 and 1500 bp long with IR of 20-30 bp (Fig. IS256.4) and generate DR of between 8 and 9 bp. A single long orf carrying a potential DDE motif with a spacing of 112 residues between the second D and E residues (Fig. IS256.2), together with a correctly placed K/R residue. This spacing is due to an insertion domain[16][17]. The catalytic residues have been validated by mutagenesis[18]. It was shown several years ago that the Tpase of IS256 family elements share some similarities with the eukaryotic Mutator element[19], a relationship which has been explored recently in more detail[20].
Members of this family transpose using an excised circular dsDNA transposon intermediate [e.g. [18][21]]. They are also found as part of composite transposons such as Tn4001 flanked on either side by IS256[22][2][23][24]. For IS256 itself, the sequences of circle junctions showed that the left IS end preferentially attacked the right end[21], a result that was independently demonstrated by the effect of small deletions in the left and right ends on circle formation[14].
IS1249 group
There are more than 30 members confined at present to the Actinobacteria and the Firmicutes. They are about 1300 pb in length with IR of about 26bp (Fig. IS256.4) and generally generate DR of 8bp (with variations of between 0 and 10).
ISC1250 group
At present, there are only 3 members of this group in ISfinder. All are found in the archaeon Sulfolobus solfataricus.
ISH6
This group (MULT2) were originally observed uniquely in archaea[25]. There are 11 members of about 1450 bp with highly conserved IR of 24-27bp (Fig. IS256.5), DR of 8 bp and a single Tpase orf encoding a protein of 450 bp.
ISLre2
There are 48 entries for ISLre2 family members in ISfinder. They are restricted at present to the bacteria. They are between 1500 and 2000 bp long, with IR from 15 to 29 bp (Fig. IS256.5) and generate 9 bp DR. Together with the related TnGBS ICE, show strong target specificity and insert 13-17 bp upstream of σA promoters [11][13] in oriented fashion with RE proximal. PCR analysis has detected a transposon circle junction, as with the related ICE, suggesting that transposition may occur via a Donor Primed Transposon Replication process.
ISKra4
This newly emerging family includes 83 members and is divided into three related groups: ISAzba1, ISMich2 and ISKra4.
ISAzba1
There are presently 28 members of this group. They encode a Tpase of between 450 and 480 aa, are 1400 to 2900 bp long with IR of about 20 bp (Fig. IS256.6) and no DR. Six (ISAfe13, ISCot1, ISEc51, ISKpn19, ISSysp7) carry an orf in addition to the Tpase and this specifies a protein related to serine-recombinases or resolvases. Four of these also include a third orf annotated as hypothetical protein. The fifth, ISAfe13, carries the Tpase, a resolvase, and an alternative orf annotated as ORF-3-like from plasmid pRiA4b. Other proteins found in this family are annotated as being hypothetical or putative TnpR resolvases although no direct evidence for resolvase function is available. Eight other members simply encode the Tpase and the ORF-3 like protein. While ISCep1 includes the ORF-3-like protein and a third annotated as phage integrase or xerC/D.
ISMich2
This includes 24 members which are presently limited to the cyanobacteria. Twenty two have a Tpase orf distributed between two reading phases while in the remaining 2 the Tpase forms a unique continuous orf. However all show a potential but atypical frameshift motif, TTTTTT which could be involved in either PRF (Programmed -1 Ribosomal Frameshifting) or PTR (Programmed Transcriptional Frameshifting) recoding. The further experimental analysis would be necessary to confirm or refute this. Members are between 1250 and 1400 bp long with a Tpase of 360aa, IR of between 18 and 39 bp (Fig. IS256.6) with 8 bp DR. Three members (ISCysp26; ISMic1; ISMich2) carry a passenger gene annotated as hypothetical protein.
ISKra4
This small group of elements range in size from 1400 to 3700 pb due to the presence in some of various passenger genes. They have IR of 18 to 31 bp (Fig. IS256.6) and generate DR of 9 bp. Three carry passenger genes: ISLdr1, a hypothetical protein and a reverse transcriptase; ISSri1, a transcriptional regulator; and ISTn1, a hypothetical protein. Six members may express their Tpases by frameshifting (5 include a 7A motif and 1 with a motif, 5TC).
Bibliography
- ↑ <pubmed>6323927</pubmed>
- ↑ 2.0 2.1 2.2 </nowiki>
- ↑ <pubmed>31474962</pubmed>
- ↑ <pubmed>9371438</pubmed>
- ↑ <pubmed>7899522</pubmed>
- ↑ <pubmed>1334269</pubmed>
- ↑ <pubmed>8654967</pubmed>
- ↑ <pubmed>7625803</pubmed>
- ↑ <pubmed>8031032</pubmed>
- ↑ <pubmed>8723445</pubmed>
- ↑ 11.0 11.1 </nowiki>
- ↑ <pubmed>23888872</pubmed>
- ↑ 13.0 13.1 13.2 13.3 13.4 Guérillot R, Siguier P, Gourbeyre E, Chandler M, Glaser P . The diversity of prokaryotic DDE transposases of the mutator superfamily, insertion specificity, and association with conjugation machineries. - Genome Biol Evol: 2014 Feb, 6(2);260-72 [PubMed:24418649] [DOI] </nowiki>
- ↑ 14.0 14.1 Hennig S, Ziebuhr W . Characterization of the transposase encoded by IS256, the prototype of a major family of bacterial insertion sequence elements. - J Bacteriol: 2010 Aug, 192(16);4153-63 [PubMed:20543074] [DOI] </nowiki>
- ↑ <pubmed>21518873</pubmed>
- ↑ <pubmed>20067338</pubmed>
- ↑ <pubmed>23217365</pubmed>
- ↑ 18.0 18.1 Loessner I, Dietrich K, Dittrich D, Hacker J, Ziebuhr W . Transposase-dependent formation of circular IS256 derivatives in Staphylococcus epidermidis and Staphylococcus aureus. - J Bacteriol: 2002 Sep, 184(17);4709-14 [PubMed:12169594] [DOI] </nowiki>
- ↑ <pubmed>8041625</pubmed>
- ↑ <pubmed>19018586</pubmed>
- ↑ 21.0 21.1 Prudhomme M, Turlan C, Claverys JP, Chandler M . Diversity of Tn4001 transposition products: the flanking IS256 elements can form tandem dimers and IS circles. - J Bacteriol: 2002 Jan, 184(2);433-43 [PubMed:11751820] [DOI] </nowiki>
- ↑ <pubmed>6323927</pubmed>
- ↑ <pubmed>2553542</pubmed>
- ↑ <pubmed>2544565</pubmed>
- ↑ <pubmed>17347521</pubmed>