Trinucleotide Repeat Disorder

Watchlist
Retrieved
2021-01-18
Source
Trials
Genes
Drugs

Trinucleotide repeat disorders, also known as microsatellite expansion diseases, are a set of over 50 genetic disorders caused by trinucleotide repeat expansion, a kind of mutation in which repeats of three nucleotides (trinucleotide repeats) increase in copy numbers until they cross a threshold above which they become unstable. Depending on its location, the unstable trinucleotide repeat may cause defects in a protein encoded by a gene; change the regulation of gene expression; produce a toxic RNA, or lead to chromosome instability. In general, the larger the expansion the faster the onset of disease, and the more severe the disease becomes.

Trinucleotide repeats are a subset of a larger class of unstable microsatellite repeats that occur throughout all genomes.

The first trinucleotide repeat disease to be identified was fragile X syndrome, which has since been mapped to the long arm of the X chromosome. Patients carry from 230 to 4000 CGG repeats in the gene that causes fragile X syndrome, while unaffected individuals have up to 50 repeats and carriers of the disease have 60 to 230 repeats. The chromosomal instability resulting from this trinucleotide expansion presents clinically as intellectual disability, distinctive facial features, and macroorchidism in males. The second DNA-triplet repeat disease, fragile X-E syndrome, was also identified on the X chromosome, but was found to be the result of an expanded CCG repeat. The discovery that trinucleotide repeats could expand during intergenerational transmission and could cause disease was the first evidence that not all disease-causing mutations are stably transmitted from parent to offspring.

There are several known categories of trinucleotide repeat disorder. Category I includes Huntington's disease (HD) and the spinocerebellar ataxias. These are caused by a CAG repeat expansion in protein-coding portions, or exons, of specific genes. Category II expansions are also found in exons, and tend to be more phenotypically diverse with heterogeneous expansions that are generally small in magnitude. Category III includes fragile X syndrome, myotonic dystrophy, two of the spinocerebellar ataxias, juvenile myoclonic epilepsy, and Friedreich's ataxia. These diseases are characterized by typically much larger repeat expansions than the first two groups, and the repeats are located in introns rather than exons.

Types

Some of the problems in trinucleotide repeat syndromes result from causing alterations in the coding region of the gene, while others are caused by altered gene regulation. In over half of these disorders, the repeated trinucleotide, or codon, is CAG. In a coding region, CAG codes for glutamine (Q), so CAG repeats result in a polyglutamine tract. These diseases are commonly referred to as polyglutamine (or polyQ) diseases. The repeated codons in the remaining disorders do not code for glutamine, and these are classified as non-polyglutamine diseases.

Polyglutamine (PolyQ) diseases

Type Gene Normal PolyQ repeats Pathogenic PolyQ repeats
DRPLA (Dentatorubropallidoluysian atrophy) ATN1 or DRPLA 6 - 35 49 - 88
HD (Huntington's disease) HTT 6 - 35 36 - 250
SBMA (Spinal and bulbar muscular atrophy) AR 4 - 34 35 - 72
SCA1 (Spinocerebellar ataxia Type 1) ATXN1 6 - 35 49 - 88
SCA2 (Spinocerebellar ataxia Type 2) ATXN2 14 - 32 33 - 77
SCA3 (Spinocerebellar ataxia Type 3 or Machado-Joseph disease) ATXN3 12 - 40 55 - 86
SCA6 (Spinocerebellar ataxia Type 6) CACNA1A 4 - 18 21 - 30
SCA7 (Spinocerebellar ataxia Type 7) ATXN7 7 - 17 38 - 120
SCA12 (Spinocerebellar ataxia Type 12) PPP2R2B 7 - 41 43 - 51
SCA17 (Spinocerebellar ataxia Type 17) TBP 25 - 42 47 - 63

Non-polyglutamine diseases

Type Gene Codon Normal Pathogenic Mechanism
FRAXA (Fragile X syndrome) FMR1 CGG (5' UTR) 6 - 53 230+ abnormal methylation
FXTAS (Fragile X-associated tremor/ataxia syndrome) FMR1 CGG (5' UTR) 6 - 53 55-200 increased expression, and a novel polyglycine product
FRAXE (Fragile XE mental retardation) AFF2 CCG (5' UTR) 6 - 35 200+ abnormal methylation
Baratela-Scott syndrome XYLT1 GGC (5' UTR) 6 - 35 200+ abnormal methylation
FRDA (Friedreich's ataxia) FXN GAA (Intron) 7 - 34 100+ impaired transcription
DM1 (Myotonic dystrophy Type 1) DMPK CTG (3' UTR) 5 - 34 50+ RNA-based; unbalanced DMPK/ZNF9 expression levels
SCA8 (Spinocerebellar ataxia Type 8) SCA8 CTG (RNA) 16 - 37 110 - 250 ? RNA

Symptoms

A common symptom of polyQ diseases is the progressive degeneration of nerve cells, usually affecting people later in life. Although these diseases share the same repeated codon (CAG) and some symptoms, the repeats are found in different, unrelated genes. In all cases, the expanded CAG repeats are translated into an uninterrupted sequence of glutamine residues, forming a polyQ tract, and the accumulation of polyQ proteins damages key cellular functions such as the ubiquitin-proteasome system. However different polyQ-containing proteins damage different subsets of neurons, leading to different symptoms. As of 2017, ten neurological and neuromuscular disorders were known to be caused by an increased number of CAG repeats.

The non-PolyQ diseases do not share any specific symptoms and are unlike the PolyQ diseases. In some of these diseases, such as Fragile X syndrome, the pathology is caused by lack of the normal function of the protein encoded by the affected gene. In others, such as Monotonic Dystrophy Type 1, the pathology is caused by a change in protein expression or function mediated through changes in the messenger RNA produced by the expression of the affected gene. In yet others, the pathology is caused by toxic assemblies of RNA in the nuclei of cells.

Genetics

Classification of the trinucleotide repeat, and resulting disease status, depends on the number of CAG repeats in Huntington's disease
Repeat count Classification Disease status
<28 Normal Unaffected
28–35 Intermediate Unaffected
36–40 Reduced-penetrance May be affected
>40 Full-penetrance Affected

Trinucleotide repeat disorders generally show genetic anticipation: their severity increases with each successive generation that inherits them. This is likely explained by the addition of CAG repeats in the affected gene as the gene is transmitted from parent to child. For example, Huntington's disease occurs when there are more than 35 CAG repeats on the gene coding for the protein HTT. A parent with 35 repeats would be considered normal and would not exhibit any symptoms of the disease. However, that parent's offspring would be at an increased risk of developing Huntington's compared to the general population, as it would take only the addition of one more CAG codon to cause the production of mHTT (mutant HTT), the protein responsible for disease.

Huntington's very rarely occurs spontaneously; it is almost always the result of inheriting the defective gene from an affected parent. However, sporadic cases of Huntington's in individuals who have no history of the disease in their families do occur. Among these sporadic cases, there is a higher frequency of individuals with a parent who already has a significant number of CAG repeats in their HTT gene, especially those whose repeats approach the number (36) required for the disease to manifest. Each successive generation in a Huntington's-affected family may add additional CAG repeats, and the higher the number of repeats, the more severe the disease and the earlier its onset. As a result, families that have suffered from Huntington's for many generations show an earlier age of disease onset and faster disease progression.

Non-trinucleotide expansions

The majority of diseases caused by expansions of simple DNA repeats involve trinucleotide repeats, but tetra-, penta- and dodecanucleotide repeat expansions are also known that cause disease. For any specific hereditary disorder, only one repeat expands in a particular gene.

Mechanism

Triplet expansion is caused by slippage during DNA replication or during DNA repair synthesis. Because the tandem repeats have identical sequence to one another, base pairing between two DNA strands can take place at multiple points along the sequence. This may lead to the formation of 'loop out' structures during DNA replication or DNA repair synthesis. This may lead to repeated copying of the repeated sequence, expanding the number of repeats. Additional mechanisms involving hybrid RNA:DNA intermediates have been proposed.

See also

  • C9orf72
  • RAN translation