Chromosome-level genome assembly reveals the secrets of this plant-animal hybrid microbe
In the murky waters of ponds and rivers worldwide, a microscopic marvel blurs the line between plant and animal. Euglena gracilis, a single-celled alga, swims using a whip-like flagellum like a protozoan but harnesses sunlight for energy like a plant. For decades, scientists have been fascinated by its biological contradictions and its industrial potential—it produces paramylon, a unique carbohydrate with immune-boosting properties, and thrives in extreme conditions where other microorganisms perish. Yet despite its significance, Euglena guarded a formidable secret: a massive, complex genome that resisted all decoding attempts—until now 1 7 .
At ~2.37 billion base pairs, nearly 80% the size of the human genome
Can produce up to 90% of its dry weight as this valuable β-glucan polymer
Euglena's genome posed a monumental challenge for three key reasons:
Over 58% consists of repetitive sequences—jumping genes, viral remnants, and duplicated segments—that act like a 10,000-piece puzzle where most pieces look identical 1 4 .
Contains genes from both red and green algae—evidence of multiple failed endosymbiotic events 4 .
Without a complete genome, efforts to engineer strains for higher paramylon yields were "flying blind" 7 .
A 2019 draft genome (1.43 Gb) had a contig N50 of just 955 bp, making gene prediction and metabolic analysis nearly impossible 4 .
In 2024, a research team combined cutting-edge technologies in a multi-step "genome assembly pipeline" to achieve the first chromosome-level blueprint of Euglena gracilis strain Z 1 3 5 .
Metric | 2024 Assembly | 2019 Draft |
---|---|---|
Size | 2.37 Gb | 1.43 Gb |
Chromosomes | 46 (99.83% anchored) | Not anchored |
Contig N50 | 619 Kb | 955 bp |
BUSCO Completeness | 80.39% | ~20% |
Category | Count | Examples |
---|---|---|
Protein-Coding Genes | 39,362 | Photosystems, paramylon synthases |
tRNAs | 4,882 | All 20 amino acids represented |
miRNAs | 188 | Regulatory non-coding RNAs |
CRISPR tools can now target paramylon pathways for strain optimization 7 .
Genes from red and green algae coexist—proof of multiple endosymbiotic events 4 .
First genome-scale metabolic model being built .
"Euglena's genome is a Rosetta Stone for understanding endosymbiosis—and a launchpad for tomorrow's bioeconomy."
The chromosome-level genome of Euglena gracilis isn't just a technical triumph—it's a master key unlocking biology's deepest questions. How do cells reconcile plant and animal traits? What evolutionary forces shape genomes after endosymbiosis? And how can we harness nature's versatility for sustainability? As engineers rewire Euglena for carbon capture or biomedicine, this humble microbe stands poised to revolutionize industries, proving that some of life's smallest creatures hold the grandest blueprints 1 7 .
Repeat Type | Genome Coverage |
---|---|
LTRs | 32.81% |
LINEs | 1.49% |
DNA Elements | 4.60% |
Unclassified | 32.73% |