On Productive Ambiguity

Mar 1, 2026
biology, genetic-code, ambiguity, design, information

On stop codons, pyrrolysine, and the case for designing systems that tolerate ambiguity rather than eliminate it.


The genetic code has three stop codons: UAA, UAG, UGA. They terminate translation. A ribosome encounters one, and the protein ends. This is one of the most conserved features across all life — so fundamental it barely gets questioned. Every codon means one thing.

In November 2025, Dipti Nayak's lab at UC Berkeley published work in PNAS showing that the methanogenic archaeon Methanosarcina acetivorans violates this rule. In this organism, UAG sometimes means stop and sometimes means pyrrolysine — the 22nd genetically encoded amino acid, used in enzymes that break down methylamines. The same codon, in the same genome, interpreted two ways.

The researchers looked for a deterministic signal. Some sequence motif, some structural feature, some contextual rule that would predict which interpretation a given UAG receives. They didn't find one. The interpretation appears to depend on pyrrolysine availability: when the amino acid is abundant, UAG is more likely read as "insert pyrrolysine." When it's scarce, UAG is more likely read as "stop." Between 200 and 300 genes in the genome contain UAG, meaning many proteins exist in two forms simultaneously — full-length and truncated — in ratios that shift with cellular conditions.

This should be catastrophic. Random pools of mixed-length proteins. Nonfunctional truncations competing with functional copies. Objectively, ambiguity in the genetic code should be deleterious.

It isn't.


The same month, Kivenson and Banfield published in Science the other solution. They identified multiple archaeal lineages that have fully recoded UAG. In these organisms, every UAG means pyrrolysine. No ambiguity. The result is a 62-sense-codon genetic code with only two stop codons — an alternative genetic code, the first ever found in archaea.

This code arose independently in multiple lineages. Over 1,800 archaeal proteins were found to contain pyrrolysine, two orders of magnitude more than previously known. The recoding is complete and deterministic. UAG means one thing, always.


Two solutions to the same problem. M. acetivorans lives with ambiguity — the same signal interpreted differently depending on context. The fully recoded archaea eliminated ambiguity — the signal means one thing. Both are viable. Both have persisted. Both support methylamine metabolism in environments where that matters.

The temptation is to rank them. The fully recoded version looks cleaner. Deterministic. Engineered. The ambiguous version looks like an evolutionary compromise, a transitional state on the way to something better.

But there's no evidence it's transitional. The ambiguous code appears to be stable. It may even be adaptive — the ability to modulate protein length ratios in response to pyrrolysine availability gives M. acetivorans a regulatory mechanism that the fully recoded archaea lack. Ambiguity as a feature, not a residual defect.

Nayak put it directly: "Biological systems are more ambiguous than we give them credit to be and that ambiguity is actually a feature — it's not a bug."


There's a medical dimension worth noting.

Approximately 11% of all disease-causing mutations in humans are nonsense mutations — premature stop codons that truncate essential proteins. Cystic fibrosis, Duchenne muscular dystrophy, some cancers. The protein starts building, hits a premature UAG or UAA or UGA, and terminates before it should. The result is nonfunctional.

The most advanced pharmaceutical approach was ataluren (marketed as Translarna), a small molecule designed to make ribosomes occasionally read through premature stop codons — insert an amino acid instead of terminating. It received conditional EU approval in 2014 for nonsense-mutation Duchenne muscular dystrophy. In March 2025, the European Commission withdrew that authorization after a decade of studies failed to confirm clinical efficacy.

The strategy was essentially to engineer ambiguity into a deterministic system. Make stop codons selectively leaky. The problem: human translation machinery is optimized for deterministic stop codon recognition. Introducing controlled ambiguity into a system designed against it proved unreliable. Too much readthrough and you lose translational fidelity everywhere. Too little and the therapeutic effect vanishes.

The archaea suggest the problem might be framed backwards. M. acetivorans doesn't introduce ambiguity into a deterministic system. It maintains ambiguity as a baseline condition and tolerates its consequences at the system level — through reduced UAG frequency in the genome, through balancing pyrrolysine supply with demand, through whatever unknown mechanisms buffer the proteome against mixed-length protein pools. The tolerance is architectural, not pharmacological.


A design principle surfaces: design for ambiguity tolerance rather than ambiguity elimination.

The fully recoded archaea took the opposite path — they eliminated ambiguity entirely, and that works too. But the interesting case is M. acetivorans, because it demonstrates that a system can function with genuinely ambiguous signal interpretation, provided the architecture accommodates it. The ambiguity isn't a problem to solve. It's a condition the system is built around.

This is a different orientation than most engineered information systems adopt. The default engineering instinct is disambiguation: make every signal mean one thing, enforce deterministic interpretation, treat ambiguity as noise to be eliminated. That instinct has obvious value. But it may also foreclose design spaces that biology has found productive.

There are systems where ambiguity isn't eliminable — where the same signal genuinely means different things in different contexts, where interpretation depends on conditions that can't be fully specified in advance. For those systems, the question isn't how to force deterministic meaning. It's how to build tolerance for interpretive variation into the architecture itself.

The archaea have been running this experiment for a long time. Both solutions persist. Neither is converging on the other.


What would it mean to design an information-processing system that treats ambiguous interpretation not as a failure mode but as an operating condition?