Skip to content

The Voynich’s Two Voices – A Field Guide to A and B

Statistical fingerprints in the Voynich Manuscript split the text into two dialect families, Currier A and Currier B. We map where each sits in the book, which hands wrote them, and what that distribution implies.

Voynich side-by-side pages of Scribe 1 (folio 33r) and Scribe 2 (folio 32v)

Two pages apart, the script behaves like two different systems. In the Herbal section of the Voynich Manuscript, you can turn a leaf and watch the statistics flip; word-final ‘dy’ suddenly floods the lines, while the ‘chol’ and ‘chor’ combinations that were everywhere a page earlier almost vanish.

The handwriting changes with it. That is the problem this piece follows.

For the broader context of the Voynich Manuscript, see our main investigation: The Voynich Paradox – When Evidence Points Both Ways.

The Core Concept

A single fifteenth-century codex, a book made from folded sheets of parchment, shows at least two statistically distinct modes of writing.

Researchers have labelled these ‘Voynich A’ and ‘Voynich B’. The split is not just a stylistic quirk. It tracks with different scribal hands and it cuts across the book’s thematic sections. In plain terms, one physical manuscript hosts two textual personalities. Our task is to show where that divide sits in the object, what defines it in numbers, who wrote which parts, and which production model best fits the record.

Before we start, a few quick definitions in plain English:

  • Codicology: The study of the physical book, from parchment and sewing to bindings and stains.
  • Palaeography: The study of handwriting, used to identify and compare scribes.
  • Quire: A group of parchment leaves folded and nested together before binding. A typical quire might have eight leaves, forming 16 pages.
  • Bifolium: A single large sheet folded once to make two leaves (four pages). Quires are constructed from multiple bifolia.
  • EVA (Extensible Voynich Alphabet): The standard way of transcribing Voynich glyphs into Latin letters so computers and humans can count things.
  • Bigram: A two-character sequence, such as ‘dy’ at the end of a word.

The Cryptographer’s Ghost

In 1976, at a conference on medieval manuscripts, a US Navy cryptographer named Prescott Currier announced a finding that fundamentally altered the study of the Voynich Manuscript. For decades, the codex had been treated as one continuous text in a single invented script. In his declassified NSA presentation, Currier stood up and said there were two.

His claim was not based on the subjective feel of the writing. It rested on statistics. When he counted letter groups across the herbal pages, the patterns split in half. On pages he labelled as A, certain clusters such as ‘chol’ and ‘chor’ turn up with high frequency. On B pages, the end of words is where the action sits. The bigram ‘dy’ at the word-final position appears over and over. The gap was too sharp to dismiss as chance.

Currier gave them names: Language A and Language B.

The terminology stuck. His point was clear. The codex is not one stream of gibberish dressed up as words. It is two statistically distinct systems occupying the same vellum. This discovery carried weight because of who he was. Currier had spent his career breaking codes during the Cold War, trained to spot the fingerprints of language in hostile ciphers.

Two words act as clear flags. On A pages, the most frequent token is ‘daiin’. On B pages, the champion is ‘chedy’. Currier and later analysts pointed out something sharper still. The word ‘chedy’ does not appear in the A corpus at all, while ‘daiin’ is present in both, an observation detailed in later computational analyses. That asymmetric relationship is a clue we will return to.

For Currier, the implication was obvious. A crude hoax could have been built from random squiggles. But this level of structured difference inside a single book pushed the odds against that explanation. In his words, the dialect split made the possibility of a one-man hoax ‘much less likely’.

'There are in fact two statistically distinguishable ‘languages’'.

Prescott Currier, NSA Symposium (1976)

A Map of the Fracture

The physical book matters because the split does not track cleanly with its subject matter. The manuscript is a codex of 102 parchment folios, radiocarbon dated to the early fifteenth century, with evidence of missing leaves and later binding work.

Drawings divide the book into broad themes: Herbal, Astronomical, Balneological (depicting bathing scenes), Cosmological, Pharmaceutical, and a dense final section of recipe-like paragraphs. You might expect a linguistic mode to stick to a theme. It often does not.

The Herbal section is the main example. It holds long stretches of Voynich A, then patches of Voynich B, sometimes on facing pages. The Balneological section is different. It is consistently written in Voynich B throughout. The Astronomical pages lean towards A, while the Recipes lean to B. The Pharmaceutical pages are mixed, a distribution mapped in detail on sites like voynich.nu.

The stranger feature appears when you stop reading in the current page order and start tracing the book by its physical units.

In the fifteenth century, a book was made of quires. Folded vellum sheets sewn together into gatherings. A scribe would typically copy one quire after another until the book was finished. That rhythm makes Currier’s finding stand out even more.

The split between Language A and Language B does not follow these themes neatly. In Quire 4, for example, a bifolium written in Voynich A by one scribe is wrapped around another written in Voynich B by a second scribe. This method of assembly is, according to manuscript expert Dr Lisa Fagin Davis, ‘utterly atypical’ for the period. Parallel copying in that period tended to allocate whole quires to different scribes, which kept text blocks intact even if people worked at the same time.

There is physical support for later disturbance.

A large water stain runs across a path that makes little sense in the present folio order and appears to predate the modern folio numbers. If the quires were disbound to dry and then reassembled without original page marks, bifolia could easily have been shuffled in a way that hid the original sequence. That does not explain the dialect split by itself, but it does explain why the split looks chaotic in places.

Dialect and Scribe Distribution Across Manuscript Sections

Manuscript Section Primary Dialect Scribes Present
Herbal Mixed (A & B) Scribes 1, 2, 4, 5
Astronomical Voynich A Scribe 4
Balneological Voynich B Scribe 2
Cosmological Voynich B Scribe 3
Pharmaceutical Mixed (A & B) Scribes 1, 2
Recipes Voynich B Scribe 3

Note: The 'Mixed' designation reflects the anomalous interleaving of bifolia written by different scribes in different dialects within a single thematic section.

The Hands Behind the Split

The statistical divide Currier found is matched by a physical one – the handwriting.

Currier himself first noted two distinct styles. On the pages he labelled Language A, the script was neat and widely spaced. On the pages he called Language B, it was tighter, more slanted, and less consistent. To his eye, two different people were writing. He called them Scribe 1 and Scribe 2.

More recent and detailed palaeographic analysis by Dr Lisa Fagin Davis has confirmed and expanded this picture. Applying the principles of studying historical handwriting to the script’s letterforms and habits, she identified five distinct scribes across the codex.

The key point is where those five land on the A/B divide. Scribes 1 and 4 write in Voynich A. Scribes 2, 3, and 5 write in Voynich B. No single hand has been shown to write both dialects. The visual differences are not minor. Scribe 1 has neat, even lines that sit flat on the ruling. Scribe 2 presses text upwards in a cramped slope. The clustering of hands by dialect is the strongest link we have between the statistics and the physical object.

This was not a random division of labour. In medieval scriptoria, multiple scribes often shared out portions of a book. What we see here is stranger… a group of scribes copying within one project, yet split down the middle into two incompatible systems of writing.

The Contradiction Log

Several tensions in the record drive the rest of the debate. Each one forces a choice about how the book was made.

  • One book, two systems: A physically coherent codex, with consistent vellum and pigments, shows two distinct statistical systems in its text. Both are internally consistent, yet they share a large amount of common vocabulary.
  • Sharp split, soft edges: The A/B contrast is clear in many places, yet a handful of folios sit between the poles on standard measures. These intermediate pages look more like a transition or a continuum than a hard wall.
  • Topic does not rescue it: If dialects mapped to subject, the Herbal section would not mix A and B so intimately. It does. The Balneological section, being uniformly B, shows that content can correlate locally, but it is not a general rule across the book.
  • Five hands, two dialects: Several people wrote this book. They cluster into two dialect families, not five individual styles. That implies shared rules and coordination, not just personal habit.
  • Vocabulary asymmetry: Voynich B seems to contain almost all of A’s common words plus sets of forms that A never uses. ‘chedy’ is the poster child for a B-only high-frequency word. If A and B were two unrelated languages, we would expect sharper, more symmetrical divergence.

The Central Paradox

The manuscript presents a core contradiction: it is a single, physically coherent object with uniform vellum, inks, and artistic style. Yet, its text is fractured into two distinct statistical systems, produced by at least five collaborating scribes.

Weighing the Working Theories

By now, the pattern is clear. Two statistical ‘languages’, five scribes, interleaved bifolia, and a few pages that sit between A and B. Three explanations are usually put forward. Each solves part of the puzzle and strains against another part.

Theory A: Multiple Authors

  • Claim: Different scribes used different norms of Voynichese, producing the A and B dialects.
  • For: The handwriting evidence is hard to argue with. Five distinct hands have been identified, and they cluster by dialect. Scribes 1 and 4 write in A; Scribes 2, 3, and 5 write in B. No hand crosses over. That is the cleanest line in the whole manuscript. It fits what we know of fifteenth-century workshops, where several scribes often split a job. If you start from the page, not from the statistics, this is the model that matches what you can actually see.
  • Against: The language data do not split so neatly. Voynich B looks like a superset of A. Words common in A also occur in B, but B carries additional forms that A never uses. That asymmetric relationship is hard to square with two unrelated systems used independently by separate people. It implies a shared rulebook or a sequence in which one system expanded on the other. If five people produced only two tightly bounded styles, where did that shared constraint come from, and who enforced it?
  • Verdict on A: Best fit for the visible hands and the practical reality of a workshop, but it needs a coordination mechanism to explain the vocabulary asymmetry and the discipline across multiple scribes.

Theory B: Evolving Method

  • Claim: One author, or a single guiding intelligence, changed the way the text was generated partway through. In simple terms, the system evolved from A into B.
  • For: The vocabulary asymmetry points this way. If B extends A, an evolution is exactly what you would expect. The ‘continuum’ pages that sit between pure A and pure B also make sense as a transition. B pages are typically more verbose, which could be the by-product of new rules or tokens added to the generation method. If you treat Voynichese as a cipher system rather than an unknown natural language, an incremental change in rules neatly explains those statistics.
  • Against: The handwriting does not follow. Scribe 1’s neat, horizontal hand writes A. Scribe 2’s cramped, slanted hand writes B. If this were a single-person changing method, we would still expect the same hand to show up on both sides of the transition. It does not. The bifolium-by-bifolium distribution also suggests parallel work by different people rather than one writer moving forward in time with a new system. In short, the palaeography breaks this theory’s spine.
  • Verdict on B: Best fit for the statistics, but it collides with the fact of multiple, stable hands tied to one dialect each.

Theory C: Content-Specific Language

  • Claim: The dialects were chosen by topic or function. The scribes switched mode when the subject changed.
  • For: Some sections are internally consistent. The Balneological pages are all B. Several recipe pages are B as well. Information theory studies have shown that certain keywords cluster by section, which shows some coupling between words and pictures. If you only sampled those areas, you might believe the topic drives the choice of dialect.
  • Against: The Herbal section breaks the model. It is one topic split across both dialects, sometimes on facing pages, sometimes interleaved by bifolium. Parts of the Pharmaceutical section show the same problem. If subject matter drove language choice, Herbal should not be mixed. It is. That makes the topic a weak predictor at best.
  • Verdict on C: Explains local pockets of uniformity, but fails on the biggest section of the book.

Cross-Check: How Theories Align with Hard Evidence

Hard Evidence Theory A: Multiple Authors Theory B: Evolving Method Theory C: Content-Specific
Five scribes in two families Fits Does not fit Agnostic
Vocabulary asymmetry (B contains A) Needs coordination Fits No leverage
Interleaved bifolia in one section Fits (with rebinding) Struggles Contradicted
'Continuum' or intermediate pages Tolerable Predicts them Not addressed
Uniform Balneological 'B' block Accommodates Accommodates Fits

Where this leaves us

If you force a single winner, Multiple Authors remains the most directly supported because it matches the hands on the page and the way the bifolia are arranged. It does not, on its own, explain why B looks like an expansion of A, or why several scribes keep to just two tightly bounded systems with no crossover. To bridge that gap, you need a coordinating element.

The least bad synthesis looks like this. A designer or lead scribe defined a method that produced A. The workshop copied bifolia in parallel. At some point the method was extended, producing B, and some scribes worked only with that extension. Rebinding and shuffling of unnumbered sheets then interleaved A and B in places where they never sat together in the original working piles. That hybrid model keeps the palaeography honest, gives the statistics a plausible cause, and uses known codicological events to explain the present layout.

It still leaves a blunt question on the table. If there was a rulebook, where is it? Until someone finds marginal notes, a colophon, or a bilingual crib, we are reading the fingerprints of a system without seeing the system itself.

Analysis of Working Theories

Theory Evidence For Evidence Against
A: Multiple Authors The palaeographic evidence is direct: at least five distinct scribal hands are present, and they cluster perfectly with the two dialects. This aligns with common 15th-century workshop practices. It struggles to explain the systematic linguistic relationship where Voynich B's vocabulary appears to be a superset of A's. This implies coordination, not independent work.
B: Evolving Method This model neatly explains the linguistic evidence, such as the vocabulary asymmetry and the existence of 'intermediate' pages, as natural results of a system changing over time. It is directly contradicted by the physical evidence of multiple, stylistically distinct scribal hands. One person's handwriting is unlikely to change so dramatically and consistently.
C: Content-Specific Language It finds support in some sections that are linguistically homogenous. The Balneological section, for example, is written entirely and consistently in Voynich B. The theory fails completely in the manuscript's largest section, the Herbal, which is thematically uniform but contains a chaotic mix of both A and B dialects.

The Missing Pieces

For all the precision in separating Voynich A from Voynich B, there are holes in the record that no amount of statistical analysis can close. This section collects the evidentiary gaps that block a clean conclusion.

No Colophon or Work Notes: There is no scribal note to explain who worked on the book or why the script shifted from one mode to another. Medieval manuscripts often carry a colophon at the end, where a scribe would record their name, date, or patron. The Voynich has none.

No Bilingual Key: With Egyptian hieroglyphs, the Rosetta Stone gave scholars a parallel Greek text to work from. No such text exists for the Voynich. Neither Voynich A nor B can yet be tied to any known language.

Uncertain Original Page Order: The book has been rebound. The current page order may not be the original, and that matters when trying to track the A-to-B transition.

Missing Folios: At least fourteen folios are missing, some from likely transition points between sections. They may have contained key evidence, but we cannot recover what is gone.

No Forensic Differences: Ink and parchment analysis shows the same production across A and B. The pigments, handwriting tools, and vellum batches all point to one workshop. The schism lies in the writing itself, not in its materials.

The Text is Unreadable: The sharpest problem remains: we cannot read a single word. Without knowing if A and B encode different languages, or if they are two cipher modes for the same underlying text, every theory rests on patterns, not meaning.

Why the A/B Split Remains Unsolved

Starting Point

What is the origin of the two distinct dialects, Voynich A and Voynich B?

Avenue 1: Textual Analysis

Attempt to translate the dialects or find a linguistic relationship between them.

Blockage

Dead End: No Bilingual Key. The text remains unreadable. Without a translation, we cannot determine if A and B encode different languages, are cipher variants, or serve different functions.

Avenue 2: Physical Analysis

Reconstruct the original page order to understand how and when the transition from A to B occurred.

Blockage

Dead End: Uncertain Page Order & Missing Folios. Historical rebinding has shuffled the bifolia, and at least 14 folios are missing from critical transition points. The original sequence is lost.

Conclusion

All investigative paths are blocked by fundamental gaps in the evidence. The 'why' behind the A/B split cannot be resolved with the current record.

Sources

Sources include: Prescott Currier’s declassified NSA presentation on the A/B split (1976); the extensive collation data, language pages, and transcription resources curated by Rene Zandbergen; palaeographic work by Dr Lisa Fagin Davis identifying five distinct hands and analysing the manuscript’s construction; the Yale Beinecke Library’s digital collection and catalogue entry for MS 408; and published computational analyses of word co-occurrence and vocabulary asymmetry.

What we still do not know

  • What the text encodes: natural language, a cipher with rule changes, or a generative system with no plaintext. No translation exists.
  • Whether a clean A→B transition point exists within a single writing unit. All observed switches so far occur at folio or bifolium level.
  • How much of the split survives across different transcription schemes when run on identical page sets.
  • Whether the present interleaving is mostly a rebinding artefact or whether some alternation reflects the original working order.
  • Who trained whom: a master-apprentice pattern would fit the vocabulary asymmetry but has no documentary support.
  • The role of the minor hands: whether Scribes 3, 4, and 5 are purer forms of A or B, transitional writers, or late additions to the project.

Comments (0)

Leave a Reply

Your email address will not be published. Required fields are marked *

Back To Top