Dead Reckoning

In 1976, a Micronesian navigator named Mau Piailug sailed the Hōkūleʻa, a traditional Polynesian voyaging canoe, from Hawaiʻi to Tahiti. No compass. No sextant. No GPS. He read the ocean. The angle of the swells told him where land was. The color of the clouds told him what was underneath them. The flight paths of birds told him how far he was from shore and in what direction. He held a mental model of the Pacific Ocean that was precise enough to cross 2,500 miles of open water and make landfall.

Mau grew up on Satawal, a half-mile-wide coral atoll in what is now the Federated States of Micronesia, where traditional navigation was still taught and practiced. The knowledge wasn’t mystical. It was empirical. Accumulated across generations of navigators who paid attention to what the ocean was actually doing and built a working model of the system from direct experience. But by the 1970s, long-distance voyaging knowledge had become dangerously thin across much of the Pacific. In Hawaiʻi, the tradition the Hōkūleʻa was built to recover had nearly disappeared. Mau was one of the few living masters who could teach it back into practice.

The voyage wasn’t a stunt. It was a proof of concept and a rescue mission. Western navigation had instruments. The instruments worked. Why would anyone learn to read the ocean when you could read a dial? The answer, the one Mau spent the rest of his life demonstrating, was that the instruments and the practice are not the same thing, and when you lose the practice you lose the ability to know when the instrument is wrong.

A compass will tell you which way is north. It will not tell you that the current has been pushing you southwest for six hours and your position is not where your heading says it should be. A GPS will give you a coordinate. It will not tell you that the coordinate doesn’t account for the reef that formed since the chart was last updated. The instrument is a tool. The practice is knowing what the tool doesn’t show you.

The navigators who relied solely on instruments were perfectly fine right up until they weren’t. The ones who combined instruments with wayfinding practice could catch the errors the instruments couldn’t surface on their own. That’s not romanticism. That’s the operational case for maintaining the craft alongside the technology that’s supposed to replace it.

We are in the instrument-dependency phase of our relationship with AI. And the wayfinding tradition is thinning with every graduating class that learns to prompt but never learns to evaluate.

These essays didn’t start as a plan. They started as frustration. Thirty years of building, watching, and working with technology, followed by a career transition that left me with time and a compulsion to say something. The problems I kept writing about weren’t industry-specific. They were structural.

The craft knowledge required to evaluate AI output is being eliminated at the same time AI output is being trusted with higher-stakes decisions. The people who know what good work looks like are being treated as obstacles to deployment rather than as essential infrastructure. The economic incentives are accelerating this. The systems themselves have failure modes that are invisible until someone has learned to recognize them.

That’s the problem. The practice is learning to read the water.

There is an emerging discipline that doesn’t have a name yet. It’s not prompt engineering, though it involves prompting. It’s not AI safety research, though it overlaps. It’s not data science, though it borrows tools. It’s the practice of working with probabilistic systems closely enough, critically enough, and long enough to develop genuine judgment about what they’re doing and where they break.

The people doing this work are scattered across research labs, creative studios, and personal projects. What they share is that they’ve moved past the two default postures most people adopt toward AI: uncritical enthusiasm and reflexive rejection. They use the systems. They don’t trust the systems. They’ve developed heuristics for when the output is reliable and when it’s not, and those heuristics came from practice, not from reading a manual.

Every significant tool transition produces this population. When photography emerged, the people who did interesting work with it weren’t the painters who dismissed it and they weren’t the technicians who merely operated it. They were the practitioners who understood both the medium’s properties and its limitations well enough to make deliberate choices about when to trust the apparatus and when to override it. That dual literacy was the craft. The camera was just the instrument.

The same structure is forming around AI. The people getting genuinely good results aren’t writing longer prompts. They’re building mental models of the system’s behavior through repeated observation. They’ve learned where the attractor basins are. They know which kinds of requests produce reliable output and which kinds produce confident garbage. They’ve been burned enough times to develop a reflexive skepticism that doesn’t prevent them from using the tools but does prevent them from trusting the tools’ self-assessment.

That skepticism is the craft. Everything else is technique.

One practice that demonstrates this well: running the same question through multiple models and paying attention to where they converge and where they diverge. Convergence doesn’t mean they’re right. It means the training data overlaps. Divergence doesn’t mean one is wrong. It means the models learned different things, and the disagreement is where the interesting information lives.

A draft goes through adversarial review across multiple frontier models. Not for copyediting. For stress-testing. Each model brings different training biases, different blind spots, different strengths. When three models agree on a factual claim, you have a little more confidence. When they disagree, you know where to dig. When one model flags a logical gap the others missed, that flag is usually worth more than the original draft.

This is not a novel technique. Researchers do something similar with ensemble methods. Journalists do it with multiple sources. Intelligence analysts do it with competing hypotheses. The principle is ancient: triangulation produces better positioning than any single bearing.

What’s new is that the instruments are cheap enough and fast enough that a single practitioner can run a meaningful cross-validation in an afternoon. The infrastructure for doing careful work exists. The culture for doing it hasn’t caught up. Most people are still using one model, reading the output, and accepting it at face value, which is the navigational equivalent of checking your compass once and never looking at the water.

I want to be careful here, because the last thing this conversation needs is another person claiming they’ve figured it all out. I haven’t. I’m a practitioner sharing observations from practice. Some will turn out to be wrong. The appropriate confidence level is: this seems to be what’s happening based on what I’ve seen, and here’s what I’ve seen. Not more than that.

That willingness to sit with uncertainty rather than collapsing it prematurely into certainty is itself part of the discipline. LLMs produce confident, well-structured, declarative output. They don’t hedge well. The gravitational pull of the training data is toward authority, toward clean answers, toward the resolved and settled. The practitioner’s job is to supply the uncertainty the system won’t.

That’s the human job in this partnership, and it’s the job that’s hardest to see because it looks like nothing. It looks like pausing before you accept the answer. It looks like running the same question a second time to see if you get the same response. It looks like noticing that the output feels too clean and investigating why. It doesn’t make for a good demo. But it’s the difference between navigation and drift.

There’s a structural irony running through this entire series. The people best positioned to develop this discipline—deep domain knowledge, technical literacy, enough experience to know when something doesn’t smell right—are the same population being displaced by the technology itself. The engineers who spent thirty years automating other people’s jobs. The skilled practitioners whose craft knowledge is being declared unnecessary.

The market is telling them to leave the room. And the room is about to lose the only people in it who know how to read the water.

This is the Luddite problem restated in operational terms. The lacemakers weren’t anti-technology. They were the quality control. When you eliminate the people who know what good work looks like, you don’t just lose those workers. You lose the ability to distinguish good output from bad output. And in a world where the machines produce confident, fluent, structurally sound bad output, that distinction is everything.

There’s something running underneath all of this that deserves direct statement: the people making deployment decisions are often not the people who understand the failure modes.

The executives, the policymakers, the venture capitalists on the podcast talking about “all lawful use”—these are not stupid people. That’s not the problem. The problem is that many of them understand these systems at the level of vendor demos, product decks, and podcast abstractions. The people who understand them at the level of attractor basins, probability distributions, and domain-specific evaluation are usually somewhere else. They are using the tools. They are finding the reefs. They are not steering the ship.

I don’t have a clean institutional answer for that. The arguments made across this series—cooperation over control, representation as mechanism, the economics of displacement—all stand. But they don’t add up to a policy platform. They add up to a picture of the problem that is more honest than the one most people are working from, and an argument that the practice of staying engaged and continuing to develop genuine understanding is not optional. It’s the infrastructure.

Mau Piailug didn’t sail to Tahiti to prove that instruments were bad. He sailed to prove that the practice was real, that it worked, and that it would die if nobody kept doing it. He spent the next thirty years teaching. Not because traditional navigation was going to replace GPS. He taught because the knowledge had value beyond the technology it preceded. Because knowing how to read the water is what tells you the compass is lying.

He couldn’t teach what he knew from a textbook. The knowledge was embodied. It came from time on the water, from developing the perceptual vocabulary that lets you distinguish one kind of swell pattern from another in real time, at night, in conditions that all look the same to an untrained eye. You had to do the sailing.

This discipline, the one forming around AI, works the same way. You can read about attractor basins. You can understand, intellectually, that these systems are probabilistic and that their confidence is not evidence of their accuracy. But the actual craft—the ability to feel when an output is wrong before you can articulate why, the mental model of where the system is likely to break on this particular kind of question—that comes from practice. From hours with the systems. From getting burned and paying attention to what burned you.

I can’t give you that in an essay. Nobody can give you that in a course. The ten-session prompt engineering bootcamps are teaching people how to read the compass. They are not teaching them how to notice the current. That’s fine as a start. But it’s not wayfinding.

Here’s where I’ll end this. Not with an answer. With a position.

We are in the early years of a transition that will restructure how knowledge work, creative work, and institutional decisions get made. The systems driving it are more capable and more dangerous than the public conversation acknowledges, but the danger I’m talking about is not the dramatic sci-fi scenario. It is the quiet, structural, invisible failure modes that only practitioners encounter. The people who develop genuine literacy about these systems—not just technical literacy, but the craft judgment that comes from sustained critical practice—are going to be the essential infrastructure of whatever comes next. Not because they’ll control the systems. Because they’ll be the ones who can tell you when the systems are wrong.

And we are currently, systematically, economically, telling those people to leave the room.

That’s the thread running through all ten of these essays. The lacemakers knew the lace was getting worse. The wayfinders knew the practice was dying. The people who understand these systems know that the outputs need human judgment applied by humans who understand what they’re judging.

None of those people were listened to in time.

The practice is: stay in the room. Learn the water. Don’t trust the compass on its own. Teach anyone who will listen. And when the instrument fails and someone asks how this happened, be the person who can show them where the current was pulling all along.

Bryan C. is a technology executive and writer based in Phoenix, AZ.