Three-panel infographic. Left panel labeled REALITY, an intricate forest with dense leaves, bark, roots, and insects. Center labeled FINITE CAPACITY, a narrow vertical aperture between two arrows. Right panel labeled WHAT YOU SEE, a sparse version of the same forest with simplified tree shapes, a bird, a sun, and a face icon.
May 17, 2026 · 4 min read

Compression Engines

Six Essays on Compression · I · Why everything we see is already a compression

By Sunny Harris, MD

Before any message is compressed, the world is.

Stand in a forest for one minute and try to take it in. Every leaf has veins. Every vein has cells. The bark has scales. The light changes as the wind moves. The wind itself is made of pressure waves you cannot count. Insects you cannot name are doing things you cannot see. The smell is hundreds of molecules. The ground beneath your feet is a fungal network deeper than your body. There is no scale at which the detail stops.

You cannot take it in. Not because you are insufficient, but because reality is not bounded the way you are. The world has, for practical purposes, inexhaustible detail. You have a few pounds of brain. You have finite sensors, finite attention, finite memory, finite time. The math does not work.

This holds for any bounded agent, not just humans: an organism, a clinician, a hospital, a model, a machine. If the world is larger than the system representing it, representation must be selective.

So the brain throws almost all of it away. It keeps edges, motion, the few colors and frequencies that mattered to our ancestors, faces, a tiny window of focus. The forest you "see" is already a compressed file; the work was done before you noticed. Perception is a codec, not a window. Rules for what to keep, rules for what to put back, running so fast you mistake it for raw input. The retina alone compresses on the order of a hundred million photoreceptors into about a million ganglion cells before anything leaves the eye, in ways the efficient-coding tradition has been formalizing since the 1960s (Barlow, 1961; Olshausen & Field, 1996).

A note on the word compression, since I use it broadly in this series. I mean any lossy transformation from a larger reality into a smaller representation, one you can store, transmit, reason over, or act on. A handoff, a memory, a chart note, an equation, an AI summary: all compressions in this sense. The file-size sense is one instance of the broader pattern, not the whole of it.

The narrower claim is this. None of those systems can begin without first deciding what to preserve, what to discard, and what form the remainder must take. Compression is deletion plus structure. The discarding is not the work; the work is choosing what to discard and what shape the remainder takes, so the receiver can rebuild what matters.

The family is wider than the word usually suggests. Selection keeps some variables and ignores others. Attention allocates bandwidth to a small part of the world. Abstraction replaces many particulars with a handle. Categorization maps continuous variation into usable types. Modeling preserves some relations while discarding others. Prediction and inference use a smaller model to stand in for missing reality. These differ in mechanism. They share one constraint: a bounded system must reduce more reality than it can hold into something it can use.

The same is true of memory. You cannot store a year of your life; you store fragments. A few scenes, a few feelings, a story that lets you reconstruct the rest plausibly (Schacter, Addis & Buckner, 2007). Your autobiography is a compressed file too, decompressed on demand into something that feels continuous.

The same is true of language. You cannot say everything about the patient, the case, the day; you say enough that the next person can reconstruct the rest. The same is true of science. A map cannot be the territory; no map could be. A model cannot be the system in full. An equation is a compression of behavior across infinite cases into a finite expression you can hold in your head.

This is why a clinical handoff works at all. When a physician compresses a patient into nine words for the next doctor on shift, they are not being terse. They are compressing because they have to. The patient was never available in full to them either. Both ends of the handoff are already running compression. The message between them is a third compression, tuned to the priors of the receiver. Communication is what happens when two compression engines agree on a shorthand for what they share. We talk about messages as if they cross between full minds. They don't. They cross between two already-compressed substrates, with the message itself compressed again to fit the bandwidth between them.

We tend to treat this as a defect, if only I could see the whole picture, hold it all at once. But the picture cannot be held. Anything that holds requires a holder, and any holder is finite. Compression is the work itself, not a defect in the work. There is no view from nowhere; there are only compressions, each made by some finite thing trying to fit some piece of the world into the room it has.

We tend to call ourselves thinking creatures. It might be more accurate to call ourselves compression engines that experience their own compression. The forest you remember is a thumbnail. The friend you love is more than any model you hold of them, but you still meet them through one, a compression built from years of contact. The self you call "I" is a story compact enough to survive in a few pounds of tissue.

There is loss in any compression. The alternative is no representation at all, not "no compression." The choice is never compression or no compression. The choice is which compression, and what gets thrown away to make the rest fit.


Six Essays on Compression · Preface · I · II · III · IV · V · VI · Coda · Postscript