Monday, November 24, 2025

AI: POCT and Shannon Information Theory (with equations)

 Think of this not as “faster lab” but as two different communication protocols between the latent disease state and the treatment decision.

I’ll use simple Shannon-ish language (states, channels, noise, erasures, information rate) and then layer in modern decision-theoretic ideas (value of information, sequential design).


1. A simple information-theoretic model of the encounter

Let:

  • D = true disease state (one of 3–4 plausible diagnoses)

  • S = signs/symptoms you see on Monday (history, exam)

  • T = lab test results

  • A = action/therapy chosen

  • U(A, D) = clinical utility (outcome/payoff) of choosing A when the true state is D

In Shannon terms, you start with prior uncertainty H(D).
After you observe symptoms S, uncertainty drops to H(D|S).
After you observe lab results T, it drops to H(D|S, T).




From modern decision theory, the value of the lab is not just the entropy reduction; it’s the increase in expected utility:

[
\text{VOI} \approx \mathbb{E}[U(A^(S,T),D)] - \mathbb{E}[U(A^(S),D)]
]

where (A^*(\cdot)) is your optimal decision rule given what you know.

Both the central lab and POCT can have identical analytic performance, i.e. same mutual information I(D;T|S). The test itself doesn’t change. What changes is the channel architecture in time:

  • Scenario 1 (central lab):
    ( D \to S \to \text{Visit}1 \to \text{Order} \to T{next_day} \to A_{phone} )

  • Scenario 2 (POCT):
    ( D \to S \to \text{Visit}1 \to T{same_visit} \to (S_{extra}, T_{extra}) \to A_{in_person} )

Same test, different protocol.

The non-trivial advantages of POCT come from how that protocol reshapes:

  • Where the noise is

  • How many times you can use the channel

  • How much information you can extract from the patient–doctor interaction conditional on the test result

  • How much value is lost to delay, dropout, and evolving disease


2. Adaptive dialogue: POCT enables sequential experimental design

In the “next-day lab” world, information flows one way:

  1. At 3 p.m. Monday you see S and choose a test set.

  2. At noon Tuesday you get T, alone in your office.

  3. You compress your entire decision process into a brief phone call or portal message.

Crucially, T does not feed back into further observation of the patient at the moment of interpretation. You can’t say, “Hmm, with this troponin, let me look again at your chest, or ask that one extra question.” The lab result is decoupled from the rich data stream of the encounter.

With POCT:

  1. 3 p.m.: you see S, form a differential, order a test.

  2. 3:45 p.m.: you now have T while the patient is still in front of you.

  3. You can adaptively acquire S₂: targeted questions, focused exam, maybe a second test based on T₁.



From an information-theory point of view, you’ve turned a single shot channel into a sequential, feedback-driven experiment. The mutual information you care about is not just I(D;T), but:

[
I\big(D; S, T_1, S_2(T_1), T_2(T_1,S_2), \dots\big)
]

That quantity can be strictly larger than what you get from the same first test alone, because:

  • The first result guides where you look next.

  • You spend your remaining “observation budget” in regions of the state space that are maximally discriminative (classic sequential design / active learning).

So POCT doesn’t merely move T earlier. It changes the structure of the observation process from:

“single batch of tests → one-off decision”

to

“test → conditional interrogation of the patient → maybe another test → decision”

Which is exactly what modern information theory calls a feedback channel rather than a memoryless one.


3. Reducing “erasures”: patients as an unreliable return channel

In Shannon’s world, an erasure is when a symbol simply never arrives: you know something was supposed to come, but it’s blank.

In the central lab scenario, several key links are erasure-prone:

  • The lab results come back, but:

    • you’re busy,

    • they’re filed into an inbox and not acted upon promptly, or

    • a call is attempted but the patient doesn’t answer, or ignores the voicemail, or the number is wrong.

  • Even if the call is made, the instruction transmission (take this antibiotic, go to the ED if X) is noisy and may not be encoded clearly or decoded correctly.

In effect, the entire decision and its rationale may be lost in the noise of real life.

POCT greatly reduces the probability of these erasures:

  • The physician and patient are co-located at the time of interpretation.

  • The probability that the diagnostic and therapeutic “message” is both sent and acknowledged is far higher.

From an information-theoretic perspective, the difference isn’t just Δ(time). It’s:

  • Central lab workflow ≈ channel with delay + nontrivial erasure probability between result and implemented action.

  • POCT workflow ≈ low-delay, low-erasure channel, because the test, interpretation, and commitment to action happen in the same tightly controlled session.

If you integrate that over millions of encounters, the expected information actually used to shape treatment is higher with POCT, even if the nominal test characteristics are identical.


4. Protecting the signal from cognitive and contextual noise

Human memory and attention are very noisy channels.

In the next-day model, when you read the result at noon, you are reconstructing:

  • Your memory of the patient’s story,

  • The subtle visual and affective cues,

  • Your sense of how sick they “felt” to you at the time.

You’re doing a rough lossy decompression of the clinical encounter from a few lines of notes, then combining that compressed representation with the lab value.

That’s a high-distortion reconstruction step. Important bits of S may have been discarded, and your recollection is now mixed with everything else you’ve seen that morning.

With POCT:

  • You’re viewing T in the full, high-fidelity context of the live patient.

  • If something in the result seems “off,” you can immediately test your mental model against reality (“That CRP seems high; let me double-check your abdomen”).

  • You can probe ambiguity in real time (“These results suggest condition X; does that fit with your experience of symptom Y?”).



So in Shannon language, POCT raises the effective signal-to-noise ratio in the composite channel:

[
D \to (S, T) \to \text{Physician’s internal state} \to A
]

Because the mapping ( (S,T) \mapsto A ) is executed while S is still a high-resolution, low-noise signal in your working memory, not a degraded trace.


5. Time-sensitive value of information: the disease is a dynamic source

Classical Shannon treats the source as static for a given message block. Clinically, D(t) evolves.

If the disease has meaningful dynamics over 24 hours–48 hours (sepsis, acute coronary syndrome, early meningitis, even some outpatient infections), then:

  • The same entropy reduction H(D|S,T) has different clinical value depending on when it occurs.

  • Modern VOI frameworks make this explicit by discounting information by delay: a test that arrives after a critical transition (e.g., infarction completed, infection disseminated) is less valuable than one that arrives before.

POCT shifts information earlier on the disease trajectory. It allows you to:

  • Move the decision boundary upstream of irreversible changes.

  • Initiate early therapy that alters the future state trajectory (D(t)) before the system passes a “point of no return.”

You can think of it as increasing the information rate relative to the disease’s characteristic time scale:

  • Central lab: maybe one useful decision cycle per 24 hours.

  • POCT: decision cycle compressed into minutes, so you can even re-sample if needed (e.g., repeat troponin, repeat lactate).

From a control-theory viewpoint, POCT improves both the latency and bandwidth of the feedback loop controlling a dynamic system (the patient’s physiology). Same number of bits; deployed early, they exert more control.


6. Better coding to the patient: common knowledge and adherence

There’s another “channel” here: physician → patient.

  • In the next-day model, you call Tuesday and say, “Your test was positive; start this treatment.” The patient is at work, half-listening, perhaps anxious, perhaps skeptical. The encoding is rushed and the decoding is partial.

  • In the POCT model, you sit together and look at the result. You explain the meaning, let the patient ask questions, maybe print or display the result.

Information theory doesn’t usually worry about semantics, but modern decision-and-behavioral frameworks do:

  • The probability that the patient truly updates their internal model of their health state (call it (P(\text{belief update}))) is higher when the message is delivered with richer coding: verbal explanation + visual result + interactive Q&A.

  • Adherence is essentially the probability that the action A you intend is the action A’ they actually implement. POCT makes A ≈ A’ more often.

From an information perspective, you’re:

  • Increasing the mutual information between your intended treatment plan and the patient’s behavior.

  • Reducing the behavioral “noise” between the order and the real-world implementation.

Again, nothing to do with “plus/minus 24 hours” per se; it’s about coding and decoding quality in the patient–physician communications channel.


7. System-level bits: throughput, triage, and network information

If you zoom out to the clinic or health system as the “receiver,” POCT alters the network information dynamics:

  • Triage and routing: A same-visit result enables immediate routing (admit vs discharge, specialist referral, isolation vs no isolation). That is, the system can classify patients into the right “queues” with fewer ambiguous cases clogging intermediate states.

  • Queue stability: In queuing theory, delayed information about job type leads to inefficient scheduling. Real-time classification (POCT) reduces the entropy of the queue composition, enabling more stable throughput.

Shannon-wise: you’re increasing the information available to the scheduler (front desk, bed manager, nurse team) at the time they must make allocation decisions. That can reduce system-level “congestion noise” (misallocated resources, unnecessary revisits), even if at the patient level you’ve only moved the test up by hours.


8. When POCT is not better in info terms

For completeness, there are regimes where POCT is not an improvement from a strict Shannon/decision standpoint:

  • If the POCT device has meaningfully worse sensitivity/specificity, its mutual information I(D;T_POCT) may be lower than I(D;T_lab).

  • If the POCT environment introduces new noise (poor training, frequent operator error, bad quality control), then conditional on S you may have more residual uncertainty H(D|S,T_POCT) than with lab-based testing.

  • If the clinical situation is not time sensitive and follow-up is robust (e.g., stable outpatient thyroid testing in a highly adherent population), then much of the advantage in terms of erasures, dynamics, and feedback evaporates, and you’re left mainly with cost and logistics.

So the information-theoretic argument for POCT is strongest where:

  • The disease dynamics are nontrivial over short time scales;

  • Follow-up channels (phone, portal) are unreliable;

  • The initial encounter is a rich information environment that benefits from feedback-driven refinement; and

  • The POCT device’s analytic information content is comparable to the central lab.


9. How you might phrase this for a XYZ-style white paper

If you ever want to operationalize this for your client, the punchlines could be framed roughly as:

  • POCT is not just “faster lab”; it is a higher-value protocol for using lab information. It converts a one-way, delayed message into a live, feedback-driven interaction between patient, physician, and test.

  • That protocol:

    • Increases the effective information yield per encounter via sequential, targeted questioning and testing.

    • Reduces information loss from missed follow-up and patient non-contact (erasure channel).

    • Protects diagnostic signal from cognitive/contextual noise by interpreting results in the full fidelity of the live encounter.

    • Aligns the timing of information with disease dynamics, so the same bits lead to higher expected clinical utility.

    • Improves the fidelity of communication to the patient, raising the mutual information between physician intent and patient behavior.

All of which are much stronger claims than “the result is back before lunch,” and they’re quite defensible in a Shannon-plus-modern-decision-theory frame.