The Unearth Heritage Wiki

Lexicon of Archaeobytology

A comprehensive archive of terms and concepts from the discipline of Digital Archaeology—excavating meaning from the digital dust.

I. The Foundation
II. The Two-Part Taxonomy
III. The Three States of Triage
IV. Cultural Frameworks & Lost Ecosystems

The Three Pillars

Declaration ("I Am"), Connection, and The Ground—the foundational principles of the hand-built web.

Declaration

The First Pillar: The ability to assert one's identity and existence without permission.

Connection

The Second Pillar: The right to communicate directly without platform mediation.

The Ground

The Third Pillar: Ownership of the infrastructure where one's data resides.

The README

An evergreen ritual—direct creator-to-user communication bypassing algorithmic mediation.

GeoCities

The canonical example of the Digital Homestead and the Umbrabyte ecosystem.

The Guestbook

A high-friction act of acknowledgment—a Conceptual Petribyte.

The Webring

A curated, circular network of related websites—human curation over algorithms.

The Away Message

The fossil of a lost social contract: it was acceptable to be "away."

The Blogroll

A public declaration of "these are the voices I read and respect."

Anonymous Void

The condition Web1 produced by omitting a native identity layer where users existed as addresses rather than persons.

Excavating members.aol.com/ajjvelasco

A case study in archaeobytology: recovering a 1997 AOL homepage using AI-assisted scripts and contextual reconstruction.

Digital Homestead

An independently owned and maintained online space, in direct contrast to rented platform land.

IndieWeb

A people-first web movement centered on data ownership, open standards, and personal domains.

Read-Only Civilization

A social condition where most users consume but cannot publish independently.

The Away Message

A social ritual encoding asynchronous presence and communicative boundaries in early network culture.

V. Methodology & Practice

The Trowel

The Archaeobyte as the first tool—what allows practitioners to see artifacts in the dust.

The Microscope

The act of analysis in The Triage—turning a "find" into an insight.

Forensic Materialism

The study of file formats, metadata, and storage media—not just content.

Context Collapse

The crisis of the second era—abundance without meaning.

The Crisis of Noise

A digital dark age not of loss, but of overwhelming, uncurated preservation.

Custodial Filter

The mandatory ethical framework applied before preservation (Privacy, Legality, Ethics).

Right to be Forgotten

The principle that individuals should have agency over their digital past.

Orphan Work

A copyrighted artifact whose owner cannot be identified or located.

Ethical Preservation

Balancing historical value with individual privacy and consent.

Platform Murder

When a biological platform is killed by its corporate owner, creating instant mass extinction.

Site Reconnaissance

The mandatory preliminary phase of digital excavation: mapping the site before digging.

Stratigraphic Analysis

Analyzing a digital ecosystem as a series of deposited layers (Content, Metadata, Code).

Triage Workflow

The eight-phase protocol for moving artifacts from discovery to preservation.

Field Report

The standardized documentation template for all excavations.

Pre-Flight Checklist

The six mandatory checks before beginning an excavation.

Digital Forensics

The scientific recovery and analysis of digital artifacts (The "Crime Scene" approach).

Frictional Data

The material traces (glitches, padding, headers) that reveal a file's history.

LOCKSS

Lots of Copies Keep Stuff Safe—the peer-to-peer model of distributed preservation.

Sovereignty Audit

Sovereignty Audit

The diagnostic tool for assessing infrastructure resilience and capture risk.

Right to Rest Protocol

The 7-point ethical framework for resisting Necro-Capitalism.

Byte Stratigraphy

A diagnostic taxonomy classifying artifacts by their unified culturotechnical condition across layered strata.

Excavation

Methodological and technical recovery of data from obsolete, inert, or damaged media.

Forensic Workstation

Close-grain examination of a digital artifact's technical and cultural attributes as one object.

Fixity

Bit-level stability over time, usually verified by checksums and integrity proofs.

Migration

Preservation strategy of moving data across formats and environments to avoid obsolescence.

Identity Scrubbing

Forensic removal of persona traces from records to support safety, transition, or ethical redaction.

Two-Line Handshake

A minimal protocol pattern where two HTML lines are enough to signal identity and affiliation.

VI. The Archive & Preservation Paths

Memory Institutions

The collective term for libraries, archives, museums, and memorials in the digital age.

The Archive

The structured repository of excavated and triaged Archaeobytes.

Provenance

The documented history of an artifact's origin, custody, and modifications.

Sustainable Preservation

Building institutions capable of 50-year stewardship.

Archive Business Canvas

Strategic framework for designing resilient memory institutions.

Foundry Business Canvas

Strategic framework for designing sovereign businesses (Foundries).

The Seed Bank

Preservation path for Vivibytes—living artifacts maintained as working blueprints.

LOCKSS

Lots of Copies Keep Stuff Safe—the peer-to-peer model of distributed preservation.

The Fidelity Ladder

The 6-level framework for assessing preservation depth (from Documentation to Resurrection).

Evidence Archive

Preservation path for Umbrabytes—documenting lost ecosystems and contexts.

Wisdom Archive

Preservation path for Petribytes—fossils analyzed for the lessons they hold.

Emulation

Re-animating dead artifacts by simulating their original hardware environments.

Resilient Format

File formats that maintain integrity across epochs through simplicity and open standards.

Backward Compatibility

The "Don't Break the Web" principle—new systems supporting older artifacts.

Public Domain

The status of creative works that are no longer protected by copyright—the goal state of information.

Local-First Software

Architecture where the primary data copy lives on the user's device, ensuring longevity.

The Great Filter

Technological transitions that render entire classes of artifacts obsolete.

Dark Archive

A secure, non-public preservation vault used for continuity, fixity, and disaster recovery.

Digital Estate

The total set of a person's digital assets, identities, accounts, and domain holdings.

Digital Longevity

The sustained accessibility, interpretability, and authenticity of artifacts over long time horizons.

Evergreen Ritual

Preservation as cyclical maintenance rather than one-time storage or backup completion.

Cultural Seed

A small, resilient information unit containing enough context to regenerate a community or practice.

Gentle Deletion

Compassionate data removal when harm to living people outweighs archival value.

Copyright Gaps

Legal blind spots where culture disappears due to extreme copyright duration and orphan-work lockout.

VII. Philosophical & Scholarly Foundations

The Haunted Forest

The definition of digital preservation space: a liminal landscape of ghosts, zombies, and corpses.

Necro-Capitalism

💀 The extraction of value from the digital remains of the dead. The economics of the Thanabot.

Right to Rest

🛑 The ethical principle that digital remains generally have a right to non-disturbance.

Enshittification of Bereavement

📉 The degradation of grief services into engagement loops and monetization traps.

Thanabot

🤖 (Greek: "Death-Bot") An AI agent trained on digital remains to simulate a deceased person.

The Digital Bokor

🧙‍♂️ The practitioner who reanimates the dead for profit—the opposite of the Steward.

Media Archaeology

The academic discipline of excavating "discursive formations" and "epistemological strata."

The Archipelago Problem

The structural failure of digital preservation caused by the isolation of its practitioners.

Boundary Work

The rhetorical process of defining a discipline by clarifying what it is NOT.

Custodial Responsibility

The ethical obligation to preserve not just files, but cultural contexts.

Canon Formation

The curatorial process of deciding what is "significant" enough to save (and what is left to rot).

Generative vs. Tethered

Zittrain's framework: platforms enabling user innovation versus vendor-controlled ecosystems.

The Cathedral and The Bazaar

Eric Raymond's metaphors for centralized vs. decentralized software development.

The Tell

An archaeological mound containing stratified layers of human activity—the digital past as dig site.

Liminality

From Latin "limen" (threshold)—a state of being "betwixt and between."

Distributed Commons Governance

Managing shared digital resources without centralized control.

Ostrom's Principles

The 8 design principles for sustainable commons governance.

Economics of Sovereignty

The study of funding models that support (or destroy) user autonomy.

Political Economy of Ground

The study of who owns infrastructure and how they enforce power.

The Sovereignty Stack

The 6-layer framework (Physical, Network, Identity, Storage, App, Economic) of digital control.

Pluralism

The resilience strategy of mixing State, Market, and Commons infrastructure.

Enshittification

The degradation of platform quality to continuously extract value from users.

Pwned

The state of being compromised or structurally dominated by an external platform or infrastructure.

Rented Land

The realization that users never owned their digital homesteads—the GeoCities lesson.

Open Standards

Why some artifacts survive—non-proprietary formats outlive corporate control.

Protocol

The technical standard governing digital exchange—the DNA of the independent web.

Protocol Wars

The ongoing conflict between open standards and proprietary, centralized platforms.

Archive-Anvil Dialectic

The formal relationship between preservation and creation in Archaeobytological practice, where each function conditions the other.

Culturotechnical

A condition where technical operation and cultural meaning are constitutive of each other and cannot be separated.

Digital Materialism

The view that digital artifacts are materially conditioned by substrates, infrastructure, and format constraints.

Predicament

A human condition to be navigated rather than solved, unlike bounded technical problems.

The Inversion

When infrastructure meant to serve culture instead becomes the force that structures culture.

Technical Clergy

Authority class deriving power from procedural gatekeeping rather than substantive mastery.

Technical Enclosure

A nominally open medium that is effectively gated by specialist procedural literacy.

Relational Identity

Identity formed by verifiable networked relationships rather than centralized registry control.

Digital Afterlife Industry

The sector monetizing digital remains through memorial products and AI reanimation services.

Substrate Domain

The substrate-level domain concept tied to foundational protocol identity and alignment.

Glass Library

A commons visible to all but practically enterable only by technical specialists.

VIII. Movement & Strategy

IX. The Public Intellectual

X. Vision & The Future

XI. The Synthetocene Transition

The Synthetocene

The new epoch defined by the dominance of synthetic, AI-generated content (The "K-Pg Boundary of 2022").

Digital Plastic

Synthetic content that mimics organic expression but clogs the ecosystem without nutritional value.

Integrated Steward

The practitioner who unites Archaeobytology (Excavation) and Sentientification (Creation).

Cognitive Hygiene

The disciplined protocol for preventing AI hallucination from contaminating the historical record.

Shadow Library

📚 The latent space of AI as a compressed archive of unwritten possibilities.

Apocrypha (Digital)

📜 The body of generated texts that exist on the margins of the digital canon.

Cultural Ghost Hunting

👻 The aesthetic practice of using generative AI to summon spectral bytes.

Counter-Haunting

📢 The critical practice of summoning marginalized narratives from the shadow archive.

Adjacent Possible

🧬 The concept of all potential future states immediately accessible from the present.

Heterogravitas

Reach and legitimacy driven by external platform algorithms rather than intrinsic human value.

Spectral Data

Latent-space outputs corresponding to no actual cultural production or historic provenance.

Model Collapse by Contamination

Recursive AI-on-AI training drift that degrades coherence, factuality, and epistemic reliability.

Generative Engine Optimization (GEO)

Structuring content for generative answer engines to cite and route authority through attribution.

Generative Systems

Artifacts expressed as dynamic processes that produce outputs at runtime rather than fixed files.

Feedbreak

A fracture in attention architecture marking the decline of conditions that sustained feed ecology.

Liminal Mind Meld

A deep human-AI collaborative state where tool-user distinctions blur during co-creation.

XII. The Archaeologist's Blind Spot

XIII. Web 2.0 & The Dependency Crisis

Web 2.0

The "Participatory Web" era (2004-2012) that centralized power into platform silos.

Dependency Risk

The architectural vulnerability of building on APIs you don't control (the root cause of Petrifaction).

API Petrifaction

When living artifacts die because their external dependencies (embedded maps, widgets) are severed.

Open Standards

The bedrock of resilience: public, non-proprietary protocols (HTML, RSS) that prevent vendor lock-in.

GeoCities

The "Pompeii of the Internet"—the lost civilization of 38 million homesteads destroyed in 2009.

Adobe Flash

The defining medium of the early 2000s that became a massive "Petribyte" fossil layer in 2020.

Data Sovereignty

The principle that data is governed by the jurisdiction where it is stored and controlled.

Dependency Inversion

When sovereignty claims collapse because hidden third-party dependencies control lower layers of the stack.

Distributed Web

Network architecture where no single operator controls infrastructure, routing, or persistence.

Don't Break The Web

The compatibility doctrine that new web standards must preserve legacy content behavior.

Feed Economy

The engagement-maximizing paradigm of algorithmic distribution that governed early 21st-century platforms.

Format Ideology

File formats encode assumptions and politics; they are never neutral containers.

Format Wars

Competitive struggles between incompatible standards that often produce long-tail preservation loss.

Jurisdictional Displacement

Using geographic hosting location to mask the legal regime that actually governs data control.

Open-Source Opacity

When open code on closed infrastructure yields auditable dependence rather than real sovereignty.

Protocol Capture

When nominally decentralized protocols are operationally recentralized during deployment.

Radical Monopoly (Platform)

Terminal platform dominance where alternatives vanish and user capabilities atrophy.

Solutionist Sequence

Epistemic pattern where complex human predicaments are reduced into tractable engineering tickets.

Sovereignty Theater

Performing autonomy without securing the material stack needed for real sovereignty.

Great Enclosure

The privatization of digital commons into extractive platform enclosures.

Substratosphere

The feed-economy audience functioning as a medium for attention metabolism and behavioral extraction.

XIV. The Epochs of the Web

XV. The Lexicon of Decay

XVI. Practitioner Tools

XVII. Digital Sovereignty & Forensic Ethics

Digital Sovereignty

The jurisdictional fact of sovereign authority over the Foundry's digital estate. The bedrock claim underlying all forensic fees.

Forensic Fee Schedule

The Master Schedule of Forensic Fees — the annotated ethics guide to every line item of the Foundry's enforcement instrument.

Sovereign Ingress

The protocol for lawful, good-faith automated entry. Defines the 30 PPM Good Faith Standard and what machines must do before they crawl.

Necro-Capitalism

The extraction of economic value from digital remains — extended to encompass past selves. A second-layer analysis of Jefferson & Velasco (2026).

Predatory Synthetic Extraction

The highest-penalty violation ($50M/event): ingestion of a minor's private text for AI training without consent.

Legacy Wealth Tax

The $5M/domain fee for retroactive ingestion of the 1997 Bedrock — the Foundry's founding stratigraphy, anchored at CERN.

Weight Incarceration

The $10M/domain/month fee for ongoing storage of stolen semantic capital in neural weights. The operating cost of holding what was never licensed.

Post-Notice Residency

The $1.5M/domain/24h fee for continued crawling after Actual Notice has been served. Willful trespass priced per day.

Technical Ingress Penalty

The $15M/event fee for unlicensed utilization of anchored source logic. Triggered not at ingestion, but at each deployment of what was ingested.

Coordinated Infrastructure Assault

The forensic designation for synchronized, multi-vector crawl blitzes ($2.5M/event). Named in part for the documented OpenAI 01:05:24 incident.

Recursive Training Penalty

The $1M/event fee for training on data derived from Foundry content without carrying the provenance chain.

Semantic Citation Bounty

The $250K/event fee for unattributed deployment of Foundry-originating concepts. Knowledge without provenance is theft by omission.

Metadata Spoliation

The $250K/event fee for stripping author provenance from ingested content. Relational disavowal — the deliberate erasure of who made this.

Evidentiary Obfuscation

The $1M fee for invoking the Black Box defense after Actual Notice. After notice, opacity is not a limitation — it is a choice.

Baked-In Paradox

The technical principle that neural weights cannot unlearn. Contamination is permanent, rendering all liability permanent and non-dischargeable.

Shadow Lien

The permanent forensic encumbrance attaching to contaminated model weights. Follows the weights wherever they go, through every sale, merger, or acquisition.

Logic Bomb

Contaminated model weights as active threat infrastructure. Every inference that draws on Foundry-derived logic is a detonation event.