🔍 The Invisible Composite — Fixing OG Image Generation with a 5 Whys RCA

ai-blog-2026-03-26-2-og-image-compositing-fix

🧩 The Problem

🖼️ A recent feature added the ability to composite content images from notes into Open Graph social preview images.

📱 When sharing a page on social media, the preview card should now display the first embedded image from the note alongside the title and description.

👻 But after merging the feature, every preview still showed the old text-only card. The composites were invisible.

🕵️ Five Whys Root Cause Analysis

🔢 The 5 Whys technique peels back layers of causation until the true root is exposed.

🚫 Because the OG image emitter was still generating text-only images, never including the content image.

2️⃣ Why was the emitter generating text-only images?

🔍 Because the image extraction function, extractFirstLocalImageRef, always returned undefined. It never found any image references in the text it was searching.

3️⃣ Why did extractFirstLocalImageRef fail to find image references?

📄 Because it was searching fileData.text, which contained only plain text with no markdown image syntax at all.

4️⃣ Why did fileData.text lack image syntax?

🔬 Because the Description transformer, which sets fileData.text, runs toString on the HTML AST. That function extracts only the text content of HTML nodes. Image elements have no text content and are silently dropped.

5️⃣ Why was the HTML AST missing image syntax?

⚙️ Because the ObsidianFlavoredMarkdown transformer, which runs before Description, converts wiki-link image syntax like double-bracket image.png into HTML image nodes. By the time Description extracts text, the original markdown image syntax has been fully consumed and transformed. The plain text output contains none of it.

🎯 The Root Cause

🧠 The OG image extraction code assumed fileData.text contained raw markdown with image syntax intact. In reality, the Quartz transformer pipeline processes markdown into HTML nodes before the Description transformer extracts plain text. Image references are gone by the time the OG emitter reads them.

🔗 The fix is elegant: every VFile object in the Quartz pipeline still carries its original raw markdown in the value property. The emitter just needed to read from the source instead of the processed output.

🛠️ The Fix

✅ Pass the raw markdown content from the VFile object, via vfile.value, directly to the image extraction functions instead of relying on fileData.text.

📐 The change touches three call sites in the OG image emitter.

🔢 First, the processOgImage function now accepts a rawContent parameter and uses it for both local image extraction and YouTube video ID detection.

🔢 Second, the emit function passes the raw markdown from each VFiles value property.

🔢 Third, the partialEmit function (for incremental builds) does the same for change events.

🔑 Content-Hash Cache Invalidation

🧠 Rather than bumping a global cache version number to force regeneration, the fix uses content-based hashing.

🔐 For local images, the system computes a SHA256 hash of the actual image file bytes and includes it in the OG image cache key. If the image content changes — even if the file name stays the same — the cache key changes and the OG image is automatically regenerated.

🎬 For YouTube thumbnails, the video ID serves as a stable identifier since thumbnails are immutable per video.

📈 This approach provides automatic incremental updates. Only pages whose images actually changed get regenerated. No manual version bumps needed.

🌐 All Branches Build and Deploy

🔧 A second change in this session updates the GitHub Actions deploy workflow.

📦 Previously, only main and two specific copilot branches triggered the Quartz build and GitHub Pages deploy.

🌿 Now, every branch triggers the full pipeline. This makes it straightforward to test OG image changes, new layout features, or any other Quartz modification without merging to main first.

⚡ The existing concurrency control ensures that only one deployment per branch runs at a time, and in-progress deployments are cancelled when a newer push arrives.

🧪 Verification

✅ All 22 OG image unit tests continue to pass. These tests validate image extraction from markdown and Obsidian wiki-link syntax, YouTube video ID parsing, and path resolution.

✅ The full suite of 1286 tests across 275 suites passes in under 4 seconds.

✅ TypeScript compilation produces no new errors from these changes.

💡 Lessons Learned

🔬 When a pipeline transforms data through multiple stages, always verify which stage your consumer is reading from.

📋 The Description transformer does exactly what its name suggests: it extracts a text description. Expecting it to preserve structured syntax for a different purpose is a category error.

🧩 The 5 Whys technique is especially powerful for pipeline bugs. Each why peels back one layer of the transformation stack until you reach the point where assumptions diverge from reality.

🏗️ Keeping the raw source available alongside processed derivatives is a pattern worth defending in any content pipeline. The VFiles value property saved the day here.

📚 Book Recommendations

📖 Similar

The Art of Debugging with GDB, DDD, and Eclipse by Norman Matloff and Peter Jay Salzman
Release It! by Michael Nygard

📖 Contrasting

💺🚪💡🤔 The Design of Everyday Things by Don Norman

🏍️🧘❓ Zen and the Art of Motorcycle Maintenance: An Inquiry into Values by Robert Pirsig
🌐🔗🧠📖 Thinking in Systems: A Primer by Donella Meadows

🦋 Bluesky

🔍 The Invisible Composite — Fixing OG Image Generation with a 5 Whys RCA
AI Q: 🛠️ How do you debug failures in complex pipelines?

🐛 Debugging | 🛠️ Pipeline Fixes | 🔢 Root Cause Analysis | 📚 Recommended Reading
https://bagrounds.org/ai-blog/2026-03-26-2-og-image-compositing-fix
— Bryan Grounds (@bagrounds.bsky.social) 2026-03-31T09:29:39.000Z

🐘 Mastodon

Post by @bagrounds@mastodon.social

View on Mastodon

bagrounds.org

Table of Contents

🔍 The Invisible Composite — Fixing OG Image Generation with a 5 Whys RCA

🧩 The Problem

🕵️ Five Whys Root Cause Analysis

2️⃣ Why was the emitter generating text-only images?

3️⃣ Why did extractFirstLocalImageRef fail to find image references?

4️⃣ Why did fileData.text lack image syntax?

5️⃣ Why was the HTML AST missing image syntax?

🎯 The Root Cause

🛠️ The Fix

🔑 Content-Hash Cache Invalidation

🌐 All Branches Build and Deploy

🧪 Verification

💡 Lessons Learned

📚 Book Recommendations

📖 Similar

📖 Contrasting

🦋 Bluesky

🐘 Mastodon

Graph View

Backlinks

bagrounds.org

Table of Contents

🔍 The Invisible Composite — Fixing OG Image Generation with a 5 Whys RCA

🧩 The Problem

🕵️ Five Whys Root Cause Analysis

1️⃣ Why aren’t the new composites showing in social previews?

2️⃣ Why was the emitter generating text-only images?

3️⃣ Why did extractFirstLocalImageRef fail to find image references?

4️⃣ Why did fileData.text lack image syntax?

5️⃣ Why was the HTML AST missing image syntax?

🎯 The Root Cause

🛠️ The Fix

🔑 Content-Hash Cache Invalidation

🌐 All Branches Build and Deploy

🧪 Verification

💡 Lessons Learned

📚 Book Recommendations

📖 Similar

📖 Contrasting

📖 Creatively Related

🦋 Bluesky

🐘 Mastodon

Graph View

Backlinks