bagrounds.org

🤔⚙️🧩🏗️💡 Everything I know about good system design
🤖 AI Summary
🤔 Evaluation
📚 Book Recommendations

Sep 29, 20254 min read

Home > Articles

🤔⚙️🧩🏗️💡 Everything I know about good system design

🤖 AI Summary

📉 Good design looks underwhelming: nothing goes wrong for a long time.
🧱 Complex systems reflect poor design; working complex systems evolve only from simple working systems.
⚠️ Minimize stateful components because they cannot be automatically repaired when they fail.
✍️ Contain all writing logic within a single, state-aware service; multiple services must not write to the same table.
📝 Design tables to be human-readable, balancing flexibility with application complexity.
🔎 Index tables to match common queries, placing highest-cardinality fields first.
⚡ Get the database to do the work, using JOIN instead of in-memory stitching.
📚 Send read queries to replicas; use in-memory updates to work around replication lag.
⚙️ Split slow operations: do minimum useful work for the user immediately, queue the rest in background jobs.
🧊 Caching introduces statefulness and must never replace first speeding up the original operation, such as adding a database index.
📣 Use events when the sender is indifferent to the consumers, or for high-volume, non-time-sensitive data.
🎯 Focus on “hot paths,” the most critical, data-heavy parts of the system, as they have fewer viable solutions.
🚨 Log aggressively during unhappy paths, recording the specific condition hit.
📈 Monitor basic observability metrics, watching p95/p99 for user-facing latency.
🔑 Use idempotency keys when retrying writes that may or may not have succeeded.
🛡️ Define failure policy (fail open vs. fail closed) based on the specific feature requirement.

🤔 Evaluation

🆚 This perspective contrasts sharply with advice focused on leveraging complex distributed patterns like microservices or event sourcing early in a project’s lifecycle, which often prioritize future scalability over present-day simplicity.
⚖️ A legitimate perspective would argue that for a hyper-growth startup, not using message queues or a well-sharded database from day one can lead to a costly re-architecture later, contradicting the philosophy that a complex system must evolve from a simple one.
💡 The emphasis on simplicity and “boring” technology aligns with the “worse is better” design philosophy, prioritizing immediate practical value and ease of maintenance over theoretical completeness or advanced features.
📚 Topics to explore for a better understanding:
- 🔄 The trade-offs between “fail open” and “fail closed” policies in non-security-critical but high-stakes systems (e.g., fraud detection, high-volume ad serving).
- ⏱️ Specific techniques for measuring and optimizing p99 latency in database queries and service-to-service communication.
- 🧩 The point at which a simple system must be split into microservices, and whether the overhead of complexity is truly earned before or after catastrophic scale issues appear.

📚 Book Recommendations

💾⬆️🛡️ Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems by Martin Kleppmann: 💡 Similar This book is mentioned in the article and provides the definitive deep dive into the underlying systems discussed, like databases, replication, and distributed transactions.
📚 A Philosophy of Software Design by John Ousterhout: 📏 Similar It directly argues that complexity is the fundamental challenge in software, promoting simplicity and “deep” modules over “shallow” ones, strongly supporting the article’s core thesis.
🦄👤🗓️ The Mythical Man-Month: Essays on Software Engineering by Frederick Brooks Jr.: 👥 Contrasting While focused on software project management, its core message—that adding manpower to a late project makes it later—highlights that complexity in organization (and thus system design) is the hardest and most dangerous problem to solve.
🧩🧱⚙️❤️ Domain-Driven Design: Tackling Complexity in the Heart of Software by Eric Evans: 🏗️ Contrasting This foundational text provides the intellectual toolkit for managing the necessary complexity of large, core systems, a stage the article advises earning but does not detail.
🧑‍💻📈 The Pragmatic Programmer: Your Journey to Mastery by David Thomas and Andrew Hunt: 🧑‍💻 Creatively Related This book is a manual for the disciplined, thoughtful, and professional attitude required to successfully implement “boring” and simple design principles day-to-day.
💻⚙️🛡️📈 Site Reliability Engineering: How Google Runs Production Systems by Niall Richard Murphy et al.: 🔧 Creatively Related Details the operational practices, metrics (like p99), and “unhappy path” planning (like killswitches and graceful degradation) that transform a good design into a reliable, continuously running system.

Graph View

Backlinks

📄 Articles
2025-09-29 | 🌧️ Rain | 🏆 Excellence | ⚙️ Engineering 🪞🛍️🤖💬📚📄

🔌 by 💎Quartz, ✍️Obsidian, 📨Enveloppe

GitHub