Danh sách tài liệu · từ các bài viết

Tài liệu

Mọi cuốn sách, bài báo, tiêu chuẩn, và ý tưởng được đề cập trong các bài viết, cùng ghi chú về lý do mỗi tài liệu xuất hiện ở đây.

293 tài liệu

A Theory of Psychological ReactanceBook

Brehm, Jack W. · 1966

The source of the psychological reactance concept used in the post's Sidenote. Brehm's core finding: when people perceive their freedom to choose being threatened, they experience a motivational state that drives them to restore that freedom — often by wanting the restricted option more strongly. The post applies this to advice-giving: insisting that your own experience is the right path for someone else tends to generate more resistance, not more adoption.

Xuất hiện trongBản đồ của mình

Philosophy

Truth and MethodBook

Gadamer, Hans-Georg · 1960

The source of the horizon of experience concept. Gadamer's argument that understanding another person requires a 'fusion of horizons' (Horizontverschmelzung) — not projecting your own perspective onto theirs, but moving far enough out of your own vantage point to see theirs. The post uses this to show why advice-giving that assumes a shared terrain between advisor and recipient systematically fails.

Xuất hiện trongBản đồ của mình

Philosophy

Psychological Safety and Learning Behavior in Work TeamsPaper

Edmondson, Amy · 1999

The paper that established psychological safety as the primary condition for team learning and performance. Edmondson's paradox — that teams with the highest error-reporting rates were the highest performers, because they felt safe to speak — is the empirical anchor of the post's central argument.

Xuất hiện trongSurvey Trên Nền Không Khí Đứa Bé Trong Phòng Họp

Philosophy

Organizational Culture and LeadershipBook

Schein, Edgar H. · 1985

The argument that organizational culture is transmitted primarily through what leaders attend to, how they behave under crisis, and what they reward or punish — not through what they declare. Schein's three-layer model (artifacts, espoused values, basic assumptions) is the structural backbone of the post's claim that posted core values are artifacts, not lived culture.

Xuất hiện trongVăn hóa không phải là chiếc lồng

Philosophy

The Fifth Discipline: The Art and Practice of the Learning OrganizationBook

Senge, Peter M. · 1990

The source of reinforcing feedback loops as an analytical lens for organizational dynamics. The argument that most management interventions fail because they address the symptoms of a loop rather than its root cause — the insight the post applies to team culture repair.

Xuất hiện trongSurvey Trên Nền Không Khí

Philosophy

Man's Search for MeaningBook

Frankl, Viktor E. · 1946

The source of the observation that between stimulus and response there is always a space — and in that space lies human freedom and choice. The post uses this as the philosophical foundation of emotional regulation: inner stability is not the absence of feeling but the preservation of that space.

Xuất hiện trongSurvey Trên Nền Không Khí Đứa Bé Trong Phòng Họp

Philosophy

Dare to LeadBook

Brown, Brené · 2018

The source of the armor of leadership concept: the protective behaviors leaders adopt to appear invulnerable, which paradoxically create the distance that makes teams feel unsafe. Brown's argument that true courage requires the capacity for vulnerability is the direct counterpoint to command-and-control notions of strength.

Xuất hiện trongSurvey Trên Nền Không Khí

Philosophy

Daring GreatlyBook

Brown, Brené · 2012

The source of Brown's observation that you cannot selectively numb emotion: when you numb pain, you numb joy. The post uses this to argue against performed composure in leadership — a leader who suppresses what they feel does not protect the team; they simply broadcast a more confusing signal. The distinction between vulnerability (conscious exposure) and oversharing (unfiltered expression) is the hinge the post turns on.

Xuất hiện trongĐứa Bé Trong Phòng Họp

Philosophy

Mindsight: The New Science of Personal TransformationBook

Siegel, Daniel J. · 2010

The source of the concept of mindsight — the capacity to perceive one's own mind and the minds of others as distinct objects rather than fusing with them. Siegel argues that self-awareness and empathy share the same neural substrate: developing one naturally develops the other. The post uses this to reverse the usual EQ curriculum: you cannot reliably read others until you can observe your own emotional state without being run by it.

Xuất hiện trongĐứa Bé Trong Phòng Họp

Philosophy

Emotional ContagionBook

Hatfield, Elaine; Cacioppo, John T.; Rapson, Richard L. · 1993

The foundational theoretical account of emotional contagion — the mechanism by which one person's emotional state spreads automatically to others through unconscious mirroring of facial expression, posture, and vocal tone. Later leadership research (Goleman, Edmondson) built on this foundation to show that the effect is amplified by authority: a leader's emotional state propagates to the group disproportionately.

Xuất hiện trongĐứa Bé Trong Phòng Họp

Philosophy

Discourses and EnchiridionBook

Epictetus · ~108 CE

The source of prohairesis — the Stoic concept of the genuine scope of one's own choice, distinguished from what lies outside it. Epictetus's eph' hēmin (what is up to us) is not a counsel of passivity but a precise boundary: your outputs (how you show up, how you respond in the first seconds of difficulty) are yours to shape; other people's feelings are not. The post uses this as a more exact frame for emotional responsibility in leadership than any EQ checklist.

Xuất hiện trongĐứa Bé Trong Phòng Họp

Philosophy

Primal Leadership: Unleashing the Power of Emotional IntelligenceBook

Goleman, Daniel; Boyatzis, Richard; McKee, Annie · 2013

The research grounding for emotional contagion in organizational contexts: a leader's mood propagates disproportionately to the group through neurological mirroring, not symbolic authority. The post draws on this to argue that a leader's internal state is not private — it shapes the ambient emotional climate of the team.

Xuất hiện trongSurvey Trên Nền Không Khí

Philosophy

The Burnout SocietyBook

Byung-Chul Han · 2010

The diagnosis of the achievement society (Leistungsgesellschaft): modern depression and burnout are caused not by external repression but by the injunction to achieve. The subject becomes both exploiter and exploited, and believes itself to be free.

Xuất hiện trongNiềm vui thuần túy

Philosophy

Nicomachean EthicsBook

Aristotle · ~350 BCE

The source of dynamis, energeia, eudaimonia — and skholē. The argument that flourishing comes from practising virtue, not merely possessing it. Also the origin of theoria: contemplation as an end in itself, not a means to anything else.

Xuất hiện trongNiềm vui thuần túy

Philosophy

Waiting for GodBook

Simone Weil · 1951

The source of the idea that attention is the highest form of generosity — a sentence that tends not to leave once encountered.

Philosophy

The Second SexBook

Simone de Beauvoir · 1949

The conception of authentic love as two independent people choosing each other freely, renewed daily — not dissolution into the other, and not dependency.

Xuất hiện trongYêu Để Cưới

Philosophy

The Art of LovingBook

Erich Fromm · 1956

The argument that love is an art requiring knowledge and effort, not a passive feeling that happens to you. Most people get love backwards: they worry about being loved rather than learning to love.

Xuất hiện trongYêu Để Cưới

Philosophy

The Course of LoveBook

Alain de Botton · 2016

A corrective to romantic idealism: marriage is the beginning of a long piece of work, not the end of a story. Romantic culture confuses early excitement with love itself.

Xuất hiện trongYêu Để Cưới

Philosophy

The Market for "Lemons": Quality Uncertainty and the Market MechanismPaper

Akerlof, George A. · 1970

The foundational paper on information asymmetry in markets. Akerlof showed that when buyers cannot verify quality before purchase, the market for high-quality goods can collapse, leaving only the lemons. The post borrows this framework to argue that the failure of meetings is rarely about lack of structure inside the room, but about the asymmetric distribution of information before anyone arrives.

Xuất hiện trongCái Xô Rỗng Nút Cancel Không Có Ở Đó

Philosophy

Psychological TypesBook

Jung, Carl G. · 1921

The work that introduced introversion and extraversion into psychology. Jung framed them as tendencies along a continuum, not fixed categories: a person who was purely one or the other would, in his view, be psychologically pathological. The post leans on this point to argue that the workplace habit of sorting people into boxes misreads what Jung actually meant.

Xuất hiện trongHướng nội

Philosophy

Quiet: The Power of Introverts in a World That Can't Stop TalkingBook

Cain, Susan · 2012

The book that put introversion onto the cultural map as a real disposition rather than a deficit. Cain documents the disproportionate share of introverts in senior leadership roles and argues that most have had to perform extraversion at high energy cost. The post uses this to name what is usually invisible: the cost of acting against one's own rhythm in environments designed for the opposite.

Xuất hiện trongHướng nội

Philosophy

Rethinking the Extraverted Sales Ideal: The Ambivert AdvantagePaper

Grant, Adam M. · 2013

Grant's Wharton study showing that ambiverts (people in the middle of the introversion-extraversion spectrum) outperform both pure introverts and pure extroverts in sales roles. The likely mechanism is flexibility: ambiverts can shift between listening and asserting depending on the situation. The post cites this finding to argue that the spectrum view is not just truer to Jung's original framing but also empirically supported.

Xuất hiện trongHướng nội

Philosophy

Color Imaging Array (US Patent 3,971,065)Paper

Bryce E. Bayer · 1976

The original patent describing the RGGB color filter array that became the universal standard for digital image sensors. Bayer's key insight was the 2:1 green-to-red/blue ratio, which mirrors human visual sensitivity. Nearly every consumer camera sensor made since then uses a variant of this pattern.

Xuất hiện trongCảm biến máy ảnh không thấy màu

Science

Image Sensors and Signal Processing for Digital Still CamerasBook

Junichi Nakamura (ed.) · 2005

The most comprehensive technical reference on CCD and CMOS image sensor design, covering photodiode physics, CFA design, microlens arrays, noise sources, and signal processing. The demosaicing chapter surveys bilinear, median-based, and edge-directed algorithms with quantitative comparisons.

Xuất hiện trongCảm biến máy ảnh không thấy màu

Science

Image Demosaicing: A Systematic SurveyPaper

Xin Li, Bahadir Gunturk, Lei Zhang · 2008

A rigorous survey of demosaicing algorithms from bilinear interpolation through edge-directed methods (AHD, LMMSE) to frequency-domain approaches. Provides quantitative PSNR comparisons on standard test images, making clear why algorithm choice matters significantly at high spatial frequencies and color boundaries.

Xuất hiện trongCảm biến máy ảnh không thấy màu

Science

Time, Clocks, and the Ordering of Events in a Distributed SystemPaper

Leslie Lamport · 1978

The proof that in a distributed system there can be no perfect global clock — each node lives in its own timeline. Every TOCTTOU race condition is a consequence of this fundamental fact, and the happened-before relation defined here is the foundation of every subsequent distributed-systems theorem.

Xuất hiện trongTime-Of-Check to Time-Of-Use Cái Switch Không Biết Mình Là Anh Hùng Thời gian — thứ distributed systems không thực sự có

Science

IEEE Standard for Floating-Point Arithmetic (IEEE 754)Standard

IEEE · 1985

The forty-year-old standard that governs every floating-point calculation your hardware will ever perform. The reason 0.1 + 0.2 gives the same wrong answer on every machine on Earth.

Xuất hiện trongBa Mươi Hai Bit Để Giữ Vô Cực

Science

Patriot Missile Defense: Software Problem Led to System Failure (IMTEC-92-26)Report

U.S. Government Accountability Office · 1992

The official documentation of the Patriot missile system failure in Dhahran. A floating-point accumulation error in the system clock, compounding over 100 hours of operation, caused a 0.34-second timing gap — enough to miss a Scud.

Xuất hiện trongBa Mươi Hai Bit Để Giữ Vô Cực

Science

The Magical Number Seven, Plus or Minus Two: Some Limits on Our Capacity for Processing InformationPaper

Miller, George A. · 1956

The paper that established working memory capacity as a measurable, bounded resource. Miller's 7±2 figure is the source of the common claim about short-term memory limits, and is cited in the post's Sidenote as the older reference that later cognitive-science work has refined downward. Useful as the historical anchor for the chapter's claim that human reading is constrained by a fixed-capacity buffer.

Xuất hiện trongCognitive Load

Science

Cognitive Load During Problem Solving: Effects on LearningPaper

Sweller, John · 1988

The paper that introduced Cognitive Load Theory and its distinction between intrinsic load (inherent to the material), extraneous load (added by how the material is presented), and germane load (the effort spent building durable mental models). The Friction Map chapter uses the intrinsic-versus-extraneous split as the criterion for what the map measures and what it deliberately ignores.

Xuất hiện trongCognitive Load Friction Map

Science

The Magical Number 4 in Short-Term Memory: A Reconsideration of Mental Storage CapacityPaper

Cowan, Nelson · 2001

The reconsideration that revised working memory capacity downward from Miller's 7±2 to roughly 4 chunks. Cowan's argument is the basis for the chapter's specific claim that a reader can hold about four slots of information at once, which in turn supports the mechanism by which extraneous variable names push out room needed for actual logic.

Xuất hiện trongCognitive Load

Science

The Birthday ProblemPaper

Philippe Flajolet, Peter Grabner, Peter Kirschenhofer, Helmut Prodinger · 1994

The formal mathematical treatment of the birthday problem and its generalizations. The result 1.18√N for the 50% threshold appears here with full proof. Useful for understanding why birthday attacks against cryptographic hash functions require only √N trials.

Xuất hiện trongNghịch lý sinh nhật

Science

Scaling Knowledge Access and Retrieval at AirbnbArticle

Airbnb Engineering · 2018

Part of Airbnb's engineering blog series on their internal knowledge graph, which connects listings to places, experiences, and events so that search and recommendation can treat 'a ski holiday near Tokyo' as an intent rather than a keyword match. The chapter draws on it for the ontology side of modeling: a team whose question space keeps growing with the product, where the hardest debates are about what kinds of things exist in the system (is a city a node or a property of a listing?) rather than about algorithms. The city example in the chapter is an illustrative reconstruction of the kind of decision these posts describe.

Xuất hiện trongNode và edge là quyết định thiết kế

Software

FlockDB: A Distributed, Fault-Tolerant Graph DatabaseDocumentation

Twitter Engineering · 2010

Twitter's home-grown graph store, announced and open-sourced in 2010 with an engineering blog introduction. The chapter cites it for one deliberate design decision: FlockDB serves very large adjacency lists with one-step set-operation queries at high throughput, and intentionally does not support multi-hop traversal. That refusal is the point — the team had determined which part of the follow problem was a graph problem and which was not, and built only for the first part. The build-versus-buy lesson in FlockDB's later fate belongs to a later chapter.

Xuất hiện trongDấu hiệu nhận biết Graph trong production, Postgres, Neo4j, hay tự xây

Software

Zen: Pinterest's Graph Storage ServiceTalk

Xun Liu, Raghavendra Prabhu · 2014

Presented at @Scale 2014 (September 2014) and QCon San Francisco 2014. Describes Zen, a graph service built at Pinterest that layers a node/edge/index API over HBase, chosen because the team already operated HBase at scale and could not afford the operational learning curve of a new database during rapid growth. Cited in Chapter 9 as the canonical case for the operational-capacity axis winning over the data-model axis.

Xuất hiện trongGraph trong production, Postgres, Neo4j, hay tự xây

Software

How NASA Finds Critical Data Through a Knowledge GraphTalk

David Meza · 2016

Presented at GraphConnect San Francisco, October 13, 2016, by David Meza (Chief Knowledge Architect, NASA Johnson Space Center). Describes how NASA rebuilt its Lessons Learned Information System — previously a SQL database allowing only keyword, date, and center filters — as a knowledge graph on Neo4j. Most concrete comparison in public sources: the same Orion uprighting query that returned 3 relevant documents after 8 days of search in the old system returned over 30 results in Neo4j. No specific query-development-time metric appears in the public sources; the chapter's comparison stays at the level of unit of measure (weeks to hours).

Xuất hiện trongGraph trong production, Postgres, Neo4j, hay tự xây

Software

eBay ShopBot: Graph-Powered Conversational CommerceDocumentation

eBay Engineering / Ajinkya Kale, Anuj Vatsa · 2018

Case study published on Neo4j Engineering Blog (August 2018), originally presented at GraphConnect New York (October 2017). eBay built a knowledge graph of over 500 million nodes and 20 billion relationships to power product-attribute-relationship queries for ShopBot, replacing rule-based logic that could not scale across 20,000+ product categories. Cited in Chapter 9 as a second case where the depth-and-pattern axis dominated: query structure was complex and variable enough that the team accepted the operational cost of learning a new system because the productivity difference was measurable daily.

Xuất hiện trongGraph trong production, Postgres, Neo4j, hay tự xây

Software

Zanzibar: Google's Consistent, Global Authorization SystemPaper

Pang, Ruoming, et al. · 2019

Published at USENIX ATC 2019. Google restates authorization — 'can this user view this document' — as relation tuples, and the permission check as a path-existence question over those tuples, then builds a global system serving hundreds of services on that phrasing. The chapter uses it as the counter-case to FlockDB: a problem that looks nothing like a graph (permissions sound like a row filter) revealing transitive, unbounded-depth structure once the requirement is read carefully.

Xuất hiện trongDấu hiệu nhận biết Traversal và reachability: câu hỏi "cái gì chạm tới cái gì"

Software

Why Google Stores Billions of Lines of Code in a Single RepositoryPaper

Potvin, Rachel; Levenberg, Josh · 2016

Published in Communications of the ACM. The public account of Google's monorepo: its scale, the trunk-based workflow, and the tooling investment that makes it viable. The chapter cites it as the context for the Bazel/Blaze case: thousands of engineers committing in parallel into one repository, asking 'what must be rebuilt' at industrial frequency, which is what makes an explicitly declared, queryable dependency graph worth its daily maintenance price.

Xuất hiện trongTraversal và reachability: câu hỏi "cái gì chạm tới cái gì"

Software

Bazel DocumentationDocumentation

Google

The public documentation of Bazel, the open-source version of Google's internal build system Blaze. The chapter draws on it for the mechanics of the declared build graph: every target explicitly declares its dependencies in a BUILD file, undeclared dependencies are invisible and fail the build at the author's desk, and dependency cycles are rejected at declaration time, making the build graph a DAG by law rather than by hope. The transferable principle is pay-at-write-time, not the tool itself.

Xuất hiện trongTraversal và reachability: câu hỏi "cái gì chạm tới cái gì"

Software

OpenLineageDocumentation

OpenLineage Project

The open standard for collecting data lineage from processing systems, built on a model of jobs, datasets, and runs. The chapter cites it, alongside Uber's Databook, as the excavation approach to dependency graphs: instead of requiring thousands of pipeline authors to declare what they read and write (a cultural war that cannot be won), the graph is inferred from query logs and job metadata, which is why every edge carries provenance and the whole graph is an honest approximation rather than a declared truth.

Xuất hiện trongTraversal và reachability: câu hỏi "cái gì chạm tới cái gì"

Software

The PayPal Wars: Battles with eBay, the Media, the Mafia, and the Rest of Planet EarthBook

Jackson, Eric M. · 2004

An insider's account of PayPal's early years by one of its first marketing executives. The chapter draws on it for one specific thread: the fraud war against organized crime, and the internal tool Igor that let human investigators trace networks of linked accounts. The detail that matters is what the team did not say: nobody called it graph theory. They called it following accounts that stick together, which is the chapter's central observation about how graph problems appear in the wild without their mathematical name.

Xuất hiện trongBạn đã giải graph problem cả tuần nay

Software

kik, left-pad, and npmReport

npm, Inc. · 2016

npm's official postmortem of the March 2016 incident in which the eleven-line left-pad package was unpublished and builds across the JavaScript ecosystem failed within hours. The chapter uses it as the source for the third case: the transitive dependency graph existed in full inside lockfiles, machine-readable line by line, yet invisible to humans until one edge vanished. The postmortem also documents the unpublish policy change that followed.

Xuất hiện trongBạn đã giải graph problem cả tuần nay

Software

LinkedIn Engineering on People You May Know and Graph InfrastructureArticle

LinkedIn Engineering

A body of engineering blog posts and talks, spanning from offline Hadoop precomputation to online graph-serving systems, in which LinkedIn recounts how the friend-of-friend question behind People You May Know became one of the heaviest workloads in the company and a driving reason graph infrastructure became core infrastructure. The chapter cites it for the second case: a fixed two-hop question that changes character when multiplied by hundreds of millions of members, with the fair caveat that traversal is only PYMK's candidate-generation stage beneath a ranking layer.

Xuất hiện trongBạn đã giải graph problem cả tuần nay

Software

The Anatomy of a Large-Scale Hypertextual Web Search EnginePaper

Brin, Sergey & Page, Lawrence · 1998

The paper that introduced Google and PageRank. The chapter reads it as a modeling story rather than an algorithm story: the web as a graph, each link as a vote, and the refusal to count votes equally — a vote from an important page weighs more, with importance defined by the same rule, recursively. The random surfer model that makes the recursive definition converge intuitively is stated in the paper itself, along with the jump probability later known as the damping factor. The chapter cites it as the case where a well-chosen definition of "important" beat pure in-degree counting badly enough to build a company on the gap.

Xuất hiện trongCentrality, ai quan trọng trong mạng lưới

Software

Centrality in Social Networks: Conceptual ClarificationPaper

Freeman, Linton C. · 1978

The paper (with its 1977 companion on betweenness) in which Freeman systematized centrality into distinct families — degree, betweenness, closeness — and made explicit that each formalizes a different intuition about what it means for a position in a network to matter. The chapter leans on exactly that framing: centrality is not one measure with several formulas but a family of questions wearing one word, and betweenness in particular stands literally on the shortest-path concept, so changing the definition of edge weight changes the list of bottlenecks.

Xuất hiện trongCentrality, ai quan trọng trong mạng lưới

Science

Problems of Monetary Management: The U.K. ExperiencePaper

Goodhart, Charles A. E. · 1975

The original statement of what became Goodhart's law: any observed statistical regularity tends to collapse once pressure is placed upon it for control purposes. The familiar phrasing — when a measure becomes a target, it ceases to be a good measure — is Marilyn Strathern's later paraphrase. The chapter cites it for the second half of the PageRank case: a centrality score that wins becomes a target, link farms were the direct attack on the winning definition, and centrality is not exempt from the law because centrality is a metric scoring something valuable.

Xuất hiện trongCentrality, ai quan trọng trong mạng lưới

Philosophy

Stripe Radar Documentation and Engineering Posts on Network SignalsDocumentation

Stripe

Stripe's public documentation and blog posts describing how Radar scores payment risk using signals from across the Stripe network — where a device, card, or email has appeared before — as one layer of features in its machine-learning models. The chapter cites it at exactly the level the sources state: network-position signals enter the scoring model as features alongside hundreds of others, producing a continuous risk score that routes transactions to review or additional verification rather than issuing verdicts on its own. Internal model architecture is not public, and the chapter does not speculate past the sources.

Xuất hiện trongCentrality, ai quan trọng trong mạng lưới

Software

Fast Unfolding of Communities in Large NetworksPaper

Blondel, Vincent D.; Guillaume, Jean-Loup; Lambiotte, Renaud; Lefebvre, Etienne · 2008

The paper introducing the Louvain method, the greedy modularity-optimization algorithm the chapter tells in human language: each node tries moving to a neighbor's community and stays where the shared score rises most, then each community is compressed into a super-node and the game repeats a level up. The chapter cites it for the cost claim that made the method ubiquitous — community detection on graphs of hundreds of millions of edges on a single machine, startlingly cheap next to the all-pairs price of betweenness from the previous chapter — and notes that the ubiquity is exactly why Louvain's pitfalls became the whole industry's pitfalls.

Xuất hiện trongCommunity detection, khi cụm tự nổi lên từ dữ liệu

Software

From Louvain to Leiden: Guaranteeing Well-Connected CommunitiesPaper

Traag, Vincent A.; Waltman, Ludo; van Eck, Nees Jan · 2019

The paper that documented a real structural defect in Louvain — it can return communities that are internally disconnected, a cluster made of pieces with no edges between them — and introduced Leiden, which keeps the same intuition, adds a refinement step that guarantees well-connected communities, and runs faster. The chapter cites it for both halves of its practical recommendation: use Leiden for new work, understand Louvain to read older systems and to know why Leiden exists.

Xuất hiện trongCommunity detection, khi cụm tự nổi lên từ dữ liệu

Software

Resolution Limit in Community DetectionPaper

Fortunato, Santo & Barthélemy, Marc · 2007

The paper showing that plain modularity optimization cannot see communities smaller than a threshold that depends on the total edge count of the graph — small real clusters get swallowed into larger ones simply because the graph is big. The chapter cites it as the reason modern implementations expose a resolution parameter, and for the consequence it insists on: the number of clusters is not a constant of the data but partly a choice of the asker, a line the library output never prints.

Xuất hiện trongCommunity detection, khi cụm tự nổi lên từ dữ liệu

Software

Every Noise at OnceDocumentation

McDonald, Glenn

Glenn McDonald's public genre map, built at The Echo Nest (acquired by Spotify in 2014) from listening-behavior data: a scatter-plot atlas of more than six thousand named micro-genres, most of which exist on no record-store shelf. The chapter cites it as the public window into the 'listening galaxies' — communities of artists that emerge from co-listening structure rather than industry labels — and as evidence for the naming step at industrial scale: thousands of emergent clusters were given human names, some matching old genres, some invented because the listening community had never been named by the industry. The site has been frozen since McDonald left Spotify in late 2023, which itself illustrates the chapter's caveat that an emergent map is a snapshot, not a permanent portrait.

Xuất hiện trongCommunity detection, khi cụm tự nổi lên từ dữ liệu

Software

Quantexa and Linkurious Public Case Studies on Network Analytics in AMLReport

Quantexa; Linkurious

A body of public case studies and product documentation — Quantexa's work with Danske Bank being a named example — describing how banks use entity resolution and network analysis in anti-money-laundering investigations. The chapter cites this material at exactly the level the sources state: community-scale signals (unusually dense, closed clusters sharing entities) generate candidates with human-readable reasons, and the candidates feed investigators, not an automatic account-closing button. Detection-rate figures are not published, so the chapter keeps that part qualitative.

Xuất hiện trongCommunity detection, khi cụm tự nổi lên từ dữ liệu Pattern matching, đi tìm hình thù của gian lận

Software

Computers and Intractability: A Guide to the Theory of NP-CompletenessBook

Garey, Michael R.; Johnson, David S. · 1979

The standard reference catalog for NP-complete problems, building on Cook's 1971 proof that SAT is NP-complete. The book lists subgraph isomorphism (GT48) among the classical NP-complete problems and is the canonical citation for the claim that finding a pattern graph inside a larger graph is intractable in the general case. The chapter cites this as the formal grounding for the intuition that the number of ways to embed a pattern into a graph grows exponentially with pattern size — while explaining why real fraud-detection workloads stay practical through small patterns, sparse graphs, and selective anchoring.

Xuất hiện trongPattern matching, đi tìm hình thù của gian lận

Software

GQL: A Property Graph Query Language StandardStandard

ISO/IEC JTC 1/SC 32 · 2024

ISO/IEC 39075:2024, the first ISO standard for a graph query language and the first new ISO database query language standard since SQL. GQL standardizes the property graph query model, drawing on openCypher (Neo4j) and the SQL:2023 property graph extensions (ISO/IEC 9075-16). The chapter cites this as evidence that Cypher is a language family, not a single product — the same declarative pattern-description model that lets an investigator read a query as a whiteboard sketch now has an open international standard behind it.

Xuất hiện trongPattern matching, đi tìm hình thù của gian lận

Software

GraphScope: A Unified Distributed Graph Computing PlatformPaper

Fan, Wenfei et al. (Alibaba DAMO Academy) · 2021

Alibaba's public research on GraphScope, a distributed graph computing platform that covers graph analytics, interactive graph queries, and graph learning in a single system. The broader body of work from the same group (including papers on real-time graph fraud detection presented at VLDB and SIGMOD) describes the design decisions behind Ant Group's use of graph pattern matching for financial fraud: running pattern queries inside the transaction approval window (milliseconds), the trade-off between pattern complexity and latency, and the two-tier architecture of fast online pattern matching for simple shapes and batch offline matching for richer patterns. The chapter cites this at the level the sources state: the existence of the real-time matching capability and the latency-complexity trade-off — not internal implementation details.

Xuất hiện trongPattern matching, đi tìm hình thù của gian lận

Software

Re: prototypes vs classes was: Re: Sun's HotSpotArticle

Kay, Alan · 2003

The email Kay sent to the POOP (Principled Object-Oriented Programming) mailing list in July 2003 containing the direct quote: "I'm sorry that I long ago coined the term 'objects' for this topic because it gets many people to focus on the lesser idea. The big idea is 'messaging'." The chapter opens with this email because it names the precise gap between what OOP was named after and what the mainstream came to practice.

Xuất hiện trongOOP không phải về class Message-based OOP — Alan Kay đã đúng từ đầu

Software

The Early History of SmalltalkPaper

Kay, Alan · 1993

Published in ACM SIGPLAN Notices, this is Kay's own account of the design decisions behind Smalltalk and the intellectual origins of OOP as he understood it. He names the three sources that shaped his thinking: biology (cells as autonomous units), LISP (code and data as unified), and Sketchpad (objects with constraints). The chapter cites this as the primary source for understanding Kay's model — his message-passing vision — as distinct from the Simula lineage that entered mainstream practice through C++ and Java.

Xuất hiện trongOOP không phải về class Object khi ranh giới là network

Software

AnemicDomainModelArticle

Fowler, Martin · 2003

Fowler's blog post naming and diagnosing the Anemic Domain Model anti-pattern. He frames it as domain objects that carry data but have none of the business behavior — all the behavior is pushed into a service layer that reads from and writes back to the passive objects. The chapter uses this framing to show what Simula-model OOP produces in practice: procedure masquerading as object-oriented design.

Xuất hiện trongOOP không phải về class Tại sao code của bạn là OOP giả

Software

The Pragmatic Programmer: From Journeyman to MasterBook

Thomas, David; Hunt, Andrew · 1999

The source of the Tell, Don't Ask principle, which the chapter uses via a Sidenote to name the design move at the core of Kay's OOP: instead of asking an object for its state and making decisions externally, tell the object what to do and let it decide internally. The principle names the difference between the two Order implementations in the chapter's code contrast.

Xuất hiện trongOOP không phải về class Encapsulation — ẩn quyết định, không phải ẩn dữ liệu

Software

Clean Code: A Handbook of Agile Software CraftsmanshipBook

Martin, Robert C. · 2008

The source of the often-cited 10:1 read-to-write ratio used in the post's Sidenote. Martin's broader argument frames code as a document the team reads many times for every time it is written, which is the operational reason behind the chapter's 'two audiences' frame: optimizing only for the writer's moment ignores the larger surface where the real cost is paid. Chapter 6 ('Objects and Data Structures') is cited separately in Chapter 2 of the OOP series for Martin's clean distinction: objects hide data behind abstractions and expose behavior; data structures expose data and have no meaningful behavior. An anemic domain model is, by this definition, a data structure named with class syntax.

Xuất hiện trongThe Two Audiences Tại sao code của bạn là OOP giả

Software

Extreme Programming Explained: Embrace ChangeBook

Beck, Kent · 2000

The source of YAGNI ('You Aren't Gonna Need It'), Beck's principle against building flexibility for requirements you don't have yet. OOP Chapter 3 uses YAGNI as a foil to clarify a specific argument: the chapter agrees with YAGNI's caution against speculative generality, but distinguishes it from having no opinion about change at all. A codebase with no opinion about its axis of change is not a YAGNI codebase — it is a codebase deferring the axis question without realizing it. The distinction is between 'don't build flexibility you can't justify' and 'bet carefully based on real domain signal.'

Xuất hiện trongThứ bạn đang thiết kế cho là sự thay đổi

Software

Code Complete: A Practical Handbook of Software ConstructionBook

McConnell, Steve · 2004

Chapters 31 and 32 ("Layout and Style") collect empirical work from the 1980s and 1990s on how visual layout of code affects readability and comprehension speed. The Structure and Layout chapter cites it as the anchor that the connection between spatial structure and readability is not a recent preference but an old observation with measurement behind it.

Xuất hiện trongStructure và Layout

Software

Writing Testable CodeArticle

Hevery, Misko · 2008

Hevery's guide, written while at Google, catalogs the four design flaws that make code untestable: constructor does real work, collaborator lookup (Service Locator), brittle global state, and class does too much. The core argument is that testability is a consequence of good design, not a goal in itself — a unit that cannot be tested in isolation is a unit whose dependencies are not declared honestly. The "Constructor does Real Work" flaw is the precise pattern shown in Chapter 7's opening scene.

Xuất hiện trongDependency — triết học trước, framework sau

Software

Dependency Injection Principles, Practices, and PatternsBook

Seemann, Mark; van Deursen, Steven · 2019

The most thorough working treatment of DI as a design discipline rather than as a framework feature. The book separates the underlying principle (a unit's dependencies should be declared in its interface) from the various ways teams choose to deliver those dependencies (constructor injection, method injection, container-based wiring), and is careful about when each is worth the cost. The Maintainability chapter cites the book as the deep reference for readers who want the full pattern language, while restricting itself to the smaller, prior intuition: whatever a function needs in order to run, let the signature say so.

Xuất hiện trongMaintainability Dependency — triết học trước, framework sau

Software

TypeScript Handbook: Literal TypesDocumentation

TypeScript Team

The official reference for literal types and `as const`. The const assertion section is particularly useful — it explains why TypeScript widens literals in mutable contexts and how `as const` freezes inference at every level of a nested structure.

Xuất hiện trongLiteral Types và const assertions

Software

TypeScript Handbook: Union and Intersection TypesDocumentation

TypeScript Team

The official reference for how TypeScript models 'one of these' (union) and 'all of these' (intersection). The intersection section covers the key gotcha: intersecting incompatible types produces `never`, not a compile error at the definition site.

Xuất hiện trongUnion và Intersection Types

Software

TypeScript Handbook: NarrowingDocumentation

TypeScript Team

The official reference for TypeScript's narrowing mechanisms — type guards, assignment narrowing, reachability, discriminated unions, and the `never` type. The section on discriminated unions and exhaustiveness checking is where the design implications of narrowing become clearest.

Xuất hiện trongNarrowing

Software

TypeScript Handbook: Type CompatibilityDocumentation

TypeScript Team

The official reference for how TypeScript's structural type checker determines compatibility — including covariance, function parameter checking, and the excess property check quirk that applies to object literals but not assigned variables.

Xuất hiện trongStructural Typing và Nominal Typing

Software

Go Specification: Type DeclarationsDocumentation

The Go Authors

The precise specification of Go's distinction between a type definition (type UserId string — creates a new type) and a type alias (type UserId = string — just another name). One character of difference, entirely different compiler behavior.

Xuất hiện trongStructural Typing và Nominal Typing

Software

TypeScript GitHub Issue #202: Nominal TypingArticle

TypeScript Community · 2014

The longest-running discussion in TypeScript's history about whether to add native nominal typing. Reading the thread is the fastest way to understand why TypeScript chose structural typing — and why the question has never fully been settled.

Xuất hiện trongStructural Typing và Nominal Typing

Software

Total TypeScriptArticle

Matt Pocock

The most practical writing on TypeScript type safety patterns, including branded types, opaque types, and type-level programming. Pocock's explanations of why structural typing bites you in large codebases — and how to work around it — are the clearest available.

Xuất hiện trongStructural Typing và Nominal Typing

Software

TypeScript Handbook: Type AssertionsDocumentation

TypeScript Team

The official reference for `as T` and `<T>value`. The handbook describes the rule TypeScript enforces on assertions (the type must be at least as wide or at least as narrow as the source) and the escape valve through `unknown` that bypasses that rule. The page is also where the standard advice to prefer type declarations over assertions is anchored.

Xuất hiện trongHướng dẫn thật về any, unknown, và as

Software

Effective TypeScript: 62 Specific Ways to Improve Your TypeScriptBook

Vanderkam, Dan · 2024

Item 9 ("Prefer Type Declarations to Type Assertions") is the canonical statement of the distinction this chapter draws: a declaration forces the value to structurally satisfy the type at the point of assignment, an assertion overrides what the compiler currently thinks. The book is the most reliable source for the patterns experienced TypeScript developers reach for once `strict` no longer surprises them.

Xuất hiện trongHướng dẫn thật về any, unknown, và as

Software

How Do Committees Invent?Article

Conway, Melvin E. · 1968

The original statement of Conway's Law, published in Datamation in April 1968: "organizations which design systems are constrained to produce designs which are copies of the communication structures of these organizations." Conway was writing about committee-designed systems, but the observation has become foundational for software architecture, including the smaller-scale version this post applies to type design, where the field-by-field decisions in a struct quietly encode the team boundaries of the organization.

Xuất hiện trongType design là system design DDD và microservices

Software

No Silver Bullet: Essence and Accidents of Software EngineeringPaper

Brooks, Frederick P. Jr. · 1986

The canonical statement of the distinction between accidental complexity (complexity contributed by tools, languages, and environment, which better tooling can remove) and essential complexity (complexity inherent in the problem domain itself, which no tool can remove). Brooks predicted no single technical advance would deliver an order-of-magnitude productivity gain on essential complexity, and decades later the prediction still holds. This chapter cites Brooks to mark the line: TypeScript removes a large share of accidental complexity, but the remaining work is essential, and it is where engineering judgment lives.

Xuất hiện trongĐiều TypeScript không làm được cho bạn

Software

The Design of Everyday ThingsBook

Norman, Donald A. · 1988

Norman is cited across this site for two complementary ideas. (1) The slips-versus-mistakes distinction borrowed in the TypeScript chapter: a slip is an execution failure (you knew what to do, but did it wrong), a mistake is a planning failure (you did exactly what you intended, but the intention itself was wrong). (2) The concept of "affordance": a well-designed artifact signals how it should be used before the user reads any instruction. The Structure and Layout chapter applies that idea to code: structure is the affordance of a file, the signal the reader picks up before reading any line.

Xuất hiện trongĐiều TypeScript không làm được cho bạn Structure và Layout

Software

Zod: TypeScript-first schema validation with static type inferenceDocumentation

Colin McDonnell

The most widely used answer to the gap between "the compiler trusts" and "the runtime verifies". A Zod schema is simultaneously the type the compiler uses to check downstream code and the validator the runtime uses to check incoming data, so the two cannot drift apart. It is the standard close on the `as Config` pattern this chapter calls insufficient.

Xuất hiện trongHướng dẫn thật về any, unknown, và as

Software

Flow: The Psychology of Optimal ExperienceBook

Mihaly Csikszentmihalyi · 1990

Burnout is not caused by working too much — it is caused by the absence of flow: no challenge, no meaning, no absorption. Flow requires intrinsic motivation; activities done for external rewards rarely produce it.

Xuất hiện trongBảy Năm Trong Ngành Phần Mềm

Software

The Mythical Man MonthBook

Frederick P. Brooks Jr. · 1975

The tar pit of software complexity — all sufficiently complex systems have hidden parts far larger than visible ones. Adding people to a late project makes it later. Still the clearest book about why software is hard.

Xuất hiện trongBảy Năm Trong Ngành Phần Mềm

Software

The CraftsmanBook

Richard Sennett · 2008

The philosophical case for craft: doing something well for its own sake. What distinguishes a good craftsman from a long-tenured engineer is not time served but attitude toward the work.

Xuất hiện trongBảy Năm Trong Ngành Phần Mềm

Software

Parkinson's LawBook

C. Northcote Parkinson · 1957

The Law of Triviality: people spend the most time on the things easiest to understand, regardless of importance. In code review, this means many comments on variable names and silence on architecture.

Xuất hiện trongVề Code Review

Software

Accelerate: The Science of Lean Software and DevOpsBook

Forsgren, Humble, Kim · 2018

Research-backed evidence that high-performing software teams do lightweight, fast code review — not heavy, thorough review. The difference is PR clarity and role clarity, not review volume. Also the source of the four DORA metrics (deployment frequency, lead time, change failure rate, mean time to restore) — the only software delivery metrics with empirically validated correlation to business outcomes across thousands of organizations.

Xuất hiện trongVề Code Review Cái Thước Không Có Đơn Vị

Software

Agile Estimating and PlanningBook

Cohn, Mike · 2005

The book that popularized story points and velocity as Agile planning tools. Cohn's own framing is precise: story points measure relative effort and complexity, not time; velocity is a team's historical reference for its own sprint planning, not a productivity metric. Both of his warnings — against comparing velocity across teams, and against using velocity as a management KPI — are systematically ignored in most implementations.

Xuất hiện trongCái Thước Không Có Đơn Vị

Software

Google Engineering Practices: Code ReviewDocumentation

Google

The explicit statement that the goal of code review is to improve code health over time, not to achieve perfection. Perfectionism in review is the enemy of velocity.

Xuất hiện trongVề Code Review

Software

Designing Data-Intensive ApplicationsBook

Martin Kleppmann · 2017

The most thorough treatment of distributed systems tradeoffs in print. Chapter 8 on "The Trouble with Distributed Systems" is essential: the fundamental problem is that you cannot distinguish a slow node from a failed one. Chapter 9 on consistency and consensus is the canonical companion reading for any chapter that touches CAP, linearizability, or eventual consistency. The chapters on replication and partitioning are also the canonical reference for the trade-offs that event-driven architectures inherit.

Xuất hiện trongOOP, state, và những thứ bạn không thể control Object khi ranh giới là network Message-based OOP — Alan Kay đã đúng từ đầu Time-Of-Check to Time-Of-Use Event-Driven Architecture Vật lý của hệ thống phân tán Log là nguồn sự thật duy nhất không thể nói dối Một message không chỉ được đọc một lần Message là gì, và tại sao nó không phải là HTTP Data Consistency, Cái Giá Của Sự Thật Thời gian — thứ distributed systems không thực sự có Đọc một hệ thống như đọc bản đồ failure modes Replication — Nhân bản không phải sao chép Partitioning — Cắt thế nào thì sống với hệ quả đó

Software

Design Patterns: Elements of Reusable Object-Oriented SoftwareBook

Gamma, Helm, Johnson, Vlissides (Gang of Four) · 1994

The original catalog of 23 patterns, still in print after three decades. The warning "don't apply these patterns blindly" appears in the book itself — the source of every pattern museum.

Xuất hiện trongVề Design Pattern Inheritance và Composition — không phải là "dùng cái nào"

Software

Refactoring: Improving the Design of Existing CodeBook

Martin Fowler · 1999

Do not refactor because the code looks bad. Refactor when you have a clear reason — usually, because a change you need to make is hard. Context decides the tool, not the other way around.

Xuất hiện trongVề Design Pattern Tradeoffs SOLID — nguyên tắc hay quyết định?Code review bằng mắt OOP

Software

Working Effectively with Legacy CodeBook

Feathers, Michael · 2004

Feathers famously defines legacy code as code without tests, because without tests no change can be verified as safe. The OOP chapter on code review reads the same definition at another layer: legacy code is code whose wrong abstraction has accumulated enough dependents that fixing it has become a project of its own. The book is a catalog of techniques for breaking dependencies so tests (and therefore change) become possible — seams, sprout methods, characterization tests. The chapter cites it to mark what PR review is the last cheap moment before: once merged, every day adds callers, workarounds, and assumptions that seal the abstraction in place.

Xuất hiện trongCode review bằng mắt OOP Những quyết định không có sách nào dạy

Software

The WyCash Portfolio Management SystemPaper

Cunningham, Ward · 1992

The OOPSLA 1992 experience report where the technical debt metaphor first appears: shipping imperfect code to learn faster is like borrowing money, useful as long as the loan is deliberate and the interest gets paid. The chapter leans on two later clarifications Cunningham made (notably in his 2009 "Debt Metaphor" video): the metaphor was meant to explain to business people why refactoring is valuable, not to justify shipping bad code, and carelessly written code that nobody recorded as a decision is not debt at all but cruft. That distinction between deliberate, recorded debt and invisible accident is the backbone of the chapter's explicit-versus-implicit debt argument.

Xuất hiện trongNhững quyết định không có sách nào dạy

Software

Strangler Fig ApplicationArticle

Fowler, Martin · 2004

The bliki entry that named and popularized the incremental migration pattern, after Fowler observed literal strangler figs in Australia: build the new system around the edges of the old one and let the old one shrink as traffic routes away from it, instead of attempting a big-bang rewrite. The chapter cites it for the origin of the name and for the observation underneath the pattern: it is not an invention but a description of how large codebases actually migrate successfully in practice, in contrast to rewrite projects that routinely fail. The pattern's enabling mechanism is the seam, which is why the chapter pairs it with Feathers.

Xuất hiện trongNhững quyết định không có sách nào dạy

Software

Structured Programming with go to StatementsPaper

Donald E. Knuth · 1974

The original source of the "premature optimization is the root of all evil" quote, almost always cited without its qualifier: "Yet we should not pass up our opportunities in that critical 3%." Knuth was not arguing against optimization, he was arguing against optimizing the parts that do not matter. The full passage frames the decision as a question about where the cost actually lives, which is the same frame this chapter applies to performance-vs-clarity tradeoffs.

Xuất hiện trongTradeoffs

Software

A Behavioral Model of Rational ChoicePaper

Herbert A. Simon · 1955

The paper that introduced satisficing as a model of how people actually make decisions under bounded cognitive resources. Simon argued that good decisions are not the result of finding the optimal solution but of finding a solution that is good enough given real constraints on time and information. The chapter applies that intuition to code quality: "enough" is not an absolute, it is a calibration against context and cost of error.

Xuất hiện trongKhi nào thì đủ

Software

The Paradox of ChoiceBook

Barry Schwartz · 2004

Extends Simon's satisficing into the psychological cost of always seeking the optimal: paralysis, regret, dissatisfaction. The book frames maximizers and satisficers as two stances toward decision-making, not two skill levels. The chapter uses this to name the source of over-engineering, optimization for an imagined reader instead of the real one, as a stance problem rather than a discipline problem.

Xuất hiện trongKhi nào thì đủ

Software

A Philosophy of Software DesignBook

John Ousterhout · 2018

Ousterhout's argument that complexity is the central problem of software design, and that complexity accumulates in places nobody notices: shallow modules that hide too little, layers added for "cleanliness" that increase total cost, abstractions that are made before the use cases that would justify them. The OOP Chapter 3 references his distinction between tactical programming (solving today's problem fastest) and strategic programming (investing in keeping complexity low), as the backdrop for why code with no opinion about its axis of change is not flexible but deferred. The "when is it enough" post uses his over-engineering argument.

Xuất hiện trongKhi nào thì đủ Thứ bạn đang thiết kế cho là sự thay đổi SOLID — nguyên tắc hay quyết định?Code review bằng mắt OOP

Software

Agile Software Development, Principles, Patterns, and PracticesBook

Robert C. Martin · 2002

The formal treatment of SOLID principles, as a set of questions for recognizing problems in code, not formulas to apply mechanically. The principles predate the patterns. The book also contains the canonical story of how ISP was extracted from work on Xerox printer software, where a single Job interface forced every job type to implement dozens of unrelated methods.

Xuất hiện trongVề Design Pattern SOLID là ngôn ngữ, không phải luật SOLID — nguyên tắc hay quyết định?Dependency — triết học trước, framework sau

Software

Clean Architecture: A Craftsman's Guide to Software Structure and DesignBook

Robert C. Martin · 2017

Martin's late restatement of SRP, made necessary by how widely the principle had been misread. The book replaces "a class should do only one thing" with "a module should be responsible to one, and only one, actor," and uses the word actor to mean a group of people who share goals and ownership over part of the system. The book also generalizes Cockburn's Hexagonal Architecture into Martin's own framing, with concentric circles of entities, use cases, interface adapters, and frameworks, all governed by the same one-way dependency rule that points inward toward the domain.

Xuất hiện trongSOLID là ngôn ngữ, không phải luật Domain modeling — từ bài toán thật đến class

Software

Practical Object-Oriented Design in Ruby (and talks)Book

Sandi Metz · 2012

The cost of abstraction: every abstraction reduces one kind of complexity while adding another. The right question is not "is this pattern good?" but "is this trade-off worth it for the problem I actually have?"

Xuất hiện trongVề Design Pattern SOLID — nguyên tắc hay quyết định?

Software

Manifesto for Agile Software DevelopmentDocumentation

Beck, Kent; Beedle, Mike; van Bennekum, Arie; Cockburn, Alistair; Cunningham, Ward; Fowler, Martin; Grenning, James; Highsmith, Jim; Hunt, Andrew; Jeffries, Ron; Kern, Jon; Marick, Brian; Martin, Robert C.; Mellor, Steve; Schwaber, Ken; Sutherland, Jeff; Thomas, Dave · 2001

The original manifesto written by seventeen practitioners at a Utah ski resort in February 2001. Four values and twelve principles, framed as priorities ("X over Y") rather than rejections. The post turns on this distinction: Agile is a statement about what to weight more heavily, not a set of ceremonies to perform. The principle of "maximizing the amount of work not done" is the one most often skipped in real implementations.

Xuất hiện trongDanh Nghĩa Agile

Software

How Amazon Web Services Uses Formal MethodsPaper

Newcombe, Chris; Rath, Tim; Zhang, Fan; Munteanu, Bogdan; Brooker, Marc; Deardeuff, Michael · 2015

Published in ACM Queue. The case study that brought TLA+ into mainstream engineering discourse. AWS engineers describe finding subtle defects in S3, DynamoDB, and EBS designs that had survived code review and testing for years, then discuss why the cost of writing a specification paid back as protection against production-time race conditions on systems that handle trillions of requests. The post's opening anecdote is taken from this paper.

Xuất hiện trongKẻ Trốn Trong Khoảng Trống

Software

Practical TLA+: Planning Driven DevelopmentBook

Wayne, Hillel · 2018

The book that turned TLA+ from a niche academic tool into something working engineers could actually pick up. Wayne's central contribution is the 'abstraction sweet spot' — small enough for the model checker to finish, precise enough to catch real bugs. The post borrows several framings from this book, including the recommendation to learn PlusCal first and only drop to raw TLA+ when needed.

Xuất hiện trongKẻ Trốn Trong Khoảng Trống

Software

The Chubby Lock Service for Loosely-Coupled Distributed SystemsPaper

Burrows, Mike · 2006

OSDI 2006. The original paper describing the distributed lock service that sits underneath much of Google's infrastructure. The reason every distributed lock built afterward uses leases and epoch numbers: the paper traces the bugs that forced those design choices, exactly the corner cases the post's distributed-lock example surfaces through TLA+.

Xuất hiện trongKẻ Trốn Trong Khoảng Trống

Software

RFC 791: Internet ProtocolStandard

Postel, Jon · 1981

The specification that defines IPv4. Forty-five years old and still the foundation of most internet traffic. RFC 791's design choice that nothing about IP is reliable — not delivery, not ordering, not integrity — is the choice every layer above IP has had to work around ever since.

Xuất hiện trongMười Bốn Bước Trước Khi HTML Đến

Software

RFC 793: Transmission Control ProtocolStandard

Postel, Jon · 1981

The original TCP specification. Defines the three-way handshake, sequence numbers, retransmission, flow control, and the connection termination dance. Every behavior that makes TCP feel "reliable" lives here. The post traces how each of these mechanisms is a specific answer to one specific guarantee that IP underneath refuses to make.

Xuất hiện trongMười Bốn Bước Trước Khi HTML Đến

Software

RFC 9110: HTTP SemanticsStandard

Fielding, Roy T.; Nottingham, Mark; Reschke, Julian · 2022

The current consolidated specification of HTTP semantics, version-independent. Replaces the older RFC 2616 and clarifies what HTTP means across HTTP/1.1, HTTP/2, and HTTP/3. The post borrows the version-agnostic framing: the wire format keeps changing, but the request-response semantics have been stable for thirty years.

Xuất hiện trongMười Bốn Bước Trước Khi HTML Đến

Software

RFC 8446: The Transport Layer Security (TLS) Protocol Version 1.3Standard

Rescorla, Eric · 2018

The current TLS specification. Reduces the handshake to one round-trip in the common case and zero round-trips on resumption. The single largest change to how HTTPS feels in twenty years, and the answer to most of the latency complaints about earlier TLS versions.

Xuất hiện trongMười Bốn Bước Trước Khi HTML Đến

Software

RFC 9114: HTTP/3Standard

Bishop, Mike · 2022

The specification of HTTP/3, the version that abandons TCP entirely and runs over QUIC on UDP. The reason: TCP's in-order delivery creates head-of-line blocking that no amount of application-layer multiplexing can fix. The only escape was replacing the transport layer. The post uses HTTP/3 as the final example of the 'do not trust the layer below' pattern.

Xuất hiện trongMười Bốn Bước Trước Khi HTML Đến

Software

Solutio problematis ad geometriam situs pertinentisPaper

Euler, Leonhard · 1741

Published in *Commentarii Academiae Scientiarum Petropolitanae*. The paper that founded graph theory by solving the Königsberg bridges problem — and by inventing the abstraction needed to solve it. Euler called the new field *geometria situs*, geometry of position; the name "graph theory" came much later. The piece traces the move he made (throw away the map, keep only nodes and edges) as the founding gesture of network science.

Xuất hiện trongBảy Cầu, Sáu Bước, Và Cấu Trúc Ẩn Của Mọi Thứ

Science

On Random Graphs IPaper

Erdős, Paul; Rényi, Alfréd · 1959

Published in *Publicationes Mathematicae*. The founding paper of random graph theory, introducing the G(n, p) model and discovering the phase transition at p = 1/n where a giant component suddenly appears. The model describes real networks poorly, but the concepts it introduced — percolation threshold, giant component, phase transition — survive in every more accurate model that followed.

Xuất hiện trongBảy Cầu, Sáu Bước, Và Cấu Trúc Ẩn Của Mọi Thứ

Science

Collective dynamics of 'small-world' networksPaper

Watts, Duncan J.; Strogatz, Steven H. · 1998

Published in *Nature*. One of the most cited papers in network science. Shows that a tiny fraction of random shortcuts collapses average path length without breaking local clustering, which is why six-degrees-of-separation is mathematically inevitable in any network with even a few long-range links. The empirical evidence covers C. elegans, the US power grid, and the actor collaboration graph.

Xuất hiện trongBảy Cầu, Sáu Bước, Và Cấu Trúc Ẩn Của Mọi Thứ

Science

Emergence of Scaling in Random NetworksPaper

Barabási, Albert-László; Albert, Réka · 1999

Published in *Science*. The paper that introduced the scale-free network concept and the preferential-attachment model. Has over 40,000 citations and reframed how people analyze the web, citation networks, biological networks, and the structural origins of inequality in growing systems.

Xuất hiện trongBảy Cầu, Sáu Bước, Và Cấu Trúc Ẩn Của Mọi Thứ Graph ở quy mô lớn: nơi sách giáo khoa dừng lại

Science

The Strength of Weak TiesPaper

Granovetter, Mark · 1973

Published in *American Journal of Sociology*. Among the most cited papers in all of sociology. Documents that most people find new jobs not through close friends but through acquaintances — and explains the result structurally: acquaintances bridge communities and carry non-redundant information, which is a function of betweenness, not closeness.

Xuất hiện trongBảy Cầu, Sáu Bước, Và Cấu Trúc Ẩn Của Mọi Thứ

Science

Final Report on the August 14, 2003 Blackout in the United States and CanadaReport

U.S.-Canada Power System Outage Task Force · 2004

The official forensic report on the cascade failure that started with a tree branch in Ohio and left 55 million people without power across eight US states and Ontario. The post uses it as the anchoring real-world example of cascade failure in a network operating near its percolation threshold with hidden interdependencies between the physical grid and the SCADA control system.

Xuất hiện trongBảy Cầu, Sáu Bước, Và Cấu Trúc Ẩn Của Mọi Thứ

Science

Thinking, Fast and SlowBook

Kahneman, Daniel · 2011

Kahneman's synthesis of forty years of research with Amos Tversky, written for non-specialists but without simplifying away the substance. The post draws on it for the System 1 / System 2 framing, the bat-and-ball problem, planning fallacy, the curriculum-writing anecdote, and Kahneman's own confession that decades of studying biases did not make him noticeably better at avoiding them.

Xuất hiện trongNgười Kinh Tế Không Tồn Tại Structure và Layout

Philosophy

Prospect Theory: An Analysis of Decision under RiskPaper

Kahneman, Daniel; Tversky, Amos · 1979

Published in *Econometrica*, one of the most-cited papers in all of social science. Replaces Expected Utility Theory with a model grounded in three empirical departures: reference dependence, an S-shaped value function with loss aversion built into its geometry, and a probability weighting function. The Nobel Committee cited this work when awarding Kahneman the Economics prize in 2002.

Xuất hiện trongNgười Kinh Tế Không Tồn Tại

Philosophy

Nudge: Improving Decisions About Health, Wealth, and HappinessBook

Thaler, Richard H.; Sunstein, Cass R. · 2008

The book that turned choice architecture from an academic concept into a policy framework adopted by governments and corporations. The post borrows the Save More Tomorrow case study, the organ-donation default contrast between Austria and Germany, and the "libertarian paternalism" framing. The 2021 revised edition addresses common criticisms, including the dark-patterns problem of choice architecture designed against the user.

Xuất hiện trongNgười Kinh Tế Không Tồn Tại

Philosophy

Misbehaving: The Making of Behavioral EconomicsBook

Thaler, Richard H. · 2015

Thaler's own account of building behavioral economics from a list of "weird things people do that economic theory cannot explain" into a recognized field that won him the 2017 Nobel. Part memoir, part intellectual history, candid about the early decades when the work was dismissed as "not economics" by mainstream academia.

Xuất hiện trongNgười Kinh Tế Không Tồn Tại

Philosophy

OCI Image Format SpecificationStandard

Open Container Initiative

The standard that defines how container images are packaged: manifest format, layer encoding, content-addressable storage. The reason an image built by Docker runs unchanged on Podman, containerd, or any other OCI-compliant runtime — and the reason `docker push` to a registry transfers only the layers the registry does not already have.

Xuất hiện trongCái Mà Bạn Nghĩ Bạn Đang Chạy

Software

OCI Runtime SpecificationStandard

Open Container Initiative

The interface between container managers (containerd) and low-level runtimes (runc, crun, kata). Every Dockerfile instruction and every `docker run` flag eventually compiles to the JSON document this spec defines, and that JSON is what the kernel actually sees.

Xuất hiện trongCái Mà Bạn Nghĩ Bạn Đang Chạy

Software

CIS Docker BenchmarkDocumentation

Center for Internet Security

The reference checklist for hardening a Docker host and the containers running on it. Run `docker-bench-security` against any production host and you get a list of concrete things to fix — usually the first time a team learns how many of its container defaults are unsafe.

Xuất hiện trongCái Mà Bạn Nghĩ Bạn Đang Chạy

Software

Developers Should Abandon AgileArticle

Jeffries, Ron · 2018

Written by one of the original Manifesto signatories. Jeffries is not arguing against Agile's principles. He is arguing that what gets called "Agile" inside organizations has become a top-down compliance regime imposed on developers, the inverse of what the Manifesto intended. The post borrows this distinction to separate Agile-as-principles from Agile-as-control-language.

Xuất hiện trongDanh Nghĩa Agile

Software

A Universal Modular Actor Formalism for Artificial IntelligencePaper

Hewitt, Carl; Bishop, Peter; Steiger, Richard · 1973

The original paper that introduced the Actor Model. Three rules govern an actor on receiving a message: send a finite number of messages, create a finite number of new actors, and decide its behavior for the next message. The paper was framed for AI reasoning in the Planner language, not distributed systems — the application to telecom and fault tolerance came later through Armstrong's Erlang work at Ericsson.

Xuất hiện trongCái Switch Không Biết Mình Là Anh Hùng

Software

Making Reliable Distributed Systems in the Presence of Software ErrorsPaper

Armstrong, Joe · 2003

Joe Armstrong's PhD thesis, submitted 19 years after he started building Erlang. The argument is mathematical but its core is simple: fault tolerance is not a feature you add — it emerges from isolation. If two processes share no state, a crash in one cannot corrupt the other. The thesis formalises why a supervision tree works, and codifies 'let it crash' as a design philosophy rather than a defeatist slogan.

Xuất hiện trongCái Switch Không Biết Mình Là Anh Hùng

Software

Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web ServicesPaper

Gilbert, Seth; Lynch, Nancy · 2002

The formal proof of Eric Brewer's 1998 CAP conjecture: a distributed system cannot simultaneously guarantee consistency, availability, and partition tolerance. The popular reading 'pick two of three' misses the point. Network partitions are unavoidable in practice, so the real choice is between consistency and availability when a partition happens. Erlang and OTP systems typically choose AP, which is the right answer for telecom and most user-facing systems.

Xuất hiện trongCái Switch Không Biết Mình Là Anh Hùng Vật lý của hệ thống phân tán Pattern không phải câu trả lời, pattern là câu hỏi đã đông cứng

Software

Dynamo: Amazon's Highly Available Key-value StorePaper

DeCandia, Giuseppe; Hastorun, Deniz; Jampani, Madan; Kakulapati, Gunavardhan; Lakshman, Avinash; Pilchin, Alex; Sivasubramanian, Swaminathan; Vosshall, Peter; Vogels, Werner · 2007

The original Dynamo paper. Notable not for its data structures but for its design philosophy: Amazon chose availability over consistency as a deliberate product decision, then designed conflict resolution as a feature with an explicit failure model ("a duplicated item in the cart is preferable to a missing one"). Section 4 (System Interface) and Section 5 (Implementation) describe the always-writeable model and semantic reconciliation that the post draws on. The paper is the technical ancestor of the public DynamoDB service released in 2012.

Xuất hiện trongVật lý của hệ thống phân tán Pattern là câu trả lời, constraint là câu hỏi Cái Giá Của Sự Đơn Giản Pattern không phải câu trả lời, pattern là câu hỏi đã đông cứng Partitioning — Cắt thế nào thì sống với hệ quả đó Cold-reading một hệ thống bạn chưa bao giờ thấy

Software

Consistent Hashing and Random Trees: Distributed Caching Protocols for Relieving Hot Spots on the World Wide WebPaper

Karger, David; Lehman, Eric; Leighton, Tom; Panigrahy, Rina; Levine, Matthew; Lewin, Daniel · 1997

The original paper that introduced consistent hashing, presented at ACM STOC 1997. The motivating context was web caching: how to distribute cached content across many cache servers without a central directory, in a way that would survive the addition and removal of servers as the web grew. The paper is the source of the hash-ring construction, the proof of the 1/(N+1) migration bound, and the random-tree variant that is less known but also proposed in the same paper. Daniel Lewin, one of the authors, went on to co-found Akamai, which used the algorithm at production scale for CDN content distribution.

Xuất hiện trongCái Giá Của Sự Đơn Giản Pattern là câu trả lời, constraint là câu hỏi

Software

A Name-Based Mapping Scheme for RendezvousPaper

Thaler, David; Ravishankar, Chinya · 1996

The introduction of what is now called Rendezvous Hashing or Highest Random Weight (HRW). Predates Karger's consistent hashing by a year, but received much less attention because it was framed as a multicast routing technique rather than a caching primitive. Each key independently computes a score for every server and picks the highest, yielding a distribution with no shared state at the cost of O(N) lookup. Used today in some DNS-based load balancers and as an internal mechanism in Riak.

Xuất hiện trongCái Giá Của Sự Đơn Giản

Software

A Fast, Minimal Memory, Consistent Hash AlgorithmPaper

Lamping, John; Veach, Eric · 2014

A three-page Google paper introducing Jump Consistent Hash. The whole algorithm fits in roughly ten lines: a linear congruential generator drives a sequence of "jumps" that lands directly on the final bucket a key belongs to as buckets are added one at a time. No ring, no lookup table, no sorted list, O(log N) lookup with O(1) space. The trade-off is that buckets must be contiguous and numbered from 0; removal from the middle is not natively supported. Eric Veach is notable for having won an Oscar Technical Achievement Award for prior work on rendering algorithms before this paper, which is a hint at the kind of cross-domain elegance the paper has.

Xuất hiện trongCái Giá Của Sự Đơn Giản

Software

Maglev: A Fast and Reliable Software Network Load BalancerPaper

Eisenbud, Daniel E. et al. · 2016

The Google paper at NSDI 2016 describing the software load balancer that handles essentially all traffic into Google. The contribution of interest for consistent hashing is the Maglev hashing algorithm: a permutation-based scheme that builds a fixed-size lookup table of size M (typically a large prime like 65537) where each backend occupies roughly M/N cells and rebuilding the table after a backend change preserves most cells in place. O(1) lookup, deterministic, and tuned for the harder constraint of keeping packets of the same TCP connection on the same backend even as backends churn.

Xuất hiện trongCái Giá Của Sự Đơn Giản

Software

Consistent Hashing with Bounded LoadsPaper

Mirrokni, Vahab; Thorup, Mikkel; Zadimoghaddam, Morteza · 2017

Google Research paper that proves a clean bound on a long-standing weakness of consistent hashing: even with virtual nodes, expected load is balanced but worst-case load is not. The paper introduces a capacity constraint (1+epsilon) times average load and proves that with epsilon = 0.25 the algorithm still keeps O(K/N) keys moving per topology change, while no server ever exceeds 1.25x average load. Used internally at Google for service-mesh load balancing where hot-key risk is real.

Xuất hiện trongCái Giá Của Sự Đơn Giản

Software

Ketama Consistent HashingArticle

Jones, Richard · 2007

The Last.fm engineering blog post that introduced ketama, a client-side consistent-hashing library for Memcached. The post is the first widely-read practical write-up of consistent hashing aimed at backend engineers rather than researchers, and it is the route through which the algorithm entered the mainstream during the late-2000s scaling era. Many Memcached client libraries still expose a "ketama hashing" option as a named alternative to plain modular hashing.

Xuất hiện trongCái Giá Của Sự Đơn Giản

Software

Spanner: Google's Globally-Distributed DatabasePaper

Corbett, James C. et al. · 2012

The Spanner paper. The interesting move is not the database itself but the TrueTime API: by giving every datacenter atomic clocks and GPS receivers, Google bounds clock uncertainty to a few milliseconds, enough to implement external consistency at planetary scale. Section 3 (TrueTime) and Section 4 (Concurrency Control) make the cost of strong consistency concrete: a commit-wait window that adds latency to every transaction. Read alongside the Dynamo paper to see two opposite answers to the same CAP question.

Xuất hiện trongVật lý của hệ thống phân tán Pattern không phải câu trả lời, pattern là câu hỏi đã đông cứng Thời gian — thứ distributed systems không thực sự có Những gì mình vẫn không chắc

Software

Fallacies of Distributed Computing ExplainedPaper

Rotem-Gal-Oz, Arnon · 2006

The paper that first gave Peter Deutsch's list its canonical name. Deutsch wrote down seven assumptions at Sun Microsystems in 1994 (James Gosling added the eighth); the list circulated without a title until Rotem-Gal-Oz collected and expanded them in this paper. The expanded discussion of each fallacy, including 'the network is reliable' and 'latency is zero,' is still the standard reference for why distributed-systems failure modes do not get fixed by faster hardware.

Xuất hiện trongVật lý của hệ thống phân tán Tại sao distributed systems phá vỡ cách bạn vẫn nghĩ Object khi ranh giới là network

Software

Stevy's Google Platforms RantArticle

Yegge, Steve · 2011

Originally posted internally at Google, then accidentally shared publicly, then left up. The piece is best known for being the closest thing we have to a public record of the 2002 Bezos API mandate. Yegge lays out the five points of the mandate from memory, including the closing "Anyone who doesn't do this will be fired," and then uses the rest of the rant to compare Amazon's platform discipline with Google's lack of one. The NSA Contract-First chapter cites this as the primary source for the Bezos mandate, the historical anchor that ties the chapter's central argument (every assumption between teams that does not exist as an interface is a hidden dependency) to a dated, public, organizational decision.

Xuất hiện trongTrade-off không phải lựa chọn, đó là bản chất Contract-First Architecture

Software

Adrian Cockcroft on Netflix's Migration to Cloud and MicroservicesArticle

Cockcroft, Adrian · 2011-2014

Adrian Cockcroft led Netflix's migration from a datacenter monolith to AWS-based microservices, and gave dozens of talks at QCon and AWS re:Invent in the years that followed. The recurring point in these talks is that the migration was not motivated by traffic scale (the database outage of 2008 was the proximate trigger) but by the coordination overhead of a single shared codebase across many fast-growing engineering teams. The chapter uses this as a clean example of an organizational assumption (deployment independence > coordination efficiency) being chosen deliberately and held long enough for its consequences to show.

Xuất hiện trongTrade-off không phải lựa chọn, đó là bản chất

Software

Summary of the AWS Service Event in the Northern Virginia (US-EAST-1) RegionReport

Amazon Web Services · 2021

The public post-mortem for the December 7, 2021 us-east-1 incident. The root cause was a change in internal network capacity scaling logic that produced a retry storm between internal services, but the part most often cited in the industry afterward was that the AWS Console itself was degraded during the outage, because it depended on the very region being affected. The chapter on designing for failure uses this as the canonical example of a failure mode that was discovered rather than designed: the unasked question was "if this region runs into trouble, do the things we use to intervene depend on it?"

Xuất hiện trongThiết kế cho sự thất bại

Software

Site Reliability Engineering: How Google Runs Production SystemsBook

Beyer, Betsy; Jones, Chris; Petoff, Jennifer; Murphy, Niall Richard (eds.) · 2016

The canonical articulation of the SLI/SLO/error budget framework. Chapter 3 ("Embracing Risk") and Chapter 4 ("Service Level Objectives") are the load-bearing pieces. Two posts in this collection draw on the book from different angles. The TTA designing-for-failure chapter reads the error budget as a currency, capital for risky deploys when full, a stop when empty. The NSA SLO-as-Language chapter reads the same framework as an interface artifact, the value of an SLO is not better measurement but shared language between engineering and business, an agreement signed before incidents that both sides will honor when the number moves. The book is freely readable online.

Xuất hiện trongThiết kế cho sự thất bại SLO Như Một Ngôn Ngữ Observability — Bạn không sở hữu cái bạn không nhìn thấy

Software

Principles of Chaos EngineeringArticle

Basiri, Ali; Behnam, Niosha; de Rooij, Ruud; Hochstein, Lorin; Kosewski, Luke; Reynolds, Justin; Rosenthal, Casey · 2015

The short manifesto written by Netflix engineers in 2015 that introduced the term Chaos Engineering and the practice of running controlled failure experiments in production. The chapter cites it for the framing that comes before the tooling: Chaos Engineering is the discipline of discovering blast radius before production discovers it for you. The five principles (build a hypothesis around steady-state behavior, vary real-world events, run experiments in production, automate continuously, minimize blast radius) are the canonical reference any team starting reliability injection work should read first.

Xuất hiện trongThiết kế cho sự thất bại Failure Mode Thinking

Software

Chaos Engineering: System Resiliency in PracticeBook

Rosenthal, Casey; Jones, Nora · 2020

The expanded book treatment of Chaos Engineering, written by two of the practitioners who built the discipline at Netflix and elsewhere. The book is structured as case studies from teams running real failure-injection programs (Netflix, LinkedIn, Capital One, Google, Microsoft), with the practical material on how to start small, how to scope blast radius, and how to integrate failure injection with normal engineering workflow. Worth reading in pairs with the Google SRE book: SRE supplies the vocabulary (SLO, error budget), Chaos Engineering supplies the verb (how to actually test blast radius rather than assume it).

Xuất hiện trongThiết kế cho sự thất bại Failure Mode Thinking

Software

The Field Guide to Understanding Human ErrorBook

Dekker, Sidney · 2006

Dekker's central reframe: the question 'who was wrong' is the wrong question. The right question is, what conditions made this decision reasonable at the time. Dekker is writing about aviation, healthcare, and other high-risk operational domains, but the frame travels straight into software engineering. The NSA Explicit-Over-Implicit chapter draws on Dekker to make the structural point about ADRs and runbooks, when a new engineer six months later looks at an old decision and concludes someone made a mistake, they are almost always missing the conditions that made the original decision reasonable. The fix is not better engineers or stricter discipline; it is artifacts that preserve the conditions of reasonable decision so the person who comes after can stand where the original decision-maker stood.

Xuất hiện trongExplicit Over Implicit

Software

Building Event-Driven Microservices: Leveraging Organizational Data at ScaleBook

Bellemare, Adam · 2020

The most complete practitioner treatment of event-driven architecture as an organizational data model, from event schema design to consumer group patterns. The chapter cites Bellemare for the term "consumer blindness": the failure mode of producer-driven schemas, where the producer assumes consumers will adapt to schema changes while having no visibility into how many consumers exist or which fields they depend on. His framing of the event schema as a public API with the same versioning discipline is the backbone of the chapter's contract section.

Xuất hiện trongMessage-based OOP — Alan Kay đã đúng từ đầu

Software

The Computer Revolution Hasn't Happened YetArticle

Kay, Alan · 1997

Kay's OOPSLA 1997 keynote, the talk where he repeats "the big idea is messaging" in front of the community that had spent a decade building class hierarchies. The chapter cites it for the biological cell analogy: cells keep internal state private and communicate by emitting chemical signals without knowing which receptors will respond, the property the chapter maps onto event-driven producers that emit facts without knowing their consumers. The keynote is also the source for late binding as the essential property of messaging.

Xuất hiện trongMessage-based OOP — Alan Kay đã đúng từ đầu

Software

Pact DocumentationDocumentation

Pact Foundation · 2013

The official documentation for Pact, the consumer-driven contract testing framework originating at realestate.com.au in 2013 and now maintained by the Pact Foundation under an Apache-2.0 license. The chapter cites Pact as the canonical implementation of consumer-driven contract testing at mid-size scale (roughly five to fifteen services). The strategic point is not the tool, but the relationship inversion the tool enforces: consumers publish expectations, the producer's CI verifies its implementation against every published expectation, and a breaking change cannot reach production unless every existing consumer is ready for it. Pact is also a useful reference for the broker/registry pattern that schema registries later generalize to events.

Xuất hiện trongContract-First Architecture Message-based OOP — Alan Kay đã đúng từ đầu

Software

Confluent Schema Registry DocumentationDocumentation

Confluent · 2014

The official documentation for Confluent Schema Registry, the original schema-registry implementation built around the Apache Kafka ecosystem starting in 2014 and now the de facto reference for the pattern. The chapter cites it as the canonical large-scale answer to API contracts in event-driven systems, where hundreds of producers and consumers share a central source of truth for schemas and a configurable compatibility mode (backward, forward, full) is enforced at CI time. Reading the compatibility-modes section is the fastest way to understand what 'breaking change is impossible to deploy by accident' actually means in operational terms.

Xuất hiện trongContract-First Architecture

Software

Observability Engineering: Achieving Production ExcellenceBook

Majors, Charity; Fong-Jones, Liz; Miranda, George · 2022

The book-length treatment of high-cardinality observability by the team that established the term in software engineering. The book's central argument: traditional monitoring tools were built when pre-aggregation was necessary because storage and compute were expensive, and the mental model of monitoring stayed in place after that constraint disappeared. The chapter draws on three load-bearing claims from the book, that observability is a property of the system rather than a category of tool, that the limit of useful observability is set by the cardinality of the data you preserve rather than the cleverness of the dashboard, and that observability is a design decision made on day one, not a layer added when the team has time.

Xuất hiện trongHệ Thống Tự Mô Tả Bản Thân Observability — Bạn không sở hữu cái bạn không nhìn thấy

Software

Dapper, a Large-Scale Distributed Systems Tracing InfrastructurePaper

Sigelman, Benjamin H.; Barroso, Luiz André; Burrows, Mike; Stephenson, Pat; Plakal, Manoj; Beaver, Donald; Jaspan, Saul; Shanbhag, Chandan · 2010

The Google technical report that formalized distributed tracing as an engineering discipline. Dapper defines the trace-and-span data model, the context propagation mechanism (a small header carried through every RPC call), and the sampling argument (100% trace rate is unaffordable at Google's scale; head-based adaptive sampling with a small per-request probability is the practical answer). The chapter draws on Dapper for three claims: that trace context propagation must be carried by application code, not just infrastructure; that sampling is a necessity rather than an optimization; and that the choice between head-based and tail-based sampling is a bet on the trade-off between simplicity and signal completeness. The paper is the ancestor of OpenTracing, OpenCensus, and OpenTelemetry.

Xuất hiện trongObservability — Bạn không sở hữu cái bạn không nhìn thấy

Software

On the General Theory of Control SystemsPaper

Kálmán, Rudolf E. · 1960

The original paper in which Rudolf Kálmán introduced the formal definition of observability in control theory, a system is observable if its complete state can be determined from a finite sequence of its outputs. The software-engineering use of the word, popularized by Charity Majors and colleagues, is a deliberate borrowing of this technical sense, not a metaphor. The chapter cites Kálmán to anchor the distinction between monitoring (verify the system is doing what you expected) and observability (recover internal state from external output, including when the state is unexpected). Without the Kálmán framing, the word collapses back into marketing-flavored 'better monitoring.'

Xuất hiện trongHệ Thống Tự Mô Tả Bản Thân

Software

OpenTelemetry DocumentationDocumentation

Cloud Native Computing Foundation · 2019

The open standard for distributed tracing, metrics, and logs in cloud-native systems, formed in 2019 by merging the OpenTracing and OpenCensus projects under the CNCF. The chapter cites OpenTelemetry as the right moment of investment at mid-size scale (roughly five to fifteen services), specifically because it separates instrumentation from backend, the team writes the instrumentation once and can ship the data to Honeycomb, Jaeger, Tempo, or any compliant store without rewriting code. The point is not the tool but the standard, the strategic property of OpenTelemetry is that it makes the choice of observability backend reversible.

Xuất hiện trongHệ Thống Tự Mô Tả Bản Thân

Software

The Netflix Simian ArmyArticle

Bennett, Cory; Tseitlin, Ariel · 2011

The original Netflix Tech Blog post in which the engineers who built Chaos Monkey describe what it does and why. Bennett and Tseitlin's framing is the one the chapter draws on, Chaos Monkey was not built to find bugs but to make instance failure a regular feature of the system, so that the organization had to build every service to survive it. The post also introduces the broader Simian Army (Latency Monkey, Conformity Monkey, Janitor Monkey, etc.), the canonical archetype of a chaos engineering toolset before the term itself had stabilized.

Xuất hiện trongFailure Mode Thinking Resilience — Thiết kế để thất bại đẹp, không phải thất bại im lặng

Software

Introducing Lyft's Envoy: Open-source L7 proxy and communication busArticle

Klein, Matt · 2016

The September 2016 announcement post in which Matt Klein, Envoy's author, describes why Lyft built an out-of-process proxy rather than a shared library. The key argument: cross-language, cross-team consistency in retry logic, timeout enforcement, TLS, and observability cannot be achieved through a library that each team adopts differently. Moving all of that into a sidecar process makes the behavior uniform across every service regardless of language or team. The post also explains why client-side load balancing is architecturally superior to server-side load balancers for service meshes: routing intelligence belongs in the distributed caller, not in a centralized bottleneck.

Xuất hiện trongService Communication — Coupling ẩn trong network

Software

gRPC: A high performance, open-source universal RPC frameworkArticle

Google · 2015

Google's 2015 announcement of gRPC as the public version of Stubby, its internal RPC system. The post explains the two design decisions that define gRPC: HTTP/2 as transport (multiplexing eliminates head-of-line blocking that made HTTP/1.1 expensive at Google's internal call volumes) and Protocol Buffers as the schema format (field-numbered messages with compile-time backward-compatibility enforcement). The chapter uses this as the primary anchor for why schema contract enforcement, not performance, is the main argument for gRPC over REST in internal service communication.

Xuất hiện trongService Communication — Coupling ẩn trong network

Software

Hystrix Wiki: How It WorksDocumentation

Netflix · 2013

The primary engineering documentation for Netflix Hystrix, the circuit breaker library Netflix built and open-sourced. The wiki is the most complete published account of why each design decision was made — the 1,000ms default execution timeout derived from observed dependency latency profiles, the 50% failure threshold with a 20-request minimum before tripping, the three-tier fallback classification (fail-silent, static, stubbed), and the half-open probe recovery model. The library is deprecated (Resilience4j is the successor), but the wiki remains the canonical reference for circuit breaker design rationale because it documents not just what the defaults are but why those values were chosen from production data.

Xuất hiện trongResilience — Thiết kế để thất bại đẹp, không phải thất bại im lặng

Software

Exponential Backoff And JitterArticle

Brooker, Marc · 2015

Marc Brooker's 2015 AWS Architecture Blog post presenting empirical data on retry strategies under correlated failure. The post compares naive retry, fixed backoff, exponential backoff, and three jitter strategies (equal jitter, full jitter, decorrelated jitter) using both simulation and production observation at Amazon. The key finding — that full jitter (random delay over the entire backoff window) outperforms exponential backoff alone on lock contention and system throughput when many callers are correlated — is the primary source for the chapter's argument that backoff alone does not prevent thundering herd. The post also provides code samples for each strategy.

Xuất hiện trongResilience — Thiết kế để thất bại đẹp, không phải thất bại im lặng

Software

Slack's Outage on January 4th 2021Report

Slack Engineering · 2021

Slack's public engineering write-up of the January 4, 2021 outage that began with a routine routing configuration change and degraded into a system-wide retry storm. The post-mortem is unusual in how deeply it walks the propagation path rather than stopping at a single root cause, showing how every individual service was behaving correctly by its own design while the interaction between services produced a positive feedback loop. The chapter uses Slack 2021 as the opening case study for failure-mode thinking, the failure that lives in the interaction between components rather than inside any one component.

Xuất hiện trongFailure Mode Thinking Observability — Bạn không sở hữu cái bạn không nhìn thấy

Software

Release It!: Design and Deploy Production-Ready SoftwareBook

Nygard, Michael · 2018

Michael Nygard's book on production stability, originally published in 2007 with a substantially revised second edition in 2018. The book formalizes the circuit breaker, bulkhead, and timeout patterns that the chapter on designing for failure references. The book is also where the temporal-coupling diagnosis is first stated in operational rather than purely conceptual terms: systems where the sender controls cadence are reliably more fragile than systems where the receiver can say when it is ready. The central position is that most production failures are not bugs in any single service but unintended interactions between services under abnormal conditions, and that stability patterns are design decisions about integration boundaries, not implementation details bolted on after the fact.

Xuất hiện trongThiết kế cho sự thất bại Message là gì, và tại sao nó không phải là HTTP

Software

What Do You Mean By "Event-Driven"?Article

Fowler, Martin · 2017

Fowler's working post on the family of patterns that get bundled under 'event-driven,' including the distinction between event notification, event-carried state transfer, event sourcing, and CQRS. The piece is the cleanest published account of why asynchronous messaging is not one design but four, and why teams that say 'we should be event-driven' are usually saying four different things at once. The chapter cites it as the standard reading for the next layer beyond the synchronous-vs-asynchronous split.

Xuất hiện trongMessage là gì, và tại sao nó không phải là HTTP Event-Driven — Thoát khỏi nô lệ thời gian thực

Software

How FriendFeed Uses MySQL to Store Schema-less DataArticle

Slatkin, Bret · 2009

The FriendFeed engineering write-up that has become the canonical early lesson on schema-less design over a relational database. The team stored data as serialized blobs to avoid migration cost during fast product change, then discovered that they had not avoided migration cost, only deferred it into application-layer complexity that lacked any of the tooling a real schema would have provided. The piece is worth reading not as a verdict on the choice itself but as one of the cleanest contemporary descriptions of how a data decision becomes a commitment that outlives the team's original intent. FriendFeed was acquired by Facebook the same year the post was written.

Xuất hiện trongData là contract, không phải implementation

Software

Designing Robust and Predictable APIs with IdempotencyArticle

Leach, Brandur · 2017

Stripe engineering post on the idempotency-key pattern, written by Brandur Leach. The piece is more interesting than the standard "how to design an API" essay because it argues, with examples from production, that idempotency is not a feature you can bolt onto an API after the fact. The storage shape it requires (O(1) lookup by key, deterministic expiry, conflict resolution under concurrent load) has to be designed into the data model from day one. The chapter uses this as the textbook example of a behavioral commitment flowing back into storage choices, instead of the other way around.

Xuất hiện trongData là contract, không phải implementation API là giao kèo

Software

How Figma's Databases Team Lived to Tell the ScaleArticle

Cui, Sammy · 2022

Figma engineering blog write-up on running Postgres at multi-terabyte scale before sharding, and on the decision-making process when sharding finally became necessary. The most useful section is the part where the team lists every current access pattern and projects what access patterns will look like in three years, then picks a sharding key (document ID, not user ID) on the basis of which queries it preserves as same-shard. The chapter uses this as the canonical example of a sharding key being a domain decision, not a load-balancing decision.

Xuất hiện trongData là contract, không phải implementation

Software

Domain-Driven Design: Tackling Complexity in the Heart of SoftwareBook

Evans, Eric · 2003

Eric Evans's foundational book on Domain-Driven Design. The chapter cites it specifically for the anti-corruption layer pattern, which Evans framed as a deliberate boundary between two bounded contexts, particularly when one is a legacy or external system you do not control. The cost is boilerplate mapping code; the benefit is that the domain model is not contaminated by another system's assumptions about identity, lifecycle, or constraints. The book itself is much broader; the ACL pattern is one of the more practically reusable pieces, and the easiest to retrofit into an existing architecture. Chapter 2 ("Communication and the Use of Language") is the canonical source for the Ubiquitous Language argument that the DDD series opens with: that the same word used by two parties at a boundary will mean different things until the team makes a deliberate decision to share a vocabulary.

Xuất hiện trongOOP, state, và những thứ bạn không thể control Khi OOP là câu trả lời sai Data là contract, không phải implementation Cái tên sai làm hỏng cả hệ thống Domain là gì Ubiquitous Language Model Bounded Context Context Mapping Aggregate Entity, Value Object, và câu hỏi về danh tính Domain Events Application Layer Repository DDD và microservices Event Sourcing và CQRS Khi domain thay đổi Legacy codebase Lan tỏa Ranh giới của DDD Tại sao code của bạn là OOP giả Domain modeling — từ bài toán thật đến class Message-based OOP — Alan Kay đã đúng từ đầu Code review bằng mắt OOP

Software

Implementing Domain-Driven DesignBook

Vernon, Vaughn · 2013

Vernon's practical companion to Evans. Chapter 6 on Value Objects provides the clearest statement of why immutability is not a technical preference but a domain requirement: a value object "measures, quantifies, or describes a thing in the domain." The OOP series cites this specifically for the test distinguishing Value Object from Entity — if two instances with identical attributes are interchangeable, it is a Value Object; if the domain needs to track them individually across time, it is an Entity.

Xuất hiện trongDomain modeling — từ bài toán thật đến class

Software

Growing Object-Oriented Software, Guided by TestsBook

Freeman, Steve; Pryce, Nat · 2009

Freeman and Pryce argue that objects should communicate through protocols, not through data. Where a data model exposes internal state for callers to interrogate, a domain model exposes operations for callers to invoke. The OOP series cites this for the data model vs domain model distinction: when a Wallet exposes balance as a number, callers communicate through data; when Wallet exposes debit() and deposit(), callers communicate through protocol. The book is also the origin of the "Tell, Don't Ask" principle as a testability heuristic that doubles as a design signal.

Xuất hiện trongDomain modeling — từ bài toán thật đến class

Software

Introducing EventStormingBook

Brandolini, Alberto

The source of Event Storming, the workshop format the chapter reframes. Brandolini's central claim is that the goal of the session is not the wall of sticky notes but the shared understanding produced when domain experts and developers narrate a business flow together and discover where their language silently diverges. The chapter draws on the method as a machine for manufacturing that linguistic collision, not as a diagramming technique, and treats the resulting diagram as sediment rather than product.

Xuất hiện trongLan tỏa

Software

Latency Numbers Every Programmer Should KnowArticle

Dean, Jeff · 2009

Jeff Dean's table of reference latencies, from L1 cache (about 0.5 ns) through RAM, SSD, same-datacenter round-trip, and cross-region round-trip (about 150 ms). The list spans more than six orders of magnitude across a handful of rows. The point of the table is not the absolute numbers, which have moved as hardware improved. It is the relative scale: an operation that sits at the wrong order of magnitude in a hot path changes the character of the system, not just its latency. The chapter on back-of-envelope thinking uses this as the canonical example of numbers worth knowing well enough to spot an unreasonable design decision on sight. A community-maintained, updated version is kept at https://gist.github.com/jboner/2841832.

Xuất hiện trongTư duy từ số

Software

Why Discord is Switching from Go to RustArticle

Howarth, Jesse · 2020

The Discord engineering write-up that documents how a single observation, namely periodic latency spikes from Go's garbage collector visible on a Grafana dashboard, drove the decision to rewrite the Read States service in Rust. The piece is one of the few public examples of a company narrating the path from "a number on a dashboard" to "an architectural decision," with the binary question (is GC pause approaching the SLA threshold at our growth rate) set explicitly before the rewrite was committed to. The chapter on back-of-envelope thinking uses it as the counterpoint to greenfield estimation: the same three-step technique, applied in reverse against an existing production system.

Xuất hiện trongTư duy từ số

Software

How Discord Stores Trillions of MessagesArticle

Discord Engineering · 2023

The follow-up to the Go-to-Rust write-up, describing the migration of Discord's message storage from Cassandra (on the JVM) to ScyllaDB (in C++). The trigger was the same kind of order-of-magnitude observation: Cassandra GC pauses against trillions of messages had crossed the threshold beyond which a patch would not be enough. Read alongside the Go-to-Rust post to see how the same team applies back-of-envelope thinking iteratively against a single production system over years.

Xuất hiện trongTư duy từ số Pattern là câu trả lời, constraint là câu hỏi Cái Giá Của Sự Đơn Giản Quyết định khi không có câu trả lời đúng

Software

The Reflective Practitioner: How Professionals Think in ActionBook

Schön, Donald A. · 1983

Schön's central distinction is between "espoused theory" (what professionals say they do when asked) and "theory-in-use" (what they actually do in the work). The gap between the two is not dishonesty. It is that most professional knowledge is tacit and only surfaces when forced into contact with a concrete situation. The chapter on reading requirements uses this directly: a PM can articulate the espoused requirement, but their theory-in-use about users, load, and failure mode only comes out when an engineer asks a sharp, situated question. Schön also introduced "reflection-in-action" as the practice of catching one's own tacit assumptions while still inside the work, which is what constraint extraction tries to operationalize for system design.

Xuất hiện trongĐọc requirement như một architect

Software

Software Engineering EconomicsBook

Boehm, Barry W. · 1981

The original source for the empirical claim that the cost of changing a software defect rises sharply as it moves through the SDLC stages (requirements, design, code, test, production). Boehm's original numbers, drawn from large IBM and TRW projects, are domain-specific and somewhat dated, but the structural relationship has been reproduced repeatedly. The chapter cites this as the economic argument for surfacing disagreement at the requirement stage rather than letting it move into code. Modern CI/CD has compressed some of the original stages, but it has not changed the direction of the curve.

Xuất hiện trongĐọc requirement như một architect

Software

Documenting Architecture DecisionsArticle

Nygard, Michael · 2011

The post that formalized the Architecture Decision Record (ADR) format: Context, Decision, Consequences. The most important field is Consequences, because it forces the writer to articulate the trade-off, not just record the choice that was made. Two chapters in this collection draw on Nygard from different angles. The TTA requirements chapter uses ADRs as the mitigation for treating constraint extraction as a one-time activity, when the assumption behind a constraint shifts, the ADR is the first place that should be revisited. The NSA Explicit-Over-Implicit chapter takes the framing further, distinguishing ADRs from documentation in kind, not in style, ADRs capture reasoning, documentation captures output, and a team without ADRs is making every future decision in the dark even when their documentation is excellent.

Xuất hiện trongĐọc requirement như một architect Explicit Over Implicit Những quyết định không có sách nào dạy

Software

Deconstructing the MonolithArticle

Westeinde, Kirsten · 2019

A Shopify Engineering blog post that introduces the term "modular monolith" and describes the Componentization project. The argument is that the boundaries of a monolith can be modularized internally to gain most of the team-autonomy benefits of microservices without paying the distributed-systems tax. The chapter uses this as the deliberate counter-example to Netflix: same era, comparable scale, the same team-coordination problem, but a different bet about which cost was worth paying.

Xuất hiện trongTrade-off không phải lựa chọn, đó là bản chất Quyết định khi không có câu trả lời đúng

Software

Jepsen: Distributed Systems Safety AnalysesArticle

Kingsbury, Kyle

Kyle Kingsbury's ongoing series of independent consistency analyses. The recurring finding: distributed databases that claim strong guarantees frequently fail under partition, clock skew, or node restart. Jepsen makes the gap between specification and implementation visible, which is what turns 'pick C or A' from a textbook decision into an operational one. Treat any consistency claim as provisional until a Jepsen analysis has been run.

Xuất hiện trongVật lý của hệ thống phân tán Những gì mình vẫn không chắc

Software

A Comprehensive Study of Convergent and Commutative Replicated Data TypesPaper

Shapiro, Marc; Preguiça, Nuno; Baquero, Carlos; Zawirski, Marek · 2011

The foundational survey of Conflict-free Replicated Data Types. The central insight: design a data structure so that two replicas can be merged without coordination, and the result is deterministic regardless of update order. Counter, set, register variants are catalogued and proved convergent. This is the technical foundation under decentralised distributed registries (Horde) and under databases like Riak that survive network partitions without losing writes.

Xuất hiện trongCái Switch Không Biết Mình Là Anh Hùng

Software

Solution of a Problem in Concurrent Programming ControlPaper

Dijkstra, Edsger W. · 1965

The paper that posed the mutual exclusion problem and gave it its first software solution. Dijkstra's framing led to semaphores, monitors, and eventually every lock-based concurrency primitive in modern operating systems. Tony Hoare later recast the same problem as 'Dining Philosophers' to make deadlock visible in undergraduate teaching.

Xuất hiện trongCái Switch Không Biết Mình Là Anh Hùng

Software

Kafka: A Distributed Messaging System for Log ProcessingPaper

Kreps, Jay; Narkhede, Neha; Rao, Jun · 2011

The original LinkedIn paper that introduced Kafka. The technical contribution that mattered most was not throughput numbers but a reframing: messaging treated as a distributed commit log instead of a queue. That reframing is what enables independent consumer groups, replay, and the architectural style that became event streaming.

Xuất hiện trongEvent-Driven Architecture Pattern là câu trả lời, constraint là câu hỏi Cold-reading một hệ thống bạn chưa bao giờ thấy

Software

The Log: What every software engineer should know about real-time data's unifying abstractionArticle

Kreps, Jay · 2013

Kreps's argument that the commit log is the unifying primitive underneath replication, change data capture, event sourcing, and stream processing — not a Kafka-specific feature but a general structure that distributed systems converge on. The post made the case that databases, messaging, and stream processing are different views of the same underlying idea, and shaped a generation of event-driven architecture thinking.

Xuất hiện trongEvent-Driven Architecture Pattern là câu trả lời, constraint là câu hỏi Log là nguồn sự thật duy nhất không thể nói dối Khi Kafka không phải câu trả lời Đọc một hệ thống như đọc bản đồ failure modes Partitioning — Cắt thế nào thì sống với hệ quả đó Event-Driven — Thoát khỏi nô lệ thời gian thực

Software

Exactly-Once Semantics Are Possible: Here's How Kafka Does ItArticle

Wang, Guozhang; Mehta, Apurva; Kreps, Jay · 2017

Confluent's engineering writeup of how the idempotent producer and transactional API work under the hood: per-producer-ID sequence numbers, producer epochs to fence off zombies, and the broker-side dedup logic that lets a producer retry safely without consumers seeing duplicates. The piece is the canonical reference for understanding that idempotence is not a duplicate filter bolted on after the fact; it is the protocol layer that also restores ordering when retries interleave with in-flight requests. Load-bearing for any chapter that argues `enable.idempotence` is more than a convenience flag.

Xuất hiện trongGửi tin không phải chỉ là gọi send()

Software

Event SourcingArticle

Fowler, Martin · 2005

Fowler's formalization of the pattern: instead of storing current state alongside a parallel audit trail, store the sequence of state-changing events as the single source of truth and derive current state from that sequence. The piece names what most enterprise systems do badly (a mutable table plus a desynchronized audit_log) and what the correct inversion looks like (the log is primary, state is derived). It is the natural companion to any chapter that argues mutable state cannot be internally consistent with its own history.

Xuất hiện trongLog là nguồn sự thật duy nhất không thể nói dối Event-Driven — Thoát khỏi nô lệ thời gian thực

Software

Designing Event-Driven SystemsBook

Stopford, Ben · 2018

A free book published by Confluent that walks through the architectural implications of putting Kafka at the center of a service ecosystem. Stopford's central point is that commit-log semantics — not queue semantics — are what make event-driven architecture qualitatively different from "async messaging," and what enables decoupling at the level of knowledge, not just runtime.

Xuất hiện trongEvent-Driven Architecture

Software

Kafka: The Definitive Guide (2nd Edition)Book

Shapira, Gwen; Palino, Todd; Sivaram, Rajini; Petty, Krit · 2021

The most comprehensive operational reference for running Kafka in production. Chapter 10 (Monitoring Kafka) is the canonical source for broker-level JMX metrics, thresholds, and the patterns that distinguish leading indicators from lagging ones. Chapter 14 (Stream Processing) covers Kafka Streams internals, including state store mechanics, changelog topic recovery, windowing semantics, and the exactly-once configuration story. The book is the operational frame this series leans on whenever a chapter argues that the right metric to alert on is the one that moves first, or that the right stateful processing decision is the one that survives the next ungraceful crash.

Xuất hiện trongMonitoring Kafka như người đã từng thấy nó chết Khi xử lý không còn là đọc rồi ghi

Software

Apache Kafka Made Simple: A First Glimpse of a Kafka Without ZooKeeperArticle

Confluent Engineering · 2021

The announcement and rationale for KRaft (Kafka Raft), the move to embed consensus directly in Kafka rather than delegating it to an external ZooKeeper cluster. The framing matters: the problem was not that ZooKeeper was unreliable, but that it was a separate failure domain. If ZooKeeper lost quorum, Kafka's controller could not be elected — meaning no broker failover, no leader election, no partition rebalancing — even while all Kafka brokers were running normally. KRaft brings the controller state inside the Kafka cluster, removing a class of incidents where 'Kafka is down' actually means 'ZooKeeper is down.'

Xuất hiện trongĐọc một hệ thống như đọc bản đồ failure modes Consensus — Khi các node phải đồng ý với nhau

Software

Monitoring KafkaDocumentation

Confluent

Confluent's reference for production Kafka monitoring: the MBean paths for the broker-level JMX metrics (UnderReplicatedPartitions, RequestHandlerAvgIdlePercent, NetworkProcessorAvgIdlePercent, ISR shrink and expand rates), recommended alert thresholds, and the cluster-health signals to watch before consumer lag confirms an incident is already in flight.

Xuất hiện trongMonitoring Kafka như người đã từng thấy nó chết Đọc một hệ thống như đọc bản đồ failure modes

Software

Prometheus JMX ExporterDocumentation

Prometheus Authors

The standard tool for exposing JVM-side JMX metrics as Prometheus scrape targets. For Kafka, it is the load-bearing piece between the broker (which emits everything important through JMX) and any modern observability stack (which speaks Prometheus). The point this series cites it for is operational: the exporter has to be wired in on the day the cluster is deployed, not added after the first incident, because the metrics it carries are leading indicators and only useful when their baseline is already known.

Xuất hiện trongMonitoring Kafka như người đã từng thấy nó chết

Software

Apache Kafka Streams DocumentationDocumentation

Apache Software Foundation

The official reference for Kafka Streams: the DSL, state stores, changelog topics, interactive queries, windowing semantics (tumbling, hopping, session), and the processing-guarantee configuration. The state-store and changelog sections are the canonical anchor for the claim that local state plus a Kafka-backed changelog is the architectural shape Streams imposes, not an implementation detail that can be swapped out.

Xuất hiện trongKhi xử lý không còn là đọc rồi ghi

Software

KIP-447: Producer scalability for exactly once semanticsDocumentation

Apache Kafka contributors · 2020

The improvement proposal that introduced exactly_once_v2 in Kafka 2.5. The original exactly_once setting required one transactional producer per input partition, which capped how far a Streams application could scale before broker-side resources became the bottleneck. KIP-447 lets a single producer cover all input partitions of a task, dropping the broker-side cost and making EOS practical at the scale most real workloads run at. It is the reason this chapter says exactly_once_v2 is always the right choice on Kafka 2.5 and later: it is not just newer, it is what makes EOS not feel like a luxury.

Xuất hiện trongKhi xử lý không còn là đọc rồi ghi

Software

How to Choose the Number of Topics/Partitions in a Kafka Cluster?Article

Kreps, Jay · 2013

The Confluent post that engineers reach for when sizing a cluster. The piece names what most documentation does not: that partition count is a cluster-wide budget, not a per-topic one, and that the cost lives in controller metadata size and failover time when a broker dies. The 2020 update keeps the framing intact but adjusts the numerical envelope to what modern Kafka can handle. The chapter cites it as the canonical anchor for the claim that the question is not how many partitions per topic, but how many across the cluster, and how close that total sits to the failover-time ceiling the team is willing to accept.

Xuất hiện trongKafka ở production

Software

Apache Kafka Documentation: Log Compaction and Offset ManagementDocumentation

Apache Software Foundation

Two sections of the Apache Kafka reference docs that the chapter leans on: the Log Compaction section, which describes the compaction window (the gap between produce and cleanup during which multiple values per key coexist), and the Offset Management section, which documents `__consumer_offsets`, its default 50 partitions, `offsets.topic.num.partitions`, and `offsets.retention.minutes`. Both sections describe behavior accurately. Both are also where the operationally important consequences live one click deeper than the getting-started reader will go.

Xuất hiện trongKafka ở production

Software

How Discord Stores Billions of MessagesArticle

Vishnevskiy, Stanislav · 2017

Discord's engineering writeup of the decision to move message storage from MongoDB to Cassandra (with later updates documenting the use of Kafka strictly for event streaming pipelines rather than for message persistence). The piece is the canonical anchor for the claim that the right answer is rarely a single tool; Discord's architecture pairs Cassandra for write-heavy keyed storage with Kafka where ordered replayable event streams are actually needed. The chapter cites it as evidence that knowing where not to use Kafka is a sign of operational maturity, not of incomplete understanding.

Xuất hiện trongKhi Kafka không phải câu trả lời

Software

PostgreSQL Documentation: Write-Ahead Logging and Streaming ReplicationDocumentation

PostgreSQL Global Development Group

Two sections of the PostgreSQL reference documentation that underpin Chapter V of Distributed Patterns. The WAL section explains why PostgreSQL writes to the log before applying changes to data files — the durability guarantee that makes crash recovery possible and streaming replication practical. The Streaming Replication section documents synchronous_commit and its levels (off, local, remote_write, remote_apply, on), which determine exactly where in the write path PostgreSQL considers a transaction durable. The distinction between remote_write (follower received the WAL record) and remote_apply (follower applied it to data files) is the precise point where assumptions about follower staleness can be wrong.

Xuất hiện trongReplication — Nhân bản không phải sao chép

Software

Patroni: A Template for PostgreSQL HA with ZooKeeper, etcd or ConsulDocumentation

Patroni Contributors

The reference documentation for Patroni, the most widely deployed tool for automated PostgreSQL failover. The key configs for Chapter V are ttl (time-to-live for the leader lock in etcd or Consul) and loop_wait (polling interval for cluster health checks). In the worst case, failover time is ttl + loop_wait — by default 30 + 10 = 40 seconds of write unavailability. Lowering ttl trades faster failover for higher risk of spurious failovers when the leader is merely slow, not dead. The documentation also explains the fencing mechanism that prevents split-brain: a node that loses its distributed lock must step down even if it can still reach clients, trading temporary unavailability for correctness.

Xuất hiện trongReplication — Nhân bản không phải sao chép

Software

Apache Cassandra Documentation: How Cassandra Reads and Writes DataDocumentation

Apache Software Foundation

The section of the Cassandra reference documentation covering write path mechanics (commit log, memtable, SSTable), quorum reads and writes, hinted handoff, and read repair. Key operational details: hinted handoff has a default TTL of 3 hours — if a node is down longer than that and the cluster accepted writes during the outage, the recovered node will be missing those writes and must be repaired via nodetool repair. The documentation also covers the LWW conflict resolution mechanism and the recommendation to use client-supplied timestamps rather than server clocks, for exactly the reason Chapter III established: server clock skew can invert the intended write ordering.

Xuất hiện trongReplication — Nhân bản không phải sao chép

Software

Facebook MySQL Semi-Synchronous ReplicationArticle

Facebook Engineering

Facebook's approach to MySQL replication at social graph scale: semi-synchronous replication, where the leader writes the binlog and waits for at least one replica to confirm receipt before acknowledging the client — not full application, just receipt. The key distinction the chapter draws from this: receive lag and apply lag are two separate metrics measuring two different guarantees. A dashboard showing green on binlog receive lag can co-exist with a growing apply lag that is invisible until follower reads start returning stale data from events queued but not yet applied. Facebook chose semi-sync because full synchronous replication was too slow for their write volume and pure async was unacceptable for durability — the middle ground trades read consistency (replicas may be behind on apply) for durability on leader crash (at least one replica has the data). When no replica replies within the timeout, MySQL automatically downgrades to async to avoid blocking writes, making semi-sync a best-effort guarantee with a controlled fallback rather than an absolute one.

Xuất hiện trongReplication — Nhân bản không phải sao chép

Software

Sharding & IDs at InstagramArticle

Instagram Engineering · 2012

The primary source for Chapter VI's Instagram anchor. The post documents the full sharding scheme: PostgreSQL schemas as logical shards (not separate databases), the 64-bit ID format encoding 41 bits of timestamp + 13 bits of logical shard ID + 10 bits of sequence number, and why encoding the shard ID in the ID itself eliminates the need for a routing table or centralized metadata service. The shard lookup becomes O(1) pure bit manipulation, with no external dependency and no single point of failure for routing. Unusually honest about what is difficult: the migration timeline (months with live traffic, double-write period, backfill in background) is documented alongside the happy path, making it one of the clearest public records of what committing to a partition scheme actually costs.

Xuất hiện trongPartitioning — Cắt thế nào thì sống với hệ quả đó

Software

Amazon DynamoDB: Best Practices for Designing and Using Partition KeysDocumentation

Amazon Web Services

Two sources that must be read together. The AWS documentation explains the 'what to avoid' side: partition keys need high cardinality and uniform distribution of access — not just high cardinality of key space but uniform distribution of actual reads and writes across those keys. The AWS re:Invent 2019 talk 'Amazon DynamoDB Under the Hood' explains the 'why it happens' side at the implementation layer: throughput is allocated per partition, not per table, so a hot partition exhausts its quota while other partitions sit idle with unused capacity. Adaptive capacity (ability to borrow throughput from cold partitions) is documented with its limits: it reduces the impact of hot partitions but cannot eliminate throttling when a single partition receives traffic that exceeds the total table throughput. The key practical consequence: monitor ThrottledRequests metric, not 5xx error rate, because hot partition failure is latency degradation before it is errors.

Xuất hiện trongPartitioning — Cắt thế nào thì sống với hệ quả đó

Software

Amazon DynamoDB: Best Practices for Using Sort Keys to Organize DataDocumentation

Amazon Web Services

The DynamoDB documentation on composite sort key design as a co-location strategy. The core pattern: if the partition key identifies an entity (user_id) and the sort key identifies an event within that entity (order_timestamp), a query for 'all orders by user X between date A and date B' is a single-partition range scan rather than a cross-partition scatter-gather. This is the concrete alternative to scatter-gather for range queries in DynamoDB — a schema design decision made upfront, not an optimization layer added later. Chapter VI cites this alongside Instagram's fan-out approach as two instances of the same principle: data that is accessed together should live together, and achieving that co-location is a design decision, not an infrastructure setting.

Xuất hiện trongPartitioning — Cắt thế nào thì sống với hệ quả đó

Software

PostgreSQL Documentation: LISTEN / NOTIFYDocumentation

PostgreSQL Global Development Group

Postgres ships an asynchronous notification system that lets sessions wait for events posted by other sessions through NOTIFY. The mechanism does not buffer history, does not partition, and does not scale to high-throughput event streaming, which is precisely why it is the right answer for the workloads where Kafka's operational overhead would not pay for itself. The chapter cites it as the canonical example that the default rebuttal to 'should we use Kafka here?' is not 'use something fancier' but 'the database you already run probably has what you need.'

Xuất hiện trongKhi Kafka không phải câu trả lời

Software

Life beyond Distributed Transactions: an Apostate's OpinionPaper

Helland, Pat · 2007

Presented at CIDR 2007. The argument that scalable distributed systems cannot rely on two-phase commit and must be redesigned around eventual consistency, idempotent operations, and compensating actions instead of global atomicity. The paper is the architectural foundation under which the saga pattern, idempotency keys, and outbox writes make sense as a coherent design discipline rather than a collection of tricks.

Xuất hiện trongOOP, state, và những thứ bạn không thể control Event-Driven Architecture Pattern là câu trả lời, constraint là câu hỏi Transactions phân tán — Cái giá của atomicity qua nhiều service

Software

SagasPaper

Garcia-Molina, Hector; Salem, Kenneth · 1987

The original paper, presented at ACM SIGMOD 1987. The motivating context was not microservices but long-running database transactions, where holding locks for hours degraded concurrency for every other transaction. Garcia-Molina and Salem proposed breaking the long transaction into a sequence of shorter local transactions, each with an explicit compensating transaction that semantically undoes its effect. Two decades later, microservices reproduced essentially the same failure mode at the architecture scale, and the community rediscovered the paper. The pattern is now taught as a microservices technique, but the underlying constraint, ACID across a boundary you cannot lock, is older than the architecture it now serves.

Xuất hiện trongPattern là câu trả lời, constraint là câu hỏi Transactions phân tán — Cái giá của atomicity qua nhiều service

Software

The Sunset HTTP Header Field (RFC 8594)Standard

Wilde, Erik · 2019

The IETF standard for the `Sunset` HTTP response header, which lets a server signal an upcoming retirement date for a resource. Paired with the in-progress `Deprecation` header draft, the two provide a machine-readable channel for telling consumers that an endpoint will go away, when, and where the replacement lives. The value of the spec for API design is not the syntax but the discipline: deprecation announcements are most useful when they appear in the consumer's development tooling at the moment of call, not in a provider blog post three months earlier.

Xuất hiện trongAPI là giao kèo

Software

Consumer-Driven Contracts: A Service Evolution PatternArticle

Robinson, Ian · 2006

Ian Robinson's essay on Martin Fowler's site, which inverted the conventional view of API contracts. The standard assumption is that the provider defines the contract and consumers conform; Robinson argued that, in evolution, the contract that matters is the union of what consumers actually depend on, and the provider's responsibility is to verify its behavior against that union. The Pact framework later operationalized the idea as a testing tool. The essay is the load-bearing reference for any argument that schema diff is insufficient and that behavioral compatibility has to be tested from the consumer's side.

Xuất hiện trongAPI là giao kèo

Software

REST in Practice: Hypermedia and Systems ArchitectureBook

Webber, Jim; Parastatidis, Savas; Robinson, Ian · 2010

A pragmatic treatment of building distributed systems with REST. The chapters on evolvability and coupling are where the post borrows the language of spatial versus temporal coupling, and the framing that hypermedia and async messaging can each remove different kinds of coupling between services.

Xuất hiện trongEvent-Driven Architecture

Software

Building Microservices: Designing Fine-Grained SystemsBook

Newman, Sam · 2021

The second edition is the practical reference for service decomposition, API contracts, and the failure modes that arise when boundaries are drawn wrong. The chapters on coupling and on integration patterns are where the post draws the distinction between physical decoupling (process boundaries) and logical decoupling (knowledge boundaries). Chapter 4 on communication styles is the load-bearing reference for the temporal-coupling argument: synchronous calls are not just slower than asynchronous ones, they couple lifetimes in a way that retries and circuit breakers cannot dissolve.

Xuất hiện trongOOP, state, và những thứ bạn không thể control Event-Driven Architecture Bạn đã từng làm cho hệ thống chờ nhau

Software

Microservices Patterns: With Examples in JavaBook

Richardson, Chris · 2018

The primary reference for the saga pattern in a microservices context. Chapter 4, "Managing transactions with sagas," defines the two variants the OOP chapter leans on: choreography, where each service reacts to events, and orchestration, where a central saga object drives the flow. Richardson's practical heuristic — choreography suffices for flows of two or three steps, orchestration wins beyond that because "the flow of events is not obvious from the code" — is quoted directly as the justification for centralizing coordination in an orchestrator object.

Xuất hiện trongOOP, state, và những thứ bạn không thể control

Software

Enterprise Integration Patterns: Designing, Building, and Deploying Messaging SolutionsBook

Hohpe, Gregor; Woolf, Bobby · 2003

The pattern language for message-based integration. The book named, catalogued, and gave the working vocabulary to most of what readers now take for granted: point-to-point channels, publish-subscribe, message routers, content-based routing, the request-reply pattern, the dead-letter channel. Reading the book is the fastest way to see that the terminology of 'message queues' hides a deep design space that pre-dates RabbitMQ by years and that RabbitMQ inherits almost wholesale. Chapter 3 (Messaging Systems) is the standard reference for the distinction between request-reply and fire-and-forget the series leans on from chapter II onward, and the Message Router and Guaranteed Delivery patterns are the direct conceptual ancestors of AMQP exchanges and acknowledgments.

Xuất hiện trongMessage-based OOP — Alan Kay đã đúng từ đầu Bạn đã từng làm cho hệ thống chờ nhau Message là gì, và tại sao nó không phải là HTTP AMQP, ngôn ngữ mà RabbitMQ nói Direct exchange, gửi đúng người Fanout và topic exchange, một nguồn nhiều đích

Software

AMQP 0-9-1 Model ExplainedDocumentation

RabbitMQ

RabbitMQ's reference document for the AMQP 0-9-1 protocol model: the connection / channel / exchange / queue / binding / message vocabulary, the default exchange behavior, the four exchange types (direct, fanout, topic, headers), and the acknowledgment commands (`basic.ack`, `basic.nack`, `basic.reject`). The chapter cites it as the anchor for the claim that acknowledgment is not an optional setting bolted onto a queue but a first-class part of the protocol vocabulary, and that the distinction between AMQP and HTTP is at the level of what each protocol has terms for.

Xuất hiện trongAMQP, ngôn ngữ mà RabbitMQ nói Fanout và topic exchange, một nguồn nhiều đích

Software

How Complex Systems FailPaper

Cook, Richard I. · 1998

Cook's eighteen-point treatise from the safety-engineering field, adapted in the years since by SRE and resilience-engineering communities for software. The central argument is that complex systems are inherently hazardous, that failure is never the result of a single cause, and that what looks like an incident is the surfacing of multiple latent failures that had been coexisting safely until one shift made them visible together. The paper is the standard intellectual anchor for failure-first design questions: you do not look for a single bug, you ask which latent failures the design currently accepts.

Xuất hiện trongCâu hỏi để tự phỏng vấn thiết kế của mình The Illusion of Understanding Data Consistency, Cái Giá Của Sự Thật Khi Mọi Thứ Đúng Mà Vẫn Sai Hệ Thống Của Bạn Đang Nói Chuyện Với Bạn Không

Software

More details about the October 4 outageReport

Facebook Engineering (Janardhan, Santosh) · 2021

Facebook's public engineering write-up of the October 4, 2021 outage that took Facebook, Instagram, WhatsApp, and Oculus offline for roughly six hours. A routine BGP audit script ran with a parameter that, in an edge case, withdrew every Facebook BGP advertisement at once, removing Facebook from the global routing table. Diagnosis took about one hour. Recovery took six, because every tool used to recover (remote access, DNS, authentication, badge readers) sat inside the network that had just disconnected itself from the internet. The report is the canonical public reference for the dependency-inversion problem in recovery paths, the tools you need to recover live on the infrastructure that is failing.

Xuất hiện trongKhi Mọi Thứ Đúng Mà Vẫn Sai

Software

Metastable Failures in Distributed SystemsPaper

Bronson, Nathan; Aghayev, Abutalib; Charapko, Aleksey; Zhu, Timothy · 2021

HotOS 2021. The first formal naming of a failure class engineers had encountered for years without a shared vocabulary. The authors define a metastable failure as one where, even after the original trigger has gone, the system sustains itself in the degraded state through a positive feedback loop, retry storms being the canonical example. The contribution is not the phenomenon, which had been described informally many times, but a unified name that lets a postmortem say "this was a metastable failure" without having to re-derive the definition. The chapter cites the paper to give the failure class a stable label and a primary source.

Xuất hiện trongKhi Mọi Thứ Đúng Mà Vẫn Sai

Software

October 21 post-incident analysisReport

GitHub Engineering · 2018

GitHub's public post-mortem of the October 21 to 22, 2018 incident in which a 43-second loss of connectivity between two East Coast data centers caused Orchestrator to failover the MySQL writer to a West Coast replica. When the network healed, the two sides of the cluster had diverged. GitHub spent more than 24 hours in degraded mode while engineers reconciled writes from both partitions before resuming normal service. The report is the canonical public reference for what a CP-versus-AP decision looks like when an automated topology manager enforces a policy that, on reflection, was not the behavior the team would have chosen had the question been asked explicitly.

Xuất hiện trongData Consistency, Cái Giá Của Sự Thật

Software

TAO: Facebook's Distributed Data Store for the Social GraphPaper

Bronson, Nathan and others · 2013

USENIX ATC 2013. Facebook engineers describe TAO, the read-optimized graph data store sitting between memcached and MySQL that serves Facebook's social graph. The paper is the most cited public source for the consistency model Facebook actually runs against, eventual consistency by default with explicit read-after-write guarantees scoped to a single user where the product needed it. For the consistency chapter it is the source of the reconstructed comment-disappearing scene. For the graph series it is the canonical example of a deliberately minimal data model: only objects and typed, directed associations, every association maintained with its inverse, time as the one first-class edge attribute, all bent around a known, stable set of product questions.

Xuất hiện trongData Consistency, Cái Giá Của Sự Thật Node và edge là quyết định thiết kế Graph ở quy mô lớn: nơi sách giáo khoa dừng lại

Software

Pregel: A System for Large-Scale Graph ProcessingPaper

Malewicz, Grzegorz and others · 2010

SIGMOD 2010. Google engineers describe Pregel, a distributed graph computation system built on the Bulk Synchronous Parallel (BSP) model. Computation proceeds in supersteps: each vertex receives messages from the previous step, updates its state, and sends messages to neighbors. The barrier between supersteps eliminates coordination overhead but introduces the straggler problem — the whole computation waits for the slowest machine in each round. The paper is the canonical reference for vertex-centric distributed graph computation and the trade-offs of the BSP execution model at scale.

Xuất hiện trongGraph ở quy mô lớn: nơi sách giáo khoa dừng lại

Software

Timelines at ScaleTalk

Krikorian, Raffi · 2012

QCon San Francisco 2012. Twitter VP of Engineering describes the architectural evolution of the Twitter timeline, specifically the decision to move away from pure fan-out-on-write for celebrity accounts (those with tens of millions of followers) to a hybrid model: fan-out-on-write for ordinary users, fan-out-on-read (fetching and merging at read time) for supernodes. The talk is the primary public source for the three-number judgment framework — degree distribution, read/write ratio, and hottest-to-median ratio — that determines which delivery strategy is appropriate for which account tier.

Xuất hiện trongGraph ở quy mô lớn: nơi sách giáo khoa dừng lại

Software

CAP Twelve Years Later: How the "Rules" Have ChangedArticle

Brewer, Eric · 2012

IEEE Computer, February 2012. Brewer revisits the CAP theorem he proposed as a conjecture at PODC 2000, and corrects the most common misreading, the "pick two of three" framing. His point is that partition tolerance is not a knob a designer chooses, it is a property of any real network, and the actual engineering decision is what to give up between consistency and availability when a partition occurs. The chapter cites this paper as the source of the practical CAP framing, not the cleaner but misleading triangle taught in introductory textbooks.

Xuất hiện trongData Consistency, Cái Giá Của Sự Thật Pattern không phải câu trả lời, pattern là câu hỏi đã đông cứng

Software

Consistency Tradeoffs in Modern Distributed Database System Design: CAP is Only Part of the StoryArticle

Abadi, Daniel J. · 2012

IEEE Computer, February 2012. Abadi argues that CAP, even read correctly, captures only the partition case, when the system has to choose between consistency and availability. In the absence of partitions, distributed systems still face a different trade-off, latency versus consistency, because keeping replicas in sync at low latency requires coordination that itself adds latency. PACELC names both axes, if a Partition occurs, choose between Availability and Consistency, Else choose between Latency and Consistency. The chapter cites PACELC as the more honest version of CAP for systems that spend most of their time not partitioned.

Xuất hiện trongData Consistency, Cái Giá Của Sự Thật

Software

The Tacit DimensionBook

Polanyi, Michael · 1966

The slim volume in which Polanyi gives his most accessible formulation of tacit knowledge, summarized in the line "we can know more than we can tell." The argument is that explicit knowledge always rests on a tacit substrate, and that the substrate is what gives propositional knowledge its meaning. The NSA series cites Polanyi as the source for why documentation does not capture reasoning, because reasoning lives in the tacit dimension and only enters the explicit register through a specific cognitive act that most engineering workflows do not have rituals for. The DDD series uses the same argument from a different angle: a glossary cannot carry what a domain expert actually knows about when a "detection" counts as real, when a confidence score is suspicious, when a signal should escalate. The propositional residue lands on the page; the practice that produced the residue stays in the room with the people who were there.

Xuất hiện trongHệ Thống Chỉ Người Viết Mới Hiểu Ubiquitous Language

Philosophy

In the Matter of Knight Capital Americas LLC (SEC Administrative Proceeding File No. 3-15570)Report

U.S. Securities and Exchange Commission · 2013

The SEC's formal administrative proceeding documenting the August 1, 2012 Knight Capital incident in which a missed deployment to one of eight production servers caused dormant 2003-era routing code (Power Peg) to execute, generating millions of unintended trades and approximately $460 million in losses within 45 minutes. The document is dense with technical detail, the SMARS routing system, the repurposing of the RepoStopTest flag, and the deployment mechanism, which makes it the canonical public reference for cognitive technical debt at scale. The NSA series uses Knight Capital twice from two angles: Chapter 3 reads it as cognitive surprise (the context for Power Peg had decayed across the years), Chapter 4 reads it as organizational surprise (four internal groups each watched a slice of the 45 minutes and no group could combine the views in time).

Xuất hiện trongHệ Thống Chỉ Người Viết Mới Hiểu Hai Thực Tại Trong Một Công Ty

Software

Normal Accidents: Living with High-Risk TechnologiesBook

Perrow, Charles · 1984

The foundational sociology of failure in high-risk technologies. Perrow's central argument is that systems exhibiting both _tight coupling_ (changes propagate quickly, with little slack) and _complex interaction_ (effects appear in non-obvious places) make some accidents structural rather than incidental. Failure surfaces at the seam between components, where no single operator was responsible. Three Mile Island is Perrow's canonical case study, but the frame travels into distributed software, into organizational coordination, and into any setting where each part can be locally correct while the whole misbehaves. The NSA series uses Perrow as the intellectual ancestor of organizational surprise; Knight Capital 2012, where four internal groups were independently correct and the four correctnesses summed to systemic failure, is a system accident in Perrow's sense.

Xuất hiện trongHai Thực Tại Trong Một Công Ty

Software

Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East RegionReport

Amazon Web Services · 2011

AWS's public post-mortem of the April 21, 2011 us-east-1 outage that began at 0:47 AM and degraded service across thousands of customers for several days. The chapter uses it as the canonical operational-surprise incident: signals were emitted from the very start, but the metrics being watched were external-facing while the failure was unfolding inside the EBS internal network during a re-mirroring storm. The report's value for NSA is the temporal evidence, the gap between when the system started saying something and when anyone heard it.

Xuất hiện trongOperational Surprise

Software

The Challenger Launch Decision: Risky Technology, Culture, and Deviance at NASABook

Vaughan, Diane · 1996

The book that gave the concept of normalization of deviance its name. Vaughan, a sociologist, spent years inside the Challenger investigation archives and concluded that the disaster was not caused by individual recklessness but by an organizational process in which anomalous O-ring erosion was gradually treated as routine across 24 successful flights. The NSA chapter cites her as the structural ancestor of alert fatigue, the same drift from signal to noise operating on a much smaller scale and a much faster tempo in software operations.

Xuất hiện trongOperational Surprise Trade-offs Không Có Đáp Án Đúng

Software

Scaling to Millions of Simultaneous ConnectionsArticle

Reed, Rick · 2012

Erlang Factory 2012 talk by WhatsApp engineer Rick Reed. The counterpoint to DynamoDB and Spanner: 450 million users served by 50 engineers using Erlang on FreeBSD, with minimal distributed complexity. The talk is the primary source for the WhatsApp architecture used throughout the series as the canonical example that operational simplicity is itself a deliberate trade-off, not a failure to achieve sophistication.

Xuất hiện trongPattern không phải câu trả lời, pattern là câu hỏi đã đông cứng Nói không

Software

Scaling up the Prime Video audio/video monitoring service and reducing costs by 90%Article

Chitilian, Varouj; Encrypted, Márton; et al. · 2023

Amazon Prime Video Tech Blog post describing the migration of their audio/video monitoring service from a distributed architecture (AWS Step Functions + Lambda + S3) to a monolithic process running on EC2, resulting in a 90% cost reduction and significant throughput improvement. The team's key finding: the problem had no distributed properties — media analysis is a sequential pipeline with strong data locality requirements, where each step needs the previous step's output as input. The distributed architecture moved data across the network at every step, which was precisely what the problem's physical shape contradicted. No component was broken; the mismatch was between the solution space and the problem space. The post generated controversy when first published because it was widely misread as a manifesto against microservices; Prime Video clarified that it described one specific service, not their overall architecture.

Xuất hiện trongNói không

Software

Postmortem of database outage of January 31Report

GitLab · 2017

GitLab's openly published post-mortem of the January 31, 2017 production database incident, in which an engineer trying to relieve a lagging replica ran a delete command against the production primary instead of staging. 300GB of data was lost in seconds. The report's value is not the human error at 0:37 AM but the cascade that followed: four independently designed backup mechanisms all failed at the same moment, each for a different reason. The post uses the incident as the canonical example that the gap between knowing the system as code and knowing the system as behavior is structural, not the fault of any particular engineer.

Xuất hiện trongThe Illusion of Understanding Tại sao distributed systems phá vỡ cách bạn vẫn nghĩ Những gì mình vẫn không chắc

Software

Logical Physical Clocks and Consistent Snapshots in Globally Distributed DatabasesPaper

Kulkarni, Sandeep; Demirbas, Murat; Madappa, Deepak; Avva, Bharadwaj; Leone, Marcelo · 2014

The paper introducing Hybrid Logical Clocks (HLC), presented at OPODIS 2014. The core contribution: a timekeeping algorithm that combines a physical clock (wall time) with a logical counter to achieve causal ordering without synchronization, without requiring GPS receivers or atomic clocks. The uncertainty interval can be wider than TrueTime's GPS-bounded window, but the approach is deployable on any commodity hardware. CockroachDB adopted HLC as its clock mechanism. The paper is relevant to any system that needs causally-ordered timestamps but cannot provision Google-scale physical infrastructure. Read alongside the Spanner paper (Corbett et al.) to see two different answers to the same problem of bounding time uncertainty in a distributed system.

Xuất hiện trongNhững gì mình vẫn không chắc

Software

Durable Objects: Consistent Storage for the Cloudflare Workers PlatformArticle

Harris, Greg · 2020

Cloudflare's technical introduction to Durable Objects — a stateful serverless primitive that gives each compute unit persistent, strongly consistent storage within its own scope. The key property: each Durable Object is a globally unique, single-threaded actor; operations on a single Object are serialized by construction, eliminating mutual exclusion problems within that scope. The post is the canonical reference for what stateful serverless means in practice as opposed to theory. The open question it leaves unresolved — how cross-Object coordination should work when consistency semantics are needed across Object boundaries — is still not settled as of the Chapter 16 writing (2026). Read alongside the Spanner and HLC papers to see the same trade-off (physical infrastructure vs. algorithmic approaches to bounded uncertainty) appearing in a serverless context.

Xuất hiện trongNhững gì mình vẫn không chắc

Software

Performing a Project PremortemArticle

Klein, Gary · 2007

Klein's Harvard Business Review piece formalizing a cognitive technique that had been used informally for decades: before a project starts (or a system goes live), assume it has already failed, and have the team brainstorm why. The trick is that the framing 'it failed' bypasses the optimism that normally blocks failure imagination at the planning stage. Pre-mortem is the project-management cousin of failure-first design questions: same cognitive move, different domain. The chapter cites it as the formal name for the habit it is teaching readers to build.

Xuất hiện trongCâu hỏi để tự phỏng vấn thiết kế của mình

Software

RabbitMQ Documentation: Consumer PrefetchDocumentation

RabbitMQ

RabbitMQ's reference page for consumer prefetch and the `basic.qos` semantics that drive it. The most operationally important part is the distinction between per-channel and per-consumer prefetch, which controls how a single prefetch setting applies when a channel hosts multiple consumers. The page also documents the relationship between prefetch and the `consumer_utilisation` metric exposed by the management plugin, which is the standard signal for whether prefetch is too low (consumer idle, waiting on broker) or appropriately tuned (consumer near 100% busy). Chapter VIII cites it as the official anchor for the claim that back-pressure in AMQP is opt-in from the consumer side, not automatic from the broker side.

Xuất hiện trongKhi consumer không theo kịp

Software

RabbitMQ Documentation: Publisher ConfirmsDocumentation

RabbitMQ

RabbitMQ's reference page for publisher confirms. The most operationally important section is 'When is a Message Confirmed,' which spells out the subtle dependency that most blog posts skip: with durable queues and persistent messages, the broker only confirms after the message is fsynced to disk; with non-durable queues or transient messages, the broker confirms once the message is enqueued. Same mechanism, different semantics, and the difference is exactly the gap chapter VII is trying to make visible. The page also covers pipelined and async confirm handlers, which are how production systems actually use confirms without paying full round-trip latency per publish.

Xuất hiện trongPublish thành công có nghĩa là gì

Software

RabbitMQ Documentation: Dead Letter ExchangesDocumentation

RabbitMQ

RabbitMQ's reference page for the dead-letter mechanism. It documents the `x-dead-letter-exchange` and `x-dead-letter-routing-key` queue arguments, the three conditions that send a message to the configured DLX (rejection with `requeue: false`, message TTL expiry, queue length limit reached), and the way the original routing key, exchange, and the `x-death` header are preserved when the message is dead-lettered. The chapter cites it as the anchor for the claim that a DLQ is a normal queue made special only by configuration on the source queue, not by any property of its own.

Xuất hiện trongDead letter queue, khi message không thể xử lý

Software

RabbitMQ Documentation: Publishers and RoutingDocumentation

RabbitMQ

RabbitMQ's reference pages on publishing semantics and routing behavior. The Publishers page documents the `mandatory` flag, publisher confirms, and the `basic.return` event the broker fires when a message cannot be routed. The Routing page covers exchange types and binding match rules. Together they are the official anchor for the unroutable-drop behavior chapter IV diagnoses: the broker drops without erroring because the protocol contract is that the publisher opts in to return-on-unroutable rather than the broker opting in to alert on it.

Xuất hiện trongDirect exchange, gửi đúng người

Software

On the Criteria To Be Used in Decomposing Systems into ModulesPaper

Parnas, David L. · 1972

The paper that argued module boundaries should follow information hiding rather than functional decomposition. Three series draw on Parnas from different angles. The RabbitMQ chapter applies it to time, two modules that compile separately and share no types can still be tightly coupled if they have to be alive at the same moment for either to function. The NSA Contract-First chapter applies it to organizational scale, Parnas's claim that a module should hide its implementation behind an interface is exactly what Bezos's 2002 mandate enforced thirty years later between Amazon teams. The OOP Chapter 3 uses his framing directly: 'design decisions which are likely to change' is what the chapter calls the axis of change, and decomposing around that axis is how localized change cost becomes possible.

Xuất hiện trongBạn đã từng làm cho hệ thống chờ nhau Contract-First Architecture Thứ bạn đang thiết kế cho là sự thay đổi Encapsulation — ẩn quyết định, không phải ẩn dữ liệu SOLID — nguyên tắc hay quyết định?Khi OOP là câu trả lời sai

Software

Domain Modeling Made Functional: Tackle Software Complexity with Domain-Driven Design and F#Book

Wlaschin, Scott · 2018

Wlaschin demonstrates, in F# but with arguments that travel across languages, that functional programming is not anti-OOP: it simply puts pure functions and algebraic composition first. The chapter cites his central claim that for domains where validation and transformation are the core rather than entity behavior, modeling with pure functions and algebraic types makes the design clearer, not more complicated. The book is the most complete worked example of the functional-core style applied to a real business domain.

Xuất hiện trongKhi OOP là câu trả lời sai

Software

Data-Oriented Design and C++ (CppCon 2014)Article

Acton, Mike · 2014

The seminal conference talk on data-oriented design, delivered from Acton's vantage as engineering director at Insomniac Games. The core argument the chapter leans on: "if you don't understand the data, you don't understand the problem." Acton makes the cost of the object abstraction physical: CPUs read cache lines, array-of-structs layouts waste most of every line loaded, and reorganizing the same data as struct-of-arrays yields order-of-magnitude speedups with no change to the algorithm. The talk is the standard citation for why paradigm choice stops being aesthetic at certain scales and access patterns.

Xuất hiện trongKhi OOP là câu trả lời sai

Software

BoundariesArticle

Bernhardt, Gary · 2012

The talk (and the accompanying Destroy All Software screencasts) that named the Functional Core, Imperative Shell pattern. Bernhardt's formulation gives the chapter its working definition of the boundary: business decisions live in pure functions that take values and return values, while a thin imperative shell owns IO, state, and coordination. The chapter credits him for the naming because the pattern is easy to practice unknowingly and hard to discuss without the name.

Xuất hiện trongKhi OOP là câu trả lời sai

Software

Why Isn't Functional Programming the Norm?Article

Feldman, Richard · 2019

Feldman's ElmConf 2019 talk argues that functional programming's minority status is mostly historical path dependence rather than inherent difficulty: mainstream education followed the C++ and Java lineage, each generation learned from the previous generation's codebases, and familiarity compounded. The chapter uses this as a counterpoint in a footnote: "popular" and "natural for this problem" are independent properties, and the dominance of one paradigm says little about its fit for any particular problem shape.

Xuất hiện trongKhi OOP là câu trả lời sai

Software

Monolith to Microservices: Evolutionary Patterns to Transform Your MonolithBook

Newman, Sam · 2019

The companion volume to Building Microservices. The framing of the distributed monolith — many services, one fate — and the case for the strangler fig migration approach are both drawn from this book. Newman is unusually honest that the most common outcome of a microservices project is failure caused by underestimating coupling.

Xuất hiện trongEvent-Driven Architecture Legacy codebase

Software

CQRSArticle

Fowler, Martin · 2011

Fowler's bliki entry on CQRS. The piece is short but does the important work of warning that CQRS is one of the most over-applied patterns in distributed systems: most systems do not need it, and adding it without need doubles complexity for no gain. The post inherits that caution when it sequences CQRS as a level-2 commitment, not a default.

Xuất hiện trongEvent-Driven Architecture

Software

Eventually ConsistentArticle

Vogels, Werner · 2008

Published in ACM Queue. Vogels, then CTO of Amazon, argued that consistency in distributed systems is a spectrum rather than a binary, and that most user-facing systems can tolerate eventual consistency if the convergence window is short. The essay is the source of the practical language used to talk about read-your-writes, monotonic reads, and the trade-offs the post builds on in the §6 discussion of consistency.

Xuất hiện trongEvent-Driven Architecture Data Consistency, Cái Giá Của Sự Thật Pattern không phải câu trả lời, pattern là câu hỏi đã đông cứng

Software

What Every Programmer Should Know About Object-Oriented DesignBook

Page-Jones, Meilir · 1996

The book that gave software design the connascence taxonomy — a ranking of coupling forms by how strongly they bind components. The post borrows connascence of meaning specifically: when two services must agree on what a value means, not just that the field exists, the coupling is deeper than the API contract makes visible.

Xuất hiện trongEvent-Driven Architecture

Software

Object-Oriented Software ConstructionBook

Meyer, Bertrand · 1988

The book that introduced Command Query Separation as a method-level principle: a routine should either change state or return information, never both. Greg Young later lifted the principle to the architectural level as CQRS. The same book is the original source of the Open-Closed Principle (a module should be open for extension, closed for modification) and of Design by Contract (preconditions, postconditions, invariants), the framework that LSP is most precisely stated in.

Xuất hiện trongEvent-Driven Architecture SOLID là ngôn ngữ, không phải luật Encapsulation — ẩn quyết định, không phải ẩn dữ liệu SOLID — nguyên tắc hay quyết định?Code review bằng mắt OOP

Software

A Behavioral Notion of SubtypingPaper

Liskov, Barbara and Wing, Jeannette · 1994

The formal statement of the principle Liskov first sketched in her 1987 OOPSLA keynote, Data Abstraction and Hierarchy. The paper defines subtype substitutability in the language of preconditions, postconditions, and invariants from Meyer's Design by Contract: a subtype may weaken what callers must provide (preconditions) and strengthen what it promises in return (postconditions), but never the reverse. The Rectangle-Square example most readers know comes from Martin's later interpretation for object-oriented practice, not from this paper.

Xuất hiện trongSOLID là ngôn ngữ, không phải luật Inheritance và Composition — không phải là "dùng cái nào"

Software

Effective JavaBook

Bloch, Joshua · 2001

A practical engineering guide to writing correct, robust Java. The third edition (2018) updates coverage for Java 8 and later. Item 18, 'Favor composition over inheritance,' is the engineering argument the chapter draws on: inheritance is fragile because it exposes a subclass to the superclass's internal implementation. Bloch's formulation — 'design and document for inheritance, or else prohibit it' — captures the key insight: a class that permits subclassing without explicitly designing for it is making implicit promises it cannot reliably keep when it evolves. Chapter 5 of the OOP series cites Item 18 as a practical lens for the same point made formally by Mikhajlov and Sekerinski: the problem is not that inheritance is bad, but that an undocumented superclass silently commits to implementation details that subclasses then depend on.

Xuất hiện trongInheritance và Composition — không phải là "dùng cái nào"

Software

A Formal Model of the Dynamic Dispatch for the Fragile Base Class ProblemPaper

Mikhajlov, Mikhail and Sekerinski, Emil · 1998

Presented at ECOOP 1998. The paper formally proves that the fragile base class problem is not a failure of discipline but an inherent property of open inheritance systems with mutable state: no amount of 'careful' superclass writing can eliminate the risk that a future internal change will break subclasses that never modified their own code. The proof is based on a formal model of dynamic dispatch that shows exactly which method call sequences a subclass can observe, and how changing the superclass's call sequencing — without changing its public interface — can invalidate subclass invariants. The OOP Chapter 5 Footnote cites this to distinguish fragile base class from bad code style: the problem is structural, and the engineering mitigations (design for inheritance or prohibit it; prefer composition) address it at the design level rather than the implementation level.

Xuất hiện trongInheritance và Composition — không phải là "dùng cái nào"

Software

Applying UML and Patterns: An Introduction to Object-Oriented Analysis and DesignBook

Larman, Craig · 2004

Source of the protect variation principle, Larman's generalization of OCP. Larman argued that many of the canonical Gang of Four patterns (Strategy, Observer, Adapter, Facade) are concrete expressions of the same underlying idea: wrap a stable interface around the points in a system where the implementation is known to vary. Reframing OCP as protect variation moves the question from "does this code follow the principle?" to "has this point been shown to vary often enough to justify the abstraction?"

Xuất hiện trongSOLID là ngôn ngữ, không phải luật

Software

Patterns of Enterprise Application ArchitectureBook

Fowler, Martin · 2002

The reference where Active Record and Data Mapper are named and contrasted as the two main strategies for connecting a domain object to its persistence. Fowler does not claim one is universally better; he describes the trade-off explicitly. Active Record (Django, Rails, TypeORM's active-record mode) keeps domain and persistence in the same class, trading testability for developer velocity. Data Mapper (Hibernate, SQLAlchemy classical) separates them and is preferred when domain complexity requires testing business rules independently of the database. Chapter 9 (Domain Logic Patterns) also documents the Service Layer pattern, which is cited in Chapter 2 of the OOP series as the architectural convention that normalized putting business logic in service classes rather than domain objects — creating the conditions for the anemic domain model to become the industry default.

Xuất hiện trongSOLID là ngôn ngữ, không phải luật Pattern không phải câu trả lời, pattern là câu hỏi đã đông cứng Tại sao code của bạn là OOP giả Domain modeling — từ bài toán thật đến class

Software

Inversion of Control Containers and the Dependency Injection patternArticle

Fowler, Martin · 2004

The article Fowler wrote because the community had begun conflating Inversion of Control, Dependency Injection, and the Dependency Inversion Principle into a single idea, after the rise of IoC containers like Spring. Fowler's key clarification is the nesting: IoC is the broadest concept (a control-flow inversion), Dependency Injection is one mechanism for achieving IoC, and DIP is a design decision about which direction abstractions point. A team can use the most sophisticated DI container on the market and still violate DIP if the abstractions live in the wrong package.

Xuất hiện trongSOLID là ngôn ngữ, không phải luật Dependency — triết học trước, framework sau

Software

Hexagonal ArchitectureArticle

Cockburn, Alistair · 2005

The article that proposed Ports and Adapters, posted on Cockburn's personal site and now archived at alistair.cockburn.us. Cockburn preferred the name Ports and Adapters because it describes the mechanism, while Hexagonal Architecture only describes the diagram's shape. The substance is the rule that infrastructure depends on domain, never the reverse, with ports owned by the domain and adapters living in the infrastructure layer. Later interpretations by Martin (Clean Architecture, 2012) and Palermo (Onion Architecture, 2008) carry the same one-way dependency rule with different layer vocabularies.

Xuất hiện trongSOLID là ngôn ngữ, không phải luật

Software

A Mathematical Theory of CommunicationPaper

Shannon, Claude E. · 1948

The founding paper of information theory. Shannon's central move was to deliberately remove meaning from the equation: information is measured by how much uncertainty a message resolves, not by what it means. From that move he derived entropy, channel capacity, source coding theorem, and noisy channel coding theorem — the four pillars on which every digital communication system, every compression codec, and every cross-entropy loss in modern machine learning still rests.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

Transmission of InformationPaper

Hartley, Ralph V. L. · 1928

The first paper to propose that information can be quantified in a technical, measurable sense, defined as proportional to the log of the number of distinguishable symbols. Hartley's H = n·log(s) was the right starting point — Shannon's later contribution was to extend it to account for non-uniform probability distributions and noisy channels.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

The Mathematical Theory of CommunicationBook

Shannon, Claude E. and Weaver, Warren · 1949

The book-length expansion of Shannon's 1948 paper, with an extensive introduction by Warren Weaver for a broader audience. Weaver introduced the three-level framework — A: technical (do the symbols arrive correctly?), B: semantic (do they carry the intended meaning?), C: effectiveness (do they produce the intended behavior?). Shannon's theory completely resolves Level A and is silent on B and C by design, which is why it generalizes to every transmission system in nature.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

The Information Bottleneck MethodPaper

Tishby, Naftali; Pereira, Fernando C.; Bialek, William · 2000

Introduced the information bottleneck framework: learn a compressed representation Z of input X that preserves information about target Y, formalized as the Lagrangian L = I(X; Z) − β·I(Z; Y). The framework gives a principled answer to 'what is a good representation?' that is independent of any specific architecture. Tishby and Schwartz-Ziv's 2017 follow-up extended this to a (contested) theory of why deep neural networks generalize, claiming that training proceeds in two phases — fitting then compression.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

Channel Polarization: A Method for Constructing Capacity-Achieving Codes for Symmetric Binary-Input Memoryless ChannelsPaper

Arıkan, Erdal · 2009

The first explicit construction of a code provably achieving Shannon channel capacity, with O(n log n) encoding and decoding complexity. The key insight is channel polarization: combining n channel uses in a recursive structure produces virtual channels that split into 'almost perfect' and 'almost useless' sets, and you simply transmit over the perfect ones. Polar codes are now part of the 5G NR standard — sixty-one years after Shannon proved such codes had to exist.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

Probability Theory: The Logic of ScienceBook

Jaynes, Edwin T. · 2003

Posthumously published treatise arguing that probability is best understood as extended logic — a quantitative measure of credence under uncertainty — and that the maximum entropy principle is the unique consistent rule for assigning probabilities given only partial constraints. Jaynes's framework reframes Shannon entropy not just as a measure of uncertainty but as the foundation of inductive reasoning: when you know only some moments, the maximum-entropy distribution is the one that adds no information beyond what you have.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

Generative Adversarial NetsPaper

Goodfellow, Ian; Pouget-Abadie, Jean; Mirza, Mehdi; Xu, Bing; Warde-Farley, David; Ozair, Sherjil; Courville, Aaron; Bengio, Yoshua · 2014

Introduced Generative Adversarial Networks. The proof showed that with an optimal discriminator, the GAN training objective is equivalent to minimizing the Jensen-Shannon divergence between the data distribution and the generator's distribution. This was the first construction to connect adversarial training rigorously to an information-theoretic divergence — and the failure mode (mode collapse) is a direct consequence of JSD's mode-seeking behavior, which Wasserstein GAN later sidestepped by replacing JSD with a transport-based metric.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

CausalityBook

Pearl, Judea · 2000

The foundational treatment of causal inference as a formal mathematical discipline distinct from probability and statistics. Pearl's central argument is that information theory and traditional statistics, however powerful, operate purely on observed probability distributions and therefore cannot answer interventional questions ('if I do X, what happens to Y?'). The do-calculus he developed gives a precise language for this gap. The post uses this to mark where mutual information stops — at correlation; causation requires a different framework.

Xuất hiện trongNgôn Ngữ Của Sự Bất Ngờ

Science

Non-Cooperative GamesPaper

Nash, John F. · 1950

The 28-page Princeton PhD dissertation that founded modern game theory. Nash proved that every finite game (any number of players, any number of pure strategies each) has at least one Nash Equilibrium when mixed strategies are allowed, using Kakutani's fixed-point theorem. The result extended von Neumann's minimax theorem from zero-sum two-player games to arbitrary finite games, giving game theory the mathematical foundation it lacked. Awarded the 1994 Nobel in Economics jointly with Harsanyi and Selten.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

The Logic of Animal ConflictPaper

Maynard Smith, John & Price, George R. · 1973

The paper that introduced the Evolutionarily Stable Strategy (ESS) concept and bridged game theory with evolutionary biology. Using the Hawk-Dove game, they showed why animals rarely fight to death even when capable: when the cost of injury exceeds the value of the resource, the stable population mix favors restraint. ESS extends Nash Equilibrium from rational conscious agents to strategies competing under natural selection, opening the entire field of evolutionary game theory. Maynard Smith expanded the framework in *Evolution and the Theory of Games* (1982).

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

The Tragedy of the CommonsPaper

Hardin, Garrett · 1968

Hardin's parable: each herder on a common pasture has a private incentive to add cattle, even though it leads to collective depletion. The structure is the Prisoner's Dilemma generalized to many players sharing a common-pool resource. Hardin concluded the commons must inevitably be destroyed without privatization or external control, a claim Elinor Ostrom later disproved empirically. Despite the contested policy implications, the paper crystallized a recurring failure mode of decentralized choice that game theory had to grapple with.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

Governing the Commons: The Evolution of Institutions for Collective ActionBook

Ostrom, Elinor · 1990

The empirical counter to Hardin. Ostrom documented dozens of long-lived commons-management systems — Swiss alpine pastures, Spanish irrigation networks, Japanese village forests, Maine lobster fisheries — that had successfully avoided the tragedy without privatization or state control. From these cases she extracted eight design principles for sustainable commons institutions: clearly defined boundaries, congruence between rules and local conditions, collective-choice arrangements, monitoring, graduated sanctions, conflict-resolution mechanisms, recognized rights to organize, and nested enterprises. Awarded the 2009 Nobel in Economics, the first woman to receive it.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

Über ein Paradoxon aus der VerkehrsplanungPaper

Braess, Dietrich · 1968

Originally written in German and largely unnoticed for years. Braess proved that adding a road to a traffic network can shift the Nash Equilibrium of route choice to a worse state, making everyone slower. The paradox was later popularized through the work of Hal Varian and Tim Roughgarden. Real-world inverse examples include the 2003 demolition of the Cheonggyecheon highway in Seoul, the 1969 closure of streets in Stuttgart, and shifts after 9/11 in New York. The mechanism appears in internet routing, electrical grids, and financial networks.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

Counterspeculation, Auctions, and Competitive Sealed TendersPaper

Vickrey, William · 1961

The paper that introduced the second-price sealed-bid auction (Vickrey auction): each bidder submits a sealed bid, the highest bidder wins but pays the second-highest bid. Vickrey proved that truthful bidding becomes a weakly dominant strategy, and the item ends up with whoever values it most — a mechanism where the Nash Equilibrium coincides with the social optimum (Price of Anarchy = 1). This is one of the cleanest demonstrations of how good mechanism design can sidestep the failures of laissez-faire. Vickrey received the 1996 Nobel in Economics three days before his death.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

Professionals Play MinimaxPaper

Palacios-Huerta, Ignacio · 2003

An empirical test of mixed-strategy Nash Equilibrium using 1,417 penalty kicks from European professional football leagues. Palacios-Huerta found that both kickers and goalkeepers play very close to the theoretical equilibrium, including the indifference condition: success rates of penalty kicks do not differ significantly between shooting directions, exactly as the theory predicts. The result is striking because no player is consciously solving equations — thousands of matches filtered out optimal patterns through experience. One of the strongest field validations of game-theoretic prediction in real human behavior.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

The Strategy of ConflictBook

Schelling, Thomas · 1960

A foundational text of strategic-bargaining theory, introducing focal points (now called Schelling points), the strategic value of commitment, the role of threats that leave something to chance, and the bargaining logic of deterrence. Schelling wrote not as a pure theorist but as someone trying to understand how Cold War crises like Berlin and Cuba could be navigated without disaster. The book reframes credibility as a structural property rather than a psychological one: a threat is credible when carrying it out is rational at the moment, not because of willpower. Awarded the 2005 Nobel in Economics jointly with Aumann.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

The Evolution of CooperationBook

Axelrod, Robert · 1984

Axelrod ran two computer tournaments of strategies for the repeated Prisoner's Dilemma. The winner of both was Tit-for-Tat, submitted by Anatol Rapoport: four lines of code, the simplest strategy in either tournament. Axelrod identified four properties of successful strategies — nice (does not defect first), retaliatory, forgiving, and clear — and showed through evolutionary simulation that cooperation can emerge in a sea of defectors if it appears in dense enough clusters. The book is the empirical and computational anchor for the Folk Theorem's claim that cooperation in repeated games is rational, not just hopeful.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

An Experimental Analysis of Ultimatum BargainingPaper

Güth, Werner; Schmittberger, Rolf; Schwarze, Bernd · 1982

The paper that introduced the ultimatum game and produced the first systematic empirical evidence that humans deviate from Nash Equilibrium predictions in consistent ways. Players reject offers below ~20-30% of the pot even though that means both go home empty-handed — punishing perceived unfairness at personal cost. The result is robust across cultures, ages, and stakes, and launched the field of behavioral game theory. The lesson is not that humans are irrational but that they optimize a richer objective function than material payoff alone.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

Prospect Theory: An Analysis of Decision under RiskPaper

Kahneman, Daniel & Tversky, Amos · 1979

The most-cited paper in the history of economics journals. Prospect theory replaces expected-utility maximization with a descriptive model of how real humans evaluate risky options: people are loss-averse (losses hurt about twice as much as equivalent gains feel good), they overweight small probabilities and underweight large ones, and they evaluate outcomes relative to a reference point rather than in absolute terms. The framework explains many systematic deviations from classical game theory's rational-agent assumption. Kahneman received the 2002 Nobel in Economics; Tversky had died in 1996.

Xuất hiện trongNơi Không Ai Muốn Đến, Nhưng Không Ai Rời Đi

Science

The Complexity of Theorem-Proving ProceduresPaper

Cook, Stephen A. · 1971

The paper that opened the modern theory of NP-completeness. Cook proved that Boolean Satisfiability (SAT) is NP-complete by showing that any polynomial-time nondeterministic computation can be encoded as a SAT formula whose satisfiability mirrors the computation's acceptance. The proof established the first concrete NP-complete problem and gave a method for proving others by reduction. The post uses Cook's result as the structural moment when P vs NP stops being a question about one problem and becomes a question about an entire class.

Xuất hiện trongKhám Phá và Kiểm Chứng

Science

Reducibility Among Combinatorial ProblemsPaper

Karp, Richard M. · 1972

Published a year after Cook's theorem, Karp showed that twenty-one familiar combinatorial problems (TSP, Knapsack, Graph Colouring, Clique, Vertex Cover, Hamiltonian Cycle, Subset Sum, and more) are all NP-complete by reducing them to one another and to SAT. The paper revealed that decades of unrelated optimisation problems engineers had been struggling with separately were in fact the same problem in different disguises, and it gave the field a single explanation for the absence of efficient algorithms. One of the most cited papers in computer science.

Xuất hiện trongKhám Phá và Kiểm Chứng

Science

On the Computational Complexity of AlgorithmsPaper

Hartmanis, Juris & Stearns, Richard E. · 1965

The founding paper of computational complexity theory as a discipline distinct from computability. Hartmanis and Stearns defined time-bounded computation, proved hierarchy theorems showing that more time allows strictly more problems to be solved, and gave the field its basic vocabulary. The authors received the 1993 Turing Award for this work. The post cites it as the moment complexity theory crystallised as a serious branch of mathematics, six years before Cook would sharpen the question to P vs NP.

Xuất hiện trongKhám Phá và Kiểm Chứng

Science

Relativizations of the P = ?NP QuestionPaper

Baker, Theodore P. & Gill, John & Solovay, Robert · 1975

The first identified barrier to settling P vs NP. The authors constructed two oracles: one relative to which P = NP, and another relative to which P ≠ NP. Any proof technique that "relativizes", that gives the same conclusion under arbitrary oracles, therefore cannot decide the actual question. Since most classical computability and diagonalisation arguments do relativize, the result told the community that the existing toolbox was insufficient and that any future proof must look inside the structure of problems, not treat them as black boxes.

Xuất hiện trongKhám Phá và Kiểm Chứng

Science

Natural ProofsPaper

Razborov, Alexander A. & Rudich, Steven · 1994

The second great barrier. Razborov and Rudich isolated a structural pattern shared by most known circuit-complexity lower-bound arguments and called it a "natural proof". They then showed that if strong pseudorandom generators exist (an assumption widely believed and implied by certain standard cryptographic conjectures), no natural proof can establish superpolynomial circuit lower bounds. Since P ≠ NP would also imply such pseudorandom generators, the result has the disturbing flavour: P ≠ NP might be true but unprovable by the kind of technique most of the field has been using. Eliminates a vast region of proof space.

Xuất hiện trongKhám Phá và Kiểm Chứng

Science

Algebrization: A New Barrier in Complexity TheoryPaper

Aaronson, Scott & Wigderson, Avi · 2009

The third barrier. After relativization, complexity theorists turned to algebraic techniques and produced major results that did not relativize (notably Shamir's IP = PSPACE in 1992). Aaronson and Wigderson generalised the oracle construction to "algebraic oracles" (extending boolean functions to low-degree polynomials over finite fields) and showed that most known algebraic methods "algebrize", and that algebrizing proofs likewise cannot resolve P vs NP. With relativization, natural proofs, and algebrization, the three barriers together rule out the great majority of techniques that have ever been tried; the post uses these as the explanation for why fifty years of effort have not produced a proof.

Xuất hiện trongKhám Phá và Kiểm Chứng

Science

Space/Time Trade-offs in Hash Coding with Allowable ErrorsPaper

Bloom, Burton H. · 1970

The original Bloom filter paper, 7 pages in Communications of the ACM. Bloom proposed treating membership errors as a tunable parameter rather than a defect: by hashing each key to several bit positions in a shared array and accepting an explicit false-positive rate, the same query that costs O(b) per key in a hash table becomes O(k) for a constant k. The post takes its title from a single word in this paper's title: "allowable". The 50-year arc from this 7-page paper to Chrome Safe Browsing, Cassandra, and Ethereum is a worked example of how making error explicit, rather than minimising it, can change what is buildable.

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Software

Summary Cache: A Scalable Wide-Area Web Cache Sharing ProtocolPaper

Fan, Li & Cao, Pei & Almeida, Jussara & Broder, Andrei · 2000

Introduced the counting Bloom filter in the context of proxy-cache sharing across wide-area networks. The problem: cooperating proxies need to know what their peers are caching so they can fetch from each other rather than the origin, and the set of cached items changes constantly as entries are evicted. A standard Bloom filter cannot delete; the counting variant trades roughly 4× memory for safe decrement-on-eviction. The post uses this paper as the canonical answer to the question "can we keep Bloom's space efficiency while supporting deletion?"

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Software

Cuckoo Filter: Practically Better Than BloomPaper

Fan, Bin & Andersen, David G. & Kaminsky, Michael & Mitzenmacher, Michael D. · 2014

The 2014 CoNEXT paper that gave probabilistic membership filters their modern challenger. Cuckoo filters store short fingerprints in cuckoo-hashed buckets, with the crucial trick of partial-key hashing: the second candidate bucket is computed as $i_2 = i_1 \oplus \mathrm{hash}(\text{fingerprint})$, an XOR that makes the lookup symmetric and avoids ever storing the original key. The result supports deletion (Bloom cannot), uses comparable or less space at false-positive rates below 3%, and answers lookups in $O(1)$ memory accesses regardless of the false-positive rate.

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Software

HyperLogLog: the analysis of a near-optimal cardinality estimation algorithmPaper

Flajolet, Philippe & Fusy, Éric & Gandouet, Olivier & Meunier, Frédéric · 2007

The paper that gave Redis its `PFCOUNT`, BigQuery its approximate `COUNT(DISTINCT)`, and most modern analytics stacks their cardinality primitive. HyperLogLog refines the older Flajolet-Martin and LogLog estimators with two ideas: stochastic averaging across $m$ registers (each tracking the maximum leading-zero count of items hashed to it), and the harmonic mean as the way to combine register estimates while dampening the influence of lucky outliers. The paper proves a standard error of $\approx 1.04/\sqrt{m}$ and exhibits a configuration that estimates billion-element cardinalities to within 1% using a few kilobytes. The post points to this paper as the cleanest example of "scarce memory + tunable error" giving you something exact counting cannot.

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Software

An Improved Data Stream Summary: The Count-Min Sketch and its ApplicationsPaper

Cormode, Graham & Muthukrishnan, S. · 2005

The paper that defined the Count-Min sketch and gave streaming frequency estimation a clean asymmetric guarantee: the estimate is never less than the true frequency, and exceeds it by more than $\varepsilon N$ with probability at most $\delta$, where $N$ is the total weight inserted. The structure is a $d \times w$ counter matrix with $d$ independent hash functions, one per row; lookup takes the minimum across rows. Cormode and Muthukrishnan derive the sizes $w = \lceil e/\varepsilon\rceil$ and $d = \lceil \ln(1/\delta)\rceil$ from a Markov-inequality argument. The same paper sketches the heavy-hitters extension and the use of Count-Min for inner-product estimation, quantile sketches, and natural-join size estimation.

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Software

Probabilistic Counting Algorithms for Data Base ApplicationsPaper

Flajolet, Philippe & Martin, G. Nigel · 1985

The earlier paper from which HyperLogLog eventually descended. Flajolet and Martin introduced the use of the leading-zero count of hashed values as a cheap, mergeable statistic for cardinality. Their analysis showed that the maximum leading-zero count over a stream is concentrated around $\log_2 n$, giving an estimator that requires only $O(\log \log n)$ bits per register. The original estimator had high variance, which the LogLog (Durand–Flajolet 2003) and HyperLogLog (2007) refinements drove down with averaging and harmonic-mean correction. The post cites this paper as the conceptual root of all modern cardinality sketches.

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Software

Calculated Risks: How to Know When Numbers Deceive YouBook

Gigerenzer, Gerd · 2002

A book about how to think clearly when probabilities and errors are involved, framed around concrete medical-screening and forensic-statistics examples. Gigerenzer's central observation is that asymmetric error costs (a false negative on a cancer screen costs much more than a false positive) are systematically obscured by conventional statistical training, and that almost any decision-making process improves when those two costs are made explicit. The post uses this in the closing section to point out that the engineering habit probabilistic data structures encode (decide which kind of error you can afford, then design for that asymmetry) is the same habit Gigerenzer recommends for medical and policy decisions.

Xuất hiện trongCái Máy Tính Biết Cách Nói Dối Có Kiểm Soát

Science

Deceptive DesignBook

Brignull, Harry · 2023

The full-length treatment of the field Brignull founded with his 2010 darkpatterns.org site. The book consolidates the taxonomy of named patterns, walks through case studies (Amazon Prime, Facebook privacy settings, Booking.com urgency claims, GDPR consent banners), traces the regulatory response from FTC actions through the EU Digital Services Act, and articulates the central distinction the field rests on: the difference between an interface that is hard to use because of constrained resources and an interface that is hard to use because someone decided it should be hard to use in exactly this way. The post takes the taxonomy from this book and its predecessor site as its scaffolding for Sections 3–4.

Xuất hiện trongNút Cancel Không Có Ở Đó

Software

Dark Patterns at Scale: Findings from a Crawl of 11K Shopping WebsitesPaper

Mathur, Arunesh & Acar, Gunes & Friedman, Michael J. & Lucherini, Eli & Mayer, Jonathan & Chetty, Marshini & Narayanan, Arvind · 2019

The first systematic empirical study of dark patterns. The Princeton team built an automated crawler that visited 11,000 shopping sites, captured HTML and screenshots, and used a combination of automated detection and manual annotation to classify dark patterns across 15 types in 7 groups. They found dark patterns on 11.1% of sites, but the finding the post leans on most is the correlation: larger sites by Alexa traffic rank used more dark patterns, not fewer. The lower-bound caveat is also load-bearing — the crawler only captured static page state, so dynamic and interaction-triggered dark patterns (exit-intent popups, multi-step cancellation flows) were systematically undercounted.

Xuất hiện trongNút Cancel Không Có Ở Đó

Software

Value Sensitive Design and Information SystemsPaper

Friedman, Batya & Kahn, Peter H. Jr. & Borning, Alan · 2008

The fullest articulation of the VSD methodology, written for *The Handbook of Information and Computer Ethics*. The chapter sets out the three interleaved investigations (conceptual, empirical, technical), the direct/indirect stakeholder distinction, the envisioning-cards practice, and the insistence that value tensions must be made explicit and adjudicated rather than resolved silently by whichever direction is most convenient for the implementing organisation. The post uses VSD in Section 11 as the named alternative to the implicit "optimise conversion and call the result good UX" approach that produces dark patterns in the first place.

Xuất hiện trongNút Cancel Không Có Ở Đó

Software

The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of PowerBook

Zuboff, Shoshana · 2019

The most systematic analysis of the economic structures that produce dark patterns at platform scale. Zuboff argues that surveillance capitalism is a distinct economic logic, qualitatively different from industrial capitalism: behavioural data is no longer a by-product of service provision but the primary product, with services existing as the apparatus to extract it. Dark patterns, in this frame, are not isolated design failures but local manifestations of an economic system that monetises asymmetric information about users. The post borrows Zuboff's framing for the closing argument that individual remedies (user vigilance, ethical designers) are insufficient against a structural condition.

Xuất hiện trongNút Cancel Không Có Ở Đó

Philosophy

Judgment under Uncertainty: Heuristics and BiasesPaper

Tversky, Amos & Kahneman, Daniel · 1974

The paper that founded the modern behavioural-economics research programme on cognitive bias. Tversky and Kahneman documented systematic ways in which human judgement diverges from the rational-agent model — availability, representativeness, anchoring, and others — and argued that these are not deficits to be trained away but heuristics that work in the environments where they evolved. The post uses this work as the conceptual scaffolding for Section 5, where the same heuristics that produce reasonable decisions in ordinary contexts become exploitable surfaces inside interfaces designed to fire them in the wrong direction.

Xuất hiện trongNút Cancel Không Có Ở Đó

Philosophy

Tractatus Logico-PhilosophicusBook

Wittgenstein, Ludwig · 1922

Wittgenstein's early work, written in the trenches of the First World War and published in 1922. The DDD series draws on its most cited line, proposition 5.6, "Die Grenzen meiner Sprache bedeuten die Grenzen meiner Welt" (The limits of my language mean the limits of my world). The Tractatus argues that the structure of language and the structure of the world it can describe are bound to each other, what we have no words for, we cannot think clearly about. Chapter 4 borrows this to frame a model as the language of a code context: the limits of the model are the limits of what the code can reason about. The point is not that programmers need to read Wittgenstein. The point is that omission from a model is not an empty space, it is the boundary that defines what the context can speak of at all. Note: Wittgenstein later rejected much of the Tractatus in the Philosophical Investigations, but this particular insight about the binding of language and world survives across both works.

Xuất hiện trongModel

Philosophy

Philosophical InvestigationsBook

Wittgenstein, Ludwig · 1953

The posthumous work in which Wittgenstein abandons the picture theory of his earlier *Tractatus* and replaces it with the idea that the meaning of a word is its use within a specific form of life, what he called a 'language game.' Sections §43 and §66 to §67 are the canonical loci. The DDD series draws on this argument to explain why the same identifier in code can be technically correct in two contexts and still mean two different things at the boundary: meaning never detaches from the practice in which the word is used, so two practices using the same word are not in fact sharing it. The point is not that programmers should read Wittgenstein. The point is that the language gap inside a codebase is not a coding problem in disguise; it is a feature of how meaning works.

Xuất hiện trongCái tên sai làm hỏng cả hệ thống Entity, Value Object, và câu hỏi về danh tính

Philosophy

Why getter and setter methods are evilArticle

Holub, Allen · 2003

Holub's JavaWorld essay arguing that getters and setters violate encapsulation and should not exist. The position is more extreme than most practitioners hold, but it identifies something real: when a developer reflexively adds a getter for every field, they are exposing implementation and calling it an interface. The OOP Chapter 4 Footnote uses the piece as a calibration point, acknowledging that getters for display and serialization are legitimate while agreeing with the underlying diagnosis — the reflex is the problem, not the method form.

Xuất hiện trongEncapsulation — ẩn quyết định, không phải ẩn dữ liệu

Software

Dark Patterns after the GDPR: Scraping Consent Pop-ups and Demonstrating their InfluencePaper

Nouwens, Midas & Liccardi, Ilaria & Veale, Michael & Karger, David & Kagal, Lalana · 2020

The empirical case for why GDPR did not reduce dark patterns in cookie consent but redirected them. The team scraped 680 UK websites' consent pop-ups and found only 11.8% used designs compliant with GDPR's strictest interpretation, while 57.4% used at least one dark pattern in the consent flow itself. The paper is the canonical citation for the broader claim that compliance theatre and dark patterns can coexist inside the same banner, and that an entire industry of Consent Management Platforms grew up after May 2018 selling "optimisation" for accept rates rather than informed consent.

Xuất hiện trongNút Cancel Không Có Ở Đó

Software

From Local to Global: A Graph RAG Approach to Query-Focused SummarizationPaper

Edge, Darren; Trinh, Ha; Cheng, Newman; et al. · 2024

Microsoft Research. The paper describes GraphRAG, an architecture that addresses the systematic failure of naive RAG on global synthesis questions — those requiring a view across an entire corpus rather than retrieval of a specific passage. The pipeline extracts entities and relationships from the corpus via LLM calls, builds a knowledge graph, runs community detection (Leiden algorithm) to organize the graph into a topic hierarchy at multiple resolutions, and generates community reports at each level. Retrieval operates in two modes: local search traverses around entities mentioned in the question; global search synthesizes across community reports to answer questions that require a full-corpus view. The paper is the canonical public reference for the architecture and its trade-offs: strong improvement on global synthesis benchmarks, substantial indexing cost from LLM-based extraction, and negligible benefit over naive RAG on simple lookup questions.

Xuất hiện trongGraph làm trí nhớ có cấu trúc cho LLM

Software

Knowledge Graph-Augmented RAG for Customer SupportArticle

LinkedIn Engineering · 2024

LinkedIn engineering post on applying a structured graph layer to historical customer support tickets. Rather than building a full knowledge graph from the corpus, the team structured each ticket into a small sub-graph with three node types — Issue (the problem the user reported), Cause (the identified technical cause), Fix (the confirmed resolution) — connected by typed edges: CAUSED_BY, RESOLVED_BY, HAS_WORKAROUND. Retrieval finds the nearest Issue node via vector search, then traverses the graph to return Cause and Fix as a complete context in a single response. The case demonstrates the architecture spectrum in practice: domain-specific entity-linking and traversal is sufficient when the question type is well-defined — a full knowledge graph with community detection is not always required. Source details unverified at time of writing; confirm post URL and accuracy of Issue-Cause-Fix schema before citing as primary source.

Xuất hiện trongGraph làm trí nhớ có cấu trúc cho LLM

Software

Google Knowledge Graph and FreebaseArticle

Google · 2012

Google acquired Freebase in 2010 and launched the Knowledge Graph in May 2012, seeded from Freebase, Wikipedia infoboxes, and semantic web triples curated by editorial staff. Freebase was wound down in 2015 and its data partially migrated to Wikidata. The chapter cites this as the canonical example of first-generation knowledge graph construction: editorially curated, high-trust, scope-limited by what a human team could process. The defining characteristic of this generation's error profile is visibility — what is absent is absent by design, so 'not found' is a known state rather than a silent one.

Xuất hiện trongLLM làm công cụ xây graph, và bài toán graph sai âm thầm

Software

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of TypesPaper

Zhu, Yunyao; et al., Amazon Research · 2020

Published at KDD 2020. AutoKnow automatically extracts structured product knowledge from three complementary sources — seller descriptions, customer Q&A, and customer reviews — constrained by Amazon's product taxonomy acting as the extraction schema. Entity resolution merges the same physical product across multiple seller representations. Human review is applied selectively: high-confidence, low-impact extractions are accepted automatically; low-confidence or high-impact claims (especially compatibility assertions) are routed to human spot-check. The chapter cites the human-allocation framework as the key lesson: the question is not whether to automate but where to spend a fixed human review budget to remove the most consequential errors.

Xuất hiện trongLLM làm công cụ xây graph, và bài toán graph sai âm thầm

Software

The State of AIReport

McKinsey Global Institute · 2024

Annual McKinsey report tracking enterprise AI adoption rates, project outcomes, and the gap between AI experimentation and production deployment. Consistently documents the discrepancy between the number of AI projects initiated and the number that reach sustained operational infrastructure.

Xuất hiện trongProduction-grade hay prototype mãi mãi

Software

Gartner Hype Cycle for Artificial IntelligenceReport

Gartner · 2024

Annual Gartner research tracking the maturity and adoption trajectory of AI technologies. Distinguishes technologies at the peak of inflated expectations from those reaching the plateau of productivity, providing an industry-acknowledged framework for assessing whether a technique is proven, maturing, or still speculative.

Xuất hiện trongProduction-grade hay prototype mãi mãi

Software

Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL TaskPaper

Yu, Tao; et al., Yale University · 2018

Published at EMNLP 2018. Spider introduced a cross-domain, complex text-to-SQL benchmark requiring models to generalize across 200 databases covering 138 domains, becoming the dominant benchmark for measuring natural language to SQL translation quality. The chapter cites it to illustrate the gap between benchmark performance on clean schemas and production performance on schemas with exceptions, aliases, and edge cases that accumulate in real deployment history.

Xuất hiện trongProduction-grade hay prototype mãi mãi

Software

Được cập nhật khi có thêm bài viết mới.