AI & Engineering Blog | Celestino Salim

When to Kill an AI Feature: Product Decisions Most Teams Get Wrong

Adding AI to a product is easy. Deciding whether to keep it is hard. Here is the decision framework I use for AI feature lifecycle: when to add, how to measure, and when to kill.

Mar 30, 2026

product-engineering
ai-strategy
+2

Guardrails Are Not Optional: A Production Safety Implementation Guide

Most teams add guardrails after the first incident. By then, data has leaked or the agent ran up a $2,000 API bill. This is the guide for building guardrails from day one.

Mar 25, 2026

guardrails
safety
+3

Observability for AI Systems: What to Log When Everything Is Probabilistic

Traditional observability was built for deterministic systems. AI systems are probabilistic -- same input, different output. Here is how I built monitoring that actually works for LLM-powered production systems.

Mar 20, 2026

observability
monitoring
+3

Building AI Agents That Actually Work: An Orchestration Playbook

Most agent tutorials show a toy ReAct loop that works on 3 test cases. Production agents need tool boundaries, retry logic, cost caps, and human-in-the-loop checkpoints. This is the playbook I use.

Mar 15, 2026

agents
orchestration
+3

Fine-Tuning vs RAG: The Decision Framework Nobody Talks About

Most teams treat fine-tuning and RAG as alternatives. They are not. They solve different problems, cost differently, and sometimes you need both. Here is the decision framework I use in production.

Mar 10, 2026

fine-tuning
rag
+3

Postgres Is All You Need: pgvector as Production AI Infrastructure

The vector database market wants you to adopt Pinecone or Weaviate. For most teams, Postgres with pgvector eliminates an entire service from your stack -- and performs within 5% of dedicated solutions at 1M vectors.

Mar 5, 2026

postgres
pgvector
+4

Why Your RAG System Is Bleeding Money (And How to Fix It)

Most RAG prototypes cost $2-5 per query. At 10,000 queries/day, that is $360K/year -- for a single feature. I cut retrieval costs by 99% in production. Here is the four-strategy playbook with real before/after numbers.

Feb 25, 2026

rag
cost-optimization
+2

Evals Are the Unit Tests of AI: A Production Playbook

We don't deploy code without tests. Why are we deploying AI with nothing but gut feelings? Here is the eval harness I use to catch hallucinations before users do -- with code, CI/CD gates, and the reliability flywheel that lifted impressions 482%.

Feb 20, 2026

evaluation
reliability
+2

The Vendor Off-Ramp: How I Cut $60K/Month in AI Spend Without Rewriting the Stack

Vendor lock-in in AI is existential. One pricing change rewrites your unit economics overnight. Here is the three-layer architecture pattern that makes provider choice a routing decision -- with TypeScript code, real cost breakdowns, and the judgment call on when NOT to abstract.

Feb 15, 2026

architecture
cost-optimization
+2

Systems Thinking for AI Engineers: Why the Model Is Never the Problem

The API times out Thursday night. The model hallucinates a legal citation. The bill arrives at 3x forecast. The problem was never the model -- it was that nobody designed the system. Here is the production engineering mindset that fixes it.

Feb 10, 2026

systems-thinking
architecture
+2

Notes on Exploring & Shipping.

Latest from the Blog

When to Kill an AI Feature: Product Decisions Most Teams Get Wrong

Guardrails Are Not Optional: A Production Safety Implementation Guide

Observability for AI Systems: What to Log When Everything Is Probabilistic

Building AI Agents That Actually Work: An Orchestration Playbook

Fine-Tuning vs RAG: The Decision Framework Nobody Talks About

Postgres Is All You Need: pgvector as Production AI Infrastructure

Why Your RAG System Is Bleeding Money (And How to Fix It)

Evals Are the Unit Tests of AI: A Production Playbook

The Vendor Off-Ramp: How I Cut $60K/Month in AI Spend Without Rewriting the Stack

Systems Thinking for AI Engineers: Why the Model Is Never the Problem

Questions about the code? Ask my AI.

Why Your RAG System Is Bleeding Money (And How to Fix It)