what good looks like in the wild
The PM had been running the in-app recommendation engine for eight months. It had shipped on time, the infrastructure was solid, and the team had worked hard on it. Usage data showed it was being seen — the widget loaded, the recommendations appeared — but click-through was running at 0.6%, against a modeled 4%. They'd run three rounds of iteration: A/B tested the copy, changed the placement, tried personalization. Nothing moved the number more than a fraction of a percent.
At the six-month mark, quietly, the team stopped talking about the feature in standups. Engineers had moved to other things. The PM kept including it in the roadmap update slides — listed under "optimization in progress." Leadership had stopped asking about it.
The call came at month 8, not from the PM, but from a VP who was cleaning up the quarterly review deck and noticed the widget was still consuming 12% of the backend's personalization budget. "What's the plan here?" The PM explained. The VP said "let's park it." The PM agreed. It was logged as a "strategic deprioritization."
That's not Kill.
Kill looks like this: at month 5, same PM, same evidence. The PM writes a short doc — two paragraphs, not a post-mortem, not a lengthy autopsy. "Here's what we thought would happen. Here's what happened instead. The gap is too large to close with iteration — the underlying assumption about how customers discover content inside the app was wrong. I'm proposing we kill the recommendation engine, move the two engineers to the search refactor, and use the personalization budget to test a different surface. Here's the spec for that test." Sent to the team. Sent to the VP. Posted in the product channel.
Reaction: one person pushed back, asked for another month. The PM said: "I understand the instinct. I had it too. But we set our signal threshold before we started, and we've passed it. If we find evidence in six months that I was wrong about the surface, I'll be the first to propose we revisit." The conversation was over in one thread.
The recommendation engine was off the roadmap by month 6. The search refactor shipped by month 9. Two of the engineers from the original team contributed to it. In the retro, one of them said: "I trusted the call because she made it before she had to."
“That is Kill at L4. The value wasn't in being right. It was in owning the wrong, early, clearly, in writing, in a way that freed the team to build the right thing.