What if the hammer has a problem where the handle breaks off randomly? Same thing happens with AI. Sometimes it breaks, randomly, and without any way of predicting it.
Another way to look at it is that the operator of the hammer has an immediate feedback loop and will not continue with a broken hammer. AI as it stands rarely has that feedback on the consequences of its decisions, and lacks the ability to react appropriately.