That's assuming they're using one of the generic models like ChatGPT and not something custom they've created specifically to do this.
Edit: they are in fact using their own as per the article
I'm aware they're not using a generic model, but that's not much better. Current custom-made models still fuck up significantly more than humans, and in less predictable ways.
Even if their custom model is slightly incorrect 1% of the time, that's still a major problem in critical systems like those.
That's assuming they're using one of the generic models like ChatGPT and not something custom they've created specifically to do this.
Edit: they are in fact using their own as per the article
I'm aware they're not using a generic model, but that's not much better. Current custom-made models still fuck up significantly more than humans, and in less predictable ways.
Even if their custom model is slightly incorrect 1% of the time, that's still a major problem in critical systems like those.
Which models are those?