Nothing went 'wrong' per se. they just massively underestimated how much work it would be.
Stuff was badly documented. The documentation that existed was outdated compared to the existing code. Not all executable matched the source code they were supposedly built from. It was unclear if this was because of old compiler bugs, new compiler bugs or (probably) because the source code in the backups was not the actual source code used to compile the exes and that sourcecode was long since lost to time. etc etc
Thanks for sharing. I wonder where AI-generated codebases land in terms of documentation. I bet it starts out okay or even great, but maybe degrades over time.