@tszzl If you remove the word "worship" in the first paragraph, which is a conjecture at best, there is nothing surprising there. Many organizations love the product they create, and use AI in some form for screening, performance reviews, and development today.
P1 creates a new repo overnight with 30k LOC. 2 days later, P2 creates a v2 complete with migration guide, 29k - / 31k + LOC in single PR. Suggests too big to review but works on few test cases, should just switch. P1 refuses to review, agrees will run few tests and merge. RIP SWE.
TIL that mandarin has a higher information density per character than English. Interestingly, this might be one reason why LLMs switch to Mandarin for thinking occasionally.
I haven't seen a single case yet of GPT-5 not responding with a follow up question at the end. Reward hacking much? Or is it in the system prompt? #GPT5#rlhf