@francoisfleuret Have you tried to get some of these models to reproduce incorrect student answers on exams? It would be fascinating to see the overlap on specific problems.
@janbromberger@lioninawhat@jamierusso Quite a few different elements: war/peace-time ceo roles, building great teams from imperfect pieces, firing and all the anecdotes to give everything credibility and context
ChatGPT to me seems to weld an English professor’s writing skills to an encyclopedia’s knowledge base and a kindergartener’s reasoning ability, producing impressive and highly polished nonsense. The screen is on - at 8K 120Hz - but nobody’s at the keyboard.
@betatim In my experience, it's often a good thing because it serves as a warning that "hey it's not quite as simple as you think" I keep a large collection of declined PRs at the ready when someone asks why don't we just upgrade numpy or switch to py3.11
@betatim It's quite hard to reliably measure unless it's running on isolated dedicated hardware like GPU and even then the standard deviations can be enormous