We hope you had a wonderful time at PyCon China 2024.🐍🎉
Huge thanks to all our sponsors, volunteers, partners, and everyone who made this event unforgettable!
See you next year for PyCon China 2025! #PyCon#PyConChina#PyConChina2024
o1 is not an upgrade of gpt-4o, it’s more like a sibling that performs better in “complicated” situations. Now, the problem is how do you identify these “complicated” scenarios in an automatical way?
https://t.co/QNTzFJRqtT provides three more agents: Assistant Gru will Helps users solve standalone technical issues, which is now in public use. Test Gru can Generates unit test code automatically and Babel Gru will Assists in building end-to-end projects
https://t.co/QNTzFJRqtT ranked first with a high score of 45.2% in the latest data released by SWE-Bench-Verified Evaluation, the authoritative standard for AI model evaluation, which is a collaboration between SWE and OpenAI. #GruAI#OpenAI#SWEBench
Behind our winning score is Bug Fix Gru, an Agent designed to auto-fix bugs based on user issues. Here is a video about how Bug Fix Gru works. https://t.co/ByLZApkXMc
@nexteacc Hello, thank you for your attention!
Babel Gru is still in the lab stage at the moment, and it is not yet publicly available, only a small amount of closed testing has been done.
Our score was just officially accepted by SWE bench, and https://t.co/QNTzFJRqtT received a score of 35.67%! This ranks us first among the teams that provided a trajectory. Well down Gru!
https://t.co/XaH6t51vQo
#SWEBench
As a developer, you often face a lot of tedious tasks. That's where https://t.co/QNTzFJRqtT comes in to help. Let's check out two examples.
https://t.co/YrGapFW73Y