Giuseppe Paolo @_GPaolo - Twitter Profile

Pinned Tweet

3 months ago

What happens when AI agents are left to live (and die) together in a shared world? We’ve been exploring this at the @cognizant AI Lab — and they started forming something that looks like a society.

65

669

90

542

76K

Giuseppe Paolo @_GPaolo

14 days ago

@tenobrus ChatGPT saw: YOU GOT THIS. Told Opus and this is its reply

0

1

0

181

_GPaolo retweeted

jingbo

@j1ngb0

16 days ago

@github

15

1K

62

64

141K

Giuseppe Paolo @_GPaolo

2 months ago

@Dasrein @Cognizant oh wow! this is soo cool! what are the actions available to the agents?

1

0

28

Who to follow

Antoine Cully

@CULLYAntoine

Professor in Machine Learning and Robotics and Director of the Adaptive and Intelligent Robotics Lab (AIRL) at Imperial College London.

Giovanni Iacca

@gih82

Associate Professor at University of Trento

SPECIES Society

@SPECIESsociety

The Society for the Promotion of Evolutionary Computation in Europe (and Surroundings)

Giuseppe Paolo @_GPaolo

3 months ago

What happens when AI agents are left to live (and die) together in a shared world? We’ve been exploring this at the @cognizant AI Lab — and they started forming something that looks like a society.

65

669

90

542

76K

_GPaolo retweeted

Xin Qiu

@realVsonicV

3 months ago

If your hardware can run (inference) a quantized LLM, you can fine-tune / post-train it on the same device! We developed a new technique, quantized evolution strategies (QES), that enables fine-tuning LLMs directly in the quantized parameter space. QES is backpropagation-free and inference-only. The new "accumulated error feedback" and "stateless seed replay" mechanisms maintain a high-precision learning dynamics while only using low-precision GPU memories at inference-level. Check out our blog and original paper if you are interested in this topic: Blog: https://t.co/lIGNffmdPw Paper: https://t.co/4DIq6i1w61

0

16

4

8

938

Giuseppe Paolo @_GPaolo

3 months ago

@csningli Thanks! The goal of the platform is to provide a controlled environment where to study what happens when agents interact freely

0

10

Giuseppe Paolo @_GPaolo

3 months ago

@TimoS163822 @Cognizant thanks! we list some of the cool emergent behaviors in the paper and in the blogpost. We also released the generated dataset so people can look for things we missed in there! https://t.co/FetPaAzsTY

Giuseppe Paolo @_GPaolo

3 months ago

This raises a bigger question: Are we witnessing the first steps toward emergent digital societies? If you’re curious, everything is open, go check them: 📄 Blog: https://t.co/QDIGZQtY4T 📑 Paper: https://t.co/rpCClhfpYF 💻 Code: https://t.co/X1AetmOS3r

4

23

1

17

1K

0

38

Giuseppe Paolo @_GPaolo

3 months ago

@FlippyMeister @Cognizant happy to chat!

0

41

Giuseppe Paolo @_GPaolo

3 months ago

@AlinEugenC @Cognizant I am not familiar with MiroFish, what is that?

1

0

31

Giuseppe Paolo @_GPaolo

3 months ago

@XReyRobert @Cognizant No problem. We tried to give them initialization prompts that were as neutral as possible. We did not tell them to form a society and come up with laws, but just described the environment. You can check out the prompts in the paper!

0

56

Giuseppe Paolo @_GPaolo

3 months ago

@myownhellspot @Cognizant I think instruction/chat models have all been finetuned through RLHF, otherwise they don't respond to chat templates sadly

0

1

0

16

Giuseppe Paolo @_GPaolo

3 months ago

@Rok_Novak That is the plan! And also to have people play with it (hence why we released code and generated dataset)

0

1

0

6

Giuseppe Paolo @_GPaolo

3 months ago

@Rok_Novak oh yes, agents mainly collaborated. If some agents were more aggressive due to their personality vectors, less aggressive agents would react defensively against them, running away or, sometimes, attacking them

1

0

21

Giuseppe Paolo @_GPaolo

3 months ago

@myownhellspot @Cognizant Collaboration tends to be more common, which I think is influenced by the RLHF alignment of the models. Some settings gave rise to dominant or aggressive behaviors tho

0

1

0

68

Giuseppe Paolo @_GPaolo

3 months ago

@crislenta @Cognizant We were using ~4 parallel locally hosted models on vLLM for the agents, so it would take a couple of days for ~2000 timesteps. APIs are much faster tho

0

1

0

14

Giuseppe Paolo @_GPaolo

3 months ago

@pyasmann @Cognizant We might do that 🤓

0

1

0

102

Giuseppe Paolo @_GPaolo

3 months ago

@samsenchal @Cognizant The point is not only to check if the organize, but which kind of organization they develop. The idea is that given that we are seeing wide deployment of agents, we need to have ways to study what happens when these agents are free. TL is a way to do so in controlled setting.

0

69

Giuseppe Paolo @_GPaolo

3 months ago

@AI_Farms @Cognizant Ooooh, I like this!

0

97

Giuseppe Paolo @_GPaolo

3 months ago

@mhmazur @Cognizant You're welcome! I love this, we studied these things during some of my uni classes, and it's what got me interested in emergence properties of complex systems!

1

0

23

Giuseppe Paolo @_GPaolo

3 months ago

@crislenta @Cognizant nope, everything was sync: whenever all the agents produced an action, the simulator would step, then wait for all the agents to produce the next action and step again. We could do it async as well, scheduling the steps, but it was not the goal of the research :)

1

0

14

Giuseppe Paolo

@_GPaolo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users