Ismail Elezi

@Ismail_Elezi

Principal Research Scientist at @Huawei Noah Ark in London (before that @TUM @Nvidia). Views are my own.

Joined December 2011

293 Following

509 Followers

610 Posts

Ismail Elezi @Ismail_Elezi

about 2 months ago

Claude is very dishonest, something I have been suspecting for a while. Played versions of this game multiple times, it always cheats: https://t.co/zdSe50YLpH Sleeper agents do not even need to be trained, and alignment is nowher ready to be solved @AnthropicAI

125

Ismail Elezi @Ismail_Elezi

3 months ago

Massive respect to @AnthropicAI for standing by their principles. The entire fate of humanity might be dependent on not doing stupid things required by people who do not know anything about AI.

139

Ismail Elezi @Ismail_Elezi

7 months ago

Did @iclr_conf just break OpenReview?

Ismail Elezi @Ismail_Elezi

11 months ago

Deadline time: 49/56 (87.5%) of reviews submitted. 7/14 papers have 4 reviews. 7/14 papers have 3 reviews. All reviews are atleast ok level. One of the best reviews is from one of the biggest names on the field, which means that yes, you have enough time to write a good review.

185

Who to follow

Pavlo Molchanov

@PavloMolchanov

Director of Research @NVIDIA

JennySeidenschwarz

@JennySeidensch1

PhD Student at Dynamic Vision and Learning Group, Technical University of Munich

Yuki

@y_m_asano

Full Professor and head of Fundamental AI (FunAI) Lab @funailab at University of Technology Nuremberg. previously @UvA_Amsterdam, @Oxford_VGG

Ismail Elezi @Ismail_Elezi

11 months ago

Stats for a first time AC in @NeurIPSConf, while in the last 48h of the reviewing deadline. 27/56 (48%) of reviews submitted 1/14 papers has 4 reviews 3/14 papers have 3 reviews 4/14 papers have 2 reviews 6/14 papers have 1 review

510

Ismail Elezi @Ismail_Elezi

11 months ago

Update: 24 hours to go: 36/56 (64%) of reviews submitted. 2/14 papers have 4 reviews. 5/14 papers have 3 reviews. 6/14 papers have 2 reviews. 1/14 papers have 1 review.

248

Ismail Elezi @Ismail_Elezi

11 months ago

@_xwen_ @NeurIPSConf Down on Chrome, works ok on Edge.

Ismail Elezi @Ismail_Elezi

11 months ago

@t_gaintseva @NeurIPSConf First-time AC.

Ismail_Elezi retweeted

Xin Wen @_xwen_

12 months ago

Can we decouple semantics from spectrum for image tokenizers? Our answer: the 1D Semanticist tokenizer. We push the burden of photo-realistic image generation to diffusion decoders, and let the tokens focus on the semantic structure. The PCA-like structure is induced by nested CFG (dropout), allowing plausible reconstruction & generation with very few tokens, and coarse-to-fine hierarchy. Thanks to semantic-spectrum decoupling, our tokenizer also achieves strong performance on ImageNet linear probing, indicating potential for unified understanding and generation. To check for more details, see you 1-2 pm TODAY at ExHall D, GMCV Workshop! "Principal Components" Enable A New Language of Images Project Page: https://t.co/yzISM0c5xQ Code: https://t.co/L13AH9fQKs

_xwen_'s tweet photo. Can we decouple semantics from spectrum for image tokenizers? Our answer: the 1D Semanticist tokenizer.

We push the burden of photo-realistic image generation to diffusion decoders, and let the tokens focus on the semantic structure. The PCA-like structure is induced by nested CFG (dropout), allowing plausible reconstruction & generation with very few tokens, and coarse-to-fine hierarchy.

Thanks to semantic-spectrum decoupling, our tokenizer also achieves strong performance on ImageNet linear probing, indicating potential for unified understanding and generation.

To check for more details, see you 1-2 pm TODAY at ExHall D, GMCV Workshop!

"Principal Components" Enable A New Language of Images
Project Page: https://t.co/yzISM0c5xQ
Code: https://t.co/L13AH9fQKs

18K

Ismail Elezi @Ismail_Elezi

about 1 year ago

Really cool project with a simple general solution, while reaching SOTA results, lead by @t_gaintseva.

Tatiana Gaintseva

@t_gaintseva

about 1 year ago

A new paper is out! CASteer: Steering Diffusion Models for Controllable Generation Arxiv link: https://t.co/8KZNrgPYZK Code: https://t.co/CusMTpcGwV Diffusion models are powerful, but their generation process can be difficult to control, which poses safety risks (e.g., generating images with nudity/violence). There are many ideas on how to address this, but most of the existing approaches are limited in what issues they can handle and often require additional training. CASteer, on the other hand, is capable of handling broad range of tasks, while being completely training-free! CASteer works by constructing a special steering vector for each cross-attention layer in a diffusion model using prompt pairs that capture dspecific concepts. By adding or subtracting these vectors from the outputs of cross-attention layers during inference, we gain fine-grained control over the entire generation process. We can build steering vectors for any kind of concept, and this allows for broad range of manipulations over images being generated. We can add/remove objects (e.g., apples), alter abstract attributes (e.g., nudity), do style transfer, identity manipulation (switching Leonrado DiCaprio to Keanu Reevs), concept interpolation (going from cat to giragge), and more (see picture). Simplicity of CASteer allows for easy incorporation of it into most of the modern DMs. I would like to thank ChengCheng Ma and @Ismail_Elezi , who provided invaluable assistance in this project, as well as my university supervisors: Ziquan Liu, Martin Benning, Gregory Slabaugh and Jiankang Deng. Hope to have further great collaborations!

t_gaintseva's tweet photo. A new paper is out! CASteer: Steering Diffusion Models for Controllable Generation
Arxiv link: https://t.co/8KZNrgPYZK
Code: https://t.co/CusMTpcGwV

Diffusion models are powerful, but their generation process can be difficult to control, which poses safety risks (e.g., generating images with nudity/violence). There are many ideas on how to address this, but most of the existing approaches are limited in what issues they can handle and often require additional training. CASteer, on the other hand, is capable of handling broad range of tasks, while being completely training-free!

CASteer works by constructing a special steering vector for each cross-attention layer in a diffusion model using prompt pairs that capture dspecific concepts. By adding or subtracting these vectors from the outputs of cross-attention layers during inference, we gain fine-grained control over the entire generation process. We can build steering vectors for any kind of concept, and this allows for broad range of manipulations over images being generated. We can add/remove objects (e.g., apples), alter abstract attributes (e.g., nudity), do style transfer, identity manipulation (switching Leonrado DiCaprio to Keanu Reevs), concept interpolation (going from cat to giragge), and more (see picture). Simplicity of CASteer allows for easy incorporation of it into most of the modern DMs.

I would like to thank ChengCheng Ma and @Ismail_Elezi , who provided invaluable assistance in this project, as well as my university supervisors: Ziquan Liu, Martin Benning, Gregory Slabaugh and Jiankang Deng. Hope to have further great collaborations!

237

Ismail Elezi @Ismail_Elezi

over 1 year ago

@PrannayKaul @JiankangDeng And on a fun note, 4 years after completing the computer vision holy trinity (CVPR, ICCV, ECCV), finally completed the machine learning conference trinity (NeurIPS, ICML, ICLR).

726

Ismail Elezi @Ismail_Elezi

over 1 year ago

First accepted paper of the year: "From Attention to Activation: Unraveling the Enigmas of Large Language Models" has been accepted to ICLR 2025. The most educative paper I have co-wrote, it strengthens some claims known in the community, it opposes others, ...

Ismail_Elezi's tweet photo. First accepted paper of the year: "From Attention to Activation: Unraveling the Enigmas of Large Language Models" has been accepted to ICLR 2025. The most educative paper I have co-wrote, it strengthens some claims known in the community, it opposes others, ... https://t.co/zk7t9zayXQ

Ismail Elezi @Ismail_Elezi

over 1 year ago

@PrannayKaul @JiankangDeng We will update the paper with the latest results, but the findings are identical to the current ArXiV version: https://t.co/EQiCAUsJxk On a personal note, always wanted to visit Singapore and this seems the perfect way to do so.

182

Ismail Elezi @Ismail_Elezi

over 1 year ago

#CVPR has done a very bad job in paper assignment this time for me. Out of 4 papers I am reviewing, only one has something to do with me, and even that is related to 2-4 years old papers I have. Not sure if I was unlucky, or the matching system has gone downhill.

243

Ismail Elezi @Ismail_Elezi

over 1 year ago

Max Tegmark talk at #NeurIPS2024 was funny, obvious, entertaining and concerning at the same time. The best AI talk I have heard in a very long time.

Ismail_Elezi's tweet photo. Max Tegmark talk at #NeurIPS2024 was funny, obvious, entertaining and concerning at the same time. The best AI talk I have heard in a very long time. https://t.co/HvOwQzi8C8

252

Ismail Elezi @Ismail_Elezi

over 1 year ago

We offer long internships (6+ months), competitive salaries, an office in the center of London, and a very diverse group (very gender-balanced, researchers from 8 countries working on a wide range of topics).

Ismail Elezi @Ismail_Elezi

over 1 year ago

I am very happy to attend NeurIPS in Vancouver when together with @Miles12Roy we will be presenting our VeLora paper on Thu 12 Dec 4:30 p.m. PST — 7:30 p.m. PST.

210

Ismail Elezi @Ismail_Elezi

over 1 year ago

have topic match (VLLMs, LLMs, multimodality learning, or diffusion) and are interested in doing an internship at Huawei Research Center in London, please write to me and let’s have a chat in the conference.

129

Ismail Elezi

@Ismail_Elezi

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users