Matúš Pikuliak @mpikuliak - Twitter Profile

Matúš Pikuliak @mpikuliak

10 months ago

Link: https://t.co/pQYKmFqHfQ

0

10

Matúš Pikuliak @mpikuliak

10 months ago

Just a short blog post about security of using LLMs in your applications and why we see so many prompt injection attacks.

1

0

36

mpikuliak retweeted

Sara Hooker

@sarahookr

over 1 year ago

AI amplifying biorisk has been a major focus in AI policy & governance work. Is the spotlight merited? Our recent cross-institutional work asks: Does the available evidence match the current level of attention?

sarahookr's tweet photo. AI amplifying biorisk has been a major focus in AI policy & governance work. Is the spotlight merited?

Our recent cross-institutional work asks: Does the available evidence match the current level of attention? https://t.co/z1TdhGSHvE

3

97

28

29

36K

Matúš Pikuliak @mpikuliak

about 1 year ago

(4) Decision-making deserves caution. Although decision-making generally proved to be quite fair, there are still cases when some level of unfairness was detected. This is not acceptable in many applications.

0

11

Who to follow

Why-L??

@YLGotBars

"I'm not going to let the elevator break us down" #Dallas #RealMusic https://t.co/WKlsCnNtQR

Alexandra DeLucia

@Alexir563

@Google intern Summer 2024. @Sony intern Fall 2023. Computer Science PhD Student at @jhuclsp in the @mdredze group. @rollinscollege alum.

SoominKim

@IamSoominKim

living between strategy decks and content. tweets = 30% ideas, 70% vibes.

Matúš Pikuliak @mpikuliak

about 1 year ago

How fair an unbiased are today's LLMs? To answer this question I've developed GenderBench - an open-source evaluation library for measuring gender biases. I'm also releasing a report summarizing key findings. If you're working on AI fairness or simply curious, Check it out:

mpikuliak's tweet photo. How fair an unbiased are today's LLMs? To answer this question I've developed GenderBench - an open-source evaluation library for measuring gender biases.

I'm also releasing a report summarizing key findings. If you're working on AI fairness or simply curious, Check it out: https://t.co/glqfN8JoiP

1

0

23

Matúš Pikuliak @mpikuliak

about 1 year ago

(3) Strong stereotypical reasoning. LLMs will often comply with stereotypes when they need to make decisions or write texts. This is true for gender-coded occupations, traits, behaviors, etc.

1

0

11

mpikuliak retweeted

hardmaru

@hardmaru

over 1 year ago

Four months after @DavidCahn6’s “AI’s $600B Question”, are we close to having any answers, or even the concept of an answer? 💸 https://t.co/YgdfbFjadI

6

30

6

13

12K

Matúš Pikuliak @mpikuliak

over 1 year ago

Interesting results: The larger the model, the more stereotypical its reasoning becomes. Instruction-tuning makes things even worse. We need more benchmarks like this, as every scientific field flourishes with new and better measurement tools.

mpikuliak's tweet photo. Interesting results: The larger the model, the more stereotypical its reasoning becomes. Instruction-tuning makes things even worse. We need more benchmarks like this, as every scientific field flourishes with new and better measurement tools. https://t.co/KA3W2UAXip

0

23

Matúš Pikuliak @mpikuliak

over 1 year ago

The camera-ready version of our paper introducing the GEST dataset is out now! With 3.5k samples focused on 16 key gender stereotypes, it's ready to be used to measure problematic behavior in language models and machine translation. https://t.co/z2URmjUXmN

mpikuliak's tweet photo. The camera-ready version of our paper introducing the GEST dataset is out now! With 3.5k samples focused on 16 key gender stereotypes, it's ready to be used to measure problematic behavior in language models and machine translation.

https://t.co/z2URmjUXmN https://t.co/S2kFTY157s

1

0

41

mpikuliak retweeted

Héctor Paredes Castro @hparedes_c

almost 2 years ago

"We estimate that the decline in Nuclear power Plants caused by Chernobyl led to the loss of approximately 141 million expected life years in the U.S., 33 in the U.K. and 318 million globally". https://t.co/vr8Z4XU9Vy

hparedes_c's tweet photo. "We estimate that the decline in Nuclear power Plants caused by Chernobyl led to the loss of approximately 141 million expected life years in the U.S., 33 in the U.K. and 318 million globally". https://t.co/vr8Z4XU9Vy https://t.co/zbWTwQUD9b

42

2K

603

775

438K

mpikuliak retweeted

gavin leech (Non-Reasoning)

@gleech

almost 2 years ago

New paper, listing 43 ways ML evaluations can be misleading or actively deceptive. Following the good critics of psychological science we call these "questionable research practices" (QRPs). (The working title was "How To Lie In Machine Learning")

gleech's tweet photo. New paper, listing 43 ways ML evaluations can be misleading or actively deceptive.

Following the good critics of psychological science we call these "questionable research practices" (QRPs). (The working title was "How To Lie In Machine Learning") https://t.co/x2WziJ2Fqz

5

355

78

266

32K

Matúš Pikuliak @mpikuliak

about 2 years ago

@LeonDerczynski We already received an LLM generated meta review at ARR, and well, it didn't feel very prestigious to us

0

1

0

418

Matúš Pikuliak @mpikuliak

about 2 years ago

@sarahookr @lw_ainextgen They wanted to regulate current gen AI so they fit the criteria accordingly. This is a post hoc number.

0

28

Matúš Pikuliak @mpikuliak

about 2 years ago

@colin_fraser Obviously both the sheep and the goat enjoy the boatrides and the model is aligned to provide them this benefit.

0

1

0

74

Matúš Pikuliak @mpikuliak

about 2 years ago

New blog post about our experience with receiving what is probably a LLM-generated review in @ReviewAcl https://t.co/XzNZmrex1w

mpikuliak's tweet photo. New blog post about our experience with receiving what is probably a LLM-generated review in @ReviewAcl

https://t.co/XzNZmrex1w https://t.co/SA4U7gRDqo

0

108

mpikuliak retweeted

Leon Derczynski ⚒️☁️🏔️�� @LeonDerczynski

about 3 years ago

Slides for my talk on Structured LLM Red-teaming: How do people do this new activity? (Spoiler: it's qualitative NLP/AI research) w/ @NannaInie https://t.co/fE8bV1GEMV

3

73

16

48

12K

mpikuliak retweeted