JC Testud @jctestud - Twitter Profile

jctestud retweeted

Noam Brown

@polynoamial

26 days ago

https://t.co/oWqzT12RtZ

78

3K

417

3K

1M

jctestud retweeted

Summer Yue

@summeryue0

3 months ago

🚀 Muse Spark Safety & Preparedness Report for Meta AI is out. We start with our pre-deployment assessment under Meta's Advanced AI Scaling Framework, covering chemical and biological, cybersecurity, and loss of control risks. Our assessment flagged potentially elevated chem/bio risk, so we implemented safeguards and validated mitigations before deployment - bringing residual risk to within acceptable levels. Beyond the Framework, we also share findings and early explorations of model behavior (honesty, intent understanding, etc.), jailbreak robustness, eval awareness, and more. We're sharing this report to give a closer look at how we evaluate advanced AI safety. Always more work to do, and we welcome feedback from the community. https://t.co/azpKHwu7x9

39

447

74

118

287K

jctestud retweeted

Alexandr Wang

@alexandr_wang

3 months ago

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

alexandr_wang's tweet photo. 1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵 https://t.co/fThDXdsxwB

752

10K

1K

3K

5M

jctestud retweeted

Joshua Saxe

@joshua_saxe

about 1 year ago

Today our AI security team @ Meta launched open source tools to support the open source GenAI ecosystem, including: - LlamaFirewall; a security-first guardrail framework for mitigating agentic prompt injection, misalignment, and insecure coding risks: https://t.co/lNxQB34jmz

joshua_saxe's tweet photo. Today our AI security team @ Meta launched open source tools to support the open source GenAI ecosystem, including:

- LlamaFirewall; a security-first guardrail framework for mitigating agentic prompt injection, misalignment, and insecure coding risks: https://t.co/lNxQB34jmz https://t.co/gEAw70ClTN

3

129

46

82

14K

Who to follow

Xavier Snelgrove

@wxswxs

Editor of Orbital Studies, a literary science magazine. https://t.co/D1XWarDQbI

Amita Mirajkar

@AmitaMirajkar

Co-Founder & CEO, Clairvoyant India | #Data #Cloud #AI | #TechLeader| #Speaker #Entrepreneur #NatureLover #Healthitarian #MusicLover #LearnerForLife

David Vázquez

@dvazquezcv

AI Research Director @ ServiceNow | Adjunct Prof @ UAB, PolyMTL, MILA & ELLIS | AI agents, multimodal AI | @ ICLR

jctestud retweeted

François Chollet

@fchollet

about 2 years ago

You thought LLM chatbots required a lot of compute? That's cute. It's when fully-generative TikTok/YouTube hits the mainstream that you'll start needing a *lot* of GPUs. Orders of magnitude more compute, both because the medium is more intensive and because the audience will be 5x-10x larger. For now we've barely scratched the surface. AGI doesn't seem to be getting any closer, but the practical applications of scaling up deep learning aren't going to slow down.

36

971

108

205

179K

jctestud retweeted

hardmaru

@hardmaru

about 3 years ago

LIMA, a 65B LLaMa fine-tuned only with supervised learning on 1000 curated examples, without any RLHF, demonstrates remarkably strong performance, generalizes well to unseen tasks not in training data. Comparable to GPT-4, Bard, DaVinc003 in human studies.https://t.co/vNuecWIP5K

21

1K

225

605

591K

jctestud retweeted

Horace He

@cHHillee

over 3 years ago

I suspect GPT-4's performance is influenced by data contamination, at least on Codeforces. Of the easiest problems on Codeforces, it solved 10/10 pre-2021 problems and 0/10 recent problems. This strongly points to contamination. 1/4

cHHillee's tweet photo. I suspect GPT-4's performance is influenced by data contamination, at least on Codeforces.

Of the easiest problems on Codeforces, it solved 10/10 pre-2021 problems and 0/10 recent problems.

This strongly points to contamination.

1/4 https://t.co/wm6yP6AmGx

65

4K

616

776

2M

JC Testud @jctestud

over 5 years ago

Happy 2021!🎇May these covid-safe fake fireworks brighten up your day #theseFireworksDoNotExist #pix2pix Blog post from 2018 👴https://t.co/oxQbjMnuAC

AIxDESIGN @AIxDesignCo

over 5 years ago

Happy 2021 from the AIxD team to you and yours! To a year of connection, creativity, learning, growth, collaboration, and joy 🎆this wish comes accompanied by @jctestud's pix2pix generated fireworks

0

2

0

3

1

0

jctestud retweeted

Ming-Yu Liu

@liu_mingyu

almost 6 years ago

Introducing #Imaginaire a #PyTorch library with optimized implementations of several #GAN image and video synthesis methods developed at #NVIDIA code https://t.co/fWMHEwhS3q video https://t.co/BUkEE7RB3D By @liu_mingyu @tcwang0509 @arunmallya @xunhuang1995

13

1K

396

303

0

jctestud retweeted

Aran Komatsuzaki

@arankomatsuzaki

almost 6 years ago

Life of a paper: 1. Appears on arXiv 1.001. @ak92501 and I tweet 1.002. @lucidrains makes a repo 2. The author tweets 3. Appears on ML subreddit 4. @hardmaru tweets 5. @ykilcher makes a video Aleph-0. Rejected by reviewers for "lack of novelty" 0. Conceived by Jurgen in 90s

13

862

87

48

0

jctestud retweeted

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

about 6 years ago

Unsupervised Translation of Programming Languages. Feed a model with Python, C++, and Java source code from GitHub, and it automatically learns to translate between the 3 languages in a fully unsupervised way. https://t.co/FpUL886KS7 with @MaLachaux @b_roziere @LowikChanussot

GuillaumeLample's tweet photo. Unsupervised Translation of Programming Languages. Feed a model with Python, C++, and Java source code from GitHub, and it automatically learns to translate between the 3 languages in a fully unsupervised way. https://t.co/FpUL886KS7
with @MaLachaux @b_roziere @LowikChanussot https://t.co/1pMMCu40yA

51

3K

963

433

0

jctestud retweeted

Eric Jang

@ericjang11

about 6 years ago

Every once in awhile a paper comes out that makes you breathe a sigh of relief that you don't publish in that field... https://t.co/56heAufhGA "Our results show that when hyperparameters are properly tuned via cross-validation, most methods perform similarly to one another"

ericjang11's tweet photo. Every once in awhile a paper comes out that makes you breathe a sigh of relief that you don't publish in that field...

https://t.co/56heAufhGA

"Our results show that when hyperparameters are properly tuned via cross-validation, most methods perform similarly to one another" https://t.co/bnG1bm265p

31

2K

415

266

0

JC Testud @jctestud

about 6 years ago

poor red points, the discriminator is messing with them

Evgenii Kashin @digitman_

about 6 years ago

Inspired by @minsukkahng amazing works, I also played with visualizations of the GAN training process. And it really can give cool insights on why the GAN works at all. I made a Colab notebook, and you can also try to tune various hyperparameters. https://t.co/fL8O5HdXeu

1

20

6

4

0

JC Testud @jctestud

about 6 years ago

This is super fun, this is the audio equivalent of the cryptic alphabet that vision GANs generate

Prafulla Dhariwal @prafdhar

about 6 years ago

In the early days, the model didnt know English (we didnt show it any lyrics) and so it used to just make words up. Led to some uncanny samples like this one. I love that it gets the spacy vibes of David Bowie! https://t.co/MvJBMy7Tpf

7

68

7

0

2

0

jctestud retweeted

Ari Holtzman

@universeinanegg

about 6 years ago

"You can't learn language from the radio." 📻 Why does NLP keep trying to? In https://t.co/yWEnq5QW9R we argue that physical and social grounding are key because, no matter the architecture, text-only learning doesn't have access to what language is *about* and what it *does*.

14

558

128

100

0

jctestud retweeted

Tim Dettmers

@Tim_Dettmers

about 6 years ago

How can you successfully train transformers on small datasets like PTB and WikiText-2? Are LSTMs better on small datasets? I ran 339 experiments worth 568 GPU hours and came up with some answers. I do not have time to write a blog post, so here a twitter thread instead. 1/n

13

1K

301

314

0

JC Testud @jctestud

over 6 years ago

@elonmusk ...and his friend, LEGO cybertruck

0

JC Testud @jctestud

over 6 years ago

Finding @elonmusk 's cybertruck in #StyleGAN2

1

6

1

0

jctestud retweeted

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

over 6 years ago

Our new paper, Deep Learning for Symbolic Mathematics, is now on arXiv https://t.co/cxAa3upB6h We added *a lot* of new results compared to the original submission. With @f_charton (1/7)

GuillaumeLample's tweet photo. Our new paper, Deep Learning for Symbolic Mathematics, is now on arXiv https://t.co/cxAa3upB6h
We added *a lot* of new results compared to the original submission. With @f_charton (1/7) https://t.co/GrhQRT5WRW

16

2K

505

233

0

jctestud retweeted

Janelle Shane @JanelleCShane

over 6 years ago

You MUST play AI Dungeon 2, a text adventure game run by a neural net. @nickwalton00 built it using @OpenAI's huge GPT-2-1.5B model, and it will respond reasonably to just about anything you try. Such as eating the moon. https://t.co/3ovYyEMWpf

JanelleCShane's tweet photo. You MUST play AI Dungeon 2, a text adventure game run by a neural net.
@nickwalton00 built it using @OpenAI's huge GPT-2-1.5B model, and it will respond reasonably to just about anything you try. Such as eating the moon.
https://t.co/3ovYyEMWpf https://t.co/c4DGJieaAE

57

2K

832

295

0

JC Testud

@jctestud

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users