Artem Artemev @aptemav - Twitter Profile

Pinned Tweet

over 3 years ago

Check out our work "Memory Safe Computations with XLA compiler" at #NeurIPS2022 (with Yuze An, @dyedgreen, @markvanderwilk). The paper and PR can be found at https://t.co/AzTMGZRxIo and https://t.co/4IXt5hLFPo. The poster is https://t.co/fwrPx1LTOG. Some details in short [1/8]

1

16

5

2

0

Artem Artemev @aptemav

about 3 years ago

@krzysztof_rus @PatrickKidger @ezyang I'm not sure how Enzyme is going to help here, even with MLIR support. User still needs an interface, in some form, to autodiff.

0

78

Artem Artemev @aptemav

over 3 years ago

@markvanderwilk will be at the #NeurIPS2022 presenting the poster. If you are at #NeurIPS pop in and say hello. Thanks! [8/8]

0

Artem Artemev @aptemav

over 3 years ago

Check out our work "Memory Safe Computations with XLA compiler" at #NeurIPS2022 (with Yuze An, @dyedgreen, @markvanderwilk). The paper and PR can be found at https://t.co/AzTMGZRxIo and https://t.co/4IXt5hLFPo. The poster is https://t.co/fwrPx1LTOG. Some details in short [1/8]

1

16

5

2

0

Who to follow

Mark van der Wilk

@markvanderwilk

Associate Professor in Machine Learning at the University of Oxford. Interested in automatic inductive bias selection using Bayesian tools.

Adam Golinski

@adam_golinski

ML research @Apple, prev @OxCSML @InfAtEd, part of @MLinPL & @polonium_org 🇵🇱, sometimes funny

Adrià Garriga-Alonso

@AdriGarriga

Funemployed, planning out what to do next. Previously mechanistic interpretability and friendly AI research at FAR AI (@farairesearch).

Artem Artemev @aptemav

over 3 years ago

We also applied eXLA to the language transformer model, and in the experiment we modified the sequence length which in turn controls the size of the self-attention block. Out of the box TF implementation fails with OOM with lengths more than 2k, and eXLA runs up to 7k. [7/8]

aptemav's tweet photo. We also applied eXLA to the language transformer model, and in the experiment we modified the sequence length which in turn controls the size of the self-attention block. Out of the box TF implementation fails with OOM with lengths more than 2k, and eXLA runs up to 7k. [7/8] https://t.co/N9djd6BJsZ

1

0

aptemav retweeted

Alexander Terenin @avt_im

over 3 years ago

When working with a Gaussian process, have you ever wondered why Cholesky factorization failed, or a CG solve did not converge? Answer: it's because you've got redundant, overlapping data points. And that's just the starting point! On arXiv now! https://t.co/L9gUFxk4HM

2

150

14

61

0

aptemav retweeted

Stat.ML Papers @StatMLPapers

over 4 years ago

Wide Mean-Field Bayesian Neural Networks Ignore the Data. (arXiv:2202.11670v1 [cs.LG]) https://t.co/aDaNlaF66J

0

13

4

1

0

aptemav retweeted

Mark van der Wilk @markvanderwilk

over 4 years ago

I am still welcoming PhD applicants for 2022 at Imperial College London. We are a growing research group, with clear goals on what new abilities we want to develop in ML and neural networks. Topics: Invariances, neural arch search, (Bayesian) model selection, Gaussian processes.

8

426

128

65

0

aptemav retweeted

Vincent Dutordoir @vdutor

over 4 years ago

We are organizing a small-scale, offline #NeurIPS2021 satellite event in Cambridge (UK) on the 8th of December. If you are interested in NeurIPS content and are in the neighborhood, this is your chance to connect with your local machine learning community https://t.co/FJ36rx8dlp

4

114

33

7

0

aptemav retweeted

Mark van der Wilk @markvanderwilk

almost 5 years ago

Join us to discuss Conjugate Gradient based GP approximations! We make training easier by automatically setting approximation parameters like CG tolerance using marginal likelihood bounds. Today 5pm (London) / 9am PDT. Long talk and poster available at https://t.co/zaSS2NzKkX.

2

37

6

4

0

aptemav retweeted

Mark van der Wilk @markvanderwilk

about 5 years ago

Current Conjugate Gradient Gaussian Processes require manual tuning to trade off accuracy and speed. Existing guidelines can give suboptimal results, without clear warnings. Our method tunes automatically, runs fewer CG steps, and performs better: https://t.co/YtnnilK5FT 👇1/6

markvanderwilk's tweet photo. Current Conjugate Gradient Gaussian Processes require manual tuning to trade off accuracy and speed. Existing guidelines can give suboptimal results, without clear warnings.

Our method tunes automatically, runs fewer CG steps, and performs better: https://t.co/YtnnilK5FT 👇1/6 https://t.co/ql0e5naCej

1

61

7

11

0

aptemav retweeted

Mark van der Wilk @markvanderwilk

about 5 years ago

I'm looking forward to speaking tomorrow. I will share some thoughts on: - How Gaussian processes can help deep learning - Recent work on accurate GP inference - What makes a method "exact", and to what extent recent methods live up to this Link below if you want to join!

2

60

4

2

0

aptemav retweeted

Mark van der Wilk @markvanderwilk

over 5 years ago

Tomorrow 10 Dec at 11am GMT I will speak at the Bayesian Deep Learning Meetup about **Bayesian Model Selection** and how it can help architecture search. In a short 20 minutes we will discuss why we (Bayesians ∪ Deep Learners) should care, and approaches from now and the past.

markvanderwilk's tweet photo. Tomorrow 10 Dec at 11am GMT I will speak at the Bayesian Deep Learning Meetup about **Bayesian Model Selection** and how it can help architecture search.

In a short 20 minutes we will discuss why we (Bayesians ∪ Deep Learners) should care, and approaches from now and the past. https://t.co/rKGqRCM8vd

3

208

24

35

0

aptemav retweeted

Vincent Adam @vincentadam87

almost 6 years ago

Come and chat with the authors of our paper: Doubly sparse variational gaussian processes! https://t.co/8kcWz9Uv73 #AISTATS2020 @stefanos_ele @aptemav @NicolasDurrande @jameshensman @PROWLER_IO We are more friendly than we look in the video ;)

0

22

5

1

0

aptemav retweeted

Arno Solin @arnosolin

almost 6 years ago

Come and work with me. I'm recruiting doctoral students (see https://t.co/zs8td4KXMl, DL Aug 17) and post-docs (see https://t.co/kTUp0oZDpo) @CSAalto

2

73

21

15

0

aptemav retweeted

Arno Solin @arnosolin

almost 6 years ago

My #ICML2020 tutorial videos on "Machine Learning with Signal Processing" are now freely available: I: https://t.co/FmXlqrlZC5 II: https://t.co/Gxr1dZLzSb III: https://t.co/NaWrZD3ZFY IV: https://t.co/wJMUcpKsVJ Slides: https://t.co/kTUp0oZDpo

arnosolin's tweet photo. My #ICML2020 tutorial videos on "Machine Learning with Signal Processing" are now freely available:
I: https://t.co/FmXlqrlZC5
II: https://t.co/Gxr1dZLzSb
III: https://t.co/NaWrZD3ZFY
IV: https://t.co/wJMUcpKsVJ
Slides: https://t.co/kTUp0oZDpo https://t.co/3t0F2CKuzp

0

256

68

62

0

Artem Artemev

@aptemav

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users