Aseem Raj Baranwal @aseemrb - Twitter Profile

Aseem Raj Baranwal @aseemrb

2 months ago

The Bloody Origins of the Number Zero https://t.co/X2lIzy79i9 via @YouTube

0

41

aseemrb retweeted

Gaurav Sahu

@dem_fier

3 months ago

wanted to make a few clarifications on openleaf as there’s lot of love from people (thanks❤️!) but also some misunderstanding: 1. "this encourages blind citation" -- openleaf links every suggested paper for a reason. you're supposed to read it before citing (the paper link is right there). it's a discovery tool, and most def not a "cite for me" button. also, its ranking is purely content-based -- no citation count, no popularity metrics -- specifically to avoid unfair concentration of citations to a select few papers/institutions. 2. "if you do your lit search after writing a paragraph, you're doing it wrong" -- agree! but the demo showed a simplified flow. the real use case: you've read 20 papers, but there are 1000s published monthly. you will miss relevant ones. openleaf helps you find them. already working on improvements to make it even better: - reading your existing .bib so it's aware of what you already cite - analyzing full paper text, not just abstracts - better reasoning track progress, suggest features, or pick up an issue! https://t.co/xs1pAhN0so

0

34

4

23

8K

aseemrb retweeted

Kimon Fountoulakis

@kfountou

8 months ago

I am hiring one PhD student. Subject: Reasoning and AI, with a focus on computational learning for long reasoning processes such as automated theorem proving and the learnability of algorithmic tasks. Preferred background: A mathematics student interested in transitioning to computer science and machine learning. However, I will also consider engineering and computer science students with a strong mathematical background.

kfountou's tweet photo. I am hiring one PhD student.

Subject: Reasoning and AI, with a focus on computational learning for long reasoning processes such as automated theorem proving and the learnability of algorithmic tasks.

Preferred background: A mathematics student interested in transitioning to computer science and machine learning. However, I will also consider engineering and computer science students with a strong mathematical background.

13

505

93

250

39K

aseemrb retweeted

Kimon Fountoulakis

@kfountou

8 months ago

On the Statistical Query Complexity of Learning Semiautomata: a Random Walk Approach Work with @ggiapitz, @EshaanNichani and @jasondeanlee. We prove the first SQ hardness result for learning semiautomata under the uniform distribution over input words and initial states, without relying on parity gadgets or adversarial inputs. The hardness is structural, it arises purely from the transition structure, not from hard languages. We show that SQ hardness can be established when both the alphabet size and input length are polynomial in the number of states.

kfountou's tweet photo. On the Statistical Query Complexity of Learning Semiautomata: a Random Walk Approach

Work with @ggiapitz, @EshaanNichani and @jasondeanlee.

We prove the first SQ hardness result for learning semiautomata under the uniform distribution over input words and initial states, without relying on parity gadgets or adversarial inputs. The hardness is structural, it arises purely from the transition structure, not from hard languages.

We show that SQ hardness can be established when both the alphabet size and input length are polynomial in the number of states.

2

59

12

42

59K

Who to follow

Hari Om Gaur

@hogaur

Engineering @zeptonow x-data @ https://t.co/VvBsKttJDz creator tooling @kumuph lambda @gojektech devops🍺 @codeignition In a life-love partnership with @maitrinigam14

Prashant Mittal

@prashant_mit

Designing https://t.co/rdToNCY6FF at @shovelcompany | Philosophy, Economics, Football & SRE | previously EM, O11y @gojektech

aseemrb retweeted

10 months ago

Waterloo Computational Learning Lab it is: https://t.co/HzPiVnnL6m! I rebranded our lab after six years to better reflect the work we do and will continue to do in the future.

kfountou's tweet photo. Waterloo Computational Learning Lab it is: https://t.co/HzPiVnnL6m! I rebranded our lab after six years to better reflect the work we do and will continue to do in the future. https://t.co/LUy8HSvfQR

0

8

1

6

675

aseemrb retweeted

Sushant Agarwal @_sushantagarwal

11 months ago

Presenting "Optimal Fair Learning Robust to Adversarial Distribution Shift" at #ICML2025 (https://t.co/LhPEbnjNS8) 📍East Exhibition Hall A-B #E-1001 ⏲️16th July, 4:30-7PM Please have a look, and do stop by if it sounds interesting to you! RT's appreciated😊Summary to follow

_sushantagarwal's tweet photo. Presenting "Optimal Fair Learning Robust to Adversarial Distribution Shift" at #ICML2025 (https://t.co/LhPEbnjNS8)

📍East Exhibition Hall A-B #E-1001
⏲️16th July, 4:30-7PM

Please have a look, and do stop by if it sounds interesting to you!
RT's appreciated😊Summary to follow https://t.co/qsX06MCRU1

1

17

7

1

832

aseemrb retweeted

Kimon Fountoulakis

@kfountou

11 months ago

My former PhD student, Aseem Baranwal, won the PhD Dissertation Award from the Department of Computer Science at the University of Waterloo for his thesis, “Statistical Foundations for Learning on Graphs.” Aseem is the first PhD student I graduated, and I couldn't be happier for him.

kfountou's tweet photo. My former PhD student, Aseem Baranwal, won the PhD Dissertation Award from the Department of Computer Science at the University of Waterloo for his thesis, “Statistical Foundations for Learning on Graphs.” Aseem is the first PhD student I graduated, and I couldn't be happier for him.

8

230

12

91

19K

aseemrb retweeted

Sebastien Bubeck

@SebastienBubeck

over 1 year ago

Enjoy! https://t.co/JX1iSALRWb

92

4K

395

1K

631K

Aseem Raj Baranwal @aseemrb

over 1 year ago

@kfountou This classifier is implementable using a message-passing GNN and is the best of both worlds (an MLP for a noisy graph and a GCN for an informative graph) across the range of SNR in the edges/features on synthetic data. Work is pending to make it scalable for use on real data.

aseemrb's tweet photo. @kfountou This classifier is implementable using a message-passing GNN and is the best of both worlds (an MLP for a noisy graph and a GCN for an informative graph) across the range of SNR in the edges/features on synthetic data. Work is pending to make it scalable for use on real data. https://t.co/zLBw0wc4xb

0

1

0

163

Aseem Raj Baranwal @aseemrb

over 1 year ago

My PhD thesis is now available on UWspace: https://t.co/YrdI3Nupjq. Thanks to my advisors @kfountou and Aukosh Jagannath for their support throughout my PhD. We introduce a statistical perspective for node classification problems. Brief details are below.

3

8

2

525

Aseem Raj Baranwal @aseemrb

over 1 year ago

@kfountou Following these analyses, we define a precise notion of Bayes optimality for node classification problems and compute the optimal classifier for arbitrary distributions of the node features and edge connectivity.

aseemrb's tweet photo. @kfountou Following these analyses, we define a precise notion of Bayes optimality for node classification problems and compute the optimal classifier for arbitrary distributions of the node features and edge connectivity. https://t.co/29XkJS0cUr

0

1

0

134

Aseem Raj Baranwal @aseemrb

over 1 year ago

@kfountou We analyze GNNs from this statistical perspective. We isolate the convolutions from the layers for GCN architectures to understand its variance reduction effects on the data. For GAT, we identify regimes of the SNR of the node features where attention helps or does not help.

aseemrb's tweet photo. @kfountou We analyze GNNs from this statistical perspective. We isolate the convolutions from the layers for GCN architectures to understand its variance reduction effects on the data. For GAT, we identify regimes of the SNR of the node features where attention helps or does not help. https://t.co/uyxScTCggK

0

104

aseemrb retweeted

Kimon Fountoulakis

@kfountou

over 1 year ago

Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning We propose calculating the attention weights in Transformers using only fixed positional encodings (referred to as positional attention). These positional encodings remain the same across layers, and no other data is used to compute attention weights. We call this architecture positional Transformer, an illustration is shown in the attached figure. Contribution 1: We show that positional Transformer achieves an average improvement of 1000x (ranging from 400x to 3000x) in out-of-distribution (OOD) value generalization (informally defined below) compared to traditional Transformers during end-to-end training on various algorithmic tasks. Value generalization describes an OOD setting where the input lengths remain the same, but the values in the test set differ in magnitude or are larger than those seen during training. This is particularly important because, when learning to solve an algorithmic task, the model is expected to perform the task across a range of numbers that it may not have encountered during training. It also serves as an indication that the model is truly learning to solve the underlying problem. We present OOD generalization results on five tasks: cumulative sum, cumulative minimum, cumulative median, sorting, and cumulative maximum sum subarray. Our results are compared to standard self-attention and various configurations of traditional Transformers, and different positional encodings. Contribution 2: We prove that positional Transformers can simulate any algorithm defined in a parallel computation model. Our motivation for positional attention for algorithmic reasoning stems from the following two facts. First, many problems are solved by parallel algorithms, where data communication does not depend on the values of the data but solely on the positions (IDs) of each machine. Second, it is also known that Transformers can function as parallel computers, with data communication managed through attention mechanisms. arXiv link: https://t.co/X1TljZy31W code: https://t.co/V9UuAA1ihv

kfountou's tweet photo. Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning

We propose calculating the attention weights in Transformers using only fixed positional encodings (referred to as positional attention). These positional encodings remain the same across layers, and no other data is used to compute attention weights. We call this architecture positional Transformer, an illustration is shown in the attached figure.

Contribution 1: We show that positional Transformer achieves an average improvement of 1000x (ranging from 400x to 3000x) in out-of-distribution (OOD) value generalization (informally defined below) compared to traditional Transformers during end-to-end training on various algorithmic tasks.

Value generalization describes an OOD setting where the input lengths remain the same, but the values in the test set differ in magnitude or are larger than those seen during training. This is particularly important because, when learning to solve an algorithmic task, the model is expected to perform the task across a range of numbers that it may not have encountered during training. It also serves as an indication that the model is truly learning to solve the underlying problem.

We present OOD generalization results on five tasks: cumulative sum, cumulative minimum, cumulative median, sorting, and cumulative maximum sum subarray. Our results are compared to standard self-attention and various configurations of traditional Transformers, and different positional encodings.

Contribution 2: We prove that positional Transformers can simulate any algorithm defined in a parallel computation model.

Our motivation for positional attention for algorithmic reasoning stems from the following two facts. First, many problems are solved by parallel algorithms, where data communication does not depend on the values of the data but solely on the positions (IDs) of each machine. Second, it is also known that Transformers can function as parallel computers, with data communication managed through attention mechanisms.

arXiv link: https://t.co/X1TljZy31W
code: https://t.co/V9UuAA1ihv

10

301

59

238

33K

aseemrb retweeted

Kimon Fountoulakis

@kfountou

over 1 year ago

This paper was just accepted at NeurIPS. I am particularly happy about this because it originated as a course project by Robert in our Graph Neural Networks course.

1

32

3

4

4K

aseemrb retweeted

Kimon Fountoulakis

@kfountou

almost 2 years ago

If you are at #ICML2024 and interested in the theory of graph neural networks, come by our poster 'Graph Attention Retrospective.' conference link: https://t.co/ZOURgL7Jmz paper: https://t.co/sB7nHPIWZd relevant blog: https://t.co/4rq6vkQFm0

kfountou's tweet photo. If you are at #ICML2024 and interested in the theory of graph neural networks, come by our poster 'Graph Attention Retrospective.'

conference link: https://t.co/ZOURgL7Jmz
paper: https://t.co/sB7nHPIWZd
relevant blog: https://t.co/4rq6vkQFm0 https://t.co/SL3ihO2nwj

1

40

8

15

3K

aseemrb retweeted

Kimon Fountoulakis

@kfountou

almost 2 years ago

I guess that as of today I can also announce that I have been promoted to the rank of Associate Professor. I am mostly making this post to publicly thank all the people who have supported me during my career, especially my first two PhD students, @aseemrb and @shenghao_yang (in alphabetical order), for the great work we have done, the numerous lengthy meetings and discussions, and their devotion to learning.

6

42

3

2

5K

aseemrb retweeted

Artur @backdeluca

almost 2 years ago

For those participating in the Complex Networks in Banking and Finance Workshop, I’ll be presenting our work on Local Graph Clustering with Noisy Labels tomorrow at 9:20 AM EDT at the Fields Institute. Hope to see you there :) https://t.co/hzXIlTyKWt

0

4

0

574

Aseem Raj Baranwal @aseemrb

almost 2 years ago

@PIBHomeAffairs @narendramodi @AmitShah Thanks for the great initiative! But unfortunately, it doesn't work for me currently. My Indian passport has a Canadian address (enforced by the Indian consulate) and the portal allows only Indian addresses.

0

18

aseemrb retweeted

Kimon Fountoulakis

@kfountou

about 2 years ago

Paper: Simulation of Graph Algorithms with Looped Transformers (revised version @icmlconf) + Multi-tasking (Remark 6.5) + Discussion on the role of ill-conditioning for the ability of Transformers to simulate algorithms. Link: https://t.co/vPqqIKPPG5

1

23

4

13

4K

aseemrb retweeted

Kimon Fountoulakis

@kfountou

about 2 years ago

Paper: Analysis of Corrected Graph Convolutions We study the performance of a vanilla graph convolution from which we remove the principal eigenvector to avoid oversmoothing. 1) We perform a spectral analysis for k rounds of corrected graph convolutions, and we provide results for partial and exact classification. 2) For partial classification, we show that each round of convolution can reduce the misclassification error exponentially up to a saturation level, after which performance does not worsen. 3) For exact classification, we show that the separability threshold can be improved exponentially up to O(log n/log log n) corrected convolutions. link: https://t.co/8Mb1ZbBKun P.S.: That's the first paper produced as part of the graduate course (CS886, 2024) on graph neural networks that I am teaching!

kfountou's tweet photo. Paper: Analysis of Corrected Graph Convolutions

We study the performance of a vanilla graph convolution from which we remove the principal eigenvector to avoid oversmoothing.

1) We perform a spectral analysis for k rounds of corrected graph convolutions, and we provide results for partial and exact classification.

2) For partial classification, we show that each round of convolution can reduce the misclassification error exponentially up to a saturation level, after which performance does not worsen.

3) For exact classification, we show that the separability threshold can be improved exponentially up to O(log n/log log n) corrected convolutions.

link: https://t.co/8Mb1ZbBKun

P.S.: That's the first paper produced as part of the graduate course (CS886, 2024) on graph neural networks that I am teaching!

0

22

3

15

6K

Aseem Raj Baranwal

@aseemrb

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users