Manish Dhakal @mns_dkl - Twitter Profile

Pinned Tweet

Manish Dhakal @mns_dkl

over 9 years ago

त्यो पारीको फूल बनी बसिराख्नु म एक दिन आउनेछु अनि टप्प टिपि लानेछु। #mns

0

5

0

mns_dkl retweeted

Ruth Hook

@ruth_hook_

4 months ago

meanwhile in science

22

29K

3K

1K

333K

Manish Dhakal @mns_dkl

7 months ago

@jxmnop CVPR conference with 2.5 hrs shift for 500 papers, approx. 18secs/paper skimming time. Just enough to read titles, considering you zero break time. It was good tho that similar papers were clustered together. NIPS is even wilder I guess.

0

1

0

640

Manish Dhakal @mns_dkl

10 months ago

@openreviewnet is there to remind us that "procrastination has consequences" by crashing just before deadlines. 😂

0

44

Who to follow

'(Bibek Panthi)

@bpanthi977

a maths, physics and AI enthusiast; wants to understand and create intelligent systems

Sudeep Bhandari

@sudeephb_

Platform Engineer @northflank | Water Drinker

Supriya Khadka

@SupriyaKhadka1

Rise and Shine ✨ Tech | Books | Movies

Manish Dhakal @mns_dkl

10 months ago

Had a hilarious chat with Gemini while debugging code! 😂 Got a random "Covid-19 Safety Measures" image mid-conversation. Gotta love those overused pretrained data surprises!

mns_dkl's tweet photo. Had a hilarious chat with Gemini while debugging code! 😂 Got a random "Covid-19 Safety Measures" image mid-conversation. Gotta love those overused pretrained data surprises! https://t.co/N0pD7DPTTA

0

2

0

256

Manish Dhakal @mns_dkl

12 months ago

When I tweak 20 parameters in an experiment, my brain goes full debug mode, knowing for sure it will break. Then, aha! It runs. 🚀 That feeling:

0

5

0

216

Manish Dhakal @mns_dkl

about 1 year ago

(1) the reasoning paths learnt via RLVRs are not novel, i.e., they already exist in the base model, (2) RLVRs make the reasoning path narrower, while the base model has wider reasoning paths. Paper Link: https://t.co/p2p2iGOLuD

0

81

Manish Dhakal @mns_dkl

about 1 year ago

A paper that breaks down the shortcomings of RLVRs (e.g., GRPO), which have been the go-to methods for training reasoning models these days. The authors have interesting findings...

mns_dkl's tweet photo. A paper that breaks down the shortcomings of RLVRs (e.g., GRPO), which have been the go-to methods for training reasoning models these days. The authors have interesting findings... https://t.co/TXTTaW7Yqp

1

2

0

217

Manish Dhakal @mns_dkl

about 1 year ago

[2/2] Paper: Learning to Reason without External Rewards ArXiv: https://t.co/oxcHWYX1Sx TL;DR: LLM's confidence score can be used to realize RL; no cost for labeling preference data.

0

1

0

100

Manish Dhakal @mns_dkl

about 1 year ago

[1/n] Here is a fresh paper, Reinforcement Learning with Internal Feed (RLIF), that I found this week. This paper claims that LLMs can use their own confidence score as a reward signal to optimize the preferred outputs, without relying on external rewards or labeled data. TL;DR..

mns_dkl's tweet photo. [1/n]
Here is a fresh paper, Reinforcement Learning with Internal Feed (RLIF), that I found this week. This paper claims that LLMs can use their own confidence score as a reward signal to optimize the preferred outputs, without relying on external rewards or labeled data.
TL;DR.. https://t.co/pUBchg8Ube

1

7

0

1

200

Manish Dhakal @mns_dkl

about 1 year ago

@Dovahkiin_08 Congratulations 🎉👏

0

1

0

31

Manish Dhakal @mns_dkl

about 1 year ago

https://t.co/nPTCSxoa30

0

65

Manish Dhakal @mns_dkl

about 1 year ago

[1/n] An interesting paper called "ICLR" 👀 published at ICLR'25: LLMs can capture semantics in their layers' representations (token features) based on their pretraining data. While LLMs typically reflect the semantics they’ve seen during pretraining, they’re also capable ....

mns_dkl's tweet photo. [1/n]
An interesting paper called "ICLR" 👀 published at ICLR'25:
LLMs can capture semantics in their layers' representations (token features) based on their pretraining data. While LLMs typically reflect the semantics they’ve seen during pretraining, they’re also capable .... https://t.co/ATeCO34jDq

2

5

0

205

Manish Dhakal @mns_dkl

about 1 year ago

[n/n] Their findings show that LLMs do adjust their representations to reflect the same graph. Also, scaling the context (using longer context prompts) helps to make the graph more refined.

0

1

0

59

Manish Dhakal @mns_dkl

about 1 year ago

[2/n] of in-context learning — meaning they can pick up new context from the input prompt itself. Authors investigated how these newly introduced, unseen contexts affect the model’s internal representation structure. To study this, they designed a toy graph tracing experiment.

1

0

68

Manish Dhakal @mns_dkl

over 1 year ago

Understanding tensors' shape is the biggest key to manipulating deep learning models. print(X.shape) is enough.

0

1

0

111

Manish Dhakal @mns_dkl

over 1 year ago

A few days ago, my phone alerted me to bad weather, urging caution and advising against long-distance travel. If only Nepal govt. had acted similarly months ago, 100s of flood deaths could have been avoided. Instead, this was our PM's shameless comment: https://t.co/qAX6JYPQwB

0

144

Manish Dhakal @mns_dkl

over 1 year ago

Mind-muscle connection is real. It shapes muscle tension, posture, and range of motion. By tweaking these details, we can create the right stretch. Fun fact: You can engage muscles with your mind—even while standing still.

0

2

0

96

Manish Dhakal @mns_dkl

over 1 year ago

@toughresearcher To sum your point, papers should outline their spectrum of impact. With wider spectrum, you must push towards exploration, whereas narrower spectrum can push to exploitation.

0

2

0

23

Manish Dhakal @mns_dkl

over 1 year ago

[Discussion] The competition for getting SOTA results for specific datasets is huge in ML research. They employ massive hyperparameter tunings to surpass their predecessors. Also, they report the best result among multiple seeds. Is this flow of research healthy or unhealthy?

1

7

0

185

Manish Dhakal @mns_dkl

over 1 year ago

Today, we (6 of us friends) were in 3-hour long video call with constant laughter 😂 and jokes. We were taking turns to roast one guy at a time. Laughed like that after a long time.

1

9

1

0

351

Manish Dhakal

@mns_dkl

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users