AmirM @amirm11 - Twitter Profile

Pinned Tweet

over 5 years ago

My first Arxiv submission on empirical study of applying Bayesian GLM for Bankruptcy prediction with expert in the loop https://t.co/uBi319LFxW #bayesianml :) #rstanarm @mcmc_stan

0

14

2

5

0

AmirM @amirm11

3 months ago

https://t.co/dj1fUB4xyi

0

8

AmirM @amirm11

10 months ago

@jaredlander What about Spark?

0

5

AmirM @amirm11

about 1 year ago

@xuanalogue Intresting topic, would like to attend the zoom.

0

11

Who to follow

Building https://t.co/zlBA5gxH6U

amirm11 retweeted

over 1 year ago

I'm happy to announce that v2 of my RL tutorial is now online. I added a new chapter on multi-agent RL, and improved the sections on 'RL as inference' and 'RL+LLMs' (although latter is still WIP), fixed some typos, etc. https://t.co/dWe5uNgcgp

17

2K

283

2K

115K

AmirM @amirm11

over 1 year ago

@bindureddy Have you or anyone carried out a red teaming test on the R1 model yet?

0

6

amirm11 retweeted

Max Welling @wellingmax

over 1 year ago

Truly excellent piece on entropy. Source: Quanta Magazine https://t.co/v2nX2KYebh

9

374

67

377

46K

amirm11 retweeted

Kevin Patrick Murphy

@sirbayes

over 1 year ago

I am happy to announce that the first draft of my RL tutorial is now available. https://t.co/SjMdabl0yW

72

4K

719

4K

321K

amirm11 retweeted

Griffiths Computational Cognitive Science Lab @cocosci_lab

over 1 year ago

(1/5) Very excited to announce the publication of Bayesian Models of Cognition: Reverse Engineering the Mind. More than a decade in the making, it's a big (600+ pages) beautiful book covering both the basics and recent work: https://t.co/5dnLpcMQzu

cocosci_lab's tweet photo. (1/5) Very excited to announce the publication of Bayesian Models of Cognition: Reverse Engineering the Mind. More than a decade in the making, it's a big (600+ pages) beautiful book covering both the basics and recent work: https://t.co/5dnLpcMQzu https://t.co/QSo91mCzcJ

20

2K

443

2K

176K

amirm11 retweeted

merve

@mervenoyann

over 1 year ago

Microsoft released a groundbreaking model that can be used for web automation, with MIT license 🔥👏 OmniParser is a state-of-the-art UI parsing/understanding model that outperforms GPT4V in parsing. 👏

30

3K

368

4K

473K

amirm11 retweeted

Rohan Paul

@rohanpaul_ai

over 1 year ago

Nice paper for a long read across 114 pages. "Ultimate Guide to Fine-Tuning LLMs" Some of the things they cover 📊 Fine-tuning Pipeline Outlines a seven-stage process for fine-tuning LLMs, from data preparation to deployment and maintenance. 🧠 Advanced Fine-tuning Methods Covers techniques like Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO) for aligning LLMs with human preferences. 🛠️ Parameter-Efficient Fine-Tuning (PEFT) Techniques Discusses methods like LoRA, QLoRA, and adapters that enable efficient fine-tuning by updating only a subset of model parameters. 🔬 Evaluation metrics and benchmarks for assessing fine-tuned LLMs Includes perplexity, accuracy, and task-specific measures. Benchmarks like GLUE, SuperGLUE, TruthfulQA, and MMLU assess various aspects of LLM performance. Safety evaluations using frameworks like DecodingTrust are also crucial for ensuring responsible AI deployment. 💻 Explores various deployment approaches and optimization techniques to enhance LLM performance and efficiency in real-world applications. 🌐 Examines the extension of fine-tuning techniques to multimodal models and domain-specific applications in fields like medicine and finance.

rohanpaul_ai's tweet photo. Nice paper for a long read across 114 pages.

"Ultimate Guide to Fine-Tuning LLMs"

Some of the things they cover

📊 Fine-tuning Pipeline

Outlines a seven-stage process for fine-tuning LLMs, from data preparation to deployment and maintenance.

🧠 Advanced Fine-tuning Methods

Covers techniques like Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO) for aligning LLMs with human preferences.

🛠️ Parameter-Efficient Fine-Tuning (PEFT) Techniques

Discusses methods like LoRA, QLoRA, and adapters that enable efficient fine-tuning by updating only a subset of model parameters.

🔬 Evaluation metrics and benchmarks for assessing fine-tuned LLMs

Includes perplexity, accuracy, and task-specific measures. Benchmarks like GLUE, SuperGLUE, TruthfulQA, and MMLU assess various aspects of LLM performance. Safety evaluations using frameworks like DecodingTrust are also crucial for ensuring responsible AI deployment.

💻 Explores various deployment approaches and optimization techniques to enhance LLM performance and efficiency in real-world applications.

🌐 Examines the extension of fine-tuning techniques to multimodal models and domain-specific applications in fields like medicine and finance.

27

3K

500

5K

241K

amirm11 retweeted

Dennis Ulmer 🦋 @dnnslmr

over 1 year ago

My dissertation "On Uncertainty In Natural Language Processing" is on arxiv! 🥳🎓 Check out my monograph for a background section summarizing statistical & linguistic views on UQ, a broad overview over methods used in #ML & #NLProc and so much more! https://t.co/c4XFBjcWUH

dnnslmr's tweet photo. My dissertation "On Uncertainty In Natural Language Processing" is on arxiv! 🥳🎓

Check out my monograph for a background section summarizing statistical & linguistic views on UQ, a broad overview over methods used in #ML & #NLProc and so much more!

https://t.co/c4XFBjcWUH https://t.co/Mgm2ILRCox

5

118

15

44

7K

amirm11 retweeted

Nando de Freitas

@NandoDF

over 1 year ago

I’d like to make more people aware of these books by @dvgodoy They provide an excellent overview of deep learning, including convnets, dropout, normalisation, RNNs, sequence-to-sequence, attention, ViTs, encoder decoder transformers and more. In many ways, the three volumes cover a good part of the history of AI since 2012 until GPT2. They also explain many important aspects of AI in practice with @PyTorch, such as optimisation, learning rate scheduling, visualising activations, datasets, loaders, training loops, etc. I highly recommend these books and the associated code by Daniel Godoy for summer schools and other introductory courses. @DeepIndaba @Khipu_AI

1

27

5

38

7K

amirm11 retweeted

Scholarship for PhD

@ScholarshipfPhd

over 1 year ago

How to write a research proposal (1/4)

2

572

125

628

48K

amirm11 retweeted

Yan Chen @HCI_Prof_YC

over 1 year ago

Transformer: Multi-Head Attention ~ Math vs Code 🔢💻 ~ I made this visualization to show you how to implement the multi-head attention math in PyTorch within 50 LoC. Multi-Head Attention is what makes the Transformer's performance outstanding. It captures and represents more diverse linguistic relationships and patterns, and attends to different learned input embedding spaces. The parallel computing design also makes the model more efficient.

10

310

54

278

33K

amirm11 retweeted

FAR.AI

@farairesearch

almost 2 years ago

"Please learn from our mistakes. Don't do exactly the same things that we did, or you'll end up in ten years with having nothing to show for it." — Nicholas Carlini urging AI researchers to avoid the pitfalls of past adversarial ML research at the Vienna Alignment Workshop 2024.

89

3K

382

1K

2M

amirm11 retweeted

Nando de Freitas

@NandoDF

almost 2 years ago

The Llama 3 paper is a must-read for anyone in AI and CS. It’s an absolutely accurate and authoritative take on what it takes to build a leading LLM, the tech behind ChatGPT, Gemini, Copilot, and others. The AI part might seem small in comparison to the gargantuan work on *data* and *scale engineering*. I hope professors in distributed systems, high performance computing, algorithms, databases, HCI, etc use it as an example of bleeding edge CS in their classes. So many exciting open problems! @UBC_CS @CompSciOxford @berkeley_ai @Cambridge_Eng @WitsUniversity @NSERC_CRSNG @NSF @ERC_Research @UKRI_News

11

2K

281

2K

160K

amirm11 retweeted

Max Welling @wellingmax

almost 2 years ago

Check this out: a completely free book written bij my physics colleague at U. Amsterdam on "all of physics". It's a beautifully illustrated and highly accessible gem to take the deep dive into (classical & quantum) physics. Thanks Sander Bais! https://t.co/lbqQAceQCs

0

146

30

128

12K

AmirM @amirm11

about 2 years ago

@ChristophMolnar Link please

0

12

amirm11 retweeted

Caglar

@caglar_ee

about 2 years ago

Video lectures, UC Berkeley CS 188 Introduction to Artificial Intelligence spring 2023, by Pieter Abbeel https://t.co/bSqmu7kgWw

2

345

76

334

29K

amirm11 retweeted

Soledad Galli

@Soledad_Galli

about 2 years ago

A probability cut-point of 0.5 is almost never the best choice. But how to find the optimal threshold? sklearn has just released a transformer that does just that. Finds the best threshold based on a performance metric. https://t.co/S6N5hO0BAY

0

9

2

3

246

AmirM

@amirm11

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users