Armin Buescher @armbues - Twitter Profile

about 2 years ago

@lucataco93 Nice! FYI peak memory during training depends on batch size and # tokens in the dataset. You should give SiLLM a try for training with LoRA/DPO: https://t.co/hf4uTSX6xV

0

12

Armin Buescher @armbues

about 2 years ago

@DaveDC22 There was a bug that was fixed in the app. Pull the latest source from the repo and it should work 👍

1

0

70

Armin Buescher @armbues

about 2 years ago

Running Llama-3-8B-Instruct on Mac with the SiLLM framework powered by MLX... just took some fiddling with the tokenizer & template to get it to run 😁

5

50

6

39

9K

Armin Buescher @armbues

about 2 years ago

@DaveDC22 You need to point it at a directory with model files that you want to run. What type of model are you trying to load?

1

0

103

Who to follow

Vitaly Kamluk

@vkamluk

POP, NOP and ROP walk into a bar. Follow me on Mastodon: https://t.co/sKz7VC2unX

Kris McConkey

@smoothimpact

#threatintel and #dfir lead @ PwC. Blue team forever. Christian, husband, dad, coffee addict, bad photographer, awful cyclist. Tweets my own, not PwC's.

1aN0rmus

@TekDefense

CTO at @permisosecurity Alum: @Mandiant, https://t.co/kqlvYwe86k, USMC

Armin Buescher @armbues

about 2 years ago

@awnihannun I could not agree more! 💯 Fantastic job by the team working on this! 👏

0

1

0

32

Armin Buescher @armbues

about 2 years ago

@alew3 @awnihannun An out-of-the-box solution to run/train LLMs on Apple Silicon built on top of MLX: https://t.co/hf4uTSX6xV

2

5

1

8

329

Armin Buescher @armbues

about 2 years ago

Early version bump for SiLLM to 0.1.1 with some bugfixes and support for Llama-3 models. https://t.co/LQpOjOTlbA https://t.co/hf4uTSX6xV

0

4

0

1

152

Armin Buescher @armbues

about 2 years ago

@adithyan_ai Just loading the model needs about 87 GB and then you’d need a bit more for inference.

0

1

18

Armin Buescher @armbues

about 2 years ago

Running WizardLM-2-8x22B 4-bit quantized on a Mac Studio with SiLLM powered by Apple MLX

1

18

2

8

1K

Armin Buescher @armbues

about 2 years ago

@ivanfioravanti @awnihannun Just the product of lots of tinkering and trying to port DPO & losses over to MLX 😁 Might have some bugs that I'm not seeing 🙈 Example code with the DPO-mix dataset here: https://t.co/h7GvQ6rat2

1

4

0

1

43

Armin Buescher @armbues

about 2 years ago

A huge thank you to @awnihannun @angeloskath and the rest of the team for developing the MLX framework that SiLLM relies on! Also big kudos to all the contributors of the MLX Examples project 👏

1

5

1

0

480

Armin Buescher @armbues

about 2 years ago

I'm excited to share a new open-source project: the Silicon LLM Training & Inference Toolkit, short SiLLM. Check out the project on Github here: https://t.co/hf4uTSX6xV

1

67

16

84

37K

Armin Buescher @armbues

about 2 years ago

The repository includes several code examples: - LoRA training with the Nvidia HelpSteer dataset - DPO Fine-tuning with the DPO Mix 7K dataset - Implementation of the MMLU Benchmark - Calculating perplexity scores of a model for a sample dataset

1

4

1

0

545

armbues retweeted

CARO Workshop 2027 @caroworkshop

about 3 years ago

One of the reasons to attend the #CARO2023 is the food for thought that is delivered in talks, conversations, and of course keynotes. Armin Büscher @armbues will share his technical perspective about innovation and disruption in cybersecurity. https://t.co/PDawMvhhv8

0

4

3

0

418

armbues retweeted

Socially Distant Jerry @Maliciouslink

over 3 years ago

Y’all have a home on https://t.co/F199jufn1i if you need it ❤️

11

130

47

5

0

armbues retweeted

StupidBird @Legen78695928

over 3 years ago

This file leaked an Security Enterprise Virustotal API Key before！But now it's expired because someone leaked the key😅 ITW:07c4a75b1422a22ec29c5102e0b67055 API Key:d10468bead05da1685629a0abcfed5f963d6adbc7e6bb2b2fc343dbb36be0349 unbelievable！

Legen78695928's tweet photo. This file leaked an Security Enterprise Virustotal API Key before！But now it's expired because someone leaked the key😅
ITW:07c4a75b1422a22ec29c5102e0b67055
API Key:d10468bead05da1685629a0abcfed5f963d6adbc7e6bb2b2fc343dbb36be0349
unbelievable！ https://t.co/bd4AbgGpcK

0

8

3

2

0

armbues retweeted

Joe Desimone

@dez_

almost 4 years ago

We just released 1000+ yara rules and 200+ endpoint behavior rules https://t.co/33PMWZIlky

14

994

353

199

0

Armin Buescher @armbues

almost 4 years ago

I'll be traveling to Vegas for #blackhat2022 and #DEFCON next week. Looking forward to hang out with many infosec folks I haven't seen in a long time 🥳

0

5

0

Armin Buescher

@armbues

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users