Markus Frohmann @FrohmannM - Twitter Profile

3 months ago

Excited to share that I'll be joining @thomsonreuters Labs in Zug, Switzerland, as an Applied AI Scientist Intern from April to September 2026! 🏔️ A nice bridge between finishing my MSc at JKU Linz and starting my PhD later this year - let me know if you're around!

0

2

0

121

Markus Frohmann @FrohmannM

3 months ago

@usmasfr Cool use case! You can start with our default setup, using 30 epochs, here: https://t.co/5Ybn8dpQMh Otherwise, I'd focus more on clean segmented training data than tuning epochs; >100 segmented training sentences would be good. You can also tune LoRA rank a bit if needed.

1

0

24

Markus Frohmann @FrohmannM

3 months ago

wtpsplit now supports length-constrained segmentation ✂️ min/max chunk length (chars) while preserving semantic chunks - should be great for RAG! Example (≤30 chars): [Landing 5pm → Beimen.] [Let's meet at: Ximen Exit 6.] [Then: Ningxia Night Market...] [Late-night snack!']

Markus Frohmann @FrohmannM

almost 2 years ago

Introducing 🪓Segment any Text! 🪓 A new state-of-the-art sentence segmentation tool! Compared to existing tools (and strong LLMs!), our models are far more: 1. efficient ⚡ 2. performant 🔝 3. robust 🚀 4. adaptable 🎯 5. multilingual 🗺

FrohmannM's tweet photo. Introducing 🪓Segment any Text! 🪓

A new state-of-the-art sentence segmentation tool!
Compared to existing tools (and strong LLMs!), our models are far more:
1. efficient ⚡
2. performant 🔝
3. robust 🚀
4. adaptable 🎯
5. multilingual 🗺 https://t.co/rV1FuYs3An

2

180

26

135

20K

2

0

209

Markus Frohmann @FrohmannM

3 months ago

More info + how to use: https://t.co/KOCMBIrICl

0

37

Who to follow

Rishav Dhar

@iamrishav111

Aspiring Product | Current SDE @Amazon | Ex-SDE Intern @Samsung

Abdul Basit H., PhD

@abdulbasitds

Machine learning | MlOps | Data Engineering | PhD in AI

Y Taha

@Ytaha64718072

tweets about research in ML & computational biology

Markus Frohmann @FrohmannM

7 months ago

I'm at #EMNLP2025 in Suzhou this year! Looking forward to connecting with the community after a year's break and spending some time abroad。。。再見！

1

16

0

2K

Markus Frohmann @FrohmannM

10 months ago

🤖🗣️ Double Entendre will be presented today, 18:00-19:30 at Hall 4/5 by @m_schedl! Check it out if you're at #ACL2025!

Markus Frohmann @FrohmannM

11 months ago

Excited to share two new papers on AI-generated music detection from my research internship at @Deezer, published in @ismir_conf #ISMIR2025 and @aclmeeting #ACL2025 Findings! 🎶🤖 The problem: most AI music detectors are impractical or unreliable in real-world settings.

5

3

0

467

5

2

0

255

Markus Frohmann @FrohmannM

11 months ago

@Deezer @aclmeeting I had a great time working on this with @deezer in Paris! Big thanks to my mentors @evpure, @Gabolsgabs, and @m_schedl! 💻 Code: https://t.co/mVoJWH1tZ7 📄 ISMIR Paper (foundation): https://t.co/vXtQZp4xce 📄 ACL Paper (Multi-View Double Entendre): https://t.co/SkmKsqFS7b

0

1

0

94

Markus Frohmann @FrohmannM

11 months ago

Excited to share two new papers on AI-generated music detection from my research internship at @Deezer, published in @ismir_conf #ISMIR2025 and @aclmeeting #ACL2025 Findings! 🎶🤖 The problem: most AI music detectors are impractical or unreliable in real-world settings.

5

3

0

467

Markus Frohmann @FrohmannM

11 months ago

@Deezer @aclmeeting I view this work as an important extension of current single-modality detectors while maintaining flexibility and modularity. It's not production-ready, but it highlights key paradigms for detection: Using all available information from just the audio and a focus on robustness.

1

0

92

Markus Frohmann @FrohmannM

about 1 year ago

Wtpsplit, our text segmentation tool, just reached ⭐️1000 stars⭐️ on GitHub! Excited to see it is proving useful! Check it out here: https://t.co/IHH1GVemv3 🎉

FrohmannM's tweet photo. Wtpsplit, our text segmentation tool, just reached ⭐️1000 stars⭐️ on GitHub! Excited to see it is proving useful!
Check it out here: https://t.co/IHH1GVemv3 🎉 https://t.co/lWu9qvPud2

Markus Frohmann @FrohmannM

almost 2 years ago

Introducing 🪓Segment any Text! 🪓 A new state-of-the-art sentence segmentation tool! Compared to existing tools (and strong LLMs!), our models are far more: 1. efficient ⚡ 2. performant 🔝 3. robust 🚀 4. adaptable 🎯 5. multilingual 🗺

2

180

26

135

20K

0

7

1

0

515

FrohmannM retweeted

Benjamin Minixhofer

@bminixhofer

about 1 year ago

We created Approximate Likelihood Matching, a principled (and very effective) method for *cross-tokenizer distillation*! With ALM, you can create ensembles of models from different families, convert existing subword-level models to byte-level and a bunch more🧵

bminixhofer's tweet photo. We created Approximate Likelihood Matching, a principled (and very effective) method for *cross-tokenizer distillation*!

With ALM, you can create ensembles of models from different families, convert existing subword-level models to byte-level and a bunch more🧵 https://t.co/ufdCcrsUJC

2

87

26

29

7K

Markus Frohmann @FrohmannM

over 1 year ago

Curious about our SoTA text segmentation tool? 🪓 It's gonna help you across all kinds of NLP tasks! Learn more at our poster session: Tuesday, 4pm, Jasmine room at #EMNLP2024! 🗓️ See you there! I'll be attending the whole conference - happy to connect with everyone! 👋

Markus Frohmann @FrohmannM

almost 2 years ago

Introducing 🪓Segment any Text! 🪓 A new state-of-the-art sentence segmentation tool! Compared to existing tools (and strong LLMs!), our models are far more: 1. efficient ⚡ 2. performant 🔝 3. robust 🚀 4. adaptable 🎯 5. multilingual 🗺

2

180

26

135

20K

0

5

0

238

Markus Frohmann @FrohmannM

over 1 year ago

Excited to share that I joined @researchdeezer as a research intern to work with @evpure and @Gabolsgabs on detecting AI-generated lyrics !🎶 The first few weeks have been amazing, and I am excited about what is to come—life in Paris certainly has unparalleled charm!

0

6

0

204

Markus Frohmann @FrohmannM

over 1 year ago

This was an awesome summer! I can only recommend ETH's summer research fellowship program 🏔️ Also happy about the project's progress - integrating videos into existing architectures is quite exciting, stay tuned! Super grateful to Ryan Cotterell and @glnmario for supervising me.

Markus Frohmann @FrohmannM

almost 2 years ago

Excited to share that I joined @ETH Zürich as a summer research fellow, supervised by Prof. @ryandcotterell, working on ✨Multimodal LLMs! ✨ The first few weeks have been a blast, and I'm looking forward to the weeks ahead! 📽️

0

13

0

1K

1

6

0

370

FrohmannM retweeted

Cohere Labs

@Cohere_Labs

over 1 year ago

Congratulations to C4AI Research Grant recipient @FrohmannM and all authors of "Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation" for their EMNLP acceptance!🥳

1

7

2

1

1K

Markus Frohmann

@FrohmannM

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users