David Smith @dasmiq - Twitter Profile

about 2 years ago

Now that the semester’s over I wanted to share the readings @giulia_taurino and I and the students in our seminar on Artificial Intelligence as an Archival Science put together. We had a great time; they wrote good papers; hopefully this will be useful. https://t.co/887QMzvQx5

3

30

12

25

4K

David Smith @dasmiq

over 2 years ago

@Dorialexander OK, try this branch: https://t.co/ithNnn9zzN from passim import align_passages align_passages(src, trg) align_passages(src, trg, n=5) Options are mostly the same as the command-line version.

0

1

0

1

21

David Smith @dasmiq

over 2 years ago

@Dorialexander Nice! I'd be interested to see what kind of classifier would work to guide inference with a large enough source corpus. Are you just expecting this to work contrastively in the beam? Anyway, I'll have some free time on Friday to sketch the passim solution.

1

0

38

David Smith @dasmiq

over 2 years ago

In the 1990s, part of it housed the Max Planck Institute for the History of Science. That's their the conference room on the lower left, where, before Google Docs, they hooked three keyboards up to one computer to co-edit papers.

John Paul Newman @johnpaul_newman

over 2 years ago

Czechoslovak embassy, Berlin, 1970s..

19

3K

197

237

185K

0

3

0

504

Who to follow

Anne Lauscher (she/her)

@anne_lauscher

Ethical and safe AI in the era of #LLMs Full Professor of Trustworthy AI @unihh leading @TrustAI_lab Previously @MilaNLProc @dwsunima @allen_ai @grammarly

JHU CLSP

@jhuclsp

Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]

Lucy Li

@lucy3_li

Postdoc @uwnlp. Incoming assistant prof @WisconsinCS. Prev @UCBerkeley, @allen_ai, @MSFTResearch, @stanfordnlp. More silly at https://t.co/rtSSUhWQnL.

dasmiq retweeted

Rahul B (@[email protected]) @rahulbot

over 2 years ago

Work with data in newsrooms, libraries, CSOs, museums, govt, or community? Excited to share I'm working on a book for *you* about creative data literacy and storytelling in pro-social settings. Tentatively titled "Community Data". Coming fall '24 from @OxUniPress 💡+🧑🏾‍💻=📗

rahulbot's tweet photo. Work with data in newsrooms, libraries, CSOs, museums, govt, or community? Excited to share I'm working on a book for *you* about creative data literacy and storytelling in pro-social settings. Tentatively titled "Community Data". Coming fall '24 from @OxUniPress 💡+🧑🏾‍💻=📗 https://t.co/vRLBWrwLNn

1

16

4

5

2K

David Smith @dasmiq

over 2 years ago

@giovanni1085 @Unibo @BoldhUnibo @UniboDHARC Congratulations! More computer scientists in philology!

0

1

0

110

dasmiq retweeted

Manuel Burghardt @8urghardt

over 2 years ago

So proud of my Computational Humanities Group @UniLeipzig + special guest @SarahALang – 5 submissions have been accepted for CHR conf. 2023 in Paris �� Props to everybody in the group and many thanks to the PC and reviewers for doing such a great job! https://t.co/ySxGLHqqYH

8urghardt's tweet photo. So proud of my Computational Humanities Group @UniLeipzig + special guest @SarahALang – 5 submissions have been accepted for CHR conf. 2023 in Paris �� Props to everybody in the group and many thanks to the PC and reviewers for doing such a great job! https://t.co/ySxGLHqqYH https://t.co/olNE72NiHv

0

41

4

1

3K

David Smith @dasmiq

over 2 years ago

@nyhabash I am so sorry, Nizar. Peace be with you and your family.

0

466

dasmiq retweeted

David Smith @dasmiq

over 2 years ago

Last but not least, in EMNLP Findings, Liwen Hou continues her brilliant line of work on diachronic syntax by investigating how we can probe language models trained on different time periods. https://t.co/lMfr3NYpZS

1

2

1

0

293

dasmiq retweeted

David Smith @dasmiq

over 2 years ago

Next in CHR, @muther22 and Mathew Barber use language models to probe modern and mediaeval citation practices. A citation is a query in a noisy channel model that the author of a target text thinks might help you find the source. https://t.co/lD2gZkdybR

1

0

263

dasmiq retweeted

David Smith @dasmiq

over 2 years ago

Caroline Craig, @kartik_goyal_ , @farnooshamsian , and @PhilologistGRC have a CHR paper on getting document-level sentence alignment to work for the ancient Greek and Latin corpus to track multiple translations into English, French, German, Persian, etc. https://t.co/JO3OkklB4f

1

5

2

1

2K

dasmiq retweeted

David Smith @dasmiq

over 2 years ago

Our OCR team @Open_ITI (Jake Murel, @Mar_Musa , and @M_T_Miller ) has a new Computational Humanities Research paper on transcribing Arabic and Persian manuscripts without any annotated manuscript data, Automatic Collation for Diversifying Corpora (ACDC): https://t.co/pQlB0kRLeJ

1

11

3

1

561

dasmiq retweeted

kartik goyal @kartik_goyal_

over 2 years ago

New CHR paper with an amazing set of collaborators: we find that high-recall bitext mining and sentence alignment is actually kinda tricky for messy historical literary text. Multilingual embeddings like LaBSE and friends work surprisingly well for literary ancient Greek though!

0

6

1

0

1K

David Smith @dasmiq

over 2 years ago

Good work to everyone at @NUlabTMN and beyond!

0

128

David Smith @dasmiq

over 2 years ago

I don't post over here much anymore, but I want to point out the great work of my talented coauthors in some recent papers on #NLProc, #DH, #HTR , and historical linguistics.

1

4

0

420

David Smith @dasmiq

over 2 years ago

A lot of past work on historical syntax involved treebanking text from different time periods. Instead, Liwen compares language models trained on different time periods on modern tagging and parsing tasks to detect language change.

dasmiq's tweet photo. A lot of past work on historical syntax involved treebanking text from different time periods. Instead, Liwen compares language models trained on different time periods on modern tagging and parsing tasks to detect language change. https://t.co/SBrgjogroX

1

0

144

David Smith

@dasmiq

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users