Associate Professor, School of Information, UC Berkeley. NLP, computational social science, digital humanities. Not active here; find me at @dbamman.bsky.social
From @dbamman (co-authored w/@rach_scholcomm) new paper today relying on #copyright exemption for decrypting DVDs to conduct #textdatamining. Using the exemption, authors built a collection of film to measure representation for gender and race/ethnicity https://t.co/iJBy3JEb8b
Hi friends, colleagues, followers.
I am on the faculty job market! I am a PhD student @BerkeleyISchool + @berkeley_ai. I work on NLP, and I believe all language, whether AI- or human-generated, is ✨social and cultural data✨. My work includes: 🧵
How might one do classification in the era of LLMs for humanities research? 🤔
@dbamman, @KentKChang, @NaitianZhou & I apply LLMs on ten tasks from prior cultural analytics lit. Larger LMs are competitive w/ older methods on established tasks, but perform less well on new ones.
In cultural analytics, accuracy is often not the only (or even primary) objective.
Here, we explore the myriad ways CA uses classification, how LLMs compare to other commonly used methods, and how they might enable new approaches to sensemaking from text data.
My group just finished up a new paper that I'm excited to get out into the world: "On Classification with Large Language Models in Cultural Analytics" (to be published at CHR): https://t.co/xtCAhSd53i. More info here! https://t.co/l7gALaOg5H
Big congrats to @KentKChang for passing his qualifying exam today! Lots of super exciting work on measuring social interactions in culture in the pipeline --
@hila_gonen@Terra@alisawuffles@luke@nlpnoah Cool work! I partly want to take this as suggestive of models following gricean maxims (e.g. "why are you telling me about yellow and asking about a job? o this must be a joke")
It’s an extraordinary pleasure and honor to teach alongside @dbamman and his wonderful students of NLP, now doubly so to have my small part recognized by @BerkeleyISchool & UC Berkeley.
It’s an extraordinary pleasure and honor to teach alongside @dbamman and his wonderful students of NLP, now doubly so to have my small part recognized by @BerkeleyISchool & UC Berkeley.
Hey NLPals, I'll be at #NAACL2024 this upcoming week! Let's chat about sociocultural NLP, what it means to study culture, and finding variation in unusual places (like memes!)
I'll be presenting this memes paper at the first poster session.
I’m headed to NAACL to present this paper!
I’m around mostly Sunday evening thru Tuesday. This fall I’ll be doing some thinking about what to do after my PhD; if you have advice/thoughts about this definitely chat with me!
Hey NLPals, I'll be at #NAACL2024 this upcoming week! Let's chat about sociocultural NLP, what it means to study culture, and finding variation in unusual places (like memes!)
I'll be presenting this memes paper at the first poster session.
Looking forward to seeing people at #NAACL2024 this week! Today, be sure to check out @NaitianZhou's poster on the sociolinguistics of memes (11am) and @lucy3_li's talk on concepts of fairness in NLG systems at 2:36pm (ethics/bias/fairness 1)
Join us on Monday, 2/26 at 4:30 pm for a lecture by @dbamman: The Promise and Peril of Large Language Models for Cultural Analytics.
RSVP: https://t.co/HUzV6CMPse
More info: https://t.co/MrwB3NPbJm
Co-sponsored by @PrincetonPLI.
Very excited to announce the launch of our citizen science initiative "The Lives of Literary Characters" hosted @the_zooniverse. This is the first ever literary citizen science project that aims to promote story understanding. A Thread 🧵 https://t.co/YDrPPeFa3o
New preprint! 📜 We investigate how ten “quality” and English langID filters, drawn from prior lit on LLM pretraining data curation pipelines, affect webpages linked to self-descriptions of their creators.
Paper: https://t.co/8w81ztd6TA
Data: https://t.co/SpPUJzsc9U 🧵(1/6)
🚨NLP+CSS workshop is back and will be at NAACL 2024!
Paper submission deadline: March 24 https://t.co/Wbipcl0Xhl
Organizing team: @anjalie_f@dallascard @dirk_hovy and myself