Andy Halterman @ahalterman - Twitter Profile

about 1 year ago

Currently in FirstView: “Synthetically generated text for supervised text analysis.” @ahalterman proposes a method for using LLMs to generate synthetic training data for training smaller, traditional supervised text models.

polanalysis's tweet photo. Currently in FirstView: “Synthetically generated text for supervised text analysis.” @ahalterman proposes a method for using LLMs to generate synthetic training data for training smaller, traditional supervised text models. https://t.co/cfcKao0vRj

1

47

17

27

6K

Andy Halterman @ahalterman

almost 3 years ago

@arthur_spirling @cbarrie @brendan642 Unfortunately pretty much none of them do, because of licensing/copyright issues. The best I've seen is a news source + headlines, which you can (sometimes, painfully) use to get the original text, but it's not easy.

0

1

0

80

ahalterman retweeted

brendan o'connor @brendan642

almost 3 years ago

Reminder - for the terrific interdisciplinary Text as Data conference, abstract submissions coming up - due Aug 4! https://t.co/BjegNt4qpv It's a great, small, non-archival conference to discuss emerging work with folks across social sciences, humanities, and computer science.

0

66

40

17

26K

Andy Halterman @ahalterman

almost 3 years ago

Or if you're interesting in forecasting civil war as a latent variable, you can come talk to me about that too.

0

1

0

426

Who to follow

Molly Roberts

@mollyeroberts

Professor of Political Science, University of California San Diego

Julia Macdonald

@jumacdo

Director of Research and Engagement @asianewzealand. Research Professor @josefkorbel. Senior fellow @VicUniWgtn. @gwucolumbian @uchicago alum.

Reid Pauly

@reidpauly

Assistant Professor of Nuclear Security and Policy @BrownUniversity @BrownUPoliSci @Watson_School

Andy Halterman @ahalterman

almost 3 years ago

MSU political science is hiring in methods! I'm at PolMeth today if you want chat about it. https://t.co/VLhaVM1Lkf

1

25

13

1

7K

Andy Halterman @ahalterman

about 3 years ago

Blog post: https://t.co/CEartyQPu6 Live demo: https://t.co/2PW2x1dtxP Paper: https://t.co/V7vI8buzpp

0

4

1

2

517

Andy Halterman @ahalterman

about 3 years ago

How can we categorize political actors extracted from text without or dictionaries or lots of hand labeling? We can use a "soft dictionary" approach with a small set of hand-written patterns and a transformer model.

1

8

2

1

1K

ahalterman retweeted

Arthur Spirling @arthur_spirling

about 3 years ago

OK, here it is: a line in the sand (in @Nature). I am very wary about scientists---including political scientists---embracing/pushing proprietary LLMs. Let's try an open science approach. Hope this take is a useful one. https://t.co/jvb4tK8lE2

16

360

117

96

399K

Andy Halterman @ahalterman

about 3 years ago

Big epiphanies today in the grad causal inference class reviewing all the methods we've covered so far.

1

8

2

0

1K

Andy Halterman @ahalterman

about 3 years ago

No ChatGPT here (yet), only the finest, handcrafted, artisanal features and good old RoBERTa.

0

7

0

314

Andy Halterman @ahalterman

about 3 years ago

I've (finally) released an update to my text geoparsing library. Mordecai 3 lets you pass in a document, and returns the place names from the text and their geographic coordinates. It's built on #spacy and Geonames and uses a new neural similarity model.

ahalterman's tweet photo. I've (finally) released an update to my text geoparsing library. Mordecai 3 lets you pass in a document, and returns the place names from the text and their geographic coordinates. It's built on #spacy and Geonames and uses a new neural similarity model. https://t.co/E1EfjNLEmL

2

42

4

12

5K

Andy Halterman @ahalterman

about 3 years ago

Mordecai 3 is open source, runs offline, and is available via pip or on GitHub: https://t.co/RVPnCux6vT. Check out the paper for more details and performance comparisons with other geoparsers: https://t.co/zLhn2IjUfj

1

6

0

318

ahalterman retweeted

cs.CL Papers @arxiv_cs_cl

about 3 years ago

https://t.co/dqZ3kuhaIh Creating Custom Event Data Without Dictionaries: A Bag-of-Tricks. (arXiv:2304.01331v1 [https://t.co/HW5RVw4UkE]) #NLProc

0

1

844

Andy Halterman @ahalterman

about 3 years ago

@a_strezh It’s a lie!

0

159

ahalterman retweeted

Andy Halterman @ahalterman

about 3 years ago

@emollick Not a special issue, but I have a working paper on using synthetically generated text to train supervised classifiers (with poli sci applications). Paper: https://t.co/YWcMpuuh8k Poster: https://t.co/qWIvb3HSIa

1

0

200

Andy Halterman @ahalterman

about 3 years ago

@emollick The main guidance is on how to guide the text generation (when to prompt vs. when to fine tune) and how to handle the output (hand label or zero shot). ChatGPT is definitely moving things toward prompt+zero shot.

0

46

Andy Halterman @ahalterman

about 3 years ago

@emollick Not a special issue, but I have a working paper on using synthetically generated text to train supervised classifiers (with poli sci applications). Paper: https://t.co/YWcMpuuh8k Poster: https://t.co/qWIvb3HSIa

1

0

200

ahalterman retweeted

Niklas Stoehr @niklas_stoehr

over 3 years ago · Abu Dhabi

@ben_j_radford and @hurrial kicking off the second day of the #CASE Workshop (Extraction of Socio-political Events from Text) @emnlpmeeting. @ahalterman @tiancheng_hu @yaoyao_dai @HristoTanev2 🙌

niklas_stoehr's tweet photo. @ben_j_radford and @hurrial kicking off the second day of the #CASE Workshop (Extraction of Socio-political Events from Text) @emnlpmeeting. @ahalterman @tiancheng_hu @yaoyao_dai @HristoTanev2 🙌 https://t.co/sqbdeUCOIB

0

5

2

0

ahalterman retweeted

MIT SSP

@MIT_SSP

over 3 years ago

Much of IR is concerned with understanding the behavior of elites. That’s nice from a natural language processing perspective, as elite decision-making tend to get written down. — SSP alum @ahalterman on natural language processing in IR research. https://t.co/4saIhRmpiA

0

3

1

0

Andy Halterman @ahalterman

over 3 years ago

@arnicas Every time I sit down to make a Streamlit demo, I think “this will probably take a couple hours”, and then it takes like 20 minutes. It’s a really nice tool!

1

2

0

Andy Halterman

@ahalterman

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users