@TheStalwart Yes, rerun the whole thing. Flash 2.5 is generally better than Flash 2.0. So is Flash 3.1 Lite, which is less expensive. I've been using these models for my startup and would be happy to help you with this project. We follow each other and have the same name!
@LangburdWright@bocowgill@pablhern I really enjoyed reading this paper! The experiment made sense, the results were interesting, and the writing quality was very high.
@chi_t_williams@KelseyTuoc It's about extracting various types of signals, including style, topics and potentially PII. They even run text through an LLM to try to disguise it, not entirely successfully. I'm now convinced that if you have enough public and anonymous online writing, LLMs can dox you.
@chi_t_williams@KelseyTuoc This paper I read https://t.co/4svs8fMAiT suggests identification isn’t about lots of your text, it’s about whether a stable “style signature” can be inferred. Even limited writing can be enough once those statistical patterns show up.
@FederalAsh14@johnjhorton@MeganTStevenson Qianfan OCR looks interesting. Here's what's different about it from other OCR models. It is a bigger model, but it can also handle, document understanding and key information extraction. It takes a prompt, and uses thinking tokens. I listened to the paper with Paper2Audio!
@FederalAsh14@johnjhorton@MeganTStevenson I'm starting to look into Qianfan OCR as well. It is 4B parameters, but claims to go beyond the other recent "OCR" models to handle document understanding tasks (like chart summaries). It looks interesting and I'm going to read its paper this week too.
@MeganTStevenson@Ljt019117161@johnjhorton The problem is that earlier tools had much lower accuracy narrating a complex doc like a research paper PDF. Paper2Audio accurately narrates your docs using high quality voices, building upon recent AI progress in multiple areas. Happy to discuss or answer more questions.