Important win in the lawsuit we filed against Deere for blocking farmers from easily fixing their equipment.
Great to see continued momentum towards ensuring that Americans have the freedom to fix their own products or use repair shops of their choice.
https://t.co/DRehBgykIF
The Washington state legislature is sending TWO right to repair bills to the governor to sign -- and both passed overwhelmingly (total of 2 no votes across 4 full chamber tallies!) https://t.co/yNcFQQDJD2
@teknium instructor. func calling w/pydantic has been fantastic, and made getting useful outputs out of data extraction tasks dead simple and nearly 100% accurate. Bonus, just found out it works with GPT4-V model too using MD_JSON mode.
@bhallcpa Paolo Bacigalupi is my favorite in the genre - the Water Knife, Ship Breaker, Windup Girl. Apocolyptic Sci-fi. Don't really follow the "major outage" trope - more like the apocalypse has happened, now what does the world look like.
@ArtirKel The quality of the PDF, and complexity of the content, is the biggest factor in RAG performance.
If they are well formatted pdfs, not much complexity - most out of the box solutions mentioned will work. Otherwise you need to OCR. Vision models have helped, but v. expensive.
The Polish hackers who heroically fixed a series of passenger trains that the manufacturer had artificially bricked with DRM are now being threatened with lawsuits. OEM demanding the trains be removed from service:
https://t.co/dR7pRVuRTq
New from 404 Media: you may have seen that a group of Polish hackers repaired trains the manufacturers had bricked (hackers were successful and helped). Now the train company is threatening to sue them. We spoke to the hackers.
https://t.co/XfkMQdiKvf
When farmers don't have access to what they need to fix their tractors, serious costs & downtime ensue.
In my latest for @uspirg, I calculated that #RightToRepair could save U.S. farmers $4.2 BILLION a year.
Here's a few of the key stories and findings from the report:
@Peter_0_0_g @abacaj I’ve had success using agents as simple controllers,deciding the path to continue based on fuzzy context, and letting it go a few levels deep. But letting them run wild any further gets into loops quickly in my experience.
@Ishaank1999@langchain@pinecone How have the costs been on pinecone?
Been a fan of pinecone so far but struggling to understand their pricing.
Have about 250k pdfs ranging in size from 10-150 pages.
@xathis@tito@LPitsoulis@pinecone@langchain It seems to depend heavily on the subject content.
Larger chunks for text heavy docs works best, but smaller chunks works better with graph or table heavy docs. No hard data, just experience at this point. Very interested in research on this
@xathis@tito@LPitsoulis@pinecone@langchain I rarely use overlap.
The only time is search across pdf pages. I use it to capture sentence continuation past the page end, but I’m not convinced some arbitrary number, token based overlap is the right method. paragraph / sentence boundary seems better.