Spider Cloud 🕷️ @spider_rust - Twitter Profile

Pinned Tweet

about 2 years ago

You can now use Spider in @llama_index as a web reader! Crawl/scrape urls and format the HTML into LLM ready markdown! Spider is the fastest web crawler built for AI Agents and LLMs. h/t @WilliamEspegren for the PR

1

6

2

6

2K

spider_rust retweeted

Troy

@troyaitken_

11 days ago

Building a scraping stack from scratch. Here's what we actually use across hundreds of outbound campaigns: Instantly Data Scraper — directories. When you need to pull from G2, Capterra, industry lists. Fast. No code. Playwright + Claude Code — custom sites. Anything with a weird structure or login wall. Claude writes the scraper. You run it. Firecrawl — full site crawls. When you need everything on a domain. Pricing pages, case studies, team pages. 10 minutes, not 10 hours. Jina / https://t.co/pAKTBWthlG — scale. 10K+ pages. LLM-ready output. This is where most teams underinvest. Browserbase — agentic browsing. For flows a static scraper can't handle. Session persistence. Works where everything else breaks. BrightData — bot-protected sites. Yes it costs more. Yes it's worth it. LinkedIn. Amazon. Anything that actively fights you. Finding the directories is one thing, understanding the use case and framing is another thing. A company listed on a niche directory already told you something. They chose to be found. They're actively positioning in that category. They want buyers to discover them. That's not a cold lead anymore. Apollo gives you a list of people who fit a description. Directories give you a list of people who took action. Those are not the same signal. Most funded teams spend 3 weeks debating which database to use. Then overpay for ZoomInfo since that’s what they did at their last company. The teams booking meetings on day one? They scraped intent from niche directories before anyone else thought to. A client recently funded their Series A. Board wanted a consistent pipeline in 90 days. We skipped the generic Apollo list. Scraped 5,600 schools from vertical-specific directories. Matched them against relevant signals. Sent 7,000 emails. Booked 55 meetings in 31 days. Same offer. Same copy. Different list quality. The scraping stack matters. But knowing where to point it matters more. Directories are intent. Treat them that way.

7

19

2

33

1K

spider_rust retweeted

Troy

@troyaitken_

3 months ago

Before writing cold email copy: Jina → homepage Spider → full crawl Claygent → pricing/careers Built With → tech stack Google News → triggers LinkedIn → profiles All synthesized into ONE research column. Then we write.

1

23

1

21

1K

spider_rust retweeted

Dify

@dify_ai

10 months ago

Dify v1.8.0 is live. This release makes it easier to refine prompts, fix code, and manage workflows. Refine or repair right inside LLM and Code nodes with an agent. Prompt and Code - Prompt Optimization: use {{last_run}} with your ideal outputs to quickly refine prompts and keep iterations under control. - Code Fix: auto repair captures {{current_code}} and {{error_message}} to generate corrected versions so you spend less time on manual debugging. - Version Management: every optimization and fix is saved as a version so you can compare and roll back any time. Workflow and agent upgrades - Multi model credentials: configure and switch between multiple keys for the same provider or a custom model. - MCP with OAuth: connect to MCP servers with OAuth, including token expiry control and callback allowlists. - Default values for workflow variables: all start node variable types now support defaults for faster setup - Agent node token usage: track token usage in agent nodes for better monitoring and optimization. Navigation and experience - Knowledge base sorting: sort documents by status for smoother management. - Extensible goto anything commands: a new architecture for faster navigation across projects. Plus performance, security, and infra improvements across the board. Full changelog: https://t.co/0tFENc67NJ Happy building!

2

47

16

11K

Spider Cloud 🕷️ @spider_rust

about 1 year ago

@gertjanwilde @MrAhmadAwais @firecrawl @gertjanwilde are you still having issues logging in? mind sending a DM to me?

0

17

spider_rust retweeted

Ahmad Awais

@MrAhmadAwais

about 1 year ago

For the first time, you can vibe-code any AI agent. Meet https://t.co/iLXHy3iBgJ — Computer Human AI by Langbase ☕ 🔹Prompt: "make an agent that…" 🔹Sip: chai builds any AI agent 🔹Ship: every agent gets a UI 🤯 Like your on-demand AI Engineer. What will you s(h)ip today?

70

583

105

711

149K

Spider Cloud 🕷️ @spider_rust

about 1 year ago

@chrislaupama Dm’d ya

0

1

0

11

Spider Cloud 🕷️ @spider_rust

over 1 year ago

Use Spider with @julep_ai cookbook https://t.co/uvLuPsae0T

Tom Dörr

@tom_doerr

over 1 year ago

Julep AI: Agent-based chat and task automation

15

766

78

1K

76K

0

4

0

1

462

spider_rust retweeted

Jason Zhou

@jasonzhou1993

over 1 year ago

This is how I use LLM to scrape 99% of websites Many people didn't realise you can build agentic scraper to: 1. Handle Authentication, Human verification, Captcha 2. Handle pagination & complex UI interactions 3. Adaptive as website structure change 4. Scrape large set of data What used to take hours now can be automated in mins; Here I show case how do you automate a scraping job on Upwork where people are paying $50~$80 per hour; 0:00 Intro 1:54 Methods overview 4:52 Web Scraper agent using @firecrawl @spider_rust @JinaAI_ 8:37 Handle website auth & captcha using @AgentQL 20:27 AI buy tickets @MultiOn_AI If you have any further question or want to get deep dive into the code example in the video, you can join my community where I post tips weekly: https://t.co/V4aBYxciqR

29

1K

152

3K

128K

Spider Cloud 🕷️ @spider_rust

over 1 year ago

Another great video from @jasonzhou1993 "This is how I scrape 99% websites via LLM" https://t.co/RGbH5aYPdl

0

1

0

1

459

Spider Cloud 🕷️ @spider_rust

over 1 year ago

congrats on the raise!

João Moura

@joaomdmoura

over 1 year ago

Excited to share that @crewAIInc raised $18 million in funding, with our series A led by @insightpartners, with @Boldstartvc leading our inception round. We're also thrilled to welcome @BlitzVentures, Earl Grey Capital(@amitvasudev_), and top AI leaders like @AndrewYNg and @dharmesh on board. This investment is a major validation of our vision: CrewAI is delivering on the promise of generative AI for enterprise, transforming automation by harnessing the power of AI agents. Our open-source framework executes over 10 million agents each month and is already trusted by an estimate of nearly half of the Fortune 500 to achieve automations that were previously impossible. With the launch of CrewAI Enterprise, we're making it even easier for large organizations to design, test, and deploy complex AI agents at scale, with high-quality results. A huge thank you to our employees, customers, partners, and investors. We ship fast and are just getting started. ⚡⚡⚡

104

578

45

90

86K

0

3

0

434

spider_rust retweeted

Lukas

@lukasboehler

over 1 year ago

Super excited about @spider_rust! It’s going to significantly boost web crawling speed, making content gathering for #kai @GleapSDK faster and more accurate 🚀 #llm #aibot #aiagent #rag

0

1

0

408

Spider Cloud 🕷️ @spider_rust

over 1 year ago

@sergedoub @dify_ai Thanks for reaching out. I'll DM you to get more info

0

67

Spider Cloud 🕷️ @spider_rust

over 1 year ago

@1000_tools are you guys still taking in new tools? I just sent ya a DM

0

1

0

45

Spider Cloud 🕷️ @spider_rust

almost 2 years ago

@jasonzhou1993 If you're looking for a blazing fast crawler for scraping that is cost effective—then check us out https://t.co/DIcpElTROA

0

1

0

236

Spider Cloud 🕷️ @spider_rust

almost 2 years ago

Great tutorial on OpenAI Structured Output by @jasonzhou1993 — thanks for the Spider shoutout! https://t.co/3NDyP0JQqq

1

2

0

1

350

Spider Cloud 🕷️ @spider_rust

almost 2 years ago

@jasonzhou1993 Spider is mentioned here: https://t.co/3ZuAInkA4w

1

0

269

spider_rust retweeted

Langflow @langflow_ai

almost 2 years ago

4/ Spider Web Scraper & Crawler component: A great tool for scraping and crawling webpages using @spider_rust .

1

10

3

1K

Spider Cloud 🕷️ @spider_rust

almost 2 years ago

Great update from the Dify team! ICYM: Spider @spider_rust can be used as a built-in tool in Workflow or as an LLM-callable tool in Agent

Dify

@dify_ai

almost 2 years ago

🔥 Dify v0.7.0 is out! We've launched Conversation Variables and Variable Assigner nodes in Dify v0.7.0, tackling LLM memory limitations. These features enable precise storage, retrieval, and updating of context information throughout the conversation flow. Supporting structured data types, they give chatflow-built LLM apps precise memory control, boosting LLMs' ability to handle complex scenarios in production. - Read the blog: https://t.co/H56xumtkXM - Docs: https://t.co/SQHUxjAB0q We've also added new models and tools, and improved workflow functionality to enhance your AI apps. See the full changelog: https://t.co/zNzoAxeyPv

dify_ai's tweet photo. 🔥 Dify v0.7.0 is out!

We've launched Conversation Variables and Variable Assigner nodes in Dify v0.7.0, tackling LLM memory limitations.
These features enable precise storage, retrieval, and updating of context information throughout the conversation flow. Supporting structured data types, they give chatflow-built LLM apps precise memory control, boosting LLMs' ability to handle complex scenarios in production.

- Read the blog: https://t.co/H56xumtkXM
- Docs: https://t.co/SQHUxjAB0q

We've also added new models and tools, and improved workflow functionality to enhance your AI apps.
See the full changelog: https://t.co/zNzoAxeyPv

4

107

19

29

41K

1

7

1

2

3K

Spider Cloud 🕷️ @spider_rust

almost 2 years ago

Next Generation web crawling and scraping that can handle thousands - millions of pages in seconds. The fastest and most affordable service built fully in Rust Lang. https://t.co/sPC1kjU3K4

0

6

0

2

548

Spider Cloud 🕷️ @spider_rust

almost 2 years ago

Under a second to crawl over 100 pages? What just happened? https://t.co/DIcpElUpE8

1

4

0

1

632

Spider Cloud 🕷️

@spider_rust

Last Seen Users on Sotwe

Trends for you

Most Popular Users