Is the U.S. office market recovering or repricing? Look at the deals.
83 commercial office transactions by real estate investment firms in the past 30 days — pulled from 8,773 pages, with property, deal size, currency, investor, and date already structured.
A CRE analyst's week of work in one query.
Query of the Week ↓ in the comments.
Enterprise risk software tracks known risks well.
But most risks start outside your organisation: in supplier news, court filings, breach reports, and local press.
A web search API adds the external layer risk teams are missing.
New post: how to build an early warning risk intelligence system with CatchAll → link in the comments.
Building a custom scraper seems like the cheap option.
Then you add proxies, CAPTCHA handling, headless browsers, parser fixes every time a site changes its layout.
The real cost shows up 6 months in🧵
Custom scrapers make sense when:
→ You need logins or session handling
→ Workflows are small + sites rarely change
→ You need full control over timing and logic
Managed APIs win when:
→ Scale grows and infra becomes the job
→ You need clean JSON for AI/analytics pipelines
→ You want to ship fast without debugging crawlers
♠️ Tracking new gambling and casino licenses worldwide is a compliance nightmare.
We ran the query anyway: 15,982 pages → 39 license approvals from the last 30 days, with the licensee, jurisdiction, regulatory authority, license type, and approval date pulled into clean fields.
Compliance, gaming investors, and payments/KYC teams — this is the kind of dataset that usually takes a small research team a week.
Grab the full dataset via the link in the comments. ↓
By the time a funding round hits Crunchbase, the window to act has usually closed.
We wrote a blog on how Investment teams are building web search APIs into their stack to close the information gap 🧵 1/3
2/3 Financial databases are built around disclosure events: filings, earnings, official press releases.
But the early signals? They live on regional news sites, niche trade blogs, and company IR pages. Scattered, fragmented, no aggregator catches all of it.
🤌 If you build with agents, you need to see this.
86 multi-agent framework releases in 30 days, pulled into clean columns: framework, version, org, date, key features.
The agent updates, structured.
Query of the Week ↓ link in the comment.
🤫 Pssst... do you want to be faster than your competitors? Then read us on @Substack!
May updates from NewsCatcher: reusable webhooks that route results anywhere, project containers to organize work by client or team, and a free query validator that catches issues before they cost you.
And *secret new feature* we're working on, that allows you to process the primary documents — PDFs, filings, legislative votes — before journalists (and your competitors) do.
→ Read the full May issue on Substack – link in the comments.
The full tutorial (working Python code, prompts, architecture) is live.
The retrieval layer is where most agents quietly fail. This one doesn't.
🔗 https://t.co/ruqTMGh5JV
We built it.
One open-ended question → orchestrator breaks it into focused sub-tasks → sub-agents each run a CatchAll search → cited markdown report comes out the other end.