Intentee @intentee - Twitter Profile

9 days ago

Our goal with Paddler is to make self-hosting models dead easy, so if you're considering migrating from proprietary vendors, check it out here: https://t.co/ZcjIB2esR3

0

3

Intentee @intentee

9 days ago

🚨 Paddler, our open-source platform for keeping AI on your own servers, just got a major update. 4.0.0 highlights:

1

0

5

Intentee @intentee

9 days ago

- Token classification and counting - Tool-call response parsing into one consistent format, with arguments validated against your tool's JSON Schema - Compatibility with the OpenAI Responses endpoint

1

0

7

Intentee @intentee

2 months ago

Stability of the user experience is a strong reason to run open source models on your own infra, and it doesn't come up as often as cost control or data privacy. More in the video.

0

7

Intentee @intentee

3 months ago

Paddler repo: https://t.co/ZcjIB2esR3

0

8

Intentee @intentee

3 months ago

Really insightful talk from Mateusz Charytoniuk on building a self-hosted LLM stack at @rustikonconf today. He broke down where open source LLMs are headed, how to self-host them at scale with Paddler, and how Rust helps to make this ecosystem more secure and maintainable.

intentee's tweet photo. Really insightful talk from Mateusz Charytoniuk on building a self-hosted LLM stack at @rustikonconf today. He broke down where open source LLMs are headed, how to self-host them at scale with Paddler, and how Rust helps to make this ecosystem more secure and maintainable. https://t.co/v5CVVOUMXS

1

0

9

Intentee @intentee

7 months ago

MCP servers aren't just technical projects. They can add real value by making your product accessible through conversational AI. Here's a short video essay on how to frame them as a business opportunity: https://t.co/fLpOB6CjOD

0

48

Intentee @intentee

8 months ago

We're organizing Post Software, an event where AI builders, designers, and professionals from other industries meet to explore how AI can create entirely new solutions, not just optimize what already exists. Want to speak or get involved? Visit https://t.co/az0coOTyV6

0

179

Intentee @intentee

9 months ago

It gives you real-time insights into how your capacity is used, how many requests are being buffered, and any issues that might have come up. It also provides a convenient way to manage your models (swap them dynamically, use custom chat templates, adjust inference parameters).

0

34

Intentee @intentee

9 months ago

Paddler is our open-source platform for self-hosting LLMs at scale, and it comes with a web UI you can use to understand your cluster at a glance. https://t.co/N6ShdCBSkd

1

0

46

Intentee @intentee

9 months ago

Self-hosting your models can solve this, and with tools like Paddler, you don’t need to hire an entire LLMOps team. Learn more https://t.co/XWdvAPvIpw and https://t.co/h5M6DjaP4v

0

28

Intentee @intentee

9 months ago

Per-token API pricing for LLM usage is convenient, but it comes with a huge cost unpredictability. Traffic spikes are one thing, but the way your users use your LLM-based features can differ widely

1

0

30

Intentee @intentee

9 months ago

Pre-alpha version (static site generation, open-source, https://t.co/IlxBE4kgF2) is out, with a custom syntax that gives full control over how content is understood structurally and will allow us to add AI-based content analysis features. More to come :)

0

28

Intentee @intentee

9 months ago

Based on the last StackOverflow Developer Survey, most developers prefer both interactive formats and long-form articles when learning new technologies. This means static site documentation should no longer be static.

1

0

1

0

31

Intentee @intentee

9 months ago

Technical products and developer tools need to be discoverable in AI platforms, let users talk to their docs, and ensure coding copilots can offer quality code suggestions for their technologies. This is what we’re aiming for with Poet https://t.co/6ee68ZU6Y5

1

0

51

Intentee @intentee

9 months ago

Paddler lets you download models directly from Hugging Face (or from a local file path), both via API and our web admin panel. You can also swap them dynamically without needing to restart the entire setup. More: https://t.co/wUc36Flpz7

0

21

Intentee @intentee

9 months ago

Not every LLM task requires a massive number of parameters. And it’s not only a matter of cost savings. Smaller, specialized models can offer a better experience to the end user (both in quality response and performance).

1

0

17

Intentee @intentee

10 months ago

If there's no available capacity, Paddler will buffer the incoming requests. Combined with autoscaling, it lets you handle traffic spikes without dropping requests. You can also scale from zero hosts to avoid overpaying for idle GPU capacity. https://t.co/v4QvEq6EMN

0

2

0

16

Intentee @intentee

10 months ago

Feature highlight: model swapping. Paddler lets you swap models dynamically, without the need to restart the balancer or the agents. Available in the web admin panel or the API. Learn more: https://t.co/wUc36FkRJz

0

2

0

13

intentee retweeted

Kevin Wittek

@Kiview

10 months ago

@ggerganov Really awesome and I am happy to see llama.cpp being features for non local-workstation use cases more :)

0

3

1

0

410

Intentee

@intentee

Last Seen Users on Sotwe

Trends for you

Most Popular Users