Michael Ortega

9 months ago

@eligos_games Yes please!

1

0

12

MichaelOrtegaSF retweeted

about 1 year ago

🚀 Serve and fine-tune #Qwen3 — in your cloud or ours with blazing fast #inference speeds! No need to share your data. 🚀 Qwen 3 is the latest #opensource LLM dominating the leaderboards. Don't get left behind! Now you can serve and customize the latest Qwen models instantly on our shared serverless endpoints or deploy securely in your own #VPC! ➡️ Try Qwen 3 with $25 in free Predibase credits: https://t.co/P7s8UZSxvs ➡️ Get access to high-end GPUs to deploy Qwen 3 in your cloud: https://t.co/ujZvGSyA10

predibase's tweet photo. 🚀 Serve and fine-tune #Qwen3 — in your cloud or ours with blazing fast #inference speeds! No need to share your data. 🚀

Qwen 3 is the latest #opensource LLM dominating the leaderboards. Don't get left behind!

Now you can serve and customize the latest Qwen models instantly on our shared serverless endpoints or deploy securely in your own #VPC!

➡️ Try Qwen 3 with $25 in free Predibase credits: https://t.co/P7s8UZSxvs

➡️ Get access to high-end GPUs to deploy Qwen 3 in your cloud: https://t.co/ujZvGSyA10

0

6

3

0

356

MichaelOrtegaSF retweeted

Founder CRO @ProjectHeena Business blogger, Startup Junkie & part of @headstarters. Passionately strives to apply common sense wherever possible!

about 1 year ago

🐳 AI teams are testing DeepSeek—but nobody agrees on when to use it In our recent survey of 500+ AI professionals, DeepSeek-R1 is getting serious attention—but it's far from mainstream. Here’s what we uncovered: 📊 57% of teams have experimented with DeepSeek-R1 ⚠️ Only 3% have deployed it in production 🤷‍♂️ Nearly half are unsure how it stacks up to other models And the demand for customization is clear: 🔧 46% want fine-tuning or distillation options 🧪 The takeaway? DeepSeek-R1 has potential—but teams are still figuring out how to unlock it. 👉 Ready to see if it fits your use case? Start experimenting on Predibase—free trial available. #AI #LLM #DeepSeek #MLOps #Predibase #GenAI #MachineLearning #opensourcellms

predibase's tweet photo. 🐳 AI teams are testing DeepSeek—but nobody agrees on when to use it

In our recent survey of 500+ AI professionals, DeepSeek-R1 is getting serious attention—but it's far from mainstream. Here’s what we uncovered:

📊 57% of teams have experimented with DeepSeek-R1
⚠️ Only 3% have deployed it in production
🤷‍♂️ Nearly half are unsure how it stacks up to other models

And the demand for customization is clear:
🔧 46% want fine-tuning or distillation options
🧪 The takeaway? DeepSeek-R1 has potential—but teams are still figuring out how to unlock it.
👉 Ready to see if it fits your use case? Start experimenting on Predibase—free trial available.

#AI #LLM #DeepSeek #MLOps #Predibase #GenAI #MachineLearning #opensourcellms

1

8

2

4

574

Who to follow

Himanshu Chanda

@himanshuchanda

Jared (JB) Balint

@jbalint88

Husband, Father, Hockey & Guitar Playing Sports Fan! Life is short Don't Fret. Co-Host of the 2 Many Men on the Mic Show, Weekdays 3 - 4 on https://t.co/OWfrDUld5H

@predibase @nvidia #Fresh

over 1 year ago

0

1

0

20

over 1 year ago

@predibase Amazing results!

0

1

0

149

over 1 year ago

Custom #AI without all the labeled data. RFT has arrived! Train your own #LLMs and outperform #DeepSeekR1 and #GPTo1 😎

over 1 year ago

Today we're thrilled to announce the first end-to-end platform for Reinforcement Fine-Tuning. With just a dozen labeled data points, you can outperform #OpenAI o1 and #DeepSeekR1 on complex tasks. Built on the #GRPO methodology that DeepSeek-R1 popularized, our platform delivers exceptional results. In our real-world PyTorch to Triton transpilation case study, we achieved 3x higher accuracy than OpenAI o1 and DeepSeek-R1 when writing GPU code. Check out the thread below to learn how you can adapt an #opensource #LLM to your use cases with unmatched efficiency. #rft

20

496

68

379

1M

0

3

0

85

over 1 year ago

@DataStax FTW at GTC!

0

1

0

380

MichaelOrtegaSF retweeted

over 1 year ago

#Reasoning models are 🔥 But #inference can be sooo slow due to massive token generation 🛑 Unless you know how to #turbo charge your LLMs 🚀 Last chance: Save your spot for our interactive 30 minute #AMA style demo on how to accelerate reasoning models like #DeepSeek-R1 by 2-3x. ➡️ Sign-up: https://t.co/Ou9TQMmiA8

0

4

1

0

189

MichaelOrtegaSF retweeted

Travis Addair @TravisAddair

over 1 year ago

🚀 #RFT vs. #SFT: When to Use Each for Maximum Impact #DeepSeek -R1 made #Reinforcement #FineTuning (RFT) the hot new thing—but is it better than #Supervised Fine-Tuning (SFT)? 🤔 Here’s when RFT wins: ✅ No labeled data? If you can verify correctness, RFT works. ✅ <100 labeled examples? RFT generalizes better. ✅ CoT helps? RFT fine-tunes reasoning beyond SFT. 📖 Read the full blog here 👉 https://t.co/7fbCGwwU71

predibase's tweet photo. 🚀 #RFT vs. #SFT: When to Use Each for Maximum Impact

#DeepSeek -R1 made #Reinforcement #FineTuning (RFT) the hot new thing—but is it better than #Supervised Fine-Tuning (SFT)? 🤔

Here’s when RFT wins:
✅ No labeled data? If you can verify correctness, RFT works.
✅ <100 labeled examples? RFT generalizes better.
✅ CoT helps? RFT fine-tunes reasoning beyond SFT.

📖 Read the full blog here 👉 https://t.co/7fbCGwwU71

0

9

2

3K

MichaelOrtegaSF retweeted

over 1 year ago

🚀 Supervised Fine-Tuning (SFT) has been the default for adapting LLMs—but it has a major limitation: it demands a large amount of high-quality labeled data. 🧵 Why Reinforcement Fine-Tuning (RFT) is a better approach when data is scarce: 👇

TravisAddair's tweet photo. 🚀 Supervised Fine-Tuning (SFT) has been the default for adapting LLMs—but it has a major limitation: it demands a large amount of high-quality labeled data.

🧵 Why Reinforcement Fine-Tuning (RFT) is a better approach when data is scarce: 👇 https://t.co/54gvGvUR3N

1

9

3

577

MichaelOrtegaSF retweeted

over 1 year ago

Big things come in #small packages! Are you ready for #SmallCon! 🎁 Just 3 weeks away and the speaker list is absolute 🔥 Save your spot for the first #virtual conference focused on how to unlock the full value of small models and build a modern #GenAI stack! And it's free! Check out our amazing speakers: ➡ @LoubnaBenAllal1 Ben Allal, SmolLM Lead, @huggingface ➡ Daniel Hunter, Prev. the Head of AI, @harvey__ai ➡ Manjeet Singh, Sr. Director of AI Platforms, @salesforce ➡ @echojuliett, Cofounder and CPO, @upstageai ➡ @mjmj1oo, Head of Product, @MistralAI ➡ @appliedml42, Sr. Staff ML Eng, @nubank ➡ Diego Guerra Orozco, GenAI Startup Lead, @Meta ➡ @ShreyaR, CEO and Cofounder, @guardrails_ai ➡ Giuseppe Romagnuolo, VP of AI, @convirza ➡ @atinsanyal, CTO and Cofounder, @rungalileo ➡ @devvret_rishi, CEO and Cofounder, @predibase ➡ @maartenvansegb, Head of Applied Science, @gretel_ai ➡ Kasey Roh, Head of US Biz, @upstageai ➡ @grg_arnav, Head of ML Eng Team, @predibase ...and more to come! Save your spot: https://t.co/3lpc3YJhXn

predibase's tweet photo. Big things come in #small packages! Are you ready for #SmallCon! 🎁

Just 3 weeks away and the speaker list is absolute 🔥

Save your spot for the first #virtual conference focused on how to unlock the full value of small models and build a modern #GenAI stack! And it's free!

Check out our amazing speakers:
➡ @LoubnaBenAllal1 Ben Allal, SmolLM Lead, @huggingface
➡ Daniel Hunter, Prev. the Head of AI, @harvey__ai
➡ Manjeet Singh, Sr. Director of AI Platforms, @salesforce
➡ @echojuliett, Cofounder and CPO, @upstageai
➡ @mjmj1oo, Head of Product, @MistralAI
➡ @appliedml42, Sr. Staff ML Eng, @nubank
➡ Diego Guerra Orozco, GenAI Startup Lead, @Meta
➡ @ShreyaR, CEO and Cofounder, @guardrails_ai
➡ Giuseppe Romagnuolo, VP of AI, @convirza
➡ @atinsanyal, CTO and Cofounder, @rungalileo
➡ @devvret_rishi, CEO and Cofounder, @predibase
➡ @maartenvansegb, Head of Applied Science, @gretel_ai
➡ Kasey Roh, Head of US Biz, @upstageai
➡ @grg_arnav, Head of ML Eng Team, @predibase

...and more to come!

Save your spot: https://t.co/3lpc3YJhXn

0

6

5

1

602

MichaelOrtegaSF retweeted

over 1 year ago

⭐ We're excited to announce the launch of #SmallCon: A free virtual conference for #GenAI teams looking to build big with small models! ⭐ We're bringing together leading minds in AI from @Meta, @MistralAI, @Salesforce and more for deep dive tech talks and panel discussions on what it takes to build the #GenAI stack of the future and put your #SLMs into production! Our amazing list of speakers include: ➡ Daniel Hunter, Prev. the Head of AI @ Harvey AI ➡ Margaret Jennings, Head of Product @ Mistral ➡ Manjeet Singh, Sr. Director of AI Platforms @ Salesforce ➡ Abhishek Patnia, St. Staff ML Eng @ Nubank ➡ Diego Guerra Orozco, GenAI Partnership Lead @ Meta ➡ Shreya Rajpal, CEO and Cofounder @ Guardrails AI ➡ Giuseppe Romagnuolo, Head of AI @ Convirza and much more! Check out the site for the full agenda and list of speakers: https://t.co/Wn7siXvb5l Make sure to save your spot! Thank you to our event cohosts @rungalileo, @gretel_ai and @upstageai !

predibase's tweet photo. ⭐ We're excited to announce the launch of #SmallCon: A free virtual conference for #GenAI teams looking to build big with small models! ⭐

We're bringing together leading minds in AI from @Meta, @MistralAI, @Salesforce and more for deep dive tech talks and panel discussions on what it takes to build the #GenAI stack of the future and put your #SLMs into production!

Our amazing list of speakers include:
➡ Daniel Hunter, Prev. the Head of AI @ Harvey AI
➡ Margaret Jennings, Head of Product @ Mistral
➡ Manjeet Singh, Sr. Director of AI Platforms @ Salesforce
➡ Abhishek Patnia, St. Staff ML Eng @ Nubank
➡ Diego Guerra Orozco, GenAI Partnership Lead @ Meta
➡ Shreya Rajpal, CEO and Cofounder @ Guardrails AI
➡ Giuseppe Romagnuolo, Head of AI @ Convirza
and much more!

Check out the site for the full agenda and list of speakers: https://t.co/Wn7siXvb5l

Make sure to save your spot!

Thank you to our event cohosts @rungalileo, @gretel_ai and @upstageai !

1

10

3

4

2K

MichaelOrtegaSF retweeted

Marktechpost AI

@Marktechpost

over 1 year ago

Revolutionizing Fine-Tuned Small Language Model Deployments: Introducing Predibase’s Next-Gen Inference Engine Predibase announces the Predibase Inference Engine, their new infrastructure offering designed to be the best platform for serving fine-tuned small language models (SLMs). The Predibase Inference Engine dramatically improves SLM deployments by making them faster, easily scalable, and more cost-effective for enterprises grappling with the complexities of productionizing AI. Built on Predibase’s innovations–Turbo LoRA and LoRA eXchange (LoRAX)–the Predibase Inference Engine is designed from the ground up to offer a best-in-class experience for serving fine-tuned SLMs. Technical Breakthroughs in the Predibase Inference Engine At the heart of the Predibase Inference Engine are a set of innovative features that collectively enhance the deployment of SLMs: ✅ LoRAX: LoRA eXchange (LoRAX) allows for the serving of hundreds of fine-tuned SLMs from a single GPU. This capability significantly reduces infrastructure costs by minimizing the number of GPUs needed for deployment. It’s particularly beneficial for businesses that need to deploy various specialized models without the overhead of dedicating a GPU to each model. ✅ Turbo LoRA: Turbo LoRA is our parameter-efficient fine-tuning method that accelerates throughput by 2-3 times while rivaling or exceeding GPT-4 in terms of response quality. These throughput improvements greatly reduce inference costs and latency, even for high-volume use cases. ✅ FP8 Quantization: Implementing FP8 quantization can reduce the memory footprint of deploying a fine-tuned SLM by 50%, leading to nearly 2x further improvements in throughput. This optimization not only improves performance but also enhances the cost-efficiency of deployments, allowing for up to 2x more simultaneous requests on the same number of GPUs. ✅ GPU Autoscaling: Predibase SaaS deployments can dynamically adjust GPU resources based on real-time demand. This flexibility ensures that resources are efficiently utilized, reducing waste and cost during periods of fluctuating demand. Read our full article here: https://t.co/ToP9mj9rYC @predibase

Marktechpost's tweet photo. Revolutionizing Fine-Tuned Small Language Model Deployments: Introducing Predibase’s Next-Gen Inference Engine

Predibase announces the Predibase Inference Engine, their new infrastructure offering designed to be the best platform for serving fine-tuned small language models (SLMs). The Predibase Inference Engine dramatically improves SLM deployments by making them faster, easily scalable, and more cost-effective for enterprises grappling with the complexities of productionizing AI. Built on Predibase’s innovations–Turbo LoRA and LoRA eXchange (LoRAX)–the Predibase Inference Engine is designed from the ground up to offer a best-in-class experience for serving fine-tuned SLMs.

Technical Breakthroughs in the Predibase Inference Engine

At the heart of the Predibase Inference Engine are a set of innovative features that collectively enhance the deployment of SLMs:

✅ LoRAX: LoRA eXchange (LoRAX) allows for the serving of hundreds of fine-tuned SLMs from a single GPU. This capability significantly reduces infrastructure costs by minimizing the number of GPUs needed for deployment. It’s particularly beneficial for businesses that need to deploy various specialized models without the overhead of dedicating a GPU to each model.

✅ Turbo LoRA: Turbo LoRA is our parameter-efficient fine-tuning method that accelerates throughput by 2-3 times while rivaling or exceeding GPT-4 in terms of response quality. These throughput improvements greatly reduce inference costs and latency, even for high-volume use cases.

✅ FP8 Quantization: Implementing FP8 quantization can reduce the memory footprint of deploying a fine-tuned SLM by 50%, leading to nearly 2x further improvements in throughput. This optimization not only improves performance but also enhances the cost-efficiency of deployments, allowing for up to 2x more simultaneous requests on the same number of GPUs.

✅ GPU Autoscaling: Predibase SaaS deployments can dynamically adjust GPU resources based on real-time demand. This flexibility ensures that resources are efficiently utilized, reducing waste and cost during periods of fluctuating demand.

Read our full article here: https://t.co/ToP9mj9rYC

@predibase

0

20

4

1

317

MichaelOrtegaSF retweeted

almost 2 years ago

What if you could have your own highly-optimized #LLMs running in your #private cloud without any hassle? 🙋‍♂️ 🙋‍♀️ Well now you can. No more choosing between #performance and #security — have your LLM cake and eat it too! 🍰 😎 Want to learn how? 💡 Save a spot for our webinar to learn how to easily deploy LLMs in your cloud < 30 minutes with Predibase #VPC. We'll even show you how to get those models to outperform #GPT4! 💰 https://t.co/cNQyUCmxiO

predibase's tweet photo. What if you could have your own highly-optimized #LLMs running in your #private cloud without any hassle? 🙋‍♂️ 🙋‍♀️

Well now you can. No more choosing between #performance and #security — have your LLM cake and eat it too! 🍰 😎

Want to learn how? 💡

Save a spot for our webinar to learn how to easily deploy LLMs in your cloud < 30 minutes with Predibase #VPC. We'll even show you how to get those models to outperform #GPT4! 💰

https://t.co/cNQyUCmxiO

1

7

5

1

367

MichaelOrtegaSF retweeted

almost 2 years ago

There's a new "best #SLM" in town! We fine-tuned Llama-3.1-8b-instruct on 25 tasks and it shows a huge improvement over #GPT-4, GPT-4o mini, fine-tuned #Phi-3, and fine-tuned #Mistral-7b. Small language models continue to set the standard for performance, cost, and privacy!

predibase's tweet photo. There's a new "best #SLM" in town!

We fine-tuned Llama-3.1-8b-instruct on 25 tasks and it shows a huge improvement over #GPT-4, GPT-4o mini, fine-tuned #Phi-3, and fine-tuned #Mistral-7b.

Small language models continue to set the standard for performance, cost, and privacy! https://t.co/v7uZXACiA3

5

10

4

3

715

MichaelOrtegaSF retweeted

Travis Addair @TravisAddair

about 2 years ago

We had a blast joining the Founded & Funded podcast with @vivekramaswami at @MadronaVentures to chat about all things LLMs, why small #finetuned models are the future, and what it takes to build a successful high growth startup. Full video: https://t.co/cXr2YnCGHP

1

9

3

2

508

MichaelOrtegaSF retweeted

about 2 years ago

What's the best model for fine-tuning? Can 8B param models really beat GPT-4 when fine-tuned to specific tasks? @predibase we've put together our most complete guide to fine-tuning to date, answering these questions and more: the Fine-Tuning Index: https://t.co/nFPXC66u85

TravisAddair's tweet photo. What's the best model for fine-tuning? Can 8B param models really beat GPT-4 when fine-tuned to specific tasks?

@predibase we've put together our most complete guide to fine-tuning to date, answering these questions and more: the Fine-Tuning Index:

https://t.co/nFPXC66u85 https://t.co/Sddw31Gi1I

6

169

38

191

25K

MichaelOrtegaSF retweeted

about 2 years ago

✨ Introducing the Fine-tuning Index! A comprehensive set of benchmarks for 13 fine-tuned #opensource #LLMs and leading models from #OpenAI across 31 diverse tasks. The index reports essential metrics, including: 📊 Performance ⚡ Speed 💰 Cost https://t.co/hPBL0AdLCq

predibase's tweet photo. ✨ Introducing the Fine-tuning Index! A comprehensive set of benchmarks for 13 fine-tuned #opensource #LLMs and leading models from #OpenAI across 31 diverse tasks. The index reports essential metrics, including:
📊 Performance
⚡ Speed
💰 Cost
https://t.co/hPBL0AdLCq https://t.co/S2zycEw3eD

0

14

9

4

2K

MichaelOrtegaSF retweeted