MLflow @mlflow - Twitter Profile

4 days ago

🔹 Full sessions grouped together, so long conversations stay debuggable 🔹 MLflow 3.12 tracing for Claude Code, Codex, Gemini CLI, OpenCode, Qwen Code, and OpenHands 🎥 Full webinar: https://t.co/JRbV5uQ4X0 #MLflow #CodingAgents

0

2

610

MLflow

@MLflow

4 days ago

MLflow 3.12 deep dive clip: why coding agents need tracing 👇 Yuki Watanabe walks through what shows up in the trace when you turn it on: 🔹 Every turn, tool call (Read, Bash, Edit), and sub-agent step 🔹 Token usage and latency per span, including cache breakdown

1

2

1

478

MLflow

@MLflow

4 days ago

Genie fixes from failed evals 👇 🔹 Traces + space config in 🔹 LLM suggests concrete edits 🔹 Shorten signal-to-patch time 📕 Read the cookbook: https://t.co/ZhYV7LpdFX #MLflow #GenAI #Genie

MLflow's tweet photo. Genie fixes from failed evals 👇
🔹 Traces + space config in
🔹 LLM suggests concrete edits
🔹 Shorten signal-to-patch time

📕 Read the cookbook: https://t.co/ZhYV7LpdFX

#MLflow #GenAI #Genie https://t.co/HqY8uqmTZr

0

7

0

5

420

MLflow

@MLflow

5 days ago

MLflow 3.13.0: RBAC + Admin UI for self-hosted servers 👇 🔐 Roles as reusable permission bundles 🖥️ Admin UI (no REST endpoints) 📦 Experiments, models, prompts, scorers, Gateway Release highlights: https://t.co/JNPgjyWXrU #MLflow #MLOps

MLflow's tweet photo. MLflow 3.13.0: RBAC + Admin UI for self-hosted servers 👇

🔐 Roles as reusable permission bundles
🖥️ Admin UI (no REST endpoints)
📦 Experiments, models, prompts, scorers, Gateway

Release highlights: https://t.co/JNPgjyWXrU

#MLflow #MLOps https://t.co/9MpyM9EnHn

2

9

0

2

501

Who to follow

Databricks

@databricks

Databricks is the Data and AI company, helping organizations build and scale data and AI apps, analytics and agents.

Apache Spark

@ApacheSpark

Lightning-fast unified analytics engine

Chip Huyen

@chipro

@aisysbooks @goodailist AI Engineering: https://t.co/94dv4uTU1H Designing MLSys: https://t.co/G81hL2dWmr Reading @chipslib

MLflow

@MLflow

5 days ago

MLflow 3.13.0 is a major update that runs AI observability at scale, focusing on access control, the lifecycle of your trace data, and richer support for agents. 🙌 🔗Check out the highlights of the release: https://t.co/JNPgjyWXrU #mlflow #opensource #linuxfoundation

MLflow's tweet photo. MLflow 3.13.0 is a major update that runs AI observability at scale, focusing on access control, the lifecycle of your trace data, and richer support for agents. 🙌

🔗Check out the highlights of the release: https://t.co/JNPgjyWXrU

#mlflow #opensource #linuxfoundation https://t.co/u2ZqxgI9gB

0

11

4

6

664

MLflow

@MLflow

6 days ago

LLM judges for Genie traces 👇 🔹 Built-in baseline judges 🔹 Custom SQL/semantics checks 🔹 Start on highest-risk traces 📕 Read the cookbook: https://t.co/DGmpxHU5xe #MLflow #Genie

MLflow's tweet photo. LLM judges for Genie traces 👇
🔹 Built-in baseline judges
🔹 Custom SQL/semantics checks
🔹 Start on highest-risk traces

📕 Read the cookbook: https://t.co/DGmpxHU5xe

#MLflow #Genie https://t.co/XcjfOPcq7Z

0

6

0

2

624

MLflow

@MLflow

7 days ago

Thousands of traces, no systematic way to spot bad agent runs. MLflow Automatic Issue Detection 👉 choose CLEARS categories, run analysis in three clicks, triage issues in the UI. 🔗 Learn more: https://t.co/a8gNos0vs7 #MLflow #LLMOps #GenAI

0

5

1

2

325

MLflow

@MLflow

11 days ago

Trace + eval Genie in MLflow 👇 🔹 Full Genie pipeline 🔹 MLflow traces + judges 🔹 Tighten one pilot space first 📕 Read the cookbook: https://t.co/kd9fQxwSof #MLflow #Genie

MLflow's tweet photo. Trace + eval Genie in MLflow 👇
🔹 Full Genie pipeline
🔹 MLflow traces + judges
🔹 Tighten one pilot space first

📕 Read the cookbook: https://t.co/kd9fQxwSof

#MLflow #Genie https://t.co/AtEWZ4s6g7

0

5

0

3

362

MLflow

@MLflow

12 days ago

Vibe-checking works until it doesn't. Change one prompt, break three behaviors—and you can't tell if you moved forward or backward. Eval-driven development in MLflow 👇 1️⃣ Trace — mlflow.openai.autolog() + @mlflow.trace spans (latency, tokens, cost) 2️⃣ Evaluate + prompts — mlflow.genai.evaluate(), make_judge(), Prompt Registry, optimize_prompts (GEPA) 3️⃣ Prod — same judges on live traces; agent dashboards for cost/latency/quality 🔗 Learn more: https://t.co/I4zS7unOrn #MLflow #LLMOps #GenAI

MLflow's tweet photo. Vibe-checking works until it doesn't. Change one prompt, break three behaviors—and you can't tell if you moved forward or backward.

Eval-driven development in MLflow 👇
1️⃣ Trace — mlflow.openai.autolog() + @mlflow.trace spans (latency, tokens, cost)
2️⃣ Evaluate + prompts — mlflow.genai.evaluate(), make_judge(), Prompt Registry, optimize_prompts (GEPA)
3️⃣ Prod — same judges on live traces; agent dashboards for cost/latency/quality

🔗 Learn more: https://t.co/I4zS7unOrn

#MLflow #LLMOps #GenAI

1

7

1

4

503

MLflow

@MLflow

13 days ago

Right answer, wrong trace? MLflow + TruLens Agent GPA scorers read the full span tree 👇 🔹 10 TruLens scorers: 6 Agent GPA + 4 RAG 🔹 95% agent errors on TRAIL vs 55% 🔹 mlflow.genai.evaluate() w/ RAG + Phoenix 🔗 Read more: https://t.co/7jIlkg3VRO #MLflow #TruLens #GenAI

MLflow's tweet photo. Right answer, wrong trace? MLflow + TruLens Agent GPA scorers read the full span tree 👇

🔹 10 TruLens scorers: 6 Agent GPA + 4 RAG
🔹 95% agent errors on TRAIL vs 55%
🔹 mlflow.genai.evaluate() w/ RAG + Phoenix

🔗 Read more: https://t.co/7jIlkg3VRO

#MLflow #TruLens #GenAI https://t.co/KunhuZBfs9

0

3

0

4

358

MLflow

@MLflow

13 days ago

Red-team LLM apps in MLflow 👇 🔹 Adversarial eval inputs 🔹 Safety scorers + guidelines 🔹 Rerun after model/prompt changes 📕 Read the cookbook: https://t.co/VffLiylPJZ #MLflow #GenAI

MLflow's tweet photo. Red-team LLM apps in MLflow 👇
🔹 Adversarial eval inputs
🔹 Safety scorers + guidelines
🔹 Rerun after model/prompt changes

📕 Read the cookbook: https://t.co/VffLiylPJZ

#MLflow #GenAI https://t.co/C1jpPM631v

0

14

1

7

697

MLflow

@MLflow

17 days ago

Claude Code can burn through dozens or hundreds of LLM calls in one session. MLflow 3.12.0+: route it through AI Gateway with two env vars for traces, budget alerts/limits, and guardrails. No SDK changes. 🛣️ Setup: mlflow server → Gateway endpoint → ANTHROPIC_BASE_URL to the claude-code proxy. Run claude as usual. Learn more 👉 https://t.co/2xSoVXuJZ2 #MLflow #AIGateway #ClaudeCode

1

22

5

19

2K

MLflow

@MLflow

17 days ago

RAG eval end-to-end in MLflow 👇 🔹 Trace retrieve + generate 🔹 Built-in retrieval/gen judges 🔹 Localize failure to a stage 📕 Read the cookbook: https://t.co/5JTKfuX3xt #MLflow #RAG

MLflow's tweet photo. RAG eval end-to-end in MLflow 👇
🔹 Trace retrieve + generate
🔹 Built-in retrieval/gen judges
🔹 Localize failure to a stage

📕 Read the cookbook: https://t.co/5JTKfuX3xt

#MLflow #RAG https://t.co/mCOLWerWlo

0

18

4

18

915

MLflow

@MLflow

18 days ago

Catch this session at Data + AI Summit (June 15-18, SF)! 🌟 Agent quality via vibe-checking breaks at scale. 🔁 MLflow self-evolving test harness 🧪 Bad-answer feedback → automated tests ✅ Coding-agent fixes vs. accumulated suite 🎤 Adam Gurary & Yuki Watanabe Session details: https://t.co/8AhnjLPmEP #MLflow #DataAISummit

MLflow's tweet photo. Catch this session at Data + AI Summit (June 15-18, SF)! 🌟

Agent quality via vibe-checking breaks at scale.
🔁 MLflow self-evolving test harness
🧪 Bad-answer feedback → automated tests
✅ Coding-agent fixes vs. accumulated suite

🎤 Adam Gurary & Yuki Watanabe

Session details: https://t.co/8AhnjLPmEP

#MLflow #DataAISummit

0

2

0

1

255

MLflow

@MLflow

18 days ago

Prompt lifecycle in MLflow 👇 🔹 Registry-backed versions 🔹 Eval-gated promotion 🔹 Rollbacks without guesswork 📕 Read the cookbook: https://t.co/omRMtW9UIt #MLflow #GenAI

MLflow's tweet photo. Prompt lifecycle in MLflow 👇
🔹 Registry-backed versions
🔹 Eval-gated promotion
🔹 Rollbacks without guesswork

📕 Read the cookbook: https://t.co/omRMtW9UIt

#MLflow #GenAI https://t.co/n0p98Gs39S

0

4

0

3

276

MLflow

@MLflow

19 days ago

.@OpenHandsDev agents edit files, run commands, and browse the web on their own—but there’s no structured record of what happened or whether the result was good. MLflow connects via @opentelemetry to trace every step, evaluate runs with built-in judges, and route model traffic through AI Gateway for budget and usage control. Learn more 👉 https://t.co/YGQlhIB7yK #MLflow #OpenHands

0

2

1

210

MLflow

@MLflow

19 days ago

@OpenHandsDev agents edit files, run commands, and browse the web on their own—but there’s no structured record of what happened or whether the result was good. MLflow connects via @opentelemetry to trace every step, evaluate runs with built-in judges, and route model traffic through AI Gateway for budget and usage control. Learn more 👉 https://t.co/YGQlhIAzJc #MLflow #OpenHands

MLflow's tweet photo. @OpenHandsDev agents edit files, run commands, and browse the web on their own—but there’s no structured record of what happened or whether the result was good.

MLflow connects via @opentelemetry to trace every step, evaluate runs with built-in judges, and route model traffic through AI Gateway for budget and usage control.

Learn more 👉 https://t.co/YGQlhIAzJc

#MLflow #OpenHands

0

6

0

1

128

MLflow retweeted

MLflow

@MLflow

20 days ago

New on the MLflow channel: evaluate a RAG agent end-to-end with Joana Mesquita, MLflow Ambassador 👇 📌 Prompt Registry + production aliases 🔍 Traces with SME ground truth ⚖️ Ragas, Phoenix + custom LLM judge Watch now: https://t.co/ryZbnFHONj Blog: https://t.co/HIMMCatGcv #MLflow #RAG

MLflow's tweet photo. New on the MLflow channel: evaluate a RAG agent end-to-end with Joana Mesquita, MLflow Ambassador 👇

📌 Prompt Registry + production aliases
🔍 Traces with SME ground truth
⚖️ Ragas, Phoenix + custom LLM judge

Watch now: https://t.co/ryZbnFHONj
Blog: https://t.co/HIMMCatGcv

#MLflow #RAG

1

12

1

4

537

MLflow

@MLflow

20 days ago

New on the MLflow channel: evaluate a RAG agent end-to-end with Joana Mesquita, MLflow Ambassador 👇 📌 Prompt Registry + production aliases 🔍 Traces with SME ground truth ⚖️ Ragas, Phoenix + custom LLM judge Watch now: https://t.co/ryZbnFHONj Blog: https://t.co/HIMMCatGcv #MLflow #RAG

1

12

1

4

537

MLflow

@MLflow

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users