ml eng

15 days ago

Super Study Guide: Transformers & Large Language Models: https://t.co/DY9teNQh5b by @afshinea & @shervinea 🌟 Beautifully presented, excellent content, timely, thorough, educational 🌟 #LLMs #MachineLearning #AI #GenAI #DataScience #DataScientist #GenerativeAI 🌟 𝒪𝒱𝐸𝑅��𝐼𝐸𝒲: 250 pages, ~600 intuitive & colored illustrations, practical examples to deeply understand concepts related to Transformers & Large Language Models

KirkDBorne's tweet photo. Super Study Guide: Transformers & Large Language Models: https://t.co/DY9teNQh5b by @afshinea & @shervinea
🌟
Beautifully presented, excellent content, timely, thorough, educational
🌟
#LLMs #MachineLearning #AI #GenAI #DataScience #DataScientist #GenerativeAI
🌟
𝒪𝒱𝐸𝑅𝒱𝐼𝐸𝒲: 250 pages, ~600 intuitive & colored illustrations, practical examples to deeply understand concepts related to Transformers & Large Language Models

mleng12 retweeted

Towards Data Science

@TDataScience

20 days ago

Writing at the intersection of math, data science, and operations research, Berend Markhorst presents an accessible and comprehensive guide to Benders' decomposition. https://t.co/BgXdHoJz11

mleng12 retweeted

Data Integration at Wells Fargo. #Tech #Gaming #Hiking #Tennis

19 days ago

Mathematical Foundations of Quantum Computing: https://t.co/wC7l2wkcX6

244

144

Who to follow

mleng12 retweeted

19 days ago

“Mastering NLP From Foundations to Agents” by Lior Gazit and Meysam Ghaffari, from @PacktPublishing @PacktDataML 👉➡️ https://t.co/fnOUOFRuyu ⬅️👈 Learn this: •Engineer NLP systems from ML foundations to LLM architectures •Implement RAG pipelines, routing layers, and agent workflows •Fine-tune and align LLMs using LoRA, RLHF, and DPO methods •Design production-grade AI systems with governance and safety

KirkDBorne's tweet photo. “Mastering NLP From Foundations to Agents” by Lior Gazit and Meysam Ghaffari, from @PacktPublishing @PacktDataML

👉➡️ https://t.co/fnOUOFRuyu ⬅️👈

Learn this:
•Engineer NLP systems from ML foundations to LLM architectures
•Implement RAG pipelines, routing layers, and agent workflows
•Fine-tune and align LLMs using LoRA, RLHF, and DPO methods
•Design production-grade AI systems with governance and safety

mleng12 retweeted

about 1 month ago

Download 242-page PDF “Introduction to Neural Networks” ➡️ https://t.co/0nNNi7ezRu ————— #DataScience #AI #Algorithms #ML #MachineLearning #DeepLearning #Mathematics #Calculus #DataScientist

KirkDBorne's tweet photo. Download 242-page PDF “Introduction to Neural Networks” ➡️ https://t.co/0nNNi7ezRu
—————
#DataScience #AI #Algorithms #ML #MachineLearning #DeepLearning #Mathematics #Calculus #DataScientist https://t.co/0H836gF1fh

310

236

13K

mleng12 retweeted

20 days ago

How CopilotKit Is Redefining the Agentic AI Stack in 2026 For years, AI inside software meant a chat widget bolted onto the corner of an application. You typed, the model responded with text, and you manually translated that output into whatever you actually needed it to do. It was useful the way a calculator is useful: functional, but fundamentally passive. CopilotKit, a Seattle-based startup co-founded by Atai Barkai and Uli Barkai, has spent the last two years arguing that the model is broken — and in 2026, the developer community is agreeing loudly. - AG-UI completes the agentic protocol stack by handling the agent-to-UI interaction layer that MCP and A2A leave unaddressed, with first-party SDKs across LangGraph, CrewAI, Mastra, Agno, and Pydantic AI, and community SDKs now live for Go, Kotlin, Dart, Java, Rust, Ruby, and C++. - AIMock ships one zero-dependency mock server for the entire agentic call chain — 11 LLM providers, MCP, A2A, vector DBs, search — with record-and-replay, daily drift detection, and chaos testing built in. - Pathfinder is a self-hosted MCP knowledge server that indexes docs, code, Notion pages, Slack, and Discord into hybrid vector-keyword search, with pluggable embeddings that need no external API key. - The three tools together target the three production blockers — knowledge retrieval, testing reliability, and runtime persistence — that demo-quality agents consistently fail to address. - CopilotKit's vendor-neutral, self-hostable design means teams can adopt any single layer without being locked into a proprietary runtime or forced to rebuild their existing stack. Full analysis: https://t.co/eOxovDdjtW GitHub repo: https://t.co/YDv9rhIu4T @CopilotKit #ai #aiagent #agenticai

Marktechpost's tweet photo. How CopilotKit Is Redefining the Agentic AI Stack in 2026

For years, AI inside software meant a chat widget bolted onto the corner of an application. You typed, the model responded with text, and you manually translated that output into whatever you actually needed it to do. It was useful the way a calculator is useful: functional, but fundamentally passive. CopilotKit, a Seattle-based startup co-founded by Atai Barkai and Uli Barkai, has spent the last two years arguing that the model is broken — and in 2026, the developer community is agreeing loudly.

- AG-UI completes the agentic protocol stack by handling the agent-to-UI interaction layer that MCP and A2A leave unaddressed, with first-party SDKs across LangGraph, CrewAI, Mastra, Agno, and Pydantic AI, and community SDKs now live for Go, Kotlin, Dart, Java, Rust, Ruby, and C++.

- AIMock ships one zero-dependency mock server for the entire agentic call chain — 11 LLM providers, MCP, A2A, vector DBs, search — with record-and-replay, daily drift detection, and chaos testing built in.

- Pathfinder is a self-hosted MCP knowledge server that indexes docs, code, Notion pages, Slack, and Discord into hybrid vector-keyword search, with pluggable embeddings that need no external API key.

- The three tools together target the three production blockers — knowledge retrieval, testing reliability, and runtime persistence — that demo-quality agents consistently fail to address.

- CopilotKit's vendor-neutral, self-hostable design means teams can adopt any single layer without being locked into a proprietary runtime or forced to rebuild their existing stack.

Full analysis: https://t.co/eOxovDdjtW

GitHub repo: https://t.co/YDv9rhIu4T

@CopilotKit #ai #aiagent #agenticai

863

mleng12 retweeted

22 days ago

50 ML projects to understand LLMs — Investigate transformer mechanisms through data analysis, visualization, and experimentation: https://t.co/J57FI0VsHs via @PacktPublishing @PacktDataML ————— #AI #GenAI #MachineLearning #DataScientist #DataScience

KirkDBorne's tweet photo. 50 ML projects to understand LLMs — Investigate transformer mechanisms through data analysis, visualization, and experimentation: https://t.co/J57FI0VsHs via @PacktPublishing @PacktDataML
—————
#AI #GenAI #MachineLearning #DataScientist #DataScience https://t.co/zXdOf1cs7C

203

159

mleng12 retweeted

20 days ago

Most agent frameworks today are stitching together reasoning models with external orchestration layers. Qwen3.7-Max takes a different position — train the agent capability into the model itself. Alibaba just introduced Qwen3.7-Max Here's what's actually interesting: → 1M-token context window — up from 256K on Qwen3.6 Max Preview → Extended-thinking mode with visible chain-of-thought reasoning trace → 1,000+ tool calls executed autonomously in an internal kernel optimization test → 35 hours of sustained autonomous execution on a single complex task → 56.6 on the Artificial Analysis Intelligence Index — #5 overall, ahead of Gemini 3.5 Flash → #13 in Text Arena (1,475 Elo), #7 in Math, #9 in Expert Prompts Full analysis: https://t.co/qSLp3fta9c Other technical details ⤵ @Alibaba_Qwen

556

mleng12 retweeted

MIT CSAIL

@MIT_CSAIL

22 days ago

MIT researchers developed “Insum,” a technique for speeding up computations on datasets replete w/zeros. It rewrites Einstein summation (“einsum”) operations to avoid inefficient handling of zeros, improving memory efficiency & performance: https://t.co/JXLLk3yuLe

MIT_CSAIL's tweet photo. MIT researchers developed “Insum,” a technique for speeding up computations on datasets replete w/zeros.

It rewrites Einstein summation (“einsum”) operations to avoid inefficient handling of zeros, improving memory efficiency & performance: https://t.co/JXLLk3yuLe https://t.co/EC1awQ1XMZ

128

14K

mleng12 retweeted

21 days ago

Most vector search libraries make you train a codebook before indexing anything. That's not a search tool — it's a data dependency. turbovec just removed it entirely. It's a Rust-built vector index with Python bindings, built on Google Research's TurboQuant algorithm — a data-oblivious quantizer that requires zero training and zero data passes. Here's what's actually interesting: → 10 million documents: 31 GB as float32, 4 GB with turbovec — 16x compression at 2-bit → Beats FAISS IndexPQFastScan by 12–20% on ARM across every configuration → On x86, wins every 4-bit config by 1–6% against FAISS → Zero codebook training — add vectors, they're indexed immediately → Fully local, no data egress — drop-in for LangChain, LlamaIndex, and Haystack The core idea: after applying a random rotation, every coordinate follows a known Beta distribution — regardless of input data. That makes the quantization boundaries computable from math alone, not from your dataset. Full analysis with Guide: https://t.co/RcUvsavLvi Repo: https://t.co/dmcGErIfbT #ai #python #aiinfrastructure #data #ml

Marktechpost's tweet photo. Most vector search libraries make you train a codebook before indexing anything.

That's not a search tool — it's a data dependency. turbovec just removed it entirely.

It's a Rust-built vector index with Python bindings, built on Google Research's TurboQuant algorithm — a data-oblivious quantizer that requires zero training and zero data passes.

Here's what's actually interesting:

→ 10 million documents: 31 GB as float32, 4 GB with turbovec — 16x compression at 2-bit
→ Beats FAISS IndexPQFastScan by 12–20% on ARM across every configuration
→ On x86, wins every 4-bit config by 1–6% against FAISS
→ Zero codebook training — add vectors, they're indexed immediately
→ Fully local, no data egress — drop-in for LangChain, LlamaIndex, and Haystack

The core idea: after applying a random rotation, every coordinate follows a known Beta distribution — regardless of input data. That makes the quantization boundaries computable from math alone, not from your dataset.

Full analysis with Guide: https://t.co/RcUvsavLvi

Repo: https://t.co/dmcGErIfbT

#ai #python #aiinfrastructure #data #ml

513

mleng12 retweeted

22 days ago

#1 best-seller in AI on Amazon... "Agentic Coding with Claude Code: The everyday developer's guide to agentic coding with Claude Code" 𝗚𝗲𝘁 𝗶𝘁 𝗵𝗲𝗿𝗲: https://t.co/jKP3cE9HgV v/ @PacktDataML 𝗪𝗵𝗮𝘁 𝘆𝗼𝘂 𝘄𝗶𝗹𝗹 𝗹𝗲𝗮𝗿𝗻: ❇️Design agentic coding workflows in the terminal and IDE using Claude Code 🔷Build custom automations with reusable slash commands and hooks ❇️Use Claude Code with a Next.js project to implement AI-driven workflows 🔷Create persistent AI memory using Claude Code memory files ❇️Apply MCP for structured context sharing across tools and agents 🔷Design multi-agent systems using subagents and orchestration patterns ❇️Enforce coding standards using project documentation and context control 🔷Scale AI pair programming while keeping code maintainable

KirkDBorne's tweet photo. #1 best-seller in AI on Amazon...

"Agentic Coding with Claude Code: The everyday developer's guide to agentic coding with Claude Code"

𝗚𝗲𝘁 𝗶𝘁 𝗵𝗲𝗿𝗲: https://t.co/jKP3cE9HgV v/ @PacktDataML

𝗪𝗵𝗮𝘁 𝘆𝗼𝘂 𝘄𝗶𝗹𝗹 𝗹𝗲𝗮𝗿𝗻:
❇️Design agentic coding workflows in the terminal and IDE using Claude Code
🔷Build custom automations with reusable slash commands and hooks
❇️Use Claude Code with a Next.js project to implement AI-driven workflows
🔷Create persistent AI memory using Claude Code memory files
❇️Apply MCP for structured context sharing across tools and agents
🔷Design multi-agent systems using subagents and orchestration patterns
❇️Enforce coding standards using project documentation and context control
🔷Scale AI pair programming while keeping code maintainable

mleng12 retweeted

22 days ago

Most LLM inference optimization forces a choice: fast drafting with a weak auxiliary model, or accurate generation with full Standard autoregressive (AR) decoding. NVIDIA Researchers just built a third option into the weights themselves. They released Nemotron-Labs-Diffusion — a 3B/8B/14B model family trained on a joint Autoregressive AR-diffusion objective that supports three decoding modes from one checkpoint: standard AR, parallel diffusion decoding, and self-speculation, where the same model drafts and verifies without any auxiliary head. Here's what's actually interesting: → Self-speculation achieves 5.99× tokens per forward over Qwen3-8B with comparable accuracy on a 10-task benchmark → Average acceptance length: 6.82 (with LoRA) vs. 2.75 for Eagle3 and 4.24 for Qwen3-9B-MTP — same draft length of 31 → AR and diffusion objectives peak at the same loss coefficient (α=0.3) and improve together — they don't compete for model capacity → Speed-of-light analysis shows a theoretical ceiling of 7.60× TPF at block length 32; current confidence-based sampling realizes only ~3×, leaving headroom for better samplers Full analysis: https://t.co/tJdGfHjCFr Paper: https://t.co/LdEz01hEQt Model weights: https://t.co/eP2MJs1GT8 Technical details: https://t.co/TQ84fmKFP5 @PavloMolchanov @NVIDIAAI @nvidia @YongganFu @xieenze_jr @MardaniMorteza @songhan_mit @jankautz

104K

mleng12 retweeted

22 days ago

Most translation models are audio pipelines with a TTS layer bolted on at the end. That's not simultaneous interpretation and Alibaba's Qwen team just built a clear technical case for the difference. They released Qwen3.5-LiveTranslate-Flash: a real-time multimodal translation model that processes audio and video frames simultaneously, clones the original speaker's voice in the output, and covers 60 input languages at 2.8 seconds of latency. No turn-detection. No generic synthesis voice replacing the speaker. Here's what's actually interesting: → Vision-enhanced comprehension reads lip movements, gestures, and on-screen text alongside audio — robust in noisy or degraded audio environments → Semantic unit prediction via "reading units" processing commits to output segments mid-sentence, enabling continuous streaming without waiting for full utterances → Real-time voice cloning replicates the original speaker's voice profile from a single spoken sentence → Dynamic keyword configuration lets you inject domain-specific glossaries at runtime — brand names, medical terms, legal vocabulary → FLEURS and CoVoST2 benchmarks: outperforms major commercial alternatives across multilingual speech translation tasks Full analysis: https://t.co/gVorchcSuU Technical details: https://t.co/R3QQurGlB9 @Alibaba_Qwen #tts #audioai #voiceai #ai @Ali_TongyiLab

386

mleng12 retweeted

23 days ago

Deep Learning with C++ — Design and deploy neural networks using CUDA for high-performance AI in C++ Get the book at https://t.co/RzMRhYihTE from @PacktPublishing @PacktDataML

KirkDBorne's tweet photo. Deep Learning with C++ — Design and deploy neural networks using CUDA for high-performance AI in C++

Get the book at https://t.co/RzMRhYihTE from @PacktPublishing @PacktDataML https://t.co/HngfHiHCZq

286

189

mleng12 retweeted

25 days ago

We at Marktechpost been building a GitHub repository of 300+ hands-on Jupyter notebooks covering the tools, models, and frameworks that actually matter for AI Agents and Agentic AI Here's what's inside: → LLM fine-tuning, RAG pipelines, and agentic workflows — end to end → Notebooks for open-source models: LLaMA, Mistral, Qwen, Gemma, and more → Covers LangChain, LlamaIndex, HuggingFace, vLLM, and the full modern stack → Every notebook is runnable — Google Colab links included → Updated continuously as new models and frameworks drop The goal was simple: if you read about something on Marktechpost, you should be able to run it the same day. 300+ notebooks. Zero paywalls. https://t.co/B8Z6nRou83

447K

mleng12 retweeted

24 days ago

3rd Edition! "Python Feature Engineering Cookbook", complete guidebook with recipes for crafting powerful features for #MachineLearning models: https://t.co/Yhzc8zV4mg by @Soledad_Galli ————— #AI #ML #DataLiteracy #DataScience #DataScientist

KirkDBorne's tweet photo. 3rd Edition! "Python Feature Engineering Cookbook", complete guidebook with recipes for crafting powerful features for #MachineLearning models: https://t.co/Yhzc8zV4mg by @Soledad_Galli
—————
#AI #ML #DataLiteracy #DataScience #DataScientist https://t.co/AGKaGwdBxR

mleng12 retweeted

23 days ago

Most "privacy-preserving" AI memory just masks sensitive values with ***. That breaks the task. The cloud can't draft your doctor's email if the blood pressure reading is gone. MemTensor just proposed a different approach — and it actually holds up under benchmarking. They introduced MemPrivacy, a framework that runs a lightweight on-device model to detect private spans, replaces them with semantically typed placeholders like <Health_Info_1> before anything leaves the device, and restores the original values locally after the cloud responds. The cloud reasons on structure. It never sees the actual data. Here's what's actually interesting: → Four-level privacy taxonomy (PL1–PL4) from general preferences to immediately exploitable credentials — user-configurable per session → MemPrivacy-4B-RL hits 85.97% F1 on MemPrivacy-Bench vs. 78.41% for Gemini-3.1-Pro and 68.99% for GPT-5.2 on privacy span extraction → Utility loss across LangMem, Mem0, and Memobase stays within 1.6% at PL2–PL4 protection — irreversible masking causes drops up to 41.87% → Models run at 0.6B, 1.7B, and 4B parameters with sub-2-second per-message latency on-device The core insight: privacy protection and semantic utility don't have to trade off — if you replace values with typed structure instead of blank masks. Full analysis: https://t.co/Zn62GYvv7G Paper: https://t.co/Y2NEV8Mam6 Model Weights: https://t.co/VC3Rn6Iap7 @ModelScope2022 #ai #data #privacy #model #llm

Marktechpost's tweet photo. Most "privacy-preserving" AI memory just masks sensitive values with ***. That breaks the task. The cloud can't draft your doctor's email if the blood pressure reading is gone.

MemTensor just proposed a different approach — and it actually holds up under benchmarking.

They introduced MemPrivacy, a framework that runs a lightweight on-device model to detect private spans, replaces them with semantically typed placeholders like <Health_Info_1> before anything leaves the device, and restores the original values locally after the cloud responds. The cloud reasons on structure. It never sees the actual data.

Here's what's actually interesting:

→ Four-level privacy taxonomy (PL1–PL4) from general preferences to immediately exploitable credentials — user-configurable per session

→ MemPrivacy-4B-RL hits 85.97% F1 on MemPrivacy-Bench vs. 78.41% for Gemini-3.1-Pro and 68.99% for GPT-5.2 on privacy span extraction

→ Utility loss across LangMem, Mem0, and Memobase stays within 1.6% at PL2–PL4 protection — irreversible masking causes drops up to 41.87%

→ Models run at 0.6B, 1.7B, and 4B parameters with sub-2-second per-message latency on-device

The core insight: privacy protection and semantic utility don't have to trade off — if you replace values with typed structure instead of blank masks.

Full analysis: https://t.co/Zn62GYvv7G

Paper: https://t.co/Y2NEV8Mam6

Model Weights: https://t.co/VC3Rn6Iap7

@ModelScope2022 #ai #data #privacy #model #llm

341

mleng12 retweeted

25 days ago

🌟🌟🌟🌟🌟 Machine Learning and Artificial Intelligence: Concepts, Algorithms and Models Available at https://t.co/cVQvIZEBSV 🌟🌟🌟🌟🌟

mleng12 retweeted