fab2s @flodl_dev - Twitter Profile

about 1 month ago

A declarative graph DSL for neural networks. Tag streams, build residuals, compose trained subgraphs as frozen Modules in larger architectures. Every level addressable by name. "The structure IS the text." Full post: https://t.co/RbWAkaLLPq

0

21

fab2s @flodl_dev

about 1 month ago

Phase one was loading. Phase two was fine-tuning, now plumbed end-to-end. Phase three is the demo on the hardware people actually own. Next: ModernBERT, LLaMA, LoRA, ViT, plus a flagship El Che fine-tune benchmark on heterogeneous consumer GPUs.

0

36

fab2s @flodl_dev

about 1 month ago

Three new families: ALBERT (factorised embeddings, cross-layer sharing) XLM-RoBERTa (multilingual SentencePiece, ~250k vocab) DeBERTa-v2/v3 (disentangled attention, mask-gated embeddings) MLM heads across all six families. fill_mask one-call for any checkpoint.

0

68

fab2s @flodl_dev

about 1 month ago

Universal Trainer for transparent fine-tuning. You write one closure (forward + loss); the framework owns the loop, backward, optimizer step, gradient sync. Same code on CPU, single GPU, or heterogeneous multi-GPU. El Che cadence auto-tunes the slow card.

0

26

fab2s @flodl_dev

about 1 month ago

Round-trip export is the flagship. fdl flodl-hf export --hub <repo> --out staged/ fdl flodl-hf verify-export staged/ Re-emits any flodl-hf checkpoint as an HF-canonical dir that loads back into HF Python's AutoModelFor* with bit-exact agreement on every head output.

0

21

fab2s @flodl_dev

about 1 month ago

🧵 flodl 0.5.3: HuggingFace, both ways. 30 head cells, bit-exact round-trip with HF Python (max abs diff = 0). 6 BERT-family architectures × 5 head shapes. In Rust. https://t.co/YsttCzWytF #rustlang #huggingface #deeplearning

flodl_dev's tweet photo. 🧵 flodl 0.5.3: HuggingFace, both ways.
30 head cells, bit-exact round-trip with HF Python (max abs diff = 0).
6 BERT-family architectures × 5 head shapes. In Rust.
https://t.co/YsttCzWytF
#rustlang #huggingface #deeplearning https://t.co/BXYB6zLhm1

0

1

0

36

fab2s @flodl_dev

about 1 month ago

flodl is two months old. Tensor + autograd, full nn parity, declarative graph DSL, transparent multi-GPU on heterogeneous hardware. The line that's stayed true: "With flodl I don't rewrite when I pivot. I add or remove a graph member." Full post: https://t.co/f81f5w7zJp

0

40

fab2s @flodl_dev

about 1 month ago

The bet: libtorch FFI through a thin C++ shim. Not pure Rust. Inherits libtorch's footprint. In exchange: CUDA parity today. NCCL today. Tensor Cores today. Mixed precision, CUDA Graphs, fused optimizers. Not in six months. Now.

0

32

fab2s @flodl_dev

about 1 month ago

Two false starts. Go first. The project was called goDL. GC and GPU memory don't compose. You end up with tensors the GC thinks are dead but the GPU is still using, or the inverse. Rust was the answer. Ownership instead of GC.

0

31

fab2s @flodl_dev

about 1 month ago

The shape: FBRL (Feedback Recursive Loops). Letters read by attention, classified, reproduced. Words compose frozen letters. Lines compose words. Each level frozen as an oracle for the next. Nested. Partially-frozen. Graph-shaped. Python did not enjoy this.

0

27

fab2s @flodl_dev

about 1 month ago

I'd never trained a deep learning model in my life. Then I built one in Python, pivoted three times, and watched the script bloat with freezing and recomposition boilerplate. So I wrote my own framework. In Rust. 🧵

flodl_dev's tweet photo. I'd never trained a deep learning model in my life.
Then I built one in Python, pivoted three times, and watched the script bloat with freezing and recomposition boilerplate.
So I wrote my own framework. In Rust.
🧵 https://t.co/hZAP1yRz5A

0

25

fab2s @flodl_dev

about 1 month ago

Write-up: https://t.co/u8DfN4Yl0B Tutorial: https://t.co/Y9bTGWB2Dk Crate: https://t.co/5pmkpY8HEz Repo: https://t.co/6FoabgCx1F Feedback and PRs welcome.

0

3

2

1

253

fab2s @flodl_dev

about 1 month ago

Foundation for the fine-tuning arc: these three families + AutoModel are the gateway to fine-tuning published checkpoints on heterogeneous consumer GPUs with ElChe. Next on the roadmap: ModernBERT, LLaMA, LoRA, ViT. Then the fine-tuning loop itself.

0

48

fab2s @flodl_dev

about 1 month ago

One command scaffolds a playground inside any flodl project: fdl add flodl-hf cd flodl-hf fdl classify Drops a side crate pinned to your flodl version with an AutoModel example ready to run. `fdl init my-model --with-hf` for fresh projects.

0

21

fab2s @flodl_dev

about 1 month ago

Numerical parity against the HuggingFace Python reference: 9 pinned checkpoints, max_abs_diff ≤ 1e-5 across all three families and three task heads. Observed on reference: bert-base-uncased pooler 9.835e-7, DistilBERT SeqCls 2.384e-7. Reproducible via `fdl test-live`.

0

19

fab2s @flodl_dev

about 1 month ago

One line, any family: AutoModelForSequenceClassification::from_pretrained(repo_id) inspects config.json's model_type and dispatches BERT/RoBERTa/DistilBERT automatically. Same three-line caller for bert-base-uncased, roberta-base, distilbert-base-uncased, or any fine-tune.

0

20

fab2s @flodl_dev

about 1 month ago

BERT in flodl is now `from_pretrained("bert-base-uncased")?` 🧵 flodl 0.5.2 ships flodl-hf: BERT, RoBERTa, DistilBERT + three task heads each (seqcls, NER, QA), AutoModel dispatch, PyTorch parity at max_abs_diff < 1e-5.

flodl_dev's tweet photo. BERT in flodl is now `from_pretrained("bert-base-uncased")?` 🧵

flodl 0.5.2 ships flodl-hf: BERT, RoBERTa, DistilBERT + three task heads each (seqcls, NER, QA), AutoModel dispatch, PyTorch parity at max_abs_diff < 1e-5. https://t.co/hH9BzPxaRT

0

15

fab2s @flodl_dev

about 2 months ago

Full changelog: https://t.co/ItlXC8pDgc Setup needs Docker + a matching libtorch variant; `fdl setup` walks through GPU detection and image builds. Rust + DL folks, feedback welcome, especially on `fdl bench` and the DDP surface.

0

20

fab2s @flodl_dev

about 2 months ago

`fdl init my-project` now asks: Docker, or native? Docker gets host-mounted or baked libtorch. Native skips the Dockerfiles, cargo builds on the host. Three self-consistent scaffolds, one interactive pick. The scaffolded Makefile is gone.

0

16

fab2s @flodl_dev

about 2 months ago

Rust binaries using `#[derive(FdlArgs)]` have always exposed `--fdl-schema` so `fdl` help + completion + validation come from one struct. 0.5.1 extends it to scripts. `benchmarks/run.sh` emits the same JSON via a heredoc at the top. `fdl` auto-probes and caches it.

0

11

fab2s

@flodl_dev

Last Seen Users on Sotwe

Trends for you

Most Popular Users