It’s been a tough few weeks. My 10yo daughter was diagnosed with a very rare, aggressive cancer called interdigitating dendritic cell sarcoma (IDCS). I’m reaching out to identify clinicians/patients who have encountered pediatric IDCS, indeterminate dendritic cell histiocytosis or other (non-LCH) histiocytic sarcomas cases.
I'm trying to understand non-surgical chemo and targeted therapy options, new pathology markers to better diagnose subtypes/treatments, and any data on progression in pediatric patients. Please feel free to share – I’m trying to cast a wide net due to the rarity of this condition and how little is known.
People can contact me directly at my first name (as written in my profile) at https://t.co/ubo0zQRMn0.
I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models.
The book covers:
- The new AI stack (e.g. how it differs from traditional ML engineering)
- Different approaches to evaluate open-ended systems
- Dataset engineering
- Prompt engineering, RAG, agents
- Finetuning
- Compute infrastructure, including how to mitigate latency and cost
AI Engineering is scheduled for late 2024. The first 3 chapters are available on the O'Reilly platform: https://t.co/8YrSmUH9qw
I’ve learned a lot during the research and writing process for this book. I hope you’ll find the learnings useful. Feedback is much appreciated!
The newest #tidymodels tune release includes all sorts of goodies—fairness assessment, survival analysis, modernized parallel processing support, percentile intervals for performance metrics, and more. Read more on the #rstats tidyverse blog:
https://t.co/rVJSBLpmX6
I have finally worked through all of @avehtari & Gelman's "Active Statistics". Advertised as a resource mainly for stats instructors, I think it's prob just as useful for students and self-learners. The case studies alone would be a worthwhile text. https://t.co/Enbl2GMhy4
I wrote a tutorial on diffusion models for undergrad and grad students. I tried my best to give intuitive explanations for complicated equations.
Your feedback is much appreciated
Thanks to those who suggested various reading materials to me
https://t.co/fWv111nG5M
Offering a 5-week machine learning course. It covers algorithm development and fundamental concepts. Focus is on genomics datasets. Lectures are in real-time, with discussion board, feedback on homework, and help showcasing your work on GitHub. Apply here: https://t.co/FcYJAGPyKg
We recently moved the Applied Predictive Modeling blog to a new url: https://t.co/pBFWInNRqa
The new url reflects the new book that Kjell Johnson and I are writing a new book. You can see the work in progress at https://t.co/c8TiYaLQPG
#rstats#DataScience#machinelearning
I'm very excited about the new {tidychatmodels} package by @rappa753, which is a tidyverse-style #rstats interface to LLMs like OpenAI, Mistral, and even local models through Ollama
Check it out - it'd be great to get some community momentum around this project!
https://t.co/w9R1w2b5qi
Just uploaded the final solution set for my course for this year. There are nine (like the circles of hell) problem sets and elaborate solution guides available, ranging from intro probability theory to advanced multilevel modeling. Feast your mind freely: https://t.co/O7xFIbqqaH
Every couple years, the #rstats tidymodels team puts out a user survey to help us better prioritize what we'll work on next. The results of this survey led to the {agua}, {stacks}, and {spatialsample} pkgs, among others. Our newest survey is up--take it!
https://t.co/ddB2hhRPt4
I have made my entire "Introduction to ggplot2" tutorial available on my website!
The tutorial covers the basics of ggplot2, geometries, colors, themes, etc.
It includes code and comments, as well as references to Pokémon!
#RStats#ggplot2#dataviz
https://t.co/8eV3Xjo4GX
Why do Random Forests perform so well off-the-shelf & appear essentially immune to overfitting?!?
I’ve found the text-book answer “it’s just variance reduction 🤷🏼♀️” to be a bit too unspecific, so in our new pre-print https://t.co/UXDO9ULnl6, @Jeffaresalan & I investigate..🕵🏼♀️ 1/n
A new version of the #rstats probably package is on CRAN. A minor update with a bug fix and under-the-hood changes for the upcoming tune version.
But there’s finally a hex logo (thanks to @theotheredgar) so we have that going for us. Which is nice.
https://t.co/gOCqUPDZOK
ggplot2 3.5.0 is on it's way to CRAN 🎉🎉🎉
This is a big one and is in large part the work of @TeunvandenBrand. The new features will be spread out over several blog posts, starting with this:
https://t.co/w0zkrUs2Y4
If you want to learn how #LLMs work under the hood or just deepen your understanding, The GenAI Guide by @canyon289 is a great and intuitive resource.
Covers transformers, pre-training, fine-tuning, evaluation and a lot more, all with detailed code.
https://t.co/PQBoyUFwxu