Top Tweets for #DataSelection
๐ขExcited to present our work at #NeurIPS2025 in San Diego!
๐T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
#SFT #DataSelection #EfficientPostTraining
โฐ Wednesday, Dec 3, 4:30โ7:30 PM PST
โก๏ธ Exhibit Hall C/D/E, #200

๐Two papers from our group got accepted to #NeurIPS2025 Congratulations to @FaisalHamman @yanjun_fu @pas4n More details soon!! #Efficiency #DataSelection #InstructionTuning #CounterfactualExplanations @NeurIPSConf

๐๐๐ Ruth @qphong @_Hu_Wenyang @ZCODE0
@ray_qiaorui @JingtanW @PangWeiKoh et al.
Accepted @icmlconf #ICML2025: #DataValuation #DataCentricAI #DataSelection #LLM #LLMs #BayesianOptimization #FederatedLearning

With compute scaling, is data selection now a bigger focus in pretraining/fine-tuning?๐
Or is the community more focused on data composition (e.g., balancing sources), synthetic/distilled data, or curation techniques?๐ง
๐ง ๐ #LLM #DataSelection #AI #GPT #DeepSeek
Visit us @icmlconf #ICML2024 Poster Session 6, 25th July 1:30PM Hall C 4-9 #709.
Code: https://t.co/FLKt7cw0wB
Paper: https://t.co/vq736f1ddV
#Interpretablility #DataAttribution #DataCentricAI #LLMs #DataSelection #LLM #ShapleyValue (n/n)
To boost scalability, we devise FreeShap, a fine-tuning-free approximation of #ShapleyValue which amortizes the fine-tuning cost using kernel regression on the precomputed #NeuralTangentKernel. We demonstrate FreeShap on #DataSelection #DataDeletion & wrong label detection. (2/n)
The @icmlconf #ICML2024 work of @RachaelSim2 @YvonneFan12 @snoidetx et al. presents DADS to select data for model training while anticipating #DataDeletions.
#DataSelection #ActiveLearning #DataValuation
https://t.co/M1XhwZb3IN
Poster Session 4 Wed 24 Jul 1:30PM Hall C 4-9 #2306
๐ข๐ทNext Tuesday (May 14th) at 5:30 pm, we'll host Danqi Chen from Princeton University (@Princeton):
"Data Selection for Pre-training and Instruction-tuning of LLMs"
zoom info: [email protected] or just DM! #kuisaitalks #LLMs #dataselection

Data selection involves exploring the entire dataset, identifying key features, and minimizing biases.
Learn how to choose wisely, train smartly, and continuously monitor your models to achieve production-grade results.
https://t.co/mTT3LL79De
#dataselection
Learn how to choose wisely, train smartly, and continuously monitor your models to achieve production-grade results.
https://t.co/mTT3LL79De
#dataselection #CV #datasets
How #Data #processing is influential for ML and AI #Algorithms? Read here - https://t.co/U1K2C79bMF & know about the essential steps involved in data processing and its importance on the dominating #technologies like #AI and #ML. #Datastorage #DataSelection #DataProcessing

Different Ways of Selecting Data inside Pandas: https://t.co/QUBdhfFsl7 #Pandas #PandasDataframe #DataSelection #PandasIlocMethod #PandasTutorial
How #Data #processing is influential for ML and AI #Algorithms? Read here - https://t.co/6tqaZVdvnH & know about the essential steps involved in data processing and its importance on the dominating #technologies like #AI and #ML. #Datastorage #DataSelection #DataProcessing

How #Data #processing is influential for ML and AI #Algorithms? Read here - https://t.co/U1K2C6R2yx & know about the essential steps involved in data processing and its importance on the dominating #technologies like #AI and #ML. #Datastorage #DataSelection #DataProcessing

World class #DataAnnotation is not enough to keep your data-driven development & validation running efficiently. With @dSPACEglobal and @INTEMPORA family we can provide #DataLogging, #DataSelection and scenario-based testing. Leave us a note at https://t.co/YtFQr7f9pR

How #Data #processing is influential for ML and AI #Algorithms? Read here - https://t.co/6tqaZVv6ff & know about the essential steps involved in data processing and its importance on the dominating #technologies like #AI and #ML. #Datastorage #DataSelection #DataProcessing

๐ Data Mining Fundamentals: With this tutorial learn about the data preprocessing technique of sampling for data selection.
https://t.co/nB1KkxiyBP
#DataScience #DataMining #DataSelection #DSDojo
How #Data #processing is influential for ML and AI #Algorithms? Read here - https://t.co/6tqaZVdvnH & know about the essential steps involved in data processing and its importance on the dominating #technologies like #AI and #ML. #Datastorage #DataSelection #DataProcessing

How #Data #processing is influential for ML and AI #Algorithms? Read here - https://t.co/6tqaZVdvnH & know about the essential steps involved in #dataprocessing and its importance on the dominating #technologies like #AI and #ML. #Datastorage #DataSelection #DataProcessing

Practical selection of representative sets of RNA-seq samples using a hierarchical approach. #RNAseq #DataSelection https://t.co/Cq881k1ZoY
Most Popular Users

Elon Musk 
@elonmusk
240.2M followers

Barack Obama 
@barackobama
119.3M followers

Donald J. Trump 
@realdonaldtrump
111.6M followers

Cristiano Ronaldo 
@cristiano
108.9M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.3M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.6M followers

KATY PERRY 
@katyperry
86.8M followers

Taylor Swift 
@taylorswift13
80.6M followers

Lady Gaga 
@ladygaga
72.2M followers

Kim Kardashian 
@kimkardashian
69.4M followers

YouTube 
@youtube
68.6M followers

Virat Kohli 
@imvkohli
68.5M followers

Bill Gates 
@billgates
63.4M followers

The Ellen Show
@theellenshow
62.5M followers

CNN 
@cnn
61.9M followers

Neymar Jr 
@neymarjr
61.1M followers

X 
@x
60.9M followers

Selena Gomez 
@selenagomez
59.9M followers










