Top Tweets for #DataContamination
Huge thanks to all authors: @MiZawalski, @mellem_boo, @balazyklaudia, @besanushi, Pablo Ribalta, and all other people involved!
Read the full paper here: https://t.co/AdfMa11m1E
#LLM #AI #ML #DataContamination #Benchmark #Research #NVIDIA
@arian_ask Impressive work! Have you considered tò test your approach on other less known benchmarks?
Zero-shot Performances seems to be overestimated for famous benchmarks. See how #DataContamination impact on #TextToSQL at https://t.co/frDgmKplxu
2/ 🚨 First, when you test closed-source #LMMs like #GPT4V using data from the Web (i.e., NEJM Image Challenge), you risk #DataContamination. For example, you can directly access the Figure 1 case online. 🛑 We need to stop testing these #LLMs and LMMs on data from the Web!

Our paper "Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation" has been accepted at ACL Findings !
Good Job guys @HumanCentricArt
#ACL2024NLP #NLproc #DataContamination
https://t.co/BVrrufxB6O
"our findings indicate that GPT-4 is contaminated with AG News, WNLI, and XSum datasets."
arXiv 👉https://t.co/t8XhwecjIK
#DataContamination #LLMs
Thrilled to share that our "Time Travel in LLMs" paper has been accepted to #ICLR2024 as a Spotlight!
w/ my awesome advisor @msurd
#LLMs #DataContamination @iclr_conf
The Turing Tests of today are mistaken
https://t.co/MRTwFDiaW6
By @raphaelmilliere via @IAI_TV
#AI #LLMs #benchmarks #performance #TuringTest #DataContamination
cc @mvollmer1 @sonu_monika @JagersbergKnut @EstelaMandela @enilev @BetaMoroney @Shi4Tech @sallyeaves @sim010101 @PerBBerggreen @ahier @gerald_bader @richardkimphd @Corix_JC @jenstirrup @dinisguarda @data_nerd @theomitsa @NeiraOsci @Hana_ElSayyed @tlloydjones @TarakRindani @Nicochan33 @FinMKTG @Fabriziobustama @Analytics_699 @LouisColumbus @YvesMulkers @HolgerGelhausen @ChuckDBrooks @SusanHayes_ @FractaloidConvo @drsharwood @jeancayeux @amalmerzouk @AndrewinContact @CurieuxExplorer @FrRonconi @FernandaKellner @mikeflache @maponi @Khulood_Almani @sminaev2015 @Eli_Krumova @pchamard @MHcommunicate @RLDI_Lamy
@RSNA 4/ If the authors had tried more cases, I’m sure they would have found #GPT4 failing more often than not. I took one random example from the Internet (yes, I risked #DataContamination and made it potentially easier for GPT-4), yet you can see how it fails spectacularly.

@simoneballoccu @PSchmidtova @LangoMateusz @tuetschek Very considerable work!
DC is an intriguing problem not only for evaluation but also witnesses the ambiguity between learning and memorization.
You'll probably find interesting our work that is deeply dedicated to Text-to-SQL.
#datacontamination #text2sql
https://t.co/BVrrufxB6O
@Adhiguna_AIaaS Good thread.
Another advantage of this solution is the easiness of interaction with the model (just prompt!) .
But can be Benchmarks on very known Test Sets affected by Data Contamination?
#datacontamination #overestimation
Take a look at our work https://t.co/BVrrufxB6O
Thrilled to share that our "Time Travel in LLMs" paper has been accepted to #ICLR2024 as a Spotlight!
w/ my awesome advisor @msurd
#LLMs #DataContamination @iclr_conf
Data contamination suggests LLMs have possibly seen test data from downstream tasks.
Our recent study introduces a novel method to replicate LLMs' training data, including downstream dataset instances, to aid in detecting data contamination.
Read more: https://t.co/3Ll3NcSJDr
The Hidden Influence of #Data Contamination on Large Language Models
https://t.co/eiK6EbOhOv
#DataContamination #LanguageModels #AI #MachineLearning #DataScience #Tech #BigData #NLP #ArtificialIntelligence #DataQuality

Hi @dbamman!
Interested in our way of detecting #datacontamination in #NLProc #LLM
Check out our PreCog
https://t.co/DMSYncvS03
w/ @esruzzetti @l__ranaldi
Work done at @HumanCentricArt
Last Seen Hashtags on Sotwe
moancewek sambil
Seen from Indonesia
elliegreen
Seen from Germany
tounseyya
Seen from France
Stud4Stud
Seen from United States
nipslip
Seen from Turkey
มีแอคล็อก
Seen from Thailand
潮宝
Seen from Brazil
monkeyappleak
Seen from United States
momson #teenage
Seen from India
stomachpressing
Seen from Pakistan
Trends for you
Most Popular Users

Elon Musk 
@elonmusk
240.1M followers

Barack Obama 
@barackobama
119.3M followers

Donald J. Trump 
@realdonaldtrump
111.6M followers

Cristiano Ronaldo 
@cristiano
108.9M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.3M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.6M followers

KATY PERRY 
@katyperry
86.8M followers

Taylor Swift 
@taylorswift13
80.6M followers

Lady Gaga 
@ladygaga
72.2M followers

Kim Kardashian 
@kimkardashian
69.4M followers

YouTube 
@youtube
68.6M followers

Virat Kohli 
@imvkohli
68.5M followers

Bill Gates 
@billgates
63.4M followers

The Ellen Show
@theellenshow
62.5M followers

CNN 
@cnn
61.9M followers

Neymar Jr 
@neymarjr
61.1M followers

X 
@x
60.9M followers

Selena Gomez 
@selenagomez
59.9M followers









