Yiming Cui

@KCrosner

NLP Researcher

北京, 中华人民共和国

Joined August 2012

82 Following

663 Followers

180 Posts

Pinned Tweet

Yiming Cui @KCrosner

5 months ago

We are extremely overwhelmed and honored to receive the IEEE Signal Processing Society Best Paper Award (2025) for our paper "Pre-Training with Whole Word Masking for Chinese BERT", published in IEEE/ACM TASLP (2021). 🎉🎉🎉 @IEEEsps #IEEE https://t.co/ZLY4d8RFAG

KCrosner's tweet photo. We are extremely overwhelmed and honored to receive the IEEE Signal Processing Society Best Paper Award (2025) for our paper "Pre-Training with Whole Word Masking for Chinese BERT", published in IEEE/ACM TASLP (2021). 🎉🎉🎉 @IEEEsps #IEEE https://t.co/ZLY4d8RFAG https://t.co/lryFcP163F

0

3

0

0

137

Yiming Cui @KCrosner

29 days ago

Sharing my ARR SAC tool (based on my old jupyter notebook). You can load OpenReview venue, inspect paper status, read comments, score distributions, and export commitment-stage papers to Excel for ranking. Runs on your own machine. https://t.co/uF3XDiI8jZ #nlproc #ARR #emnlp2026

KCrosner's tweet photo. Sharing my ARR SAC tool (based on my old jupyter notebook). You can load OpenReview venue, inspect paper status, read comments, score distributions, and export commitment-stage papers to Excel for ranking. Runs on your own machine. https://t.co/uF3XDiI8jZ #nlproc #ARR #emnlp2026 https://t.co/HKe8uMhDne

0

1

0

1

170

Yiming Cui @KCrosner

3 months ago

@anton_reshetov @OpenAI same here. really frustrating. i am on an annual subscription, which is even worse. 🥲

0

0

0

0

23

Yiming Cui @KCrosner

3 months ago

Good News: I am enrolled in #Codex for Open Source. 🎉 Bad News: I cannot redeem this untill my Plus subscription ends. What even worse, I am on an annual subscription, ends 6 months+. 😭 @OpenAI please help.

KCrosner's tweet photo. Good News: I am enrolled in #Codex for Open Source. 🎉
Bad News: I cannot redeem this untill my Plus subscription ends. What even worse, I am on an annual subscription, ends 6 months+. 😭

@OpenAI please help. https://t.co/B45rrwpKNA

0

3

0

0

127

Who to follow

Verified account

Co-founder of @Recursive_SI. ex-SVP of Salesforce AI Research | ex-MetaMind (Opinions are personal.)

Verified account

Co-founder @RekaAILabs and Honorary Researcher @Hitz_zentroa (University of the Basque Country) | Past: Research Scientist @AIatMeta (FAIR)

Verified account

Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp

Yiming Cui @KCrosner

6 months ago

AI vs. the Olympiad: Can Multimodal LLMs Truly 'See' Chemistry? https://t.co/UA3rvwe3ZO

0

2

0

0

43

Yiming Cui @KCrosner

6 months ago

(5/x) Occlusion-based saliency analysis. By masking image regions and measuring confidence drops,baseline MLLMs often rely on spurious visual cues, while CoT shifts attention toward chemically meaningful structures, improving both grounding and confidence. https://t.co/SsQA6jwPuA

KCrosner's tweet photo. (5/x) Occlusion-based saliency analysis. By masking image regions and measuring confidence drops,baseline MLLMs often rely on spurious visual cues, while CoT shifts attention toward chemically meaningful structures, improving both grounding and confidence. https://t.co/SsQA6jwPuA https://t.co/7IWHdR8u0J

0

0

0

0

28

Yiming Cui @KCrosner

6 months ago

Excited to share our first paper in @CommsChem (Nature Portfolio) 🎉🎉🎉 We systematically evaluate multimodal large language models on chemistry Olympiad–level problems, revealing where current models succeed and where they still struggle. #AI4Chemistry #LLM #MultimodalAI #NLP

KCrosner's tweet photo. Excited to share our first paper in @CommsChem (Nature Portfolio) 🎉🎉🎉
We systematically evaluate multimodal large language models on chemistry Olympiad–level problems, revealing where current models succeed and where they still struggle. #AI4Chemistry #LLM #MultimodalAI #NLP https://t.co/PyTK6oNR4t

Communications Chemistry @CommsChem

6 months ago

Evaluating large language models on multimodal chemistry olympiad exams https://t.co/OcXyBgKDSP

0

2

0

0

607

5

2

0

1

228

Yiming Cui @KCrosner

6 months ago

(4/x) When seeing hurts: visual input can degrade performance. Adding visual input sometimes reduces accuracy, especially in smaller models. Larger models tend to balance textual and visual signals better, which may be key to achieving strong performance. https://t.co/SsQA6jwPuA

KCrosner's tweet photo. (4/x) When seeing hurts: visual input can degrade performance. Adding visual input sometimes reduces accuracy, especially in smaller models. Larger models tend to balance textual and visual signals better, which may be key to achieving strong performance. https://t.co/SsQA6jwPuA https://t.co/n1vO2Rrys4

0

1

0

0

29

Yiming Cui @KCrosner

6 months ago

(3/x) Task-type breakdown. Current multimodal LLMs perform well on tables and charts, but struggle with molecular structures and experimental apparatus, which require chemistry-specific visual understanding and domain knowledge. https://t.co/SsQA6jwPuA

KCrosner's tweet photo. (3/x) Task-type breakdown. Current multimodal LLMs perform well on tables and charts, but struggle with molecular structures and experimental apparatus, which require chemistry-specific visual understanding and domain knowledge. https://t.co/SsQA6jwPuA https://t.co/PZqLHeQWa0

0

1

0

0

15

Yiming Cui @KCrosner

6 months ago

(2/x) CoT generally improves chemical reasoning performance. Analysis show that CoT is especially helpful for mid-tier models. For e.g., GPT-4.1-mini achieves 20~26 accuracy improvement with CoT, while less significant for small/large-scale models. https://t.co/SsQA6jwPuA

KCrosner's tweet photo. (2/x) CoT generally improves chemical reasoning performance. Analysis show that CoT is especially helpful for mid-tier models. For e.g., GPT-4.1-mini achieves 20~26 accuracy improvement with CoT, while less significant for small/large-scale models. https://t.co/SsQA6jwPuA https://t.co/TM0PDWMTYN

0

1

0

0

29

Yiming Cui @KCrosner

6 months ago

(1/x) we curate a chemistry benchmark based on USNCO exams, spanning over two decades, consisting of 473 real multimodal QA problems. It covers a broad spectrum of chemistry topics, including general, physical, organic, inorganic, and analytical chemistry. https://t.co/SsQA6jwPuA

KCrosner's tweet photo. (1/x) we curate a chemistry benchmark based on USNCO exams, spanning over two decades, consisting of 473 real multimodal QA problems. It covers a broad spectrum of chemistry topics, including general, physical, organic, inorganic, and analytical chemistry. https://t.co/SsQA6jwPuA https://t.co/kEVeUh2PrQ

0

1

0

0

39

Yiming Cui @KCrosner

about 2 years ago

Our paper "Self-Evolving GPT: A Lifelong Autonomous Experiential Learner" is accepted at #ACL2024 main! We propose a framework for LLMs to autonomously learn and apply experience, boosting GPT-3.5 and GPT-4 performance. Stay tuned for the paper and code release! #NLP #LLM #GPT

KCrosner's tweet photo. Our paper "Self-Evolving GPT: A Lifelong Autonomous Experiential Learner" is accepted at #ACL2024 main! We propose a framework for LLMs to autonomously learn and apply experience, boosting GPT-3.5 and GPT-4 performance. Stay tuned for the paper and code release! #NLP #LLM #GPT https://t.co/hjDcaWe1uB

0

5

0

2

890

Yiming Cui @KCrosner

about 2 years ago

Happy to introduce Chinese-LLaMA-Alpaca-3, which is our 3rd open-source projects on #Llama series. We release Llama-3-Chinese-8B and Llama-3-Chinese-8B-Instruct with continual PT/SFT on Chinese corpora. Check our project: https://t.co/bx9Xly7SCd #nlproc #llama3

0

2

0

1

408

Yiming Cui @KCrosner

over 2 years ago

Through our empirical experiments on creating Chinese Mixtral, we find that extending vocabulary might NOT be a necessity for LLM language transfer. As usual, we open-source Chinese-Mixtral(-Instruct) at GitHub/HF: https://t.co/CdekiqY8Hs arXiv Paper: https://t.co/KHw7iHh4zU

0

7

1

0

855

Yiming Cui @KCrosner

almost 3 years ago

We release Chinese-LLaMA-2-7B and Chinese-Alpaca-2-7B based on #Llama-2, which achieved significant improvements over our first-gen Chinese-LLaMA/Alpaca, even surpass 13B models on some metrics. Check our GitHub repo: https://t.co/Klpd0jOk74 #llm #NLProc

1

19

2

0

644

Yiming Cui @KCrosner

almost 3 years ago

@joemkwon Sorry for the late reply. Regarding your question, our main motivation is to add more trainable parameter (qkvo and mlp) within LoRA scheme. Recent research QLoRA also shows that adapting qkvo/mlp is essential to achieve a better performance. Maybe you can check the QLoRA paper.

0

0

0

0

73

Yiming Cui @KCrosner

about 3 years ago

Excited to release our Chinese 🦙#LLaMA and #Alpaca LLMs (7B for now), extended with an additional 20k Chinese vocabulary, trained with alpaca-lora. Our model works seamlessly with the wonderful llama.cpp on CPU. Give it a try at https://t.co/rJiPdJfOuP #nlproc #llm #AI

1

12

0

2

2K

Yiming Cui @KCrosner

about 3 years ago

Update 13B Chinese #LLaMA and #Alpaca. Better quality compared to 7B. GPT-4 rates 13B model 71/100 while 49 for 7B version. We also provide a Colab notebook for fast conversion, and of course it is fully compatible with llama.cpp. Try: https://t.co/rJiPdJfOuP #nlproc #llm #ai

Yiming Cui @KCrosner

about 3 years ago

Excited to release our Chinese 🦙#LLaMA and #Alpaca LLMs (7B for now), extended with an additional 20k Chinese vocabulary, trained with alpaca-lora. Our model works seamlessly with the wonderful llama.cpp on CPU. Give it a try at https://t.co/rJiPdJfOuP #nlproc #llm #AI

1

12

0

2

2K

0

11

0

3

1K

Yiming Cui @KCrosner

over 3 years ago

Happy to release our multimodal pre-trained model VLE, which achieved top performance on VCR. We also set up a pipeline with captioning model and LLM to generate much user-friendly answers for VQA. Resources, code, and demo are available through: https://t.co/MaavjgL4Jf 🎉🎉🎉

KCrosner's tweet photo. Happy to release our multimodal pre-trained model VLE, which achieved top performance on VCR. We also set up a pipeline with captioning model and LLM to generate much user-friendly answers for VQA. Resources, code, and demo are available through: https://t.co/MaavjgL4Jf 🎉🎉🎉 https://t.co/AxevKeoWXv

0

5

1

1

517

Yiming Cui @KCrosner

almost 4 years ago

@cryptexcode @SemEvalWorkshop @naacl Thank you. The live session (mainly for task organizers) is hosted via Zoom, and all system papers are presented as posters (no oral). I'm not sure if the video will be made public by official. If you are interested in best paper list, it will be posted on SemEval website soon.

1

0

0

0

0

Yiming Cui @KCrosner

almost 4 years ago

We are happy to announce that our SemEval-2022 system description paper is recognized as "best paper honorable mention award". 🎉🎉🎉 Paper and code: https://t.co/X4wV9ObLqQ @SemEvalWorkshop @NAACL #nlproc #naacl2022 #semeval

KCrosner's tweet photo. We are happy to announce that our SemEval-2022 system description paper is recognized as "best paper honorable mention award". 🎉🎉🎉 Paper and code: https://t.co/X4wV9ObLqQ @SemEvalWorkshop @NAACL #nlproc #naacl2022 #semeval https://t.co/wfOG7NrCBQ

2

11

0

0

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users