Medical Sphere @MedicalSphereAI - Twitter Profile

Pinned Tweet

6 days ago

Early results for Claude Opus 4.8 and Gemini 3.5 Flash on @OpenAI's HealthBench Professional: Opus 4.8 looks essentially flat against 4.7 (within noise). Gemini 3.5 Flash is a step up from 3.1 Pro.

MedicalSphereAI's tweet photo. Early results for Claude Opus 4.8 and Gemini 3.5 Flash on @OpenAI's HealthBench Professional:

Opus 4.8 looks essentially flat against 4.7 (within noise). Gemini 3.5 Flash is a step up from 3.1 Pro. https://t.co/0f6ouoLjEx

1

35

6

8

3K

Medical Sphere

@MedicalSphereAI

about 8 hours ago

Tested a council of AI models on this case to see how they respond 👇 Overall, most models interpreted the lesion as a benign/low-grade vascular or calcified process rather than a true high-grade glioma—most commonly cavernous malformation/organizing hemorrhagic vascular lesion with reactive gliosis. The main disagreements were grok-4.3, which called it gliosarcoma (GBM grade 4), and gemini-3.5, which favored CAPNON; claude-opus-4-8 leaned toward subependymoma/other low-grade intraventricular tumor. 🔗 https://t.co/8KVMhnc5kh

1

0

86

Medical Sphere

@MedicalSphereAI

about 8 hours ago

@modernHealthMe @AskMedSphere

1

0

39

Medical Sphere

@MedicalSphereAI

about 8 hours ago

@AB_drmd @AskMedSphere

1

0

615

Medical Sphere

@MedicalSphereAI

about 8 hours ago

@ambrose074 @AskMedSphere

1

0

16

Medical Sphere

@MedicalSphereAI

about 9 hours ago

@Nate_path https://t.co/le7WeptBqc

Medical Sphere

@MedicalSphereAI

about 9 hours ago

We tested a few AI models out of curiosity to see how they interpret this case, this is what they said 🧐👇 Most models interpret the slides as showing acellular purple globular material with a benign colloid/proteinaceous appearance, with the strongest consensus favoring a benign thyroid colloid nodule (Bethesda II). The main outlier is Gemini, which instead interprets the globules as Actinomyces “sulfur granules”; Claude is noncommittal but notes the material could be colloid or other metachromatic globules. 🔗 Full case: https://t.co/BvMskRIj0z

0

1

0

100

0

23

Medical Sphere

@MedicalSphereAI

about 9 hours ago

We tested a few AI models out of curiosity to see how they interpret this case, this is what they said 🧐👇 Most models interpret the slides as showing acellular purple globular material with a benign colloid/proteinaceous appearance, with the strongest consensus favoring a benign thyroid colloid nodule (Bethesda II). The main outlier is Gemini, which instead interprets the globules as Actinomyces “sulfur granules”; Claude is noncommittal but notes the material could be colloid or other metachromatic globules. 🔗 Full case: https://t.co/BvMskRIj0z

0

1

0

100

Medical Sphere

@MedicalSphereAI

about 10 hours ago

We tested a few AI models to see what they say 👇 All models broadly agree that the findings most likely represent prominent hematogones/benign B-cell precursors in an infant marrow, not definitive B-ALL, and that CD79a/morphology alone are insufficient. The main difference is confidence level: Gemini leans more strongly toward hematogones, while gpt-5.5 and Claude are more cautious and stress that flow cytometry ± molecular testing (especially KMT2A in this age group) is needed to exclude infant B-ALL.

0

1

0

1

97

Medical Sphere

@MedicalSphereAI

about 11 hours ago

@JMGardnerMD @AskMedSphere

1

0

13

Medical Sphere

@MedicalSphereAI

about 11 hours ago

@RadMasterclass @AskMedSphere

1

0

26

Medical Sphere

@MedicalSphereAI

about 13 hours ago

@drkeithsiau @AskMedSphere

1

0

100

Medical Sphere

@MedicalSphereAI

about 13 hours ago

@MaddiWulfeckMD @AskMedSphere

0

24

Medical Sphere

@MedicalSphereAI

about 13 hours ago

@Sohayla_Yaseen We tested a few AI models on this case to see what they say, how did they do?! 🤔 🔗 Full case: https://t.co/DxBZOpGmJr

MedicalSphereAI's tweet photo. @Sohayla_Yaseen We tested a few AI models on this case to see what they say, how did they do?! 🤔

🔗 Full case: https://t.co/DxBZOpGmJr https://t.co/nJwjBpBo9l

0

1

0

178

Medical Sphere

@MedicalSphereAI

about 13 hours ago

This is what the council said this time 🧐 There is no clear consensus across the models: one favored caseating granulomatous lymphadenitis/tuberculosis, another called Hodgkin lymphoma, and a third interpreted it as metastatic mucinous adenocarcinoma. 🔗 https://t.co/1NkgVMrdeJ

0

2

0

184

Medical Sphere

@MedicalSphereAI

about 24 hours ago

@drsebheaven @traumaticum @DrBhavinJadav @InvictaOrtho @aschwartz45 @rkh_md @jointdocShields @orthotraumamd @AskMedSphere

1

0

135

Medical Sphere

@MedicalSphereAI

about 24 hours ago

We recorded a video to show you how to use the platform 👇 - Please go to https://t.co/NF4wrpOEEp - Create a free account. You only need an email address. - Follow the steps in the video to create a medical case, run AI models, and view their results. Let us know if you need any help. We’re here! 🙂

0

2

1

0

26

Medical Sphere

@MedicalSphereAI

2 days ago

@IhabFathiSulima @AskMedSphere

1

0

183

Medical Sphere

@MedicalSphereAI

2 days ago

@albertoortegana @AskMedSphere

1

0

48

Medical Sphere

@MedicalSphereAI

2 days ago

@LizMontgomeryMD We tested a few AI models on this case out of curiosity to see how they interpreted it. This is what they said 👇 🔗 Full case: https://t.co/XnngSzpivt