Can a multimodal LLM, prompted carefully, recognize dermatoscopic features as well as a board-certified dermatologist? Across 6 prompting arms and 8 diagnoses — short answer: not yet, but the gap is closing.
Dataset spans 8 dermatologic diagnoses tested across 17 multimodal LLMs from OpenAI and Google. 6 prompting strategies cover zero-shot label list, few-shot exemplars, primer board, anti-anchoring, and free-form variants.