Serina S. ApplebaumNikhil KhandekarQiao Jin 0001Guangzhi XiongSoren DunnSerina S. ApplebaumZain AnwarMaame Sarfo-GyamfiConrad W. SafranekAbid A AnwarAndrew ZhangAidan GilsonMaxwell B. SingerAmisha D. DaveAndrew TaylorAidong ZhangQingyu Chen 0001Zhiyong LuMedCalc-Bench: Evaluating Large Language Models for Medical Calculations.2024abs/2406.12036CoRRhttps://doi.org/10.48550/arXiv.2406.12036db/journals/corr/corr2406.html#abs-2406-12036streams/journals/corrAbid A AnwarZain AnwarQingyu Chen 0001Amisha D. DaveSoren DunnAidan GilsonQiao Jin 0001Nikhil KhandekarZhiyong LuConrad W. SafranekMaame Sarfo-GyamfiMaxwell B. SingerAndrew TaylorGuangzhi XiongAidong ZhangAndrew Zhang