The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...
-
allenai/Olmo-3.1-32B-Think
Text Generation β’ 32B β’ Updated β’ 4.64k β’ β’ 66 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B β’ Updated β’ 2.37k β’ 6 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation β’ 32B β’ Updated β’ 993 β’ 4 -
allenai/Olmo-3.1-32B-Instruct
Text Generation β’ 32B β’ Updated β’ 9.06k β’ β’ 45