Latest research

July 1, 2025

SciArena: A new platform for evaluating foundation models in scientific literature tasks

Discover how SciArena is being used to evaluate foundation models’ capabilities in scientific literature tasks through community-driven, literature-grounded, and multi-disciplinary reasoning.

Read post

June 24, 2025

OMEGA: Can LLMs reason outside the box in math?

Discover how OMEGA is being used to evaluate large language models' ability to generalize in math through exploratory, compositional, and transformative reasoning

Read post

June 13, 2025

New applications of the Ai2 Climate Emulator (ACE) by the international climate modeling community

Learn how ACE is being used for seasonal forecasts and understanding decadal variations in global warming.

Read post

June 3, 2025

Revisiting critical batch size for large-batch OLMo pretraining

We introduce a more reliable method to measure the critical batch size (CBS), analyze how CBS changes over training, and use this to train OLMo with fewer grad steps.

Read post

April 28, 2025

Introducing Atlantes: the first AI-powered GPS model for real-time global scale maritime intelligence

Atlantes: a system of transformers for real-time GPS modeling.

Read post

April 15, 2025

DataDecide: How to predict best pretraining data with small experiments

Explore the secrets of how language model developers make decisions with DataDecide.

Read post

April 9, 2025

Going beyond open data – increasing transparency and trust in language models with OLMoTrace

OLMoTrace lets you trace the outputs of language models back to their full, multi-trillion-token training data in real time.

Read post

March 31, 2025

Introducing CodeScientist: A step toward automated scientific discovery

Will there be a system that automatically identifies gaps in scientific knowledge and runs experiments?

Read post

March 26, 2025

Introducing Ai2 Paper Finder

Ai2 Paper Finder is an LLM-powered literature search system that mimics the iterative paper-finding process.

Read post

Previous37-45Next