Meta Llama4 performance degrades on long context

Building and Evaluating Q&A on long documents

Pew Research Center: How the U.S. Public and AI Experts View Artificial Intelligence