✦AI Papers Timeline Map Tracks Benchmarks Which Model?

[Scaling]·PAP-MJCGEH·2022·March 17, 2026

Emergent Abilities of Large Language Models

2022

Jason Wei, Yi Tay, Rishi Bommasani et al.

SCALING

4 min readScalingReasoning

Core Insight

Larger language models develop unexpected skills, challenging our predictions and scaling strategies.

In Plain English

The paper reveals that exhibit absent in smaller ones, defying performance predictions. These findings suggest that mere scaling introduces novel capabilities that smaller models can't achieve.

Knowledge Prerequisites

git blame for knowledge

To fully understand Emergent Abilities of Large Language Models, trace this dependency chain first. Papers in our library are linked — click to read them.

DIRECT PREREQIN LIBRARY

Attention Is All You Need

Understanding the attention mechanism is fundamental for grasping how large language models work since they rely heavily on transformer architectures introduced in this paper.

Attention mechanismTransformer architectureSelf-attention

DIRECT PREREQIN LIBRARY

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

This paper introduces Bidirectional Encoder Representations from Transformers, which is a foundational large language model that shows how pre-training on vast textual data can improve language understanding tasks.

Bidirectional transformersPre-trainingMasked language models

DIRECT PREREQIN LIBRARY

Training language models to follow instructions with human feedback

Understanding instruction following through human feedback is crucial for realizing how large language models can be fine-tuned to improve task performance based on human-provided feedback.

Fine-tuningHuman feedbackInstruction-following

DIRECT PREREQIN LIBRARY

Toolformer: Language Models Can Teach Themselves to Use Tools

This paper explores how large language models can utilize external tools to enhance their capabilities, a concept that is potentially linked to the emergent abilities described.

Tool use in language modelsSelf-improvementExternal API integration

DIRECT PREREQIN LIBRARY

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Understanding structured thinking processes in language models will provide insights into how these models develop emergent problem-solving abilities.

Tree structuresDeliberate reasoningProblem-solving strategies

YOU ARE HERE

Emergent Abilities of Large Language Models

The Idea Graph

⚠Problem✦Insight⬡Method◎Result→Impact

11 nodes · 15 edges

Click a node to explore · Drag to pan · Scroll to zoom

476 words · 3 min read6 sections · 11 concepts

The Problem: Incomplete Predictive Models

86 words

Before this research, the expectations from language models were largely defined by existing . These laws suggested that merely increasing the size of models would enhance known capabilities. However, they did not account for the emergence of , which became evident with larger models. The limitations of these predictive models meant that researchers could not foresee the full range of abilities that scaling could unlock. This gap between expectation and reality highlighted a significant problem in understanding the true potential of large language models.

Key Insight: Emergent Abilities

85 words

The core insight of this paper is the discovery of that appear in large language models. These abilities are not simply enhanced versions of existing skills but are entirely new capabilities that arise from scaling. The presence of such abilities challenges the existing understanding and predictive models in the field, suggesting that there is more to model scaling than previously thought. This insight opens up new avenues for research and application, as it demonstrates the potential for language models to develop unforeseen skills.

Method: Model Scaling and Parameter Experiments

82 words

To explore the potential of large language models, the research involved scaling these models by increasing their parameters from millions to hundreds of billions. This process, known as , was central to the experiments conducted in the study. By varying the size of the models, the researchers were able to observe differences in performance and identify new abilities. The were designed to not only test known capabilities but also to uncover emergent abilities that only appeared in larger models.

Results: Linguistic and Reasoning Capabilities

74 words

The experiments revealed that large language models exhibit enhanced , including nuanced language understanding that smaller models could not achieve. Additionally, these models demonstrated that were surprising in their complexity and depth. One of the most striking findings was the models' ability to engage in creative problem-solving, which was not anticipated by existing scaling laws. These results underscore the idea that scaling introduces novel abilities rather than merely improving existing ones.

Impact: Enhanced Human-Computer Interaction

75 words

The discovery of in large language models has significant implications for the future of . By leveraging these newfound capabilities, companies like OpenAI and Google can create more natural and sophisticated AI-driven interfaces. These improvements could lead to better chatbots and virtual assistants, enhancing user experience by making interactions more seamless and intuitive. The potential for these models to redefine is substantial, offering new opportunities for innovation in the field.

Limitations & Open Questions: Managing Unpredictability

74 words

While the emergent abilities of large language models offer exciting opportunities, they also introduce . This poses challenges for developers aiming to ensure a consistent and reliable user experience. As these models continue to evolve, rigorous testing and careful management will be essential to harness their potential without compromising functionality. Open questions remain about how to best predict and control these emergent abilities, ensuring that advancements in AI are both beneficial and manageable.

Experience It

Live Experiment

Emergent Abilities

See Emergent Abilities in Action

You'll see how larger language models develop unexpected skills that smaller models lack, highlighting the impact of scaling.

Look for nuanced understanding and unexpected skills in the larger model's responses, as highlighted in the paper.

Try an example — see the difference instantly

Enter a reasoning problem — or try your own

⌘↵ to run

Read Original Paper on arXiv

Origin Story

arXiv preprint, May 2022Google ResearchJason Wei, Yi Tay et al.

The Room

Inside Google Research, a group of curious minds huddled together in a conference room. They were captivated by a lingering question: What uncharted territories lay beyond the known scaling laws of language models? The room buzzed with a mixture of excitement and skepticism as they grappled with the unknown, wondering if larger models could surprise them in unexpected ways.

The Bet

The team decided to push the limits further than ever, betting that by scaling models massively, they might stumble upon unforeseen abilities. It was a risky move, as the hypothesis sounded almost too optimistic. Doubts lingered, and there was a moment when submitting the paper felt like leaping into the dark, unsure if their hunch would hold any water.

The Blast Radius

Without this leap of faith, the landscape of AI would be starkly different. Models like PaLM and Claude might not exist, and the narrative around emergent properties would be less vibrant. Jason Wei and Yi Tay have continued to explore these themes, influencing new research directions and inspiring the next wave of AI breakthroughs.

↳PaLM↳Claude↳Gemini

Explained Through an Analogy

“

Imagine a plant that not only grows taller with water but blooms new, unforeseen flowers. Scaling models similarly reveals hidden algorithms.

The Full Story

~2 min · 224 words

The Context

What problem were they solving?

mergent abilities in large models challenge traditional performance extrapolation techniques used in AI development.

The Breakthrough

What did they actually do?

The study used scaling tests to observe at what sizes new abilities began to manifest in language models.

Under the Hood

How does it work?

Emergent language model abilities could expand the practical applications beyond what's currently imagined.

World & Industry Impact

Emergent abilities of large language models can redefine the functionality of chatbots, virtual assistants, and other AI-driven interfaces. Companies like OpenAI, Google, and Amazon might leverage these emergent features to enhance product offerings, leading to more natural and sophisticated human-computer interactions. As these abilities can be unpredictable, product testers must remain vigilant to ensure seamless user experience.

Interactive Diagram

Emergent Abilities in Language Models

Step 1 / 5

Scaling Limitations

✗Old Belief

·Known abilities improve

✓New Understanding

·New abilities emerge

Before this research, it was believed that increasing the size of language models would only enhance known capabilities, not introduce new ones.

Scaling Limitations → Surprising Emergence → Model Size vs. Skills → Performance Equation → New Capabilities Unlocked

TL;DR

This paper shows that larger language models develop surprising new skills that smaller models cannot, challenging existing scaling laws.

Key Terms

Emergent Abilities

New skills that appear in larger models but not in smaller ones.

Like hidden talents that only show up when you mature.

Scaling Laws

Previous predictions of model performance improvement with size.

Like expecting a car's speed to increase with engine size.

Parameters

The elements of a model that are adjusted during training to learn from data.

Like the knobs and switches on a machine you tune for optimal performance.

Language Model

A computational model designed to understand and generate human language.

Nuanced Understanding

Deep, subtle comprehension of language tasks.

Creative Problem-Solving

The ability to devise novel solutions to complex problems.

Like coming up with a new recipe from limited ingredients.

Compute

The computational resources needed to train models.

Architecture

The structural design of a model.

Like the blueprint of a building.

Core Ideas

1
Emergent Abilities
Demonstrates that scaling models can lead to unforeseen capabilities.
2
Challenge to Scaling Laws
Shows that current predictive models may need revising.
3
Model Complexity and Size
Highlights the relationship between model size and capability.
4
New Application Potential
Unlocks possibilities for more sophisticated AI applications.

Key Formula

Performance = Data × Compute × Architecture

Data

The amount and quality of training data.

Compute

The computational power used for training.

Architecture

The design and structure of the model.

Before vs After

Before

It was assumed that increasing model size would only enhance known abilities.

After

This paper shows that scaling can lead to new, unforeseen abilities in models.

Remember it as

"Like a child growing up to discover hidden talents."

How grounded is this content?

Metrics are computed from available source text only — abstract, summary, and impact fields ingested into this system. Full paper PDF is not ingested; numerical claims that originate from within the paper body will not appear in these scores.

Source Richness88%

7 of 8 content fields populated. More fields = better-grounded generation.

Source Depth~225 words

Total source text analyzed by the model. Includes extended deep-dive summary — high confidence.

Methodology: Number grounding uses regex digit extraction against source text. Quote traceability uses token set intersection on content words stripped of stop-words. Neither metric validates semantic correctness or factual accuracy against the original paper. For full verification, cross-reference with the original paper via the arXiv link above.

Evaluating Large Language Models Trained on Code Proximal Policy Optimization Algorithms

Table of Contents

The Problem: Incomplete Predictive Models

Key Insight: Emergent Abilities

Method: Model Scaling and Parameter Experiments

Results: Linguistic and Reasoning Capabilities

Impact: Enhanced Human-Computer Interaction

Limitations & Open Questions: Managing Unpredictability

See Emergent Abilities in Action

The Context

The Breakthrough

Under the Hood

The Problem

Scaling Limitations