SIGIR-Style Background: Retrieval-Augmented & Augmented Language Models
Paper summary from Sigir 2023
Although the following two papers are not both published at SIGIR 2023, they form the conceptual foundation behind the SIGIR 2023 research trend:
Search systems are shifting from document retrieval → reasoning systems powered by retrieval-augmented language models.
We briefly summarize them and explain their connection.
Paper 1 — Augmented Language Models: a Survey (2023)
Paper link: https://arxiv.org/abs/2302.07842
Core Idea
The paper defines Augmented Language Models (ALMs):
Language models enhanced with external computation and interaction modules.
Two capability axes:
| Axis | Meaning |
|---|---|
| Reasoning | internal multi-step thinking |
| Acting | interacting with tools / environment |
Instead of storing all knowledge in parameters, the model becomes a controller of computation.
Architectural View
[
\text{ALM} = \text{LM} + \text{Memory} + \text{Tools} + \text{Planning}
]
This shifts the role of LLMs from:
predicting text → solving tasks
Implication for Retrieval
Traditional IR pipeline:
query → retrieve → rank → return document
ALM pipeline:
query → plan → retrieve → reason → act → refine → answer
Retrieval is no longer the final product — it becomes a step inside cognition.
Paper 2 — REALM: Retrieval-Augmented Language Model Pre-Training (ICML 2020)
Paper link: https://dl.acm.org/doi/pdf/10.5555/3524938.3525306
Core Idea
REALM integrates retrieval directly into pretraining:
[
P(y|x) = \sum_{d \in \text{documents}} P(y|x,d)P(d|x)
]
Instead of memorizing facts, the model learns to retrieve evidence during training.
Key Contribution
REALM introduced:
Differentiable retrieval during pretraining.
The model jointly learns:
- language understanding
- search behavior
Impact on Information Retrieval
Before REALM:
Retrieval supports models.
After REALM:
Models learn retrieval as a skill.
This becomes the origin of modern RAG systems.
Conceptual Connection
| Stage | Retrieval role |
|---|---|
| Classic IR | final output |
| REALM | latent memory |
| ALM | reasoning tool |
Evolution:
[
\text{Search Engine} \rightarrow \text{Neural Retriever} \rightarrow \text{Cognitive Module}
]
Why This Became a SIGIR-Era Direction
Traditional SIGIR evaluation:
[
NDCG, MRR, Recall
]
LLM-era evaluation:
[
Task\ Success
]
Retrieval quality is no longer measured only by relevance —
but by whether it helps reasoning succeed.
Key Insight
REALM teaches models where knowledge is.
ALM teaches models when to use knowledge.
Together they establish the modern paradigm:
Retrieval is not answering questions.
Retrieval is enabling reasoning.
One-Sentence Takeaway
REALM turns retrieval into learnable memory, while ALM turns retrieval into a decision-making action — modern SIGIR research sits between them.