Research - papers we follow

May 13, 2026 #research #papers

Seminal and recent papers across world models, reasoning, agents, neuro-symbolic AI, and foundation models. Hand-picked from a working corpus of 1000+ arxiv references.

Updated May 2026.

Foundations

The papers everything else rests on.

arxiv	Title	Byline	Tag
1706.03762	Attention Is All You Need	Vaswani et al. - 2017 - The Transformer.	Seminal
2005.11401	Retrieval-Augmented Generation for Knowledge-Intensive NLP	Lewis et al. - 2020	Seminal
2009.03300	Measuring Massive Multitask Language Understanding (MMLU)	Hendrycks et al. - 2020	Seminal
2302.13971	LLaMA: Open and Efficient Foundation Language Models	Touvron et al. - 2023	Seminal
2401.04088	Mixtral of Experts	Jiang et al. - 2024	Seminal

Reasoning & agents

Chain-of-thought to ReAct.

arxiv	Title	Byline	Tag
2201.11903	Chain-of-Thought Prompting Elicits Reasoning in LLMs	Wei et al. - 2022	Seminal
2203.11171	Self-Consistency Improves Chain of Thought Reasoning	Wang et al. - 2022	Seminal
2210.03629	ReAct: Synergizing Reasoning and Acting in Language Models	Yao et al. - 2022	Seminal
2305.10601	Tree of Thoughts: Deliberate Problem Solving with LLMs	Yao et al. - 2023	Seminal
2306.05685	Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena	Zheng et al. - 2023	Seminal
2312.10997	Retrieval-Augmented Generation for LLMs: A Survey	Gao et al. - 2023	Seminal
2603.20639	Agentic AI and the Next Intelligence Explosion	Google - 2026 - Institutional design principles for AI agents.	Recent
2603.12372	Efficient Reasoning with Balanced Thinking	2026	Recent
2603.06847	Characterizing Faults in Agentic AI: A Taxonomy	2026	Recent
2602.10479	From Prompt-Response to Goal-Directed Systems: The Evolution of Agentic AI Software Architecture	2026	Recent
2601.10025	Structured Personality Control and Adaptation for LLM Agents	2026	Recent

World models & JEPA

The track outside the pure-LLM mainstream - learning representations that predict, not generate.

arxiv	Title	Byline	Tag
2603.14482	V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning	Meta FAIR - 2026	Recent
2602.11389	Causal-JEPA: Learning World Models through Object-Level Latent Interventions	2026	Recent
2603.19312	LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels	2026	Recent
2603.22281	ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning	2026	Recent

Neuro-symbolic AI

Combining learned + symbolic.

arxiv	Title	Byline	Tag
2410.22077	Mapping the Neuro-Symbolic AI Landscape by Architectures	Hudson et al. - 2024 - The most-referenced paper in our working notes - a handbook on augmenting deep learning through symbolic reasoning.	Seminal

RL for LLMs

Training at scale.

arxiv	Title	Byline	Tag
2503.14476	DAPO: An Open-Source LLM Reinforcement Learning System at Scale	ByteDance / Tsinghua - 2025	Recent

Networked agents. Adjacent to Maibook’s design - how trust works in networks where agents (and humans) interact.

arxiv	Title	Byline	Tag
2603.11054	A Survey on Quantitative Modeling of Trust in Online Social Networks	Song, Barber - 2026	Recent

Non-arxiv essentials

Programs we follow.

Source	Title	Byline	Tag
Sutton	The Bitter Lesson	Rich Sutton - The argument for scale over hand-crafted methods.	Essay
Numenta	Thousand Brains framework	Hawkins / Numenta - Cortical columns, voting-based representation.	Program
FAIR	V-JEPA & world-model program	Yann LeCun - Meta FAIR	Program