← All posts

Research - papers we follow

Seminal and recent papers across world models, reasoning, agents, neuro-symbolic AI, and foundation models. Hand-picked from a working corpus of 1000+ arxiv references.

Updated May 2026.

Foundations

The papers everything else rests on.

arxiv Title Byline Tag
1706.03762 Attention Is All You Need Vaswani et al. - 2017 - The Transformer. Seminal
2005.11401 Retrieval-Augmented Generation for Knowledge-Intensive NLP Lewis et al. - 2020 Seminal
2009.03300 Measuring Massive Multitask Language Understanding (MMLU) Hendrycks et al. - 2020 Seminal
2302.13971 LLaMA: Open and Efficient Foundation Language Models Touvron et al. - 2023 Seminal
2401.04088 Mixtral of Experts Jiang et al. - 2024 Seminal

Reasoning & agents

Chain-of-thought to ReAct.

arxiv Title Byline Tag
2201.11903 Chain-of-Thought Prompting Elicits Reasoning in LLMs Wei et al. - 2022 Seminal
2203.11171 Self-Consistency Improves Chain of Thought Reasoning Wang et al. - 2022 Seminal
2210.03629 ReAct: Synergizing Reasoning and Acting in Language Models Yao et al. - 2022 Seminal
2305.10601 Tree of Thoughts: Deliberate Problem Solving with LLMs Yao et al. - 2023 Seminal
2306.05685 Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena Zheng et al. - 2023 Seminal
2312.10997 Retrieval-Augmented Generation for LLMs: A Survey Gao et al. - 2023 Seminal
2603.20639 Agentic AI and the Next Intelligence Explosion Google - 2026 - Institutional design principles for AI agents. Recent
2603.12372 Efficient Reasoning with Balanced Thinking 2026 Recent
2603.06847 Characterizing Faults in Agentic AI: A Taxonomy 2026 Recent
2602.10479 From Prompt-Response to Goal-Directed Systems: The Evolution of Agentic AI Software Architecture 2026 Recent
2601.10025 Structured Personality Control and Adaptation for LLM Agents 2026 Recent

World models & JEPA

The track outside the pure-LLM mainstream - learning representations that predict, not generate.

arxiv Title Byline Tag
2603.14482 V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning Meta FAIR - 2026 Recent
2602.11389 Causal-JEPA: Learning World Models through Object-Level Latent Interventions 2026 Recent
2603.19312 LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels 2026 Recent
2603.22281 ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning 2026 Recent

Neuro-symbolic AI

Combining learned + symbolic.

arxiv Title Byline Tag
2410.22077 Mapping the Neuro-Symbolic AI Landscape by Architectures Hudson et al. - 2024 - The most-referenced paper in our working notes - a handbook on augmenting deep learning through symbolic reasoning. Seminal

RL for LLMs

Training at scale.

arxiv Title Byline Tag
2503.14476 DAPO: An Open-Source LLM Reinforcement Learning System at Scale ByteDance / Tsinghua - 2025 Recent

Trust & social models

Networked agents. Adjacent to Maibook’s design - how trust works in networks where agents (and humans) interact.

arxiv Title Byline Tag
2603.11054 A Survey on Quantitative Modeling of Trust in Online Social Networks Song, Barber - 2026 Recent

Non-arxiv essentials

Programs we follow.

Source Title Byline Tag
Sutton The Bitter Lesson Rich Sutton - The argument for scale over hand-crafted methods. Essay
Numenta Thousand Brains framework Hawkins / Numenta - Cortical columns, voting-based representation. Program
FAIR V-JEPA & world-model program Yann LeCun - Meta FAIR Program

← All posts