Margdarshak ยท The Guide
Where do you want
to go?
Not a firehose โ a curriculum. Each track is a guided sequence of papers that builds understanding from first principles, with a reason for every paper in every position.
Start here
Essential
Go deeper
๐ง
Intermediate5 papersยท ~50 min
The Reasoning Revolution
Understand why o1 and DeepSeek-R1 represent a fundamentally new paradigm, and explain the research lineage that made them possible.
1DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models
โ2Learning When to Sample: Confidence-Aware Self-Consistency for Efficient LLM Chain-of-Thought Reasoning
โ3Tree of Thoughts: Deliberate Problem Solving with Large Language Models
๐ค
Intermediate5 papersยท ~50 min
Build AI Agents
Understand the architecture behind any AI agent โ how it reasons, uses tools, self-corrects, and operates in the real world.
1ReAct: Synergizing Reasoning and Acting in Language Models
โ2Toolformer: Language Models Can Teach Themselves to Use Tools
โ3Reflexion: Language Agents with Verbal Reinforcement Learning
Applied
โก
Applied4 papersยท ~40 min
Ship AI Without the GPU Bill
Understand why Mistral can compete with GPT-4 at a fraction of the cost, and evaluate any AI efficiency claim your engineers make.
1LoRA: Low-Rank Adaptation of Large Language Models
โ2FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
โ3Mixtral of Experts
๐ก๏ธ
Applied4 papersยท ~40 min
AI Safety for Product Teams
Evaluate any AI system's safety properties, explain alignment to stakeholders, and make informed decisions about what to ship.
1Training language models to follow instructions with human feedback
โ2Constitutional AI: Harmlessness from AI Feedback
โ3Direct Preference Optimization: Your Language Model is Secretly a Reward Model