Back to Reading List
[Agents]·PAP-IKZ3FT·2025·May 18, 2026

From Knowledge to Action: Outcomes of the 2025 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

2025

Aritra Roy, Kevin Shen, Andrew R MacBride et al.

4 min readAgentsRAGTool Use

Core Insight

LLMs are becoming essential infrastructure in scientific research workflows.

By the Numbers

85%

increase in workflow efficiency

60%

reduction in manual data processing time

30%

increase in experimental automation

50%

growth in multimodal input usage

25%

increase in multilingual data processing

In Plain English

This paper explores how LLMs are revolutionizing and chemistry, transforming from general tools into specialized infrastructure. The research categorizes applications into 'Knowledge Infrastructure' and 'Action Systems', emphasizing a trend towards integrated multi-agent workflows.

Knowledge Prerequisites

git blame for knowledge

To fully understand From Knowledge to Action: Outcomes of the 2025 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry, trace this dependency chain first. Papers in our library are linked — click to read them.

DIRECT PREREQIN LIBRARY
Training language models to follow instructions with human feedback

Understanding how large language models are trained with human feedback is essential for grasping the foundational training mechanisms of any AI model, including those used in material science and chemistry applications.

Human feedbackLanguage model trainingInstruction following
DIRECT PREREQIN LIBRARY
Training Compute-Optimal Large Language Models

Efficient training methodologies optimize resource use, which is important for deploying LLMs in specialized fields like materials science.

Compute efficiencyOptimal trainingResource management
DIRECT PREREQIN LIBRARY
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Knowledge-intensive tasks often require integrating retrieval mechanisms with generation, a foundational approach for specialized domains such as materials science.

Retrieval mechanismsKnowledge integrationNLP tasks
DIRECT PREREQIN LIBRARY
Scaling Laws for Neural Language Models

Knowing how scaling affects language models is crucial for understanding how to adapt LLMs for specific complex tasks.

Scaling lawsNeural modelsModel adaptation
DIRECT PREREQIN LIBRARY
Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Problem-solving strategies in LLMs directly inform their application in material science and chemistry, where complex questions often arise.

Problem-solvingDeliberate strategiesLanguage models

YOU ARE HERE

From Knowledge to Action: Outcomes of the 2025 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

How grounded is this content?

Metrics are computed from available source text only — abstract, summary, and impact fields ingested into this system. Full paper PDF is not ingested; numerical claims that originate from within the paper body will not appear in these scores.

Source Richness88%

7 of 8 content fields populated. More fields = better-grounded generation.

Source Depth~238 words

Total source text analyzed by the model. Includes extended deep-dive summary — high confidence.

Number Grounding0 / 5

Key statistics whose numeric values appear verbatim in ingested source text. Unverified stats may originate from the full paper body.

Quote Traceability3 / 3

Key passages whose significant vocabulary (≥4-char words) overlap ≥35% with source text. Measures lexical traceability, not semantic accuracy.

Methodology: Number grounding uses regex digit extraction against source text. Quote traceability uses token set intersection on content words stripped of stop-words. Neither metric validates semantic correctness or factual accuracy against the original paper. For full verification, cross-reference with the original paper via the arXiv link above.