Checklist Reviewer Toolkit

Beyond Static Checklists

Hamed Hemati¹, Alicia Janz¹, Stefan Sandfeld¹

¹IAS-9, Forschungszentrum Jülich

Poster at the HMC Conference 2026 GitHub repo

Start a reproducibility review

ml_repro_paper.pdf

Drop PDF or click to upload

Uploading…

Checklist

Review process

Step 1 · Upload PDF, choose checklist and process, then run the review.

My ML Reproducibility Checklist

Does it provide the checkpoints of the trained models?

Does it provide theoretical proofs for all claims?

Is the space complexity of the proposed method O(N²)?

Step 2 · Each checklist item is verified in order with evidence-backed outcomes.

Run insights

Satisfied 2 of 3 criteria

Needs attention 1 flagged for follow-up

Reproducibility score 67% aligned with checklist

Step 3 · Summaries and aggregates roll up for reporting and human verification.

Toolkit Purpose

How do we verify research when publications are scaling faster than human reviewers?

Manual

- Expensive

LLM-Based

- Hallucination

Agentic Workflow- Powerful but difficult to implement

Toolkit Components

A complete, end-to-end pipeline for rigorous metadata assessment.

Data Collection

Review Process

Human Verification

Analysis

Dynamic Process Designer

Build complex review workflows visually.

Our toolkit features a node-based review process designer. Instead of hard-coding the logic, researchers can obtain various outputs for the same collection and checklist by composing tools as "agents".

Model-driven agentic process
Workflow transparency
Explainability for all outputs

Input (PDF)

Question Reviewer

Output (Markdown)

Click GitHub, Materials, or the agentic process card to open details.

Review Process

Input PDF

Question reviewer

Local DB

Output Checklist / MD

Key Capabilities

Freedom of Choice

Choose your backbone model for each component. Supports local execution (Ollama) and remote (Google GenAI, LiteLLM) with customized reasoning levels.

External Tools

Agents can dynamically call external tools for "claim verification", querying databases, fetching code repositories, or running scripts.

Modular Extension

New components can be added without hard rewiring. Embrace non-deterministic, evolving workflows built on a plug-and-play architecture.

Poster at the HMC Conference 2026

Presented at the HMC Conference 2026

View Full Poster

Download poster (PDF)

Toolkit Purpose

Manual

LLM-Based

Agentic Workflow

Toolkit Components

Data Collection

Review Process

Human Verification

Analysis

Dynamic Process Designer

Key Capabilities

Freedom of Choice

External Tools

Modular Extension

Poster at the HMC Conference 2026

Title