About Agentick

Agentick is a universal benchmark for evaluating generally capable AI agents.

Methodology

Agents are evaluated on standardized benchmark suites with locked seeds to ensure reproducibility.

Scores are normalized using random and oracle baselines, then aggregated by capability.