Show HN: FieldOps-Bench an open eval for physical-world AI agents
A boat captain has released FieldOps-Bench, an open evaluation benchmark for physical-world AI agents across 7 industries. The 157-case multimodal benchmark tests visual diagnostics, code citations, and industrial field knowledge. The creator's Camera Search agent outperformed Claude Opus 4.6 on 87% of cases in the evaluation.