翻訳言語

RFC: Artificial Contributors to Open Source

本ドキュメントは、オープンソースプロジェクトへの人工知能による貢献者（人工コントリビューター）の扱いに関するベストプラクティスを提案するものです。AIによるコード生成や自動修正が増加する中、貢献の品質管理、ライセンス遵守、透明性の確保といった課題に対処するためのガイドラインを示します。オープンソースコミュニティにおける人間とAIの協業のあり方を整理し、健全なエコシステム維持を目指します。

InferenceBench: A Benchmark for Open-Ended Inference Optimization by AI Agents
3.0
InferenceBench is a new benchmark designed to evaluate how well AI agents can optimize their own inference strategies for open-ended questions, moving beyond standard single-response accuracy tests by measuring cost-efficiency and iterative improvement.
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
2.5
MemEye is a visual-centric evaluation framework designed to assess the memory capabilities of multimodal AI agents by testing how well they retain and recall visual information across interactions. The framework provides a structured benchmark to measure an agent's ability to remember past visual inputs, supporting research into more persistent and context-aware multimodal AI systems.

出典 nesbitt.io原文を表示 ↗

翻訳言語

RFC: Artificial Contributors to Open Source

InferenceBench: A Benchmark for Open-Ended Inference Optimization by AI Agents
3.0
InferenceBench is a new benchmark designed to evaluate how well AI agents can optimize their own inference strategies for open-ended questions, moving beyond standard single-response accuracy tests by measuring cost-efficiency and iterative improvement.
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
2.5
MemEye is a visual-centric evaluation framework designed to assess the memory capabilities of multimodal AI agents by testing how well they retain and recall visual information across interactions. The framework provides a structured benchmark to measure an agent's ability to remember past visual inputs, supporting research into more persistent and context-aware multimodal AI systems.

RFC: Artificial Contributors to Open Source

関連記事

InferenceBench: A Benchmark for Open-Ended Inference Optimization by AI Agents

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

RFC: Artificial Contributors to Open Source

関連記事

InferenceBench: A Benchmark for Open-Ended Inference Optimization by AI Agents

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory