Translation

A Set Theoretic Approach to Detecting Logic Bugs in DBMS InnerJoin Optimizations

This paper presents a set-theoretic approach to detect logic bugs in DBMS inner-join optimizations, aiming to improve database reliability by identifying incorrect query optimizations.

Background

- Database management systems (DBMS) like MySQL, PostgreSQL, and SQLite use "query optimizers" to decide how to execute SQL queries efficiently. INNER JOIN is a fundamental SQL operation that combines rows from two tables based on a related column. - A "logic bug" in an optimizer means the DBMS returns the wrong results — not a crash or error, but silent data corruption. These are especially dangerous because users may trust incorrect output. - The paper introduces a new testing method based on "set theory" (a branch of math dealing with collections of objects) to systematically find such bugs in how DBMSs handle join optimizations. - This is academic research (an arXiv preprint), not a product announcement. The key contribution is a formal, principled approach to detecting optimizer bugs, which previously relied on more ad-hoc testing.