Which LLM is the best at finding real vulnerabilities?
A comparison of large language models (LLMs) is conducted to evaluate their effectiveness at identifying real-world software vulnerabilities. The study tests various LLMs on a benchmark of actual security flaws, analyzing their detection accuracy and false positive rates to determine which model performs best for vulnerability discovery.