We OCR'ed 30k papers using Codex, open OCR models and Jobs
Researchers used Codex, open OCR models, and Hugging Face Jobs to extract text from 30,000 academic papers. The project demonstrates scalable document processing with modern AI tools. The extracted data enables new research possibilities in scientific literature analysis.