We've identified multiple loopholes with SWE Bench Verified where agents may look at future repository state (by querying it directly or through a variety of methods), and cases in which future ...
What steps can reproduce the bug? When using Git Bash on a Windows computer to run Codex, only character input functions normally. However, when attempting to delete characters using the backspace key ...