We've identified multiple loopholes with SWE Bench Verified where agents may look at future repository state (by querying it directly or through a variety of methods), and cases in which future ...
What steps can reproduce the bug? When using Git Bash on a Windows computer to run Codex, only character input functions normally. However, when attempting to delete characters using the backspace key ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results