A real-world benchmark for AI code review
infrastructure building
screened out
1
Analysis
2
Screen
3
Fact Check
4
Synthesis