Samples · swe_bench.swe_bench_lite
Run #49 · Adapter v1.0.0+patch-apply-detection · 10/10 Samples angezeigt
· Score 0%
KI-Auswertung
Keine KI-Auswertung verfügbar.
Übersicht
10 SamplesVerteilung
Score-Histogramm
0.0 ────── 1.0
Latenz (ms)
p50: 9847
p95: 17319
mean: 10852
Tokens/s
p50: 85.7
mean: 93.4
Top-Fehlermuster
-
7×
patch_apply_failed
| Frage-ID | Status | Score | Prompt | Latenz | Tokens/s | TTFT | |
|---|---|---|---|---|---|---|---|
| astropy__astropy-12907 | error | — | 7902 ms | 131.4 | — | ||
|
Lade Detail …
|
|||||||
| astropy__astropy-14182 | failed | — | 6488 ms | 130.2 | — | ||
|
Lade Detail …
|
|||||||
| astropy__astropy-14365 | error | — | 13857 ms | 86.1 | — | ||
|
Lade Detail …
|
|||||||
| astropy__astropy-14995 | error | — | 7708 ms | 82.1 | — | ||
|
Lade Detail …
|
|||||||
| astropy__astropy-6938 | error | — | 14360 ms | 86.9 | — | ||
|
Lade Detail …
|
|||||||
| astropy__astropy-7746 | error | — | 10887 ms | 85.9 | — | ||
|
Lade Detail …
|
|||||||
| django__django-10914 | error | — | 9875 ms | 84.4 | — | ||
|
Lade Detail …
|
|||||||
| django__django-10924 | failed | — | 19740 ms | 85.5 | — | ||
|
Lade Detail …
|
|||||||
| django__django-11001 | failed | — | 9819 ms | 84.8 | — | ||
|
Lade Detail …
|
|||||||
| django__django-11019 | error | — | 7882 ms | 76.6 | — | ||
|
Lade Detail …
|
|||||||