Samples · lm_eval_harness.humaneval
KI-Auswertung
Generiert 2026-05-13 03:59 · claude-sonnet-4-6Zusammenfassung
Das Modell `mlx-community/Qwen3-Coder-Next` erzielt auf dem HumanEval-Benchmark eine Pass-Rate von 0 % — keine einzige der 164 Aufgaben wird korrekt gelöst. Es treten dabei keinerlei Laufzeitfehler auf, was auf ein systematisches Ausgabeformat-Problem hindeutet.
Stärken
- Das Modell versteht die Aufgaben inhaltlich: Die Antworten enthalten korrekte Erklärungen, Lösungsansätze und Algorithmen.
- Keine Errors (0 Crashes), das Modell generiert konsistent Ausgaben.
Schwächen
- Jede Antwort endet mit einem unvollständigen Code-Block: Das Modell schreibt den Funktionskopf in den Prompt-Kontext und bricht dann genau dort ab, wo der eigentliche Funktionskörper beginnen müsste.
- Die Stop-Sequenzen (`\ndef`, `\nclass`, etc.) schneiden den generierten Code offensichtlich ab, bevor die Implementierung ausgegeben wird.
Auffälligkeiten
Alle 164 Failures zeigen dasselbe Muster: Das Modell produziert einen einleitenden Erklärungstext auf Englisch, öffnet einen Markdown-Codeblock mit ` ```python `, gibt dann den Import und ggf. den Funktionskopf aus — und dort greift die Stop-Sequenz `\ndef` und beendet die Generierung vorzeitig. Der eigentliche Funktionskörper wird nie ausgegeben. Dies ist kein Kompetenzproblem, sondern ein reines Konfigurationsproblem.
Empfehlung
Die Stop-Sequenz `\ndef` muss aus dem Harness-Konfiguration entfernt oder durch eine spezifischere Sequenz (z. B. `\n\ndef ` mit zwei Zeilenumbrüchen) ersetzt werden, da das Modell intern mit einem Reasoning/Thinking-Block oder Markdown-Codeblöcken arbeitet, in denen `def` legitim vorkommt. Alternativ sollte ein Chat-Template-Wrapper eingesetzt werden, der den Code nach dem schließenden ` ``` ` extrahiert.
Übersicht
164 Samples| Frage-ID | Status | Score | Prompt | Latenz | Tokens/s | TTFT | |
|---|---|---|---|---|---|---|---|
| 0 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 1 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 2 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef truncate… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 3 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 4 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 5 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 6 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 7 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 8 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 9 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 10 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef is_palin… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 11 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 12 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 13 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef greatest… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 14 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 15 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef string_s… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 16 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef count_di… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 17 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 18 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef how_many… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 19 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 20 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 21 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 22 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 23 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef strlen(s… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 24 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef largest_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 25 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 26 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 27 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef flip_cas… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 28 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 29 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"from typing import… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 30 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef get_posi… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 31 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef is_prime… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 32 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"import math\\n\\n\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 33 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef sort_thi… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 34 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef unique(l… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 35 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef max_elem… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 36 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef fizz_buz… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 37 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef sort_eve… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 38 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef encode_c… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 39 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef prime_fi… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 40 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef triples_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 41 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef car_race… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 42 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef incr_lis… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 43 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef pairs_su… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 44 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef change_b… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 45 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef triangle… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 46 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef fib4(n: … | — | — | — | ||
|
Lade Detail …
|
|||||||
| 47 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef median(l… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 48 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef is_palin… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 49 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef modp(n: … | — | — | — | ||
|
Lade Detail …
|
|||||||
| 50 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef encode_s… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 51 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef remove_v… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 52 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef below_th… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 53 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef add(x: i… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 54 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef same_cha… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 55 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef fib(n: i… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 56 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef correct_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 57 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef monotoni… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 58 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef common(l… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 59 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef largest_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 60 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef sum_to_n… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 61 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef correct_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 62 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef derivati… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 63 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef fibfib(n… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 64 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\nFIX = \\\"\\\"\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 65 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef circular_sh… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 66 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef digitSum(s)… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 67 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef fruit_distr… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 68 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef pluck(arr):… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 69 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef search(lst)… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 70 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef strange_sor… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 71 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef triangle_ar… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 72 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef will_it_fly… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 73 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef smallest_ch… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 74 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef total_match… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 75 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_multiply… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 76 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_simple_p… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 77 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef iscube(a):\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 78 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef hex_key(num… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 79 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef decimal_to_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 80 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_happy(s)… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 81 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef numerical_l… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 82 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef prime_lengt… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 83 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef starts_one_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 84 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef solve(N):\\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 85 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef add(lst):\\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 86 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef anti_shuffl… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 87 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef get_row(lst… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 88 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef sort_array(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 89 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef encrypt(s):… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 90 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef next_smalle… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 91 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_bored(S)… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 92 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef any_int(x, … | — | — | — | ||
|
Lade Detail …
|
|||||||
| 93 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef encode(mess… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 94 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef skjkasdk… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 95 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef check_dict_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 96 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef count_up_to… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 97 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef multiply(a,… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 98 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef count_upper… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 99 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef closest_int… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 100 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef make_a_pile… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 101 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef words_strin… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 102 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef choose_num(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 103 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef rounded_avg… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 104 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef unique_digi… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 105 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef by_length(a… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 106 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef f(n):\\n … | — | — | — | ||
|
Lade Detail …
|
|||||||
| 107 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef even_odd_pa… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 108 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef count_nums(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 109 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef move_one_ba… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 110 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef exchange(ls… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 111 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef histogram(t… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 112 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef reverse_del… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 113 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef odd_count(l… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 114 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef minSubArray… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 115 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef max_fill(gr… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 116 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef sort_array(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 117 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef select_word… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 118 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef get_closest… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 119 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef match_paren… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 120 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef maximum(arr… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 121 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef solution(ls… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 122 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef add_element… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 123 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef get_odd_col… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 124 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef valid_date(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 125 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef split_words… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 126 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_sorted(l… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 127 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef intersectio… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 128 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef prod_signs(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 129 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef minPath(gri… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 130 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef tri(n):\\n … | — | — | — | ||
|
Lade Detail …
|
|||||||
| 131 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef digits(n):\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 132 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_nested(s… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 133 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\ndef sum_squa… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 134 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef check_if_la… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 135 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef can_arrange… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 136 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef largest_sma… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 137 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef compare_one… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 138 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef is_equal_to… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 139 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef special_fac… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 140 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef fix_spaces(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 141 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef file_name_c… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 142 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\n\\n\\ndef sum_s… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 143 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef words_in_se… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 144 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef simplify(x,… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 145 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef order_by_po… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 146 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef specialFilt… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 147 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef get_max_tri… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 148 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef bf(planet1,… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 149 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef sorted_list… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 150 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef x_or_y(n, x… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 151 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef double_the_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 152 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef compare(gam… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 153 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef Strongest_E… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 154 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef cycpattern_… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 155 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef even_odd_co… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 156 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef int_to_mini… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 157 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef right_angle… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 158 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef find_max(wo… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 159 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef eat(number,… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 160 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef do_algebra(… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 161 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef solve(s):\\… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 162 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef string_to_m… | — | — | — | ||
|
Lade Detail …
|
|||||||
| 163 | failed | {…} {"gen_args_0":{"arg_0":["[{\"role\": \"user\", \"content\": \"\\ndef generate_in… | — | — | — | ||
|
Lade Detail …
|
|||||||