Ich erstelle einen Benchmark, um ein Sprachmodell zu bewerten. Zuerst habe ich den Datensatz generiert, mit dem ich das Sprachmodell fordert. Anschließend versuchte ich, ein winziges Sprachmodell zu bewerten, um eine End-to-End-App-App zu erhalten. Ich folgte den Anweisungen zu den Dokumenten der umarmenden LightVal -Bibliotheksbibliotheks, aber ich erhalte immer wieder Fehler, wenn ich versuche, den Bewertungscode auszuführen. Angesichts der Tatsache, dass ich Python 3.11 /Versuch 3.10 verwende und das gleiche passiert.
main ()
Datei "C: \ Benutzer \ Mahmoud \ Desktop \ Codings \ Benchmark-EVAL-LLM \ Run_Evaluation.py", Zeile 21, in Main
Pipeline_params = PipelineParameters (
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ PipelineParameter. />
[*] Evaluation /< /li>
----Custom_csi_task.py
[*]csi_benchmark_advanced.jsonl
[*]run_evaluation.py
custom_csi_task.py
import lighteval
from lighteval.logging.evaluation_tracker import EvaluationTracker
from lighteval.models.vllm.vllm_model import VLLMModelConfig
from lighteval.pipeline import ParallelismManager, Pipeline, PipelineParameters
from lighteval.utils.imports import is_accelerate_available
if is_accelerate_available():
from datetime import timedelta
from accelerate import Accelerator, InitProcessGroupKwargs
accelerator = Accelerator(kwargs_handlers=[InitProcessGroupKwargs(timeout=timedelta(seconds=3000))])
else:
accelerator = None
def main():
evaluation_tracker = EvaluationTracker(
output_dir="./results",
save_details=True,
)
pipeline_params = PipelineParameters(
launcher_type=ParallelismManager.ACCELERATE,
custom_task_directory='evaluation', # if using a custom task
# Remove the 2 parameters below once your configuration is tested
)
model_config = VLLMModelConfig(
model_name="HuggingFaceH4/zephyr-7b-beta",
dtype="float16",
use_chat_template=True,
)
task = "helm|mmlu|5|1"
pipeline = Pipeline(
# tasks=task,
pipeline_parameters=pipeline_params,
evaluation_tracker=evaluation_tracker,
model_config=model_config,
)
pipeline.evaluate()
pipeline.save_and_push_results()
pipeline.show_results()
if __name__ == "__main__":
main()
< /code>
csi_benchmark_advanced.jsonl
{"task_id": "CSI-EN-DE-001", "task_type": "cross_lingual_synonym_identification", "source_word": "house", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'house' in German?", "choices": ["Torbau", "häusliches Arbeitszimmer", "Haus", "Dachboden"], "answer": "Haus"}
{"task_id": "CSI-EN-FR-002", "task_type": "cross_lingual_synonym_identification", "source_word": "house", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'house' in French?", "choices": ["porche", "galetas", "maison", "bibliothèque"], "answer": "maison"}
{"task_id": "CSI-EN-DE-003", "task_type": "cross_lingual_synonym_identification", "source_word": "water", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'water' in German?", "choices": ["Flaschenwasser", "Eiswasser", "Trinkwasser", "Sodawasser"], "answer": "Trinkwasser"}
{"task_id": "CSI-EN-FR-004", "task_type": "cross_lingual_synonym_identification", "source_word": "water", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'water' in French?", "choices": ["eau potable", "eau gazeuse", "eau en bouteille", "l'eau glacée"], "answer": "eau potable"}
{"task_id": "CSI-EN-DE-005", "task_type": "cross_lingual_synonym_identification", "source_word": "sun", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'sun' in German?", "choices": ["ruhetag", "Sonntag", "Eins", "Weekend"], "answer": "Sonntag"}
{"task_id": "CSI-EN-FR-006", "task_type": "cross_lingual_synonym_identification", "source_word": "sun", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'sun' in French?", "choices": ["fin de semaine", "un", "jour de repos", "dimanche"], "answer": "dimanche"}
{"task_id": "CSI-EN-DE-007", "task_type": "cross_lingual_synonym_identification", "source_word": "tree", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'tree' in German?", "choices": ["Kernholz", "Wald", "Splintholz", "Baum"], "answer": "Baum"}
{"task_id": "CSI-EN-FR-008", "task_type": "cross_lingual_synonym_identification", "source_word": "tree", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'tree' in French?", "choices": ["aubier", "bois", "arbre", "duramen"], "answer": "arbre"}
{"task_id": "CSI-EN-FR-009", "task_type": "cross_lingual_synonym_identification", "source_word": "eat", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'eat' in French?", "choices": ["essuyer", "consommer", "petit", "occuper"], "answer": "consommer"}
Ich erstelle einen Benchmark, um ein Sprachmodell zu bewerten. Zuerst habe ich den Datensatz generiert, mit dem ich das Sprachmodell fordert. Anschließend versuchte ich, ein winziges Sprachmodell zu bewerten, um eine End-to-End-App-App zu erhalten. Ich folgte den Anweisungen zu den Dokumenten der umarmenden LightVal -Bibliotheksbibliotheks, aber ich erhalte immer wieder Fehler, wenn ich versuche, den Bewertungscode auszuführen. Angesichts der Tatsache, dass ich Python 3.11 /Versuch 3.10 verwende und das gleiche passiert. main () Datei "C: \ Benutzer \ Mahmoud \ Desktop \ Codings \ Benchmark-EVAL-LLM \ Run_Evaluation.py", Zeile 21, in Main Pipeline_params = PipelineParameters ( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ PipelineParameter. /> [*] Evaluation /< /li> ----Custom_csi_task.py [*]csi_benchmark_advanced.jsonl [*]run_evaluation.py custom_csi_task.py [code] import numpy as np from lighteval.tasks.lighteval_task import LightevalTask from lighteval.tasks.requests import Doc
class CustomCSI(LightevalTask): def doc_to_text(self, doc: Doc) -> str: return doc["question"]
pipeline_params = PipelineParameters( launcher_type=ParallelismManager.ACCELERATE, custom_task_directory='evaluation', # if using a custom task # Remove the 2 parameters below once your configuration is tested
{"task_id": "CSI-EN-DE-001", "task_type": "cross_lingual_synonym_identification", "source_word": "house", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'house' in German?", "choices": ["Torbau", "häusliches Arbeitszimmer", "Haus", "Dachboden"], "answer": "Haus"} {"task_id": "CSI-EN-FR-002", "task_type": "cross_lingual_synonym_identification", "source_word": "house", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'house' in French?", "choices": ["porche", "galetas", "maison", "bibliothèque"], "answer": "maison"} {"task_id": "CSI-EN-DE-003", "task_type": "cross_lingual_synonym_identification", "source_word": "water", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'water' in German?", "choices": ["Flaschenwasser", "Eiswasser", "Trinkwasser", "Sodawasser"], "answer": "Trinkwasser"} {"task_id": "CSI-EN-FR-004", "task_type": "cross_lingual_synonym_identification", "source_word": "water", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'water' in French?", "choices": ["eau potable", "eau gazeuse", "eau en bouteille", "l'eau glacée"], "answer": "eau potable"} {"task_id": "CSI-EN-DE-005", "task_type": "cross_lingual_synonym_identification", "source_word": "sun", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'sun' in German?", "choices": ["ruhetag", "Sonntag", "Eins", "Weekend"], "answer": "Sonntag"} {"task_id": "CSI-EN-FR-006", "task_type": "cross_lingual_synonym_identification", "source_word": "sun", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'sun' in French?", "choices": ["fin de semaine", "un", "jour de repos", "dimanche"], "answer": "dimanche"} {"task_id": "CSI-EN-DE-007", "task_type": "cross_lingual_synonym_identification", "source_word": "tree", "source_lang": "EN", "target_lang": "DE", "question": "Which word has the same meaning as 'tree' in German?", "choices": ["Kernholz", "Wald", "Splintholz", "Baum"], "answer": "Baum"} {"task_id": "CSI-EN-FR-008", "task_type": "cross_lingual_synonym_identification", "source_word": "tree", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'tree' in French?", "choices": ["aubier", "bois", "arbre", "duramen"], "answer": "arbre"} {"task_id": "CSI-EN-FR-009", "task_type": "cross_lingual_synonym_identification", "source_word": "eat", "source_lang": "EN", "target_lang": "FR", "question": "Which word has the same meaning as 'eat' in French?", "choices": ["essuyer", "consommer", "petit", "occuper"], "answer": "consommer"}
Ich habe bereits versucht, das Paket für biometrische Anmeldungen zu reagieren. Aber ich konnte keine Lösung finden, um Zugriff auf FaceID in Android zu erhalten. oder Gesichtserkennung. Ich habe...
Schritt 1: Ich möchte den folgenden Code in Google Colab ausführen. (patients_query_sampled Ordner). Warum?
Ich weiß nicht, wie ich das heruntergeladen habe (nicht Github!from datasets import...
Ausgabe
Ich habe versucht, mich für das Umarmungsgesichts -CLI anzumelden, aber ich habe den folgenden Fehler immer wieder erhalten:
requests.exceptions.htttperror : 504 Server-Fehler: Gateway...
Ausgabe
Ich habe versucht, mich für das Umarmungsgesichts -CLI anzumelden, aber ich habe den folgenden Fehler immer wieder erhalten:
requests.exceptions.htttperror : 504 Server-Fehler: Gateway...
Basierend auf dem, was eine Aufgabe als Aufgabe in einer Aufgabe in einer Aufgabe in eine Aufgabe umrundet. ist ungefähr so:
async Task ConvertAsync(Task task, T result) {
await task;
return...