F.cross_entropy erhöht "RunTimeError: CUDA-Fehler: Geräteseitiges Assert ausgelöst. Kompilieren Sie mit` Torch_USE_CUDA_

F.cross_entropy erhöht "RunTimeError: CUDA-Fehler: Geräteseitiges Assert ausgelöst. Kompilieren Sie mit` Torch_USE_CUDA_ ⇐ Python

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Anonymous

F.cross_entropy erhöht "RunTimeError: CUDA-Fehler: Geräteseitiges Assert ausgelöst. Kompilieren Sie mit` Torch_USE_CUDA_

Report
Quote

Post by Anonymous » 17 Aug 2025, 19:04

Ich habe diesen Fehler in der Verlustfunktion erfüllt. Ein Beispiel unten: < /p>

Code: Select all

import torch
device = "cuda" if torch.cuda.is_available() else "cpu"

a = torch.Tensor([[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815],
[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815],
[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815],
[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815]]).to(device)
b = torch.Tensor([ -100,  -1,  -100,  2456]).long().to(device)

loss = torch.nn.functional.cross_entropy(a,b,ignore_index=-1)
print(loss)
< /code>
Dann ist der Fehler: < /p>
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
 in ()
----> 1 loss = torch.nn.functional.cross_entropy(a,b,ignore_index=-1).to(device)
2 loss

/usr/local/lib/python3.10/dist-packages/torch/nn/functional.py in cross_entropy(input, target, weight, size_average, ignore_index, reduce, reduction, label_smoothing)
3027     if size_average is not None or reduce is not None:
3028         reduction = _Reduction.legacy_get_string(size_average, reduce)
-> 3029     return torch._C._nn.cross_entropy_loss(input, target, weight, _Reduction.get_enum(reduction), ignore_index, label_smoothing)
3030
3031

RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

Ich habe versucht, Export -Torch_USE_CUDA_DSA = 1 festzulegen, aber immer noch der gleiche Fehler.

1755450240

Anonymous

Ich habe diesen Fehler in der Verlustfunktion erfüllt. Ein Beispiel unten: < /p>
[code]import torch
device = "cuda" if torch.cuda.is_available() else "cpu"

a = torch.Tensor([[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815],
[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815],
[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815],
[-10.3353, -28.4371,   2.0768,   -4.2789,  -8.6644,  -6.0815]]).to(device)
b = torch.Tensor([ -100,  -1,  -100,  2456]).long().to(device)

loss = torch.nn.functional.cross_entropy(a,b,ignore_index=-1)
print(loss)
< /code>
Dann ist der Fehler: < /p>
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
 in ()
----> 1 loss = torch.nn.functional.cross_entropy(a,b,ignore_index=-1).to(device)
2 loss

/usr/local/lib/python3.10/dist-packages/torch/nn/functional.py in cross_entropy(input, target, weight, size_average, ignore_index, reduce, reduction, label_smoothing)
3027     if size_average is not None or reduce is not None:
3028         reduction = _Reduction.legacy_get_string(size_average, reduce)
-> 3029     return torch._C._nn.cross_entropy_loss(input, target, weight, _Reduction.get_enum(reduction), ignore_index, label_smoothing)
3030
3031

RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
[/code]
Ich habe versucht, Export -Torch_USE_CUDA_DSA = 1  festzulegen, aber immer noch der gleiche Fehler.

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Quick Reply

Subject:

Username:

Change Text Case:

Smilies

View more smilies

Similar Topics

Replies

Views

Last post

Fehler RunTimeError: CUDA -Fehler: Operation nicht unterstützt, wenn er versucht hat, etwas in CUDA zu finden

Last post by Anonymous « 11 Feb 2025, 15:03
Posted in Python

by Anonymous » 11 Feb 2025, 15:03 » in Python

Hier ist mein Code:
from transformers import AutoModelForCausalLM, AutoTokenizer, QuantoConfig
import torch
device = cuda:0
model_id = bigscience/bloom-560m
quantization_config =...

0 Replies

28 Views

Last post by Anonymous
11 Feb 2025, 15:03
Warum ist Torch.fft.rfft(x) schneller als Torch.fft.rfft(x, out=y)?

Last post by Guest « 05 Jan 2025, 10:38
Posted in Python

by Guest » 05 Jan 2025, 10:38 » in Python

Bei der Verwendung der Funktion Torch.fft.rfft von PyTorch habe ich festgestellt, dass die Angabe eines Ausgabetensors mithilfe des Parameters out langsamer ist, als die Ausgabe intern von PyTorch...

0 Replies

23 Views

Last post by Guest
05 Jan 2025, 10:38
Warum ist Torch.fft.rfft(x) schneller als Torch.fft.rfft(x, out=y)?

Last post by Guest « 08 Jan 2025, 08:36
Posted in Python

by Guest » 08 Jan 2025, 08:36 » in Python

Bei der Verwendung der Funktion Torch.fft.rfft von PyTorch habe ich festgestellt, dass die Angabe eines Ausgabetensors mithilfe des Parameters out langsamer ist, als die Ausgabe intern von PyTorch...

0 Replies

20 Views

Last post by Guest
08 Jan 2025, 08:36
Segmentierungsfehler beim Versuch, Speicher mithilfe von Torch::from_blob::clone() vom Gst-Puffer in den Torch-Tensor zu

Last post by Guest « 16 Jan 2025, 12:12
Posted in C++

by Guest » 16 Jan 2025, 12:12 » in C++

Ich erhalte einen Segmentierungsfehler, wenn ich versuche, Speicher vom GST-Puffer in den Torch-Tensor zu kopieren. Hier ist Code. Ich werde markieren, woher der Segfault stammt:
static...

0 Replies

28 Views

Last post by Guest
16 Jan 2025, 12:12
CUDA über Debian Trixie: Die CUDA -Treiberversion reicht für die CUDA -Laufzeitversion nicht aus

Last post by Anonymous « 20 Aug 2025, 17:06
Posted in Linux

by Anonymous » 20 Aug 2025, 17:06 » in Linux

Ich habe Nvidia-Driver-580 und cuda-tools-13 auf Debian Trixie installiert (beide sind die neueste Version, die ich finden kann): $ a p t l i s t - - i n s t a l l e d | a g n v i d i a f i r m w a r...

0 Replies

5 Views

Last post by Anonymous
20 Aug 2025, 17:06

Return to “Python”