Wie man FFT -basierte Faltung mit langen Doppel/Float128s schnell und genau durchführt

Wie man FFT -basierte Faltung mit langen Doppel/Float128s schnell und genau durchführt ⇐ Python

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Guest

Wie man FFT -basierte Faltung mit langen Doppel/Float128s schnell und genau durchführt

Post by Guest » 07 Feb 2025, 02:25

Auf meinem Linux -System habe ich: < /p>

np.finfo(np.float128)
info(resolution=1e-18, min=-1.189731495357231765e+4932, max=1.189731495357231765e+4932, dtype=float128)
< /code>
Das ist also ein 80-Bit-Doppel. Ich möchte eine Faltung zwischen zwei einigermaßen langen Arrays von NP.Float128

s durchführen. scipy.signal.convolve funktioniert mit methode = 'Direct' , gibt jedoch die falsche Antwort für method = 'fft' an. Hier ist ein Spielzeugbeispiel: < /p>

Code: Select all

a = np.array(['1.e+401', '1.e+000', '1.e+401', '1.e+000'], dtype=np.float128)

convolve(a, a, mode='full', method='direct')
array([1.e+802, 2.e+401, 2.e+802, 4.e+401, 1.e+802, 2.e+401, 1.e+000],
dtype=float128) # correct
convolve(a, a, mode='full', method='fft')
array([1.e+802, 0.e+000, 2.e+802, 0.e+000, 1.e+802, 0.e+000, 0.e+000],
dtype=float128) # wrong

Ich habe versucht, die Faltung mit Pyfftw von Grund auf neu zu implementieren, aber es gab immer noch die falsche Antwort. Um eine Flucht zu machen, die dem folgenden schnell und genau mit FFTS ähnelt: < /p>

Code: Select all

a = np.array([1e401, 1e000, 1e401, 1e000] * 10000, dtype=np.float128)
convolve(a, a)

Wie kann das geschehen?

1738891526

Guest

Auf meinem Linux -System habe ich: < /p>
[code]np.finfo(np.float128)
info(resolution=1e-18, min=-1.189731495357231765e+4932, max=1.189731495357231765e+4932, dtype=float128)
< /code>
Das ist also ein 80-Bit-Doppel. Ich möchte eine Faltung zwischen zwei einigermaßen langen Arrays von NP.Float128 [/code] s durchführen.  scipy.signal.convolve  funktioniert mit methode = 'Direct' , gibt jedoch die falsche Antwort für method = 'fft'  an. Hier ist ein Spielzeugbeispiel: < /p>
[code]a = np.array(['1.e+401', '1.e+000', '1.e+401', '1.e+000'], dtype=np.float128)

convolve(a, a, mode='full', method='direct')
array([1.e+802, 2.e+401, 2.e+802, 4.e+401, 1.e+802, 2.e+401, 1.e+000],
dtype=float128) # correct
convolve(a, a, mode='full', method='fft')
array([1.e+802, 0.e+000, 2.e+802, 0.e+000, 1.e+802, 0.e+000, 0.e+000],
dtype=float128) # wrong
[/code]
Ich habe versucht, die Faltung mit Pyfftw  von Grund auf neu zu implementieren, aber es gab immer noch die falsche Antwort. Um eine Flucht zu machen, die dem folgenden schnell und genau mit FFTS ähnelt: < /p>
[code]a = np.array([1e401, 1e000, 1e401, 1e000] * 10000, dtype=np.float128)
convolve(a, a)
[/code]
Wie kann das geschehen?

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Quick Reply

Username:

Change Text Case:

Smilies

View more smilies

Similar Topics

Replies

Views

Last post

Warum ist Torch.fft.rfft(x) schneller als Torch.fft.rfft(x, out=y)?

Last post by Guest « 05 Jan 2025, 10:38
Posted in Python

by Guest » 05 Jan 2025, 10:38 » in Python

Bei der Verwendung der Funktion Torch.fft.rfft von PyTorch habe ich festgestellt, dass die Angabe eines Ausgabetensors mithilfe des Parameters out langsamer ist, als die Ausgabe intern von PyTorch...

0 Replies

14 Views

Last post by Guest
05 Jan 2025, 10:38
Warum ist Torch.fft.rfft(x) schneller als Torch.fft.rfft(x, out=y)?

Last post by Guest « 08 Jan 2025, 08:36
Posted in Python

by Guest » 08 Jan 2025, 08:36 » in Python

Bei der Verwendung der Funktion Torch.fft.rfft von PyTorch habe ich festgestellt, dass die Angabe eines Ausgabetensors mithilfe des Parameters out langsamer ist, als die Ausgabe intern von PyTorch...

0 Replies

15 Views

Last post by Guest
08 Jan 2025, 08:36
Cupy ndimage Faltung in einer verschachtelten Schließverschlüsselung scheint schnell, aber die nächste Ausführung ist in

Last post by Anonymous « 13 Feb 2025, 21:13
Posted in Python

by Anonymous » 13 Feb 2025, 21:13 » in Python

Ich versuche, Code zu schreiben, das ein 3D -Bild mit einem 3D -Wavelet -Kernel verteilt, das mit drei unabhängigen Parametern beschrieben werden kann. Ich möchte die Ergebnisse der Faltung für alle...

0 Replies

10 Views

Last post by Anonymous
13 Feb 2025, 21:13
Cupy ndimage Faltung in einer verschachtelten Schließverschlüsselung scheint schnell, aber die nächste Ausführung ist in

Last post by Anonymous « 13 Feb 2025, 21:34
Posted in Python

by Anonymous » 13 Feb 2025, 21:34 » in Python

Ich versuche, Code zu schreiben, das ein 3D -Bild mit einem 3D -Wavelet -Kernel verteilt, das mit drei unabhängigen Parametern beschrieben werden kann. Ich möchte die Ergebnisse der Faltung für alle...

0 Replies

7 Views

Last post by Anonymous
13 Feb 2025, 21:34
Cupy ndimage Faltung in einer verschachtelten Schließverschlüsselung scheint schnell, aber die nächste Ausführung ist in

Last post by Guest « 14 Feb 2025, 04:53
Posted in Python

by Guest » 14 Feb 2025, 04:53 » in Python

Ich versuche, Code zu schreiben, das ein 3D -Bild mit einem 3D -Wavelet -Kernel verteilt, das mit drei unabhängigen Parametern beschrieben werden kann. Ich möchte die Ergebnisse der Faltung für alle...

0 Replies

6 Views

Last post by Guest
14 Feb 2025, 04:53

Return to “Python”