So erstellen Sie eine Einbettung für ein 4-Bit-quantisiertes Lama3-Modell mithilfe von Huggingface und Langchain

So erstellen Sie eine Einbettung für ein 4-Bit-quantisiertes Lama3-Modell mithilfe von Huggingface und Langchain ⇐ Python

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Anonymous

So erstellen Sie eine Einbettung für ein 4-Bit-quantisiertes Lama3-Modell mithilfe von Huggingface und Langchain

Report
Quote

Post by Anonymous » 12 Nov 2025, 12:05

Ich versuche, einen Rag mit Longchain und Huggingface zu machen,

Code: Select all

from langchain_huggingface import HuggingFaceEmbeddings

model_name = "unsloth/llama-3-8b-Instruct-bnb-4bit"
model_kwargs = {'device': device}
encode_kwargs = {'normalize_embeddings': False}
hf = HuggingFaceEmbeddings(
model_name=model_name,
model_kwargs=model_kwargs,
encode_kwargs=encode_kwargs
)
...
vectorstore = Chroma.from_documents(documents=splits, embedding=hf)

Allerdings erhalte ich beim Erstellen der HF die Fehlermeldung „ValueError: Supplied state dict for Layers.0.mlp.down_proj.weight does not contains bitsandbytes__* and möglicherweise other quantized_stats Components“.
Wie soll ich das korrigieren? Vielen Dank

1762945536

Anonymous

Ich versuche, einen Rag mit Longchain und Huggingface zu machen,
[code]from langchain_huggingface import HuggingFaceEmbeddings

model_name = "unsloth/llama-3-8b-Instruct-bnb-4bit"
model_kwargs = {'device': device}
encode_kwargs = {'normalize_embeddings': False}
hf = HuggingFaceEmbeddings(
model_name=model_name,
model_kwargs=model_kwargs,
encode_kwargs=encode_kwargs
)
...
vectorstore = Chroma.from_documents(documents=splits, embedding=hf)
[/code]
Allerdings erhalte ich beim Erstellen der HF die Fehlermeldung „ValueError: Supplied state dict for Layers.0.mlp.down_proj.weight does not contains bitsandbytes__* and möglicherweise other quantized_stats Components“.
Wie soll ich das korrigieren? Vielen Dank

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Quick Reply

Subject:

Username:

Change Text Case:

Smilies

View more smilies

Similar Topics

Replies

Views

Last post

NEU 64-BIT DEV (Old 32-Bit Dev): Warum ist meine 64-Bit-ausführbare Datei so riesig?

Last post by Anonymous « 12 Jul 2025, 20:00
Posted in C++

by Anonymous » 12 Jul 2025, 20:00 » in C++

Ich habe mich seit V3 im C ++ - Builder entwickelt. Der größte Teil meiner Arbeit wurde in V5 und V6 erledigt. Ich bin gerade jetzt nach ein paar Jahren wieder darauf zurück und probiere die...

0 Replies

58 Views

Last post by Anonymous
12 Jul 2025, 20:00
Erhalten Sie ein 32-Bit-Programm für eine 64-Bit

Last post by Anonymous « 23 Sep 2025, 11:36
Posted in Linux

by Anonymous » 23 Sep 2025, 11:36 » in Linux

Ich versuche, eine ausführbare Datei auszuführen, die für 32-Bit-Linux kompiliert wurde, auf zwei Maschinen, die 64-Bit-Linux ausführen. Auf dieser Maschine läuft das Programm gut. Strace Ausgabe:...

0 Replies

34 Views

Last post by Anonymous
23 Sep 2025, 11:36
Das CLIP-Modell aus dem Modul „open_clip“ gibt eine einzelne Einbettung für 77 Token zurück

Last post by Anonymous « 14 Jan 2026, 20:37
Posted in Python

by Anonymous » 14 Jan 2026, 20:37 » in Python

Ich verwende das Modul open_clip, um Texteinbettungen aus dem CLIP-Modell zu erhalten. Wenn ich eine Liste einer einzelnen Textsequenz tokenisiere und sie an die Methode encode_text des Modells...

0 Replies

1 Views

Last post by Anonymous
14 Jan 2026, 20:37
Best Practices zum Erstellen und Dekonstruktion von 32-Bit-komplexen Werten in C ++? (2 x 16-Bit-Schwimmer)

Last post by Anonymous « 23 Apr 2025, 10:44
Posted in C++

by Anonymous » 23 Apr 2025, 10:44 » in C++

Ich beginne zum ersten Mal in C ++ mit 16-Bit-Floats und insbesondere mit 32-Bit-komplexen Werten, die mit _float16 _complex deklariert sind. Dabei wundere ich mich über Best Practices, um neue Werte...

0 Replies

32 Views

Last post by Anonymous
23 Apr 2025, 10:44
Fehler beim Importieren von Langchain-Modulen: Kein Modul mit dem Namen „langchain.chains“

Last post by Anonymous « 13 Nov 2025, 14:23
Posted in Python

by Anonymous » 13 Nov 2025, 14:23 » in Python

Ich versuche, eine FastAPI RAG-Anwendung mit Google Cloud und Vertex AI mit Python3.12 unter Verwendung der LangChain-Bibliothek zu programmieren.
Wenn ich jedoch versuche, die LangChain-Bibliothek...

0 Replies

17 Views

Last post by Anonymous
13 Nov 2025, 14:23

Return to “Python”