Wie benutze ich das quantitative AWQ -Klassifizierungsmodell? - Programmiererforum

Wie benutze ich das quantitative AWQ -Klassifizierungsmodell? ⇐ Python

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Guest

Wie benutze ich das quantitative AWQ -Klassifizierungsmodell?

Post by Guest » 14 Feb 2025, 03:42

Wenn ich ein Klassifizierungsmodell habe, das auf QWEN2.5-0.5B-Training basiert: < /p>

Code: Select all

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B")
model = AutoModelForSequenceClassification.from_pretrained(
"Qwen/Qwen2.5-0.5B",
device_map="auto",
num_labels=2,
torch_dtype=torch.bfloat16,
)

Wie quantifiziere ich es für AWQ und kalibrieren Sie es? p>

Code: Select all

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B")
model = AutoModelForSequenceClassification.from_pretrained(
"Qwen/Qwen2.5-0.5B",
device_map="auto",
num_labels=2,
torch_dtype=torch.bfloat16,
)
model.save_pretrained(model_path)
tokenizer.save_pretrained(model_path)
```python

The conversion was then performed using AutoAwq, but it was found that the head layer changed from score to lm_head after quantisation (the model architecture was changed).

```python
from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer, AutoModelForSequenceClassification
from transformers import AwqConfig, AutoConfig
import torch
quant_config = {"zero_point": True, "q_group_size": 128, "w_bit": 4, "version": "GEMM" }

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
model = AutoAWQForCausalLM.from_pretrained(model_path, device_map="auto", safetensors=True)

model.quantize(tokenizer, quant_config=quant_config, calib_data=data)
model.save_quantized(quant_path, safetensors=True, shard_size="4GB")
tokenizer.save_pretrained(quant_path)

1739500942

Guest

Wenn ich ein Klassifizierungsmodell habe, das auf QWEN2.5-0.5B-Training basiert: < /p>
[code]from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B")
model = AutoModelForSequenceClassification.from_pretrained(
"Qwen/Qwen2.5-0.5B",
device_map="auto",
num_labels=2,
torch_dtype=torch.bfloat16,
)
[/code]
Wie quantifiziere ich es für AWQ und kalibrieren Sie es? p>
[code]from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-0.5B")
model = AutoModelForSequenceClassification.from_pretrained(
"Qwen/Qwen2.5-0.5B",
device_map="auto",
num_labels=2,
torch_dtype=torch.bfloat16,
)
model.save_pretrained(model_path)
tokenizer.save_pretrained(model_path)
```python

The conversion was then performed using AutoAwq, but it was found that the head layer changed from score to lm_head after quantisation (the model architecture was changed).

```python
from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer, AutoModelForSequenceClassification
from transformers import AwqConfig, AutoConfig
import torch
quant_config = {"zero_point": True, "q_group_size": 128, "w_bit": 4, "version": "GEMM" }

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
model = AutoAWQForCausalLM.from_pretrained(model_path, device_map="auto", safetensors=True)

model.quantize(tokenizer, quant_config=quant_config, calib_data=data)
model.save_quantized(quant_path, safetensors=True, shard_size="4GB")
tokenizer.save_pretrained(quant_path)
[/code]

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Quick Reply

Username:

Change Text Case:

Smilies

View more smilies

Similar Topics

Replies

Views

Last post

Wie benutze ich Jupyter -Notizbücher oder Ipython zur Entwicklung, ohne das Gebiet zu verschmutzen?

Last post by Anonymous « 27 Mar 2025, 15:55
Posted in Python

by Anonymous » 27 Mar 2025, 15:55 » in Python

Best Practice für Python besteht darin, Venv auf die Importe zu isolieren, die Sie wirklich benötigen. Ich verwende Python -m Venv . Diese müssen jedoch in das zu verwendende Venv eingebaut werden....

0 Replies

17 Views

Last post by Anonymous
27 Mar 2025, 15:55
Wie benutze ich das Ergebnis eines Ausdrucks als Argument zu einem Namenknoten programmgesteuert?

Last post by Anonymous « 09 Apr 2025, 05:08
Posted in Python

by Anonymous » 09 Apr 2025, 05:08 » in Python

Im Prozess der Entwicklung einer benutzerdefinierten Jinja2 -Erweiterung, die Namespaces mit dynamisch bewerteten Namen erstellt, muss ich das Ergebnis der Bewertung eines Template -Expression als...

0 Replies

11 Views

Last post by Anonymous
09 Apr 2025, 05:08
Wie benutze ich Java 22+ FFM -API, um das Startmenü und den Desktop -Standort über Windows -API zu erhalten?

Last post by Anonymous « 02 Jun 2025, 19:25
Posted in Java

by Anonymous » 02 Jun 2025, 19:25 » in Java

Wie kann man Java 22+ FFM -API verwenden, um das Startmenü und die Desktop -Positionen über die Windows -API zu erhalten? Guid.GUID guid = KnownFolders.FOLDERID_CommonPrograms;
//or...

0 Replies

4 Views

Last post by Anonymous
02 Jun 2025, 19:25
Warum habe ich 2 rote Punkte auf meinem Kivymd2.0.0 -Bildschirm? Ich benutze eine KV -String und rendere sie. Wie korrig

Last post by Anonymous « 08 Mar 2025, 15:51
Posted in Python

by Anonymous » 08 Mar 2025, 15:51 » in Python

Ich bin ein Neuling mit Stapelüberlauf und Kivymd. Warum habe ich 2 unerwartete rote Punkte auf meinem Kivymd2.0.0 -Bildschirm? Ich habe eine KV -String in meinem Python -Code erstellt, um das...

0 Replies

14 Views

Last post by Anonymous
08 Mar 2025, 15:51
Wie lasse ich ein Werkzeug ein Bild zurückgeben, wenn ich Langgraph/Langchain benutze?

Last post by Anonymous « 23 Apr 2025, 09:32
Posted in Python

by Anonymous » 23 Apr 2025, 09:32 » in Python

Ich verwende ein LLM-Modell, das Bilder verarbeiten kann.@tool
def get_screenshot():
picture_path = do_screenshoot_and_save_it_to_local_disk()
return ??? # What should I return?

Wie kann es...

0 Replies

8 Views

Last post by Anonymous
23 Apr 2025, 09:32

Return to “Python”