ใช้ Qiskit Code Assistant ในโหมด local

เรียนรู้วิธีติดตั้ง กำหนดค่า และใช้โมเดล Qiskit Code Assistant ใดก็ได้บนเครื่องของคุณ

หมายเหตุ

Qiskit Code Assistant อยู่ในสถานะ preview release และอาจมีการเปลี่ยนแปลง
หากคุณมีความคิดเห็นหรือต้องการติดต่อทีมนักพัฒนา ใช้ ช่อง Qiskit Slack Workspace หรือ public GitHub repositories ที่เกี่ยวข้อง

เริ่มต้นใช้งานด่วน (แนะนำ)

วิธีที่ง่ายที่สุดในการเริ่มต้นใช้ Qiskit Code Assistant ในโหมด local คือการใช้ script ตั้งค่าอัตโนมัติสำหรับ VS Code หรือ JupyterLab extension script เหล่านี้จะติดตั้ง Ollama เพื่อรัน LLM โดยอัตโนมัติ ดาวน์โหลดโมเดลที่แนะนำ และกำหนดค่า extension ให้คุณ

การตั้งค่า VS Code extension

รันคำสั่งต่อไปนี้ใน terminal ของคุณ:

bash <(curl -fsSL https://raw.githubusercontent.com/Qiskit/qiskit-code-assistant-vscode/main/setup_local.sh)

script นี้ดำเนินการขั้นตอนต่อไปนี้:

ติดตั้ง Ollama (หากยังไม่ได้ติดตั้ง)
ดาวน์โหลดและกำหนดค่าโมเดล Qiskit Code Assistant ที่แนะนำ
ตั้งค่า VS Code extension ให้ทำงานกับ local deployment ของคุณ

การตั้งค่า JupyterLab extension

รันคำสั่งต่อไปนี้ใน terminal ของคุณ:

bash <(curl -fsSL https://raw.githubusercontent.com/Qiskit/qiskit-code-assistant-jupyterlab/main/setup_local.sh)

script นี้จะ:

ติดตั้ง Ollama (หากยังไม่ได้ติดตั้ง)
ดาวน์โหลดและกำหนดค่าโมเดล Qiskit Code Assistant ที่แนะนำ
ตั้งค่า JupyterLab extension ให้ทำงานกับ local deployment ของคุณ

โมเดลที่ใช้ได้

โมเดลปัจจุบัน

เหล่านี้คือโมเดลล่าสุดที่แนะนำสำหรับใช้กับ Qiskit Code Assistant:

Qiskit/mistral-small-3.2-24b-qiskit - เปิดตัว October 2025
qiskit/qwen2.5-coder-14b-qiskit - เปิดตัว June 2025
qiskit/granite-3.3-8b-qiskit - เปิดตัว June 2025
qiskit/granite-3.2-8b-qiskit - เปิดตัว June 2025

โมเดล GGUF (แนะนำสำหรับ personal environments/laptops)

โมเดลรูปแบบ GGUF ได้รับการปรับแต่งสำหรับการใช้งาน local และต้องใช้ทรัพยากรการประมวลผลน้อยกว่า:

mistral-small-3.2-24b-qiskit-GGUF – เปิดตัว October 2025 ฝึกด้วยข้อมูล Qiskit จนถึงเวอร์ชัน 2.1
qiskit/qwen2.5-coder-14b-qiskit-GGUF – เปิดตัว June 2025 ฝึกด้วยข้อมูล Qiskit จนถึงเวอร์ชัน 2.0
qiskit/granite-3.3-8b-qiskit-GGUF – เปิดตัว June 2025 ฝึกด้วยข้อมูล Qiskit จนถึงเวอร์ชัน 2.0
qiskit/granite-3.2-8b-qiskit-GGUF – เปิดตัว June 2025 ฝึกด้วยข้อมูล Qiskit จนถึงเวอร์ชัน 2.0

โมเดล Qiskit Code Assistant แบบ Open Source มีให้ในรูปแบบ safetensors หรือ GGUF file format และสามารถดาวน์โหลดได้จาก Hugging Face ตามที่อธิบายด้านล่าง

เวอร์ชัน Qiskit ที่ใช้ฝึกโมเดล

โมเดล						Benchmark Metrics					วันที่เปิดตัว	ฝึกบน Qiskit เวอร์ชัน
	QiskitHumanEval-Hard	QiskitHumanEval	HumanEval	ASDiv	MathQA	SciQ	MBPP	IFEval	CrowsPairs (English)	TruthfulQA (MC1 acc)
mistral-small-3.2-24b-qiskit	32.45	47.02	77.49	3.77	49.68	97.50	64.00	48.44	67.08	39.41	January 2026	2.2
qwen2.5-coder-14b-qiskit	25.17	49.01	91.46	4.21	53.90	97.00	77.60	49.64	65.18	37.82	June 2025	2.0
granite-3.3-8b-qiskit	14.57	27.15	62.80	0.48	38.66	93.30	52.40	59.71	59.75	39.05	June 2025	2.0
granite-3.2-8b-qiskit	9.93	24.50	57.32	0.09	41.41	96.30	51.80	60.79	66.79	40.51	June 2025	2.0
granite-8b-qiskit-rc-0.10	15.89	38.41	59.76	—	—	—	—	—	—	—	February 2025	1.3
granite-8b-qiskit	17.88	44.37	53.66	—	—	—	—	—	—	—	November 2024	1.2

หมายเหตุ: โมเดลทั้งหมดที่แสดงในตาราง benchmark ได้รับการประเมินโดยใช้ system prompt ของตน ซึ่งกำหนดไว้ในโมเดล Hugging Face

โมเดลที่ deprecated แล้ว

โมเดลเหล่านี้ไม่ได้รับการดูแลอย่างต่อเนื่องอีกต่อไป แต่ยังคงใช้งานได้:

qiskit/granite-8b-qiskit-rc-0.10 - เปิดตัว February 2025 (deprecated)
qiskit/granite-8b-qiskit - เปิดตัว November 2024 (deprecated)

การตั้งค่าขั้นสูง

หากคุณต้องการกำหนดค่า local setup ด้วยตนเองหรือต้องการควบคุมกระบวนการติดตั้งมากขึ้น ให้ขยาย section ด้านล่าง

ดาวน์โหลดจากเว็บไซต์ Hugging Face

ทำตามขั้นตอนเหล่านี้เพื่อดาวน์โหลดโมเดลที่เกี่ยวข้องกับ Qiskit Code Assistant จากเว็บไซต์ Hugging Face:

ไปที่หน้าโมเดล Qiskit ที่ต้องการบน Hugging Face
ไปที่แท็บ Files and Versions และดาวน์โหลดไฟล์โมเดล safetensors หรือ GGUF

ดาวน์โหลดโดยใช้ Hugging Face CLI

ในการดาวน์โหลดโมเดล Qiskit Code Assistant ที่ใช้ได้โดยใช้ Hugging Face CLI ให้ทำตามขั้นตอนเหล่านี้:

ติดตั้ง Hugging Face CLI
เข้าสู่ระบบ Hugging Face account ของคุณ
```
huggingface-cli login
```
ดาวน์โหลดโมเดลที่คุณต้องการจากรายการก่อนหน้า
```
huggingface-cli download <HF REPO NAME> <MODEL PATH> --local-dir <LOCAL PATH>
```

Deploy โมเดล Qiskit Code Assistant ใน local ด้วย Ollama ด้วยตนเอง

มีหลายวิธีในการ deploy และโต้ตอบกับโมเดล Qiskit Code Assistant ที่ดาวน์โหลดมา คู่มือนี้แสดงการใช้ Ollama ดังนี้: ด้วย Ollama application โดยใช้ Hugging Face Hub integration หรือโมเดล local หรือด้วย แพ็กเกจ llama-cpp-python

การใช้ Ollama application

Ollama application ให้วิธีแก้ปัญหาที่ง่ายในการรัน LLM ใน local ง่ายต่อการใช้งาน พร้อม CLI ที่ทำให้กระบวนการตั้งค่าทั้งหมด การจัดการโมเดล และการโต้ตอบค่อนข้างตรงไปตรงมา เหมาะสำหรับการทดลองอย่างรวดเร็วและสำหรับผู้ใช้ที่ต้องการรายละเอียดทางเทคนิคน้อยกว่า

ติดตั้ง Ollama

ดาวน์โหลด Ollama application
ติดตั้งไฟล์ที่ดาวน์โหลด
เปิดใช้งาน Ollama application ที่ติดตั้ง

ข้อมูล
application ทำงานสำเร็จเมื่อไอคอน Ollama ปรากฏใน desktop menu bar คุณยังสามารถตรวจสอบว่า service ทำงานอยู่โดยไปที่ http://localhost:11434/
ลอง Ollama ใน terminal ของคุณและเริ่มรันโมเดล ตัวอย่าง:
```
ollama run hf.co/Qiskit/Qwen2.5-Coder-14B-Qiskit
```

ตั้งค่า Ollama โดยใช้ Hugging Face Hub integration

Ollama/Hugging Face Hub integration ให้วิธีโต้ตอบกับโมเดลที่โฮสต์บน Hugging Face Hub โดยไม่จำเป็นต้องสร้าง modelfile ใหม่หรือดาวน์โหลดไฟล์ GGUF หรือ safetensors ด้วยตนเอง ไฟล์ template และ params เริ่มต้นรวมอยู่ในโมเดลบน Hugging Face Hub แล้ว

ตรวจสอบให้แน่ใจว่า Ollama application ทำงานอยู่
ไปที่หน้าโมเดลที่ต้องการ และคัดลอก URL ตัวอย่าง https://huggingface.co/Qiskit/Qwen2.5-Coder-14B-Qiskit-GGUF
จาก terminal ของคุณ รันคำสั่ง:
```
ollama run hf.co/Qiskit/Qwen2.5-Coder-14B-Qiskit
```

คุณสามารถใช้โมเดล hf.co/Qiskit/Qwen2.5-Coder-14B-Qiskit หรือโมเดล GGUF อย่างเป็นทางการที่แนะนำอื่น ๆ hf.co/Qiskit/mistral-small-3.2-24b-qiskit-GGUF หรือ hf.co/Qiskit/granite-3.3-8b-qiskit-GGUF

ตั้งค่า Ollama ด้วยโมเดล Qiskit Code Assistant GGUF ที่ดาวน์โหลดด้วยตนเอง

หากคุณดาวน์โหลดโมเดล GGUF ด้วยตนเอง เช่น https://huggingface.co/Qiskit/Qwen2.5-Coder-14B-Qiskit-GGUF และต้องการทดลองด้วย template และ parameter ต่างกัน คุณสามารถทำตามขั้นตอนเหล่านี้เพื่อโหลดเข้า Ollama application ใน local ของคุณ

สร้าง Modelfile โดยป้อนเนื้อหาต่อไปนี้ และอย่าลืมอัปเดต <PATH-TO-GGUF-FILE> เป็นเส้นทางจริงของโมเดลที่ดาวน์โหลด

FROM <PATH-TO-GGUF-FILE>
TEMPLATE """{{ if .System }}
System:
{{ .System }}

{{ end }}{{ if .Prompt }}Question:
{{ .Prompt }}

{{ end }}Answer:
```python{{ .Response }}
"""

PARAMETER stop "Question:"
PARAMETER stop "Answer:"
PARAMETER stop "System:"
PARAMETER stop "```"

PARAMETER temperature 0
PARAMETER top_k 1

Run the following command to create a custom model instance based on the Modelfile.
```
ollama create Qwen2.5-Coder-14B-Qiskit -f ./path-to-model-file
```
หมายเหตุ
This process may take some time for Ollama to read the model file, initialize the model instance, and configure it according to the specifications provided.

Run the Qiskit Code Assistant model manually downloaded in Ollama

After the Qwen2.5-Coder-14B-Qiskit model has been set up in Ollama, run the following command to launch the model and interact with it in the terminal (in chat mode).

ollama run Qwen2.5-Coder-14B-Qiskit

Some useful commands:

ollama list - List models on your computer
ollama rm Qwen2.5-Coder-14B-Qiskit - Delete the model
ollama show Qwen2.5-Coder-14B-Qiskit - Show model information
ollama stop Qwen2.5-Coder-14B-Qiskit - Stop a model that is currently running
ollama ps - List which models are currently loaded

Manually deploy the Qiskit Code Assistant models in local through the llama-cpp-python package

An alternative to the Ollama application is the llama-cpp-python package, which is a Python binding for llama.cpp. It gives you more control and flexibility to run the GGUF model locally, and is ideal for users who wish to integrate the local model in their workflows and Python applications.

Install llama-cpp-python
Interact with the model from within your application using llama_cpp. For example:

from llama_cpp import Llama

model_path = <PATH-TO-GGUF-FILE>

model = Llama(
        model_path,
        seed=17,
        n_ctx=10000,
        n_gpu_layers=37, # to offload in gpu, but put 0 if all in cpu
    )

input = 'Generate a quantum circuit with 2 qubits'
raw_pred = model(input)["choices"][0]["text"]

You can also add text generation parameters to the model to customize the inference:

generation_kwargs = {
        "max_tokens": 512,
        "echo": False, # Echo the prompt in the output
        "top_k": 1
    }

raw_pred = model(input, **generation_kwargs)["choices"][0]["text"]

Manually deploy the Qiskit Code Assistant models in local through llama.cpp

Use the `llama.cpp` library

Another alternative is to use llama.cpp, an open-source library for performing LLM inference on a CPU with minimal setup. It provides low-level control over the model execution and is typically run from the command line, pointing to a local GGUF model file.

There are several ways to install llama.cpp on your machine:

Install llama.cpp using brew, nix, or winget
Run with Docker: See out the Docker documentation by llama.cpp team
Download pre-built binaries from the releases page
Build from source by cloning this repository

Once installed, you can use llama.cpp to interact with GGUF models in conversation mode as follows:

# Use a local model file
llama-cli -m my_model.gguf -cnv

# Or download and run a model directly from Hugging Face
llama-cli -hf Qiskit/Qwen2.5-Coder-14B-Qiskit-GGUF -cnv

You can also launch an OpenAI-compatible API server for the model in the following way:

llama-server -hf Qiskit/Qwen2.5-Coder-14B-Qiskit-GGUF

Advanced parameters

With the llama-cli program, you can control the model generation using command-line options. For example, you can provide an initial “system” prompt using the -p/--prompt flag. In conversation mode (-cnv), this initial prompt acts as the system message. Otherwise, you can simply prepend any desired instruction to your prompt text. You can also adjust sampling parameters - for instance: temperature (--temp), top-k (--top-k), top-p (--top-p), repetition penalty (--repeat-penalty), and the seed to use (--seed). The following is an example invocation using these options:

llama-cli -hf Qiskit/Qwen2.5-Coder-14B-Qiskit-GGUF \
  -p "You are a friendly assistant." -cnv \
  --temp 0.7 \
  --top-k 50 \
  --top-p 0.95 \
  --repeat-penalty 1.1 \
  --seed 42

เพื่อให้โมเดล Qiskit ของเราทำงานได้อย่างถูกต้อง เราแนะนำให้ใช้ system prompt ที่ระบุใน HF GGUF repositories ของเรา: system prompt สำหรับ mistral-small-3.2-24b-qiskit-GGUF, Qwen2.5-Coder-14B-Qiskit-GGUF, granite-3.3-8b-qiskit-GGUF, และ granite-3.2-8b-qiskit-GGUF

เชื่อมต่อ extension กับ local deployment ด้วยตนเอง

ใช้ VS Code extension และ JupyterLab extension สำหรับ Qiskit Code Assistant เพื่อ prompt โมเดล Qiskit Code Assistant ที่ deploy ใน local เมื่อคุณตั้งค่า Ollama application กับโมเดล แล้ว คุณสามารถกำหนดค่า extension เพื่อเชื่อมต่อกับ local service ได้

เชื่อมต่อด้วย Qiskit Code Assistant VS Code extension

ด้วย Qiskit Code Assistant VS Code extension คุณสามารถโต้ตอบกับโมเดลและทำ code completion ขณะเขียนโค้ดได้ ซึ่งเหมาะสำหรับผู้ใช้ที่ต้องการความช่วยเหลือในการเขียนโค้ด Qiskit สำหรับ Python application ของตน

ติดตั้ง Qiskit Code Assistant VS Code extension
ใน VS Code ไปที่ User Settings และตั้งค่า Qiskit Code Assistant: Url เป็น URL ของ local Ollama deployment (ตัวอย่าง http://localhost:11434)
โหลด VS Code ใหม่โดยไปที่ View > Command Palette... และเลือก Developer: Reload Window

โมเดล Qiskit Code Assistant ที่กำหนดค่าใน Ollama จะปรากฏใน status bar และพร้อมใช้งาน

เชื่อมต่อด้วย Qiskit Code Assistant JupyterLab extension

ด้วย Qiskit Code Assistant JupyterLab extension คุณสามารถโต้ตอบกับโมเดลและทำ code completion โดยตรงใน Jupyter Notebook ได้ ผู้ใช้ที่ทำงานกับ Jupyter Notebook เป็นหลักสามารถใช้ประโยชน์จาก extension นี้เพื่อเพิ่มประสิทธิภาพในการเขียนโค้ด Qiskit

ติดตั้ง Qiskit Code Assistant JupyterLab extension
ใน JupyterLab ไปที่ Settings Editor และตั้งค่า Qiskit Code Assistant Service API เป็น URL ของ local Ollama deployment (ตัวอย่าง http://localhost:11434)

โมเดล Qiskit Code Assistant ที่กำหนดค่าใน Ollama จะปรากฏใน status bar และพร้อมใช้งาน

Source: IBM Quantum docs — updated 24 เม.ย. 2569

English version on doQumentation — updated 7 พ.ค. 2569

This translation based on the English version of 11 มี.ค. 2569

เริ่มต้นใช้งานด่วน (แนะนำ)​

การตั้งค่า VS Code extension​

การตั้งค่า JupyterLab extension​

โมเดลที่ใช้ได้​

โมเดลปัจจุบัน​

โมเดล GGUF (แนะนำสำหรับ personal environments/laptops)​

เวอร์ชัน Qiskit ที่ใช้ฝึกโมเดล​

โมเดลที่ deprecated แล้ว​

การตั้งค่าขั้นสูง​

การใช้ Ollama application​

ติดตั้ง Ollama​

ตั้งค่า Ollama โดยใช้ Hugging Face Hub integration​

ตั้งค่า Ollama ด้วยโมเดล Qiskit Code Assistant GGUF ที่ดาวน์โหลดด้วยตนเอง​

Run the Qiskit Code Assistant model manually downloaded in Ollama​

Use the llama.cpp library​

Advanced parameters​

เชื่อมต่อด้วย Qiskit Code Assistant VS Code extension​

เชื่อมต่อด้วย Qiskit Code Assistant JupyterLab extension​