Skip to content

How to convert safetensors into sbs? #533

@zeerd

Description

@zeerd

I know it's an old question, but I still have no idea how to finish it.

I got some files after fine-tune and merge_and_unload, like below:

added_tokens.json  config.json  generation_config.json  model.safetensors  special_tokens_map.json  
tokenizer.json  tokenizer.model  tokenizer_config.json

I only find the script python/convert_from_safetensors.py.
It seems to be created for PaliGemma only, but I still tried it with the error below:

python3 python/convert_from_safetensors.py --load_path /model/gemma-3-1b-finetune/merged_gemma/
I0401 15:06:34.645604 140119943138176 convert_from_safetensors.py:423] Restoring pure python logging
I0401 15:06:34.645855 140119943138176 convert_from_safetensors.py:429]
====
Reading from /model/gemma-3-1b-finetune/merged_gemma/ and writing to /tmp/gemmacpp.sbs
====
Traceback (most recent call last):
  File "/model/gemma-3-1b-finetune/gemma.cpp/python/convert_from_safetensors.py", line 436, in <module>
    app.run(main)
  File "/usr/local/lib/python3.11/site-packages/absl/app.py", line 316, in run
    _run_main(main, args)
  File "/usr/local/lib/python3.11/site-packages/absl/app.py", line 261, in _run_main
    sys.exit(main(argv))
             ^^^^^^^^^^
  File "/model/gemma-3-1b-finetune/gemma.cpp/python/convert_from_safetensors.py", line 432, in main
    export_paligemma_sbs(load_path, metadata_file, sbs_file)
  File "/model/gemma-3-1b-finetune/gemma.cpp/python/convert_from_safetensors.py", line 212, in export_paligemma_sbs
    with safetensors.safe_open(file, framework="pt") as f:
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
OSError: No such device (os error 19)

Any suggest for the convert scripts?
Or maybe there is another fine-tune guide?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions