美文网首页
如果模型太大无法加载怎么办?

如果模型太大无法加载怎么办?

作者: 英文名字叫dawntown | 来源:发表于2021-09-23 21:40 被阅读0次

There are several solutions to use The XXL versions of ProtT5:

  1. Use a GPU with a big memory like NVIDIA Quadro RTX-8000 or NVIDIA A100, with/without half-precision.
  2. Use a GPU with less memory after you quantize the model, which will make the model size 3x-4x smaller:
    https://pytorch.org/docs/stable/quantization.html
  3. Convert the model to onnx, then Quantize the model, and use the CPU rather than GPU for inference:
    https://github.com/agemagician/ProtTrans/tree/master/Embedding/Onnx
  4. Parallelize the model across multiple small GPUs:
    https://huggingface.co/transformers/model_doc/t5.html#transformers.T5Model.parallelize

You can, of course, combine one or more of the above points.

相关文章

网友评论

      本文标题:如果模型太大无法加载怎么办?

      本文链接:https://www.haomeiwen.com/subject/notagltx.html