Can I use multi GPUS #31

magicleo · 2023-05-18T08:24:58Z

I have 2 GPUS, each on has 24G memory.
when I run code below

model = SentenceTransformerSpecb( "bigscience/sgpt-bloom-7b1-msmarco", cache_folder = "/mnt/storage/agtech/modelCache", ) query_embeddings = model.encode(queries, is_query=True)
got OutOfMemoryError, it only use the first GPU. Can it load the model on two gpus?

OutOfMemoryError: CUDA out of memory. Tried to allocate 64.00 MiB (GPU 0; 22.03 GiB total capacity; 21.27 GiB
already allocated; 50.94 MiB free; 21.27 GiB reserved in total by PyTorch) If reserved memory is >> allocated
memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and
PYTORCH_CUDA_ALLOC_CONF

The text was updated successfully, but these errors were encountered:

Muennighoff · 2023-05-18T08:57:13Z

For inference, you can use accelerate for that I think; Check huggingface/accelerate#769

magicleo · 2023-05-18T11:58:30Z

@Muennighoff Thank you very much for your reply.
I tried code like below
model = SentenceTransformerSpecb( "bigscience/sgpt-bloom-7b1-msmarco", cache_folder="/mnt/storage/agtech/modelCache", ) accelerator = Accelerator() model = accelerator.prepare(model)

when run model= accelerator.prepare(model) I got CUDA out of memory，still only use first gpu.
Any suggest?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I use multi GPUS #31

Can I use multi GPUS #31

magicleo commented May 18, 2023 •

edited

Loading

Muennighoff commented May 18, 2023

magicleo commented May 18, 2023 •

edited

Loading

Can I use multi GPUS #31

Can I use multi GPUS #31

Comments

magicleo commented May 18, 2023 • edited Loading

Muennighoff commented May 18, 2023

magicleo commented May 18, 2023 • edited Loading

magicleo commented May 18, 2023 •

edited

Loading

magicleo commented May 18, 2023 •

edited

Loading