Hello, so far, I have taken benchmarks for the cuQuantum's cuStateVec using a 4-card and 8-card configuration of V100, specifically for quantum volume calculations at depth 10. This time, I took benchmarks for an 8-card configuration of A100 and would like to introduce it. The problem I tackled remains the same. I performed quantum circuit calculations on the GPU using Qiskit+cuStateVec.
The computational results are pretty much as expected. This time, since the VRAM has increased from 32GB in the V100 to 40GB, I was able to calculate one more qubit. Overall, the 8 cards of A100 are faster.
However, to be honest, as of 2023, obtaining eight A100s for quantum calculations is extremely difficult. It's virtually impossible to get them, so it would be wise to give up on that idea. You probably have no choice but to use the V100 or H100. Next, if I can get my hands on an H100 environment, I'll try taking some benchmarks. That's all.