Plot expectation value benchmarks #168

natestemen · 2025-01-14T02:07:45Z

Description

This PR refactors the expectation value benchmarking script to ensure it works with the run_benchmarks.sh script. It also introduces new circuits to broaden the test of the expectation value testing. Scripts to visualize relative and absolute errors across different compilers over time are added, with one plot added to the README.

@jordandsullivan we have the option to plot relative or absolute error, but relative is much higher for ucc, which I don't understand the reason for.

benchmarks/scripts/run_benchmarks.sh

…nd/ucc into plot-expectation-value

the new data isn't the same format

jordandsullivan

Thanks for your work in this! Great getting to hack in person together.

I'm wondering why are the relative errors in the hundreds in the first place? What are the actual expectation values? I'd say we want to report errors in terms of a percent.

natestemen · 2025-01-15T19:31:16Z

why are the relative errors in the hundreds

I think it's because the ideal values are coming out so close to 0 that the relative errors are blowing up. You can find all the most recent results in benchmarks/results/expval_2025-01-14_20.csv. E.g. for QFT the ideal expectation value (last column) is $\mathcal{O}(10^{-21})$ so it's easy to be $>100\%$ off.

ucc/benchmarks/results/expval_2025-01-14_20.csv

Lines 11 to 14 in 8471ab3

    
           ucc,qft,ZZZZZZZZZZ,4.7704895589362195e-18,4.771491994589455e-18,4759.8985323285315,-1.002435653235501e-21 
        
           qiskit,qft,ZZZZZZZZZZ,8.673617379884035e-19,8.68364173641639e-19,866.2542786051877,-1.002435653235501e-21 
        
           pytket,qft,ZZZZZZZZZZ,-3.2526065174565133e-18,3.2516040818032778e-18,3243.703544769454,-1.002435653235501e-21 
        
           cirq,qft,ZZZZZZZZZZ,2.168404344971009e-19,2.178428701503364e-19,217.31356965129692,-1.002435653235501e-21

Not sure what the best course of action here is.

jordandsullivan · 2025-01-15T21:12:55Z

Okay just as a sanity check can you plot the simulated and ideal expectation values and standard deviation, similar to what I did here for #58 (where I was running on real hardware)?

This reminds me, we can also simply measure an array of observables in addition to ZZZZZZ like I did there. Maybe just adding in some that measure like XIIIIIII or XXXXXZZZZZ, etc.

jordandsullivan

Looking good, a few suggestions.

jordandsullivan · 2025-01-15T19:03:43Z

benchmarks/average_relative_error_over_time.png

Trying to understand why the relative errors are in the hundreds? What are the actual expectation values we're getting? Are they what we'd expect?

Perhaps we shouldn't use the same observable on all circuits if it is giving answers that don't seem meaningful.

Per @willzeng , we want to split off the customization of specific observables with the different benchmarks into a separate issue. Can complete this issue with the current all Z observable.

benchmarks/average_relative_error_over_time.png

benchmarks/latest_expval_benchmark_by_compiler.png

benchmarks/latest_relative_absolute_errors_by_circuit.png

natestemen · 2025-01-17T05:23:07Z

benchmarks/latest_absolute_errors_by_circuit.png

It's a little suspicious how the QAOA, QV, and QCNN circuit results are all basically identical. We should make sure this is real and not an artifact of how we perform simulation (or something else)!

Did you plot the expectation values themselves and standard deviations as suggested above?

Let's compare the compiled gate counts between the different compilers in case they are returning approximately the same circuits.

jordandsullivan and others added 4 commits January 13, 2025 15:00

remove unused QASM generation functions

9f62ec5

Bug fixes for save

47632ad

Add N=10 qubits QCNN file

c7dcd18

refactor expval benchmark to run single iteration

1ed386c

natestemen commented Jan 14, 2025

View reviewed changes

benchmarks/scripts/run_benchmarks.sh Outdated Show resolved Hide resolved

jordandsullivan added 5 commits January 14, 2025 09:47

Merge branch 'plot-expectation-value' of https://github.com/unitaryfu…

fc9a3e5

…nd/ucc into plot-expectation-value

Copy benchpress QASM 2 files for N=10 and N=9 qubits

019a232

Rename qasm file to include basis

c32ee7c

Add filenames to expal run

dcb4322

Fix filenames

1c28fd3

Misty-W linked an issue Jan 14, 2025 that may be closed by this pull request

Add expectation value plot to GH benchmarks pipeline #147

Open

natestemen added 6 commits January 14, 2025 20:39

ensure observable matches circuit size

22a35a7

add qisit aer to requirements

5353c4b

remove old expval data

683310f

the new data isn't the same format

first results

1fb692f

first plot

39f4ed2

more plots

8471ab3

natestemen marked this pull request as ready for review January 15, 2025 05:33

natestemen requested review from jordandsullivan and Misty-W January 15, 2025 05:33

jordandsullivan reviewed Jan 15, 2025

View reviewed changes

jordandsullivan closed this Jan 15, 2025

jordandsullivan reopened this Jan 15, 2025

jordandsullivan reviewed Jan 16, 2025

View reviewed changes

natestemen added 4 commits January 16, 2025 21:13

uncomment first commands to run

782c920

s/relative/absolute for errors

011b185

add new data and remove relative errors

e334d0c

remove old plot

f6b6417

natestemen commented Jan 17, 2025

View reviewed changes

Merge branch 'main' into plot-expectation-value

496731a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plot expectation value benchmarks #168

Plot expectation value benchmarks #168

natestemen commented Jan 14, 2025 •

edited

Loading

jordandsullivan left a comment •

edited

Loading

natestemen commented Jan 15, 2025

jordandsullivan commented Jan 15, 2025 •

edited

Loading

jordandsullivan left a comment

jordandsullivan Jan 15, 2025

jordandsullivan Jan 15, 2025

jordandsullivan Jan 16, 2025

natestemen Jan 17, 2025

jordandsullivan Jan 17, 2025

jordandsullivan Jan 21, 2025

Plot expectation value benchmarks #168

Are you sure you want to change the base?

Plot expectation value benchmarks #168

Conversation

natestemen commented Jan 14, 2025 • edited Loading

Description

jordandsullivan left a comment • edited Loading

Choose a reason for hiding this comment

natestemen commented Jan 15, 2025

jordandsullivan commented Jan 15, 2025 • edited Loading

jordandsullivan left a comment

Choose a reason for hiding this comment

jordandsullivan Jan 15, 2025

Choose a reason for hiding this comment

jordandsullivan Jan 15, 2025

Choose a reason for hiding this comment

jordandsullivan Jan 16, 2025

Choose a reason for hiding this comment

natestemen Jan 17, 2025

Choose a reason for hiding this comment

jordandsullivan Jan 17, 2025

Choose a reason for hiding this comment

jordandsullivan Jan 21, 2025

Choose a reason for hiding this comment

natestemen commented Jan 14, 2025 •

edited

Loading

jordandsullivan left a comment •

edited

Loading

jordandsullivan commented Jan 15, 2025 •

edited

Loading