Core GPU Benchmark
- The linear solver is MUMPS for all experiments.
- Below you can find Dolan–Moré performance profiles comparing solver–model combinations on the set of optimal control problems and grid sizes. For a detailed explanation of how to read these profiles, see the Performance Profiles page.
Moonshot
This benchmark suite evaluates optimal control problems on GPU-accelerated hardware, focusing on large-scale problems.
⚙️ Configuration
Problems: beam, chain, double_oscillator, electric_vehicle, glider, jackson, robbins, rocket, vanderpol
Solvers: madnlp
Models: exa, exa_gpu
Grid sizes: 1000, 5000, 10000, 20000 discretization points
Discretization: midpoint method
Tolerance: 1.0e-8
Ipopt strategy: adaptive barrier parameter
Limits: 1000 iterations max, 2000.0s wall time
🖥️ Environment
📅 Timestamp : 2025-12-09 16:46:43 UTC
🔧 Julia version : 1.11.7
💻 OS : Linux
🖥️ Machine : moonshotYou can download the exact environment used for this benchmark:
📦 Project.toml - Package dependencies
📋 Manifest.toml - Complete dependency tree with versions
📜 Benchmark script - Julia script to run the benchmark
These files allow you to reproduce the benchmark environment and results exactly.
Julia Version 1.11.7
Commit f2b3dbda30a (2025-09-08 12:10 UTC)
Build Info:
Official https://julialang.org/ release
Platform Info:
OS: Linux (x86_64-linux-gnu)
CPU: 144 × Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz
WORD_SIZE: 64
LLVM: libLLVM-16.0.6 (ORCJIT, skylake-avx512)
Threads: 16 default, 0 interactive, 8 GC (on 144 virtual cores)
Environment:
JULIA_PKG_SERVER_REGISTRY_PREFERENCE = eager
JULIA_DEPOT_PATH = /scratch/github-actions/julia_depot
LD_LIBRARY_PATH = /home/mschanen/local/lib:/home/mschanen/local/lib:
JULIA_NUM_THREADS = 16 Project CTBenchmarks v0.3.1
Status `/scratch/github-actions/actions_runner_control_toolbox/_work/CTBenchmarks.jl/CTBenchmarks.jl/Project.toml`
[6e4b80f9] BenchmarkTools v1.6.3
⌅ [54762871] CTBase v0.16.2
[052768ef] CUDA v5.9.5
[a93c6f00] DataFrames v1.8.1
[ffbed154] DocStringExtensions v0.9.5
[b6b21f68] Ipopt v1.13.0
[682c06a0] JSON v1.3.0
[4076af6c] JuMP v1.29.3
[d72a61cc] MadNLPGPU v0.7.16
[3b83494e] MadNLPMumps v0.5.1
[f4238b75] NLPModelsIpopt v0.11.0
[5f98b655] OptimalControl v1.1.6
[59046045] OptimalControlProblems v0.4.0
[91a5bcdd] Plots v1.41.2
[bd369af6] Tables v1.12.1
[ade2ca70] Dates v1.11.0
[b77e0a4c] InteractiveUtils v1.11.0
[44cfe95a] Pkg v1.11.0
[de0858da] Printf v1.11.0
[6462fe0b] Sockets v1.11.0
Info Packages marked with ⌅ have new versions available but compatibility constraints restrict them from upgrading. To see why use `status --outdated` Project CTBenchmarks v0.3.1
Status `/scratch/github-actions/actions_runner_control_toolbox/_work/CTBenchmarks.jl/CTBenchmarks.jl/Manifest.toml`
[54578032] ADNLPModels v0.8.13
[47edcb42] ADTypes v1.20.0
[14f7f29c] AMD v0.5.3
[621f4979] AbstractFFTs v1.5.0
[79e6a3ab] Adapt v4.4.0
[66dad0bd] AliasTables v1.1.3
[a9b6321e] Atomix v1.1.2
[13072b0f] AxisAlgorithms v1.1.0
[ab4f0b2a] BFloat16s v0.6.0
[6e4b80f9] BenchmarkTools v1.6.3
[d1d4a3ce] BitFlags v0.1.9
[fa961155] CEnum v0.5.0
⌅ [54762871] CTBase v0.16.2
[790bbbee] CTDirect v0.17.4
[1c39547c] CTFlows v0.8.9
⌅ [34c4fa32] CTModels v0.6.9
[32681960] CTParser v0.7.2
[052768ef] CUDA v5.9.5
[1af6417a] CUDA_Runtime_Discovery v1.0.0
[45b445bb] CUDSS v0.6.3
[d360d2e6] ChainRulesCore v1.26.0
[523fee87] CodecBzip2 v0.8.5
[944b1d66] CodecZlib v0.7.8
[35d6a980] ColorSchemes v3.31.0
[3da002f7] ColorTypes v0.12.1
[c3611d14] ColorVectorSpace v0.11.0
[5ae59095] Colors v0.13.1
[38540f10] CommonSolve v0.2.4
[bbf7d656] CommonSubexpressions v0.3.1
[34da2185] Compat v4.18.1
[f0e56b4a] ConcurrentUtilities v2.5.0
[d38c429a] Contour v0.6.3
[a8cc5b0e] Crayons v4.1.1
[9a962f9c] DataAPI v1.16.0
[a93c6f00] DataFrames v1.8.1
[864edb3b] DataStructures v0.19.3
[e2d170a0] DataValueInterfaces v1.0.0
[8bb1440f] DelimitedFiles v1.9.1
[163ba53b] DiffResults v1.1.0
[b552c78f] DiffRules v1.15.1
[ffbed154] DocStringExtensions v0.9.5
[1037b233] ExaModels v0.9.2
[460bff9d] ExceptionUnwrapping v0.1.11
[e2ba6199] ExprTools v0.1.10
[c87230d0] FFMPEG v0.4.5
[9aa1b823] FastClosures v0.3.2
[1a297f60] FillArrays v1.15.0
[53c48c17] FixedPointNumbers v0.8.5
[1fa38f19] Format v1.3.7
[f6369f11] ForwardDiff v1.3.0
[069b7b12] FunctionWrappers v1.1.3
[0c68f7d7] GPUArrays v11.3.1
[46192b85] GPUArraysCore v0.2.0
[61eb1bfa] GPUCompiler v1.7.5
[096a3bc2] GPUToolbox v1.0.0
[28b8d3ca] GR v0.73.19
[42e2da0e] Grisu v1.0.2
[34c5aeac] HSL v0.5.2
[cd3eb016] HTTP v1.10.19
[076d061b] HashArrayMappedTries v0.2.0
[842dd82b] InlineStrings v1.4.5
[a98d9a8b] Interpolations v0.16.2
[41ab1584] InvertedIndices v1.3.1
[b6b21f68] Ipopt v1.13.0
[92d709cd] IrrationalConstants v0.2.6
[82899510] IteratorInterfaceExtensions v1.0.0
[1019f520] JLFzf v0.1.11
[692b3bcd] JLLWrappers v1.7.1
[682c06a0] JSON v1.3.0
[0f8b85d8] JSON3 v1.14.3
[4076af6c] JuMP v1.29.3
[63c18a36] KernelAbstractions v0.9.39
[40e66cde] LDLFactorizations v0.10.1
[929cbde3] LLVM v9.4.4
[8b046642] LLVMLoopInfo v1.0.0
[b964fa9f] LaTeXStrings v1.4.0
[23fbe1c1] Latexify v0.16.10
[5c8ed15e] LinearOperators v2.11.0
[2ab3a3ac] LogExpFunctions v0.3.29
[e6f89c97] LoggingExtras v1.2.0
[33e6dc65] MKL v0.9.0
[d8e11817] MLStyle v0.4.17
[1914dd2f] MacroTools v0.5.16
[2621e9c9] MadNLP v0.8.12
[d72a61cc] MadNLPGPU v0.7.16
[3b83494e] MadNLPMumps v0.5.1
[b8f27783] MathOptInterface v1.47.0
[739be429] MbedTLS v1.1.9
[442fdcdd] Measures v0.3.3
[2679e427] Metis v1.5.0
[e1d29d7a] Missings v1.2.0
[d8a4904e] MutableArithmetics v1.6.7
⌅ [a4795742] NLPModels v0.21.5
[f4238b75] NLPModelsIpopt v0.11.0
[e01155f1] NLPModelsModifiers v0.7.2
[5da4648a] NVTX v1.0.1
[77ba4419] NaNMath v1.1.3
[6fe1bfb0] OffsetArrays v1.17.0
[4d8831e6] OpenSSL v1.6.1
[5f98b655] OptimalControl v1.1.6
[59046045] OptimalControlProblems v0.4.0
[bac558e1] OrderedCollections v1.8.1
[d96e819e] Parameters v0.12.3
[69de0a69] Parsers v2.8.3
[ccf2f8ad] PlotThemes v3.3.0
[995b91a9] PlotUtils v1.4.4
[91a5bcdd] Plots v1.41.2
[2dfb63ee] PooledArrays v1.4.3
⌅ [aea7be01] PrecompileTools v1.2.1
[21216c6a] Preferences v1.5.0
[08abe8d2] PrettyTables v3.1.2
[43287f4e] PtrArrays v1.3.0
[be4d8f0f] Quadmath v0.5.13
[74087812] Random123 v1.7.1
[e6cf234a] RandomNumbers v1.6.0
[c84ed2f1] Ratios v0.4.5
[3cdcf5f2] RecipesBase v1.3.4
[01d81517] RecipesPipeline v0.6.12
[189a3867] Reexport v1.2.2
[05181044] RelocatableFolders v1.0.1
[ae029012] Requires v1.3.1
[37e2e3b7] ReverseDiff v1.16.1
[7e506255] ScopedValues v1.5.0
[6c6a2e73] Scratch v1.3.0
[91c51154] SentinelArrays v1.4.8
[992d4aef] Showoff v1.0.3
[777ac1f9] SimpleBufferStream v1.2.0
[ff4d7338] SolverCore v0.3.9
[a2af1166] SortingAlgorithms v1.2.2
[9f842d2f] SparseConnectivityTracer v1.1.3
[0a514795] SparseMatrixColorings v0.4.23
[276daf66] SpecialFunctions v2.6.1
[860ef19b] StableRNGs v1.0.4
[90137ffa] StaticArrays v1.9.15
[1e83bf80] StaticArraysCore v1.4.4
[10745b16] Statistics v1.11.1
[82ae8749] StatsAPI v1.8.0
[2913bbd2] StatsBase v0.34.9
[892a3eda] StringManipulation v0.4.2
[856f2bd8] StructTypes v1.11.0
[ec057cc2] StructUtils v2.6.0
[3783bdb8] TableTraits v1.0.1
[bd369af6] Tables v1.12.1
[62fd8b95] TensorCore v0.1.1
[a759f4b9] TimerOutputs v0.5.29
[e689c965] Tracy v0.1.6
[3bb67fe8] TranscodingStreams v0.11.3
[5c2747f8] URIs v1.6.1
[3a884ed6] UnPack v1.0.2
[1cfade01] UnicodeFun v0.4.1
[013be700] UnsafeAtomics v0.3.0
[41fe7b60] Unzip v0.2.0
[efce3f68] WoodburyMatrices v1.0.0
[ae81ac8f] ASL_jll v0.1.3+0
[6e34b625] Bzip2_jll v1.0.9+0
[d1e2174e] CUDA_Compiler_jll v0.3.0+0
[4ee394cb] CUDA_Driver_jll v13.0.2+0
[76a88914] CUDA_Runtime_jll v0.19.2+0
[4889d778] CUDSS_jll v0.7.1+0
[83423d85] Cairo_jll v1.18.5+0
[ee1fde0b] Dbus_jll v1.16.2+0
[2702e6a9] EpollShim_jll v0.0.20230411+1
[2e619515] Expat_jll v2.7.3+0
[b22a6f82] FFMPEG_jll v8.0.0+0
[a3f928ae] Fontconfig_jll v2.17.1+0
[d7e528f0] FreeType2_jll v2.13.4+0
[559328eb] FriBidi_jll v1.0.17+0
[0656b61e] GLFW_jll v3.4.1+0
[d2c73de3] GR_jll v0.73.19+1
[b0724c58] GettextRuntime_jll v0.22.4+0
[61579ee1] Ghostscript_jll v9.55.1+0
[7746bdde] Glib_jll v2.86.2+0
[3b182d85] Graphite2_jll v1.3.15+0
[017b0a0e] HSL_jll v4.0.4+0
[2e76f6c2] HarfBuzz_jll v8.5.1+0
[e33a78d0] Hwloc_jll v2.12.2+0
[1d5cc7b8] IntelOpenMP_jll v2025.2.0+0
[9cc047cb] Ipopt_jll v300.1400.1900+0
[aacddb02] JpegTurbo_jll v3.1.3+0
[9c1d0b0a] JuliaNVTXCallbacks_jll v0.2.1+0
[c1c5ebd0] LAME_jll v3.100.3+0
[88015f11] LERC_jll v4.0.1+0
[dad2f222] LLVMExtra_jll v0.0.38+0
[1d63c593] LLVMOpenMP_jll v18.1.8+0
[dd4b983a] LZO_jll v2.10.3+0
[ad6e5548] LibTracyClient_jll v0.9.1+6
⌅ [e9f186c6] Libffi_jll v3.4.7+0
[7e76a0d4] Libglvnd_jll v1.7.1+1
[94ce4f54] Libiconv_jll v1.18.0+0
[4b2f31a3] Libmount_jll v2.41.2+0
[89763e89] Libtiff_jll v4.7.2+0
[38a345b3] Libuuid_jll v2.41.2+0
[d00139f3] METIS_jll v5.1.3+0
[856f044c] MKL_jll v2025.2.0+0
[d7ed1dd3] MUMPS_seq_jll v500.800.100+0
[e98f9f5b] NVTX_jll v3.2.2+0
[e7412a2a] Ogg_jll v1.3.6+0
[656ef2d0] OpenBLAS32_jll v0.3.29+0
[458c3c95] OpenSSL_jll v3.5.4+0
[efe28fd5] OpenSpecFun_jll v0.5.6+0
[91d4177d] Opus_jll v1.5.2+0
[36c8627f] Pango_jll v1.57.0+0
⌅ [30392449] Pixman_jll v0.44.2+0
[c0090381] Qt6Base_jll v6.8.2+2
[629bc702] Qt6Declarative_jll v6.8.2+1
[ce943373] Qt6ShaderTools_jll v6.8.2+1
[e99dba38] Qt6Wayland_jll v6.8.2+2
⌅ [319450e9] SPRAL_jll v2025.5.20+0
[a44049a8] Vulkan_Loader_jll v1.3.243+0
[a2964d1f] Wayland_jll v1.24.0+0
⌅ [02c8fc9c] XML2_jll v2.13.9+0
[ffd25f8a] XZ_jll v5.8.1+0
[f67eecfb] Xorg_libICE_jll v1.1.2+0
[c834827a] Xorg_libSM_jll v1.2.6+0
[4f6342f7] Xorg_libX11_jll v1.8.12+0
[0c0b7dd1] Xorg_libXau_jll v1.0.13+0
[935fb764] Xorg_libXcursor_jll v1.2.4+0
[a3789734] Xorg_libXdmcp_jll v1.1.6+0
[1082639a] Xorg_libXext_jll v1.3.7+0
[d091e8ba] Xorg_libXfixes_jll v6.0.2+0
[a51aa0fd] Xorg_libXi_jll v1.8.3+0
[d1454406] Xorg_libXinerama_jll v1.1.6+0
[ec84b674] Xorg_libXrandr_jll v1.5.5+0
[ea2f1a96] Xorg_libXrender_jll v0.9.12+0
[a65dc6b1] Xorg_libpciaccess_jll v0.18.1+0
[c7cfdc94] Xorg_libxcb_jll v1.17.1+0
[cc61e674] Xorg_libxkbfile_jll v1.1.3+0
[e920d4aa] Xorg_xcb_util_cursor_jll v0.1.6+0
[12413925] Xorg_xcb_util_image_jll v0.4.1+0
[2def613f] Xorg_xcb_util_jll v0.4.1+0
[975044d2] Xorg_xcb_util_keysyms_jll v0.4.1+0
[0d47668e] Xorg_xcb_util_renderutil_jll v0.3.10+0
[c22f9ab0] Xorg_xcb_util_wm_jll v0.4.2+0
[35661453] Xorg_xkbcomp_jll v1.4.7+0
[33bec58e] Xorg_xkeyboard_config_jll v2.44.0+0
[c5fb5394] Xorg_xtrans_jll v1.6.0+0
[3161d3a3] Zstd_jll v1.5.7+1
[1e29f10c] demumble_jll v1.3.0+0
[35ca27e7] eudev_jll v3.2.14+0
[214eeab7] fzf_jll v0.61.1+0
[a4ae2306] libaom_jll v3.13.1+0
[0ac62f75] libass_jll v0.17.4+0
[1183f4f0] libdecor_jll v0.2.2+0
[2db6ffa8] libevdev_jll v1.13.4+0
[f638f0a6] libfdk_aac_jll v2.0.4+0
[36db933b] libinput_jll v1.28.1+0
[b53b4c65] libpng_jll v1.6.53+0
[f27f6e37] libvorbis_jll v1.3.8+0
[009596ad] mtdev_jll v1.1.7+0
[1317d2d5] oneTBB_jll v2022.0.0+1
[1270edf5] x264_jll v10164.0.1+0
[dfaa095f] x265_jll v4.1.0+0
[d8fb68d0] xkbcommon_jll v1.13.0+0
[0dad84c5] ArgTools v1.1.2
[56f22d72] Artifacts v1.11.0
[2a0f44e3] Base64 v1.11.0
[ade2ca70] Dates v1.11.0
[8ba89e20] Distributed v1.11.0
[f43a241f] Downloads v1.6.0
[7b1f6079] FileWatching v1.11.0
[9fa8497b] Future v1.11.0
[b77e0a4c] InteractiveUtils v1.11.0
[4af54fe1] LazyArtifacts v1.11.0
[b27032c2] LibCURL v0.6.4
[76f85450] LibGit2 v1.11.0
[8f399da3] Libdl v1.11.0
[37e2e46d] LinearAlgebra v1.11.0
[56ddb016] Logging v1.11.0
[d6f4376e] Markdown v1.11.0
[a63ad114] Mmap v1.11.0
[ca575930] NetworkOptions v1.2.0
[44cfe95a] Pkg v1.11.0
[de0858da] Printf v1.11.0
[9abbd945] Profile v1.11.0
[3fa0cd96] REPL v1.11.0
[9a3f8284] Random v1.11.0
[ea8e919c] SHA v0.7.0
[9e88b42a] Serialization v1.11.0
[1a1011a3] SharedArrays v1.11.0
[6462fe0b] Sockets v1.11.0
[2f01184e] SparseArrays v1.11.0
[f489334b] StyledStrings v1.11.0
[4607b0f0] SuiteSparse
[fa267f1f] TOML v1.0.3
[a4e569a6] Tar v1.10.0
[8dfed614] Test v1.11.0
[cf7118a7] UUIDs v1.11.0
[4ec0a83e] Unicode v1.11.0
[e66e0078] CompilerSupportLibraries_jll v1.1.1+0
[deac9b47] LibCURL_jll v8.6.0+0
[e37daf67] LibGit2_jll v1.7.2+0
[29816b5a] LibSSH2_jll v1.11.0+1
[c8ffd9c3] MbedTLS_jll v2.28.6+0
[14a3606d] MozillaCACerts_jll v2023.12.12
[4536629a] OpenBLAS_jll v0.3.27+1
[05823500] OpenLibm_jll v0.8.5+0
[efcefdf7] PCRE2_jll v10.42.0+1
[bea87d4a] SuiteSparse_jll v7.7.0+0
[83775a58] Zlib_jll v1.2.13+1
[8e850b90] libblastrampoline_jll v5.11.0+0
[8e850ede] nghttp2_jll v1.59.0+0
[3f19e933] p7zip_jll v17.4.0+2
Info Packages marked with ⌅ have new versions available but compatibility constraints restrict them from upgrading. To see why use `status --outdated -m` 📈 Performance Profile GPU Time
Dataset overview for core-moonshot-gpu:
- Problems: 9 unique optimal control problems
- Instances: 36
- Solver combos: 2
Profile configuration:
- Instance definition: (problem, grid_size)
- Solver combos definition: (model, solver)
- Criterion: CPU time
- Successful runs: 67/72 (93.1%)
- Successful instances: 35/36 (97.2%)
- Unsuccessful instances (no solver converged):
glider, 5000
Robustness (% of instances solved):
(exa, madnlp): 97.2%(exa_gpu, madnlp): 88.9%
Efficiency (% of instances where fastest):
(exa, madnlp): 33.3%(exa_gpu, madnlp): 63.9%
Most robust: (exa, madnlp) solved 97.2% of instances.
Most efficient: (exa_gpu, madnlp) was fastest on 63.9% of instances.
📈 Performance Profile Iterations
Dataset overview for core-moonshot-gpu:
- Problems: 9 unique optimal control problems
- Instances: 36
- Solver combos: 2
Profile configuration:
- Instance definition: (problem, grid_size)
- Solver combos definition: (model, solver)
- Criterion: Iterations
- Successful runs: 67/72 (93.1%)
- Successful instances: 35/36 (97.2%)
- Unsuccessful instances (no solver converged):
glider, 5000
Robustness (% of instances solved):
(exa, madnlp): 97.2%(exa_gpu, madnlp): 88.9%
Efficiency (% of instances where fastest):
(exa, madnlp): 91.7%(exa_gpu, madnlp): 19.4%
Most robust: (exa, madnlp) solved 97.2% of instances.
Most efficient: (exa, madnlp) was fastest on 91.7% of instances.
📊 Tables of Results
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 1000 | exa | madnlp | 74.455 | 26 | 8.888914 | min | ✓ |
| ✓ | 1000 | exa_gpu | madnlp | 277.577 | 48 | 8.888302 | min |
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 5000 | exa | madnlp | 1195.754 | 79 | 8.888892 | min | |
| ✓ | 5000 | exa_gpu | madnlp | 947.418 | 138 | 8.885839 | min | ✓ |
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 10000 | exa | madnlp | 5897.597 | 175 | 8.888893 | min | |
| ✓ | 10000 | exa_gpu | madnlp | 1879.474 | 234 | 8.882791 | min | ✓ |
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 20000 | exa | madnlp | 13832.823 | 189 | 8.888898 | min | |
| ✓ | 20000 | exa_gpu | madnlp | 2628.246 | 381 | 8.876698 | min | ✓ |
Benchmarks results:
┌─ Problem: beam
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 74.455 ms | iters: 26 | obj: 8.888914e+00 (min) | CPU: 8.89 MiB
│ │ ✓ | exa_gpu | time: 277.577 ms | iters: 48 | obj: 8.888302e+00 (min) | CPU: 16.613 MiB | GPU: 12.101 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 1.196 s | iters: 79 | obj: 8.888892e+00 (min) | CPU: 97.02 MiB
│ │ ✓ | exa_gpu | time: 947.418 ms | iters: 138 | obj: 8.885839e+00 (min) | CPU: 47.271 MiB | GPU: 94.855 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 5.898 s | iters: 175 | obj: 8.888893e+00 (min) | CPU: 391.56 MiB
│ │ ✓ | exa_gpu | time: 1.879 s | iters: 234 | obj: 8.882791e+00 (min) | CPU: 91.258 MiB | GPU: 262.464 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 13.833 s | iters: 189 | obj: 8.888898e+00 (min) | CPU: 841.47 MiB
│ │ ✓ | exa_gpu | time: 2.628 s | iters: 381 | obj: 8.876698e+00 (min) | CPU: 125.664 MiB | GPU: 747.316 MiB
│ └─
└─
┌─ Problem: chain
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 60.927 ms | iters: 14 | obj: 5.068480e+00 (min) | CPU: 6.74 MiB
│ │ ✓ | exa_gpu | time: 125.344 ms | iters: 15 | obj: 5.068452e+00 (min) | CPU: 7.756 MiB | GPU: 14.348 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 476.996 ms | iters: 13 | obj: 5.068480e+00 (min) | CPU: 30.89 MiB
│ │ ✓ | exa_gpu | time: 175.897 ms | iters: 16 | obj: 5.068339e+00 (min) | CPU: 10.799 MiB | GPU: 72.133 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.049 s | iters: 13 | obj: 5.068480e+00 (min) | CPU: 61.26 MiB
│ │ ✓ | exa_gpu | time: 29.423 s | iters: 439 | obj: 5.068201e+00 (min) | CPU: 1.633 GiB | GPU: 576.439 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 3.589 s | iters: 14 | obj: 5.068480e+00 (min) | CPU: 125.07 MiB
│ │ ✓ | exa_gpu | time: 302.681 ms | iters: 15 | obj: 5.067922e+00 (min) | CPU: 19.882 MiB | GPU: 286.336 MiB
│ └─
└─
┌─ Problem: double_oscillator
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 41.658 ms | iters: 6 | obj: 9.110011e-04 (min) | CPU: 10.65 MiB
│ │ ✓ | exa_gpu | time: 90.534 ms | iters: 6 | obj: 9.106227e-04 (min) | CPU: 6.238 MiB | GPU: 29.634 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 468.704 ms | iters: 6 | obj: 9.110335e-04 (min) | CPU: 51.32 MiB
│ │ ✓ | exa_gpu | time: 218.805 ms | iters: 6 | obj: 9.091470e-04 (min) | CPU: 14.877 MiB | GPU: 148.459 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.000 s | iters: 6 | obj: 9.110345e-04 (min) | CPU: 102.16 MiB
│ │ ✓ | exa_gpu | time: 244.341 ms | iters: 6 | obj: 9.072690e-04 (min) | CPU: 20.132 MiB | GPU: 296.003 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 2.358 s | iters: 6 | obj: 9.110348e-04 (min) | CPU: 203.85 MiB
│ │ ✓ | exa_gpu | time: 354.128 ms | iters: 6 | obj: 9.035310e-04 (min) | CPU: 36.184 MiB | GPU: 591.891 MiB
│ └─
└─
┌─ Problem: electric_vehicle
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 18.819 ms | iters: 4 | obj: 1.228583e+03 (min) | CPU: 4.95 MiB
│ │ ✓ | exa_gpu | time: 92.246 ms | iters: 11 | obj: 1.228577e+03 (min) | CPU: 6.185 MiB | GPU: 13.167 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 171.911 ms | iters: 5 | obj: 1.228580e+03 (min) | CPU: 23.54 MiB
│ │ ✓ | exa_gpu | time: 129.296 ms | iters: 11 | obj: 1.228551e+03 (min) | CPU: 9.499 MiB | GPU: 65.806 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 380.696 ms | iters: 5 | obj: 1.228580e+03 (min) | CPU: 46.63 MiB
│ │ ✓ | exa_gpu | time: 137.367 ms | iters: 10 | obj: 1.228521e+03 (min) | CPU: 12.157 MiB | GPU: 130.933 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 852.533 ms | iters: 5 | obj: 1.228580e+03 (min) | CPU: 92.80 MiB
│ │ ✓ | exa_gpu | time: 191.228 ms | iters: 10 | obj: 1.228463e+03 (min) | CPU: 19.469 MiB | GPU: 261.839 MiB
│ └─
└─
┌─ Problem: glider
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 10.738 s | iters: 729 | obj: -1.247985e+03 (min) | CPU: 314.44 MiB
│ │ ✗ | exa_gpu | time: 13.531 s | iters: 1000 | obj: -2.314292e+02 (min) | CPU: 854.804 MiB | GPU: 224.246 MiB
│ │
│ │ N = 5000
│ │ ✗ | exa | time: 114.709 s | iters: 1000 | obj: -1.215926e+03 (min) | CPU: 2.10 GiB
│ │ ✗ | exa_gpu | time: 6.879 s | iters: 128 | obj: -1.013713e+02 (min) | CPU: 143.923 MiB | GPU: 373.914 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 171.780 s | iters: 763 | obj: -1.247988e+03 (min) | CPU: 3.22 GiB
│ │ ✗ | exa_gpu | time: 10.744 s | iters: 115 | obj: -1.083643e+02 (min) | CPU: 138.935 MiB | GPU: 725.767 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 230.370 s | iters: 527 | obj: -1.247988e+03 (min) | CPU: 5.10 GiB
│ │ ✗ | exa_gpu | time: 174.403 s | iters: 1000 | obj: -3.695779e+02 (min) | CPU: 865.837 MiB | GPU: 4.378 GiB
│ └─
└─
┌─ Problem: jackson
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 100.998 ms | iters: 23 | obj: -1.918150e-01 (min) | CPU: 20.88 MiB
│ │ ✓ | exa_gpu | time: 175.808 ms | iters: 22 | obj: -1.918374e-01 (min) | CPU: 10.130 MiB | GPU: 26.396 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 697.461 ms | iters: 21 | obj: -1.918128e-01 (min) | CPU: 96.91 MiB
│ │ ✓ | exa_gpu | time: 266.524 ms | iters: 25 | obj: -1.919247e-01 (min) | CPU: 16.844 MiB | GPU: 134.356 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.465 s | iters: 21 | obj: -1.918111e-01 (min) | CPU: 193.26 MiB
│ │ ✓ | exa_gpu | time: 319.805 ms | iters: 24 | obj: -1.920350e-01 (min) | CPU: 23.312 MiB | GPU: 266.923 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 3.335 s | iters: 20 | obj: -1.918079e-01 (min) | CPU: 375.57 MiB
│ │ ✓ | exa_gpu | time: 549.393 ms | iters: 21 | obj: -1.922558e-01 (min) | CPU: 35.987 MiB | GPU: 523.752 MiB
│ └─
└─
┌─ Problem: robbins
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 299.890 ms | iters: 44 | obj: 1.943317e+01 (min) | CPU: 14.32 MiB
│ │ ✓ | exa_gpu | time: 368.758 ms | iters: 44 | obj: 1.943298e+01 (min) | CPU: 18.419 MiB | GPU: 18.398 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 2.517 s | iters: 75 | obj: 1.943184e+01 (min) | CPU: 100.74 MiB
│ │ ✓ | exa_gpu | time: 492.927 ms | iters: 48 | obj: 1.943093e+01 (min) | CPU: 21.902 MiB | GPU: 93.877 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 4.205 s | iters: 63 | obj: 1.943181e+01 (min) | CPU: 176.54 MiB
│ │ ✓ | exa_gpu | time: 1.067 s | iters: 95 | obj: 1.942999e+01 (min) | CPU: 42.458 MiB | GPU: 236.838 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 11.831 s | iters: 71 | obj: 1.943181e+01 (min) | CPU: 386.90 MiB
│ │ ✓ | exa_gpu | time: 1.255 s | iters: 91 | obj: 1.942819e+01 (min) | CPU: 49.573 MiB | GPU: 464.901 MiB
│ └─
└─
┌─ Problem: rocket
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 144.374 ms | iters: 23 | obj: -1.012833e+00 (min) | CPU: 21.07 MiB
│ │ ✓ | exa_gpu | time: 346.080 ms | iters: 24 | obj: -1.012870e+00 (min) | CPU: 12.503 MiB | GPU: 34.870 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 941.435 ms | iters: 21 | obj: -1.012820e+00 (min) | CPU: 98.57 MiB
│ │ ✓ | exa_gpu | time: 846.136 ms | iters: 27 | obj: -1.013000e+00 (min) | CPU: 21.260 MiB | GPU: 176.718 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 2.362 s | iters: 24 | obj: -1.012824e+00 (min) | CPU: 210.32 MiB
│ │ ✓ | exa_gpu | time: 1.535 s | iters: 27 | obj: -1.013162e+00 (min) | CPU: 30.619 MiB | GPU: 353.259 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 4.141 s | iters: 21 | obj: -1.012767e+00 (min) | CPU: 392.57 MiB
│ │ ✓ | exa_gpu | time: 2.656 s | iters: 29 | obj: -1.013484e+00 (min) | CPU: 50.085 MiB | GPU: 712.702 MiB
│ └─
└─
┌─ Problem: vanderpol
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 23.107 ms | iters: 4 | obj: 1.047807e+00 (min) | CPU: 5.41 MiB
│ │ ✓ | exa_gpu | time: 116.645 ms | iters: 7 | obj: 1.047787e+00 (min) | CPU: 6.183 MiB | GPU: 13.701 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 149.856 ms | iters: 4 | obj: 1.047807e+00 (min) | CPU: 25.29 MiB
│ │ ✓ | exa_gpu | time: 141.503 ms | iters: 7 | obj: 1.047710e+00 (min) | CPU: 9.642 MiB | GPU: 68.475 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 320.819 ms | iters: 4 | obj: 1.047807e+00 (min) | CPU: 50.14 MiB
│ │ ✓ | exa_gpu | time: 191.049 ms | iters: 8 | obj: 1.047613e+00 (min) | CPU: 14.309 MiB | GPU: 137.547 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 774.128 ms | iters: 5 | obj: 1.047807e+00 (min) | CPU: 102.12 MiB
│ │ ✓ | exa_gpu | time: 231.373 ms | iters: 7 | obj: 1.047420e+00 (min) | CPU: 21.355 MiB | GPU: 273.683 MiB
│ └─
└─ KKT
This benchmark suite evaluates optimal control problems on the KKT runner.
⚙️ Configuration
Problems: beam, chain, double_oscillator, electric_vehicle, glider, insurance, jackson, robbins, robot, rocket, space_shuttle, steering, vanderpol, brachistochrone, balanced_field, bryson_denham, mountain_car
Solvers: madnlp
Models: exa, exa_gpu
Grid sizes: 1000, 5000, 10000, 20000 discretization points
Discretization: midpoint method
Tolerance: 1.0e-8
Ipopt strategy: adaptive barrier parameter
Limits: 1000 iterations max, 2000.0s wall time
🖥️ Environment
📅 Timestamp : 2026-03-05 09:50:16 UTC
🔧 Julia version : 1.11.9
💻 OS : Linux
🖥️ Machine : kkt.mcs.anl.govYou can download the exact environment used for this benchmark:
📦 Project.toml - Package dependencies
📋 Manifest.toml - Complete dependency tree with versions
📜 Benchmark script - Julia script to run the benchmark
These files allow you to reproduce the benchmark environment and results exactly.
Julia Version 1.11.9
Commit 53a02c0720c (2026-02-06 00:27 UTC)
Build Info:
Official https://julialang.org/ release
Platform Info:
OS: Linux (x86_64-linux-gnu)
CPU: 192 × INTEL(R) XEON(R) PLATINUM 8568Y+
WORD_SIZE: 64
LLVM: libLLVM-16.0.6 (ORCJIT, sapphirerapids)
Threads: 1 default, 0 interactive, 1 GC (on 192 virtual cores)
Environment:
JULIA_CUDSS_LIBRARY_PATH = /software/libcudss/libcudss-linux-x86_64-0.7.1.4_cuda13-archive/lib
JULIA_LOAD_PATH = @:@v#.#:@stdlib:/software/julia/environments/v1.12
JULIA_PKG_SERVER_REGISTRY_PREFERENCE = eager
JULIA_DEPOT_PATH = /storage/mschanen/github-actions/julia_depot
LD_LIBRARY_PATH = /software/julia/julia_binaries/julia-1.12/lib:/software/mpich-ofi/lib:/software/libcudss/libcudss-linux-x86_64-0.7.1.4_cuda13-archive/lib:/usr/local/cuda/lib Project CTBenchmarks v0.3.1
Status `/storage/mschanen/github-actions/actions_runner_ct/_work/CTBenchmarks.jl/CTBenchmarks.jl/Project.toml`
[6e4b80f9] BenchmarkTools v1.6.3
⌅ [54762871] CTBase v0.16.2
[052768ef] CUDA v5.9.7
[a93c6f00] DataFrames v1.8.1
[ffbed154] DocStringExtensions v0.9.5
[b6b21f68] Ipopt v1.14.1
[682c06a0] JSON v1.4.0
[4076af6c] JuMP v1.30.0
⌅ [d72a61cc] MadNLPGPU v0.7.18
[3b83494e] MadNLPMumps v0.5.1
[f4238b75] NLPModelsIpopt v0.11.2
⌃ [5f98b655] OptimalControl v1.1.8-beta.3
[59046045] OptimalControlProblems v0.4.0 `https://github.com/control-toolbox/OptimalControlProblems.jl#206-dev-test-all-new-problems-with-gpu`
[91a5bcdd] Plots v1.41.6
[10745b16] Statistics v1.11.1
[bd369af6] Tables v1.12.1
[ade2ca70] Dates v1.11.0
[b77e0a4c] InteractiveUtils v1.11.0
[44cfe95a] Pkg v1.11.0
[de0858da] Printf v1.11.0
[6462fe0b] Sockets v1.11.0
Info Packages marked with ⌃ and ⌅ have new versions available. Those with ⌃ may be upgradable, but those with ⌅ are restricted by compatibility constraints from upgrading. To see why use `status --outdated` Project CTBenchmarks v0.3.1
Status `/storage/mschanen/github-actions/actions_runner_ct/_work/CTBenchmarks.jl/CTBenchmarks.jl/Manifest.toml`
[54578032] ADNLPModels v0.8.13
[47edcb42] ADTypes v1.21.0
[14f7f29c] AMD v0.5.3
[621f4979] AbstractFFTs v1.5.0
[79e6a3ab] Adapt v4.5.0
[66dad0bd] AliasTables v1.1.3
[a9b6321e] Atomix v1.1.2
[13072b0f] AxisAlgorithms v1.1.0
[ab4f0b2a] BFloat16s v0.6.1
[6e4b80f9] BenchmarkTools v1.6.3
[d1d4a3ce] BitFlags v0.1.9
[fa961155] CEnum v0.5.0
⌅ [54762871] CTBase v0.16.2
⌅ [790bbbee] CTDirect v0.17.5-beta
⌃ [1c39547c] CTFlows v0.8.12-beta
⌅ [34c4fa32] CTModels v0.6.10-beta.2
⌃ [32681960] CTParser v0.8.2-beta.6
[052768ef] CUDA v5.9.7
[1af6417a] CUDA_Runtime_Discovery v1.0.0
[45b445bb] CUDSS v0.6.7
[d360d2e6] ChainRulesCore v1.26.0
[523fee87] CodecBzip2 v0.8.5
[944b1d66] CodecZlib v0.7.8
[35d6a980] ColorSchemes v3.31.0
[3da002f7] ColorTypes v0.12.1
[c3611d14] ColorVectorSpace v0.11.0
[5ae59095] Colors v0.13.1
[38540f10] CommonSolve v0.2.6
[bbf7d656] CommonSubexpressions v0.3.1
[34da2185] Compat v4.18.1
[f0e56b4a] ConcurrentUtilities v2.5.1
[d38c429a] Contour v0.6.3
[a8cc5b0e] Crayons v4.1.1
[9a962f9c] DataAPI v1.16.0
[a93c6f00] DataFrames v1.8.1
[864edb3b] DataStructures v0.19.3
[e2d170a0] DataValueInterfaces v1.0.0
[8bb1440f] DelimitedFiles v1.9.1
[163ba53b] DiffResults v1.1.0
[b552c78f] DiffRules v1.15.1
[ffbed154] DocStringExtensions v0.9.5
⌃ [1037b233] ExaModels v0.9.3
[460bff9d] ExceptionUnwrapping v0.1.11
[e2ba6199] ExprTools v0.1.10
[c87230d0] FFMPEG v0.4.5
[9aa1b823] FastClosures v0.3.2
[1a297f60] FillArrays v1.16.0
[53c48c17] FixedPointNumbers v0.8.5
[1fa38f19] Format v1.3.7
[f6369f11] ForwardDiff v1.3.2
[069b7b12] FunctionWrappers v1.1.3
[0c68f7d7] GPUArrays v11.4.1
[46192b85] GPUArraysCore v0.2.0
[61eb1bfa] GPUCompiler v1.8.2
[096a3bc2] GPUToolbox v1.0.0
[28b8d3ca] GR v0.73.24
[42e2da0e] Grisu v1.0.2
[34c5aeac] HSL v0.5.2
[cd3eb016] HTTP v1.10.19
[076d061b] HashArrayMappedTries v0.2.0
[842dd82b] InlineStrings v1.4.5
[a98d9a8b] Interpolations v0.16.2
[41ab1584] InvertedIndices v1.3.1
[b6b21f68] Ipopt v1.14.1
[92d709cd] IrrationalConstants v0.2.6
[82899510] IteratorInterfaceExtensions v1.0.0
[1019f520] JLFzf v0.1.11
[692b3bcd] JLLWrappers v1.7.1
[682c06a0] JSON v1.4.0
[4076af6c] JuMP v1.30.0
[63c18a36] KernelAbstractions v0.9.40
[40e66cde] LDLFactorizations v0.10.1
[929cbde3] LLVM v9.4.6
[8b046642] LLVMLoopInfo v1.0.0
[b964fa9f] LaTeXStrings v1.4.0
[23fbe1c1] Latexify v0.16.10
[5c8ed15e] LinearOperators v2.13.0
[2ab3a3ac] LogExpFunctions v0.3.29
[e6f89c97] LoggingExtras v1.2.0
[33e6dc65] MKL v0.9.1
[d8e11817] MLStyle v0.4.17
[1914dd2f] MacroTools v0.5.16
⌅ [2621e9c9] MadNLP v0.8.12
⌅ [d72a61cc] MadNLPGPU v0.7.18
[3b83494e] MadNLPMumps v0.5.1
[b8f27783] MathOptInterface v1.49.0
[739be429] MbedTLS v1.1.10
[442fdcdd] Measures v0.3.3
[2679e427] Metis v1.5.0
[e1d29d7a] Missings v1.2.0
[d8a4904e] MutableArithmetics v1.6.7
[a4795742] NLPModels v0.21.11
[f4238b75] NLPModelsIpopt v0.11.2
[e01155f1] NLPModelsModifiers v0.7.4
[5da4648a] NVTX v1.0.3
[77ba4419] NaNMath v1.1.3
[6fe1bfb0] OffsetArrays v1.17.0
[4d8831e6] OpenSSL v1.6.1
⌃ [5f98b655] OptimalControl v1.1.8-beta.3
[59046045] OptimalControlProblems v0.4.0 `https://github.com/control-toolbox/OptimalControlProblems.jl#206-dev-test-all-new-problems-with-gpu`
[bac558e1] OrderedCollections v1.8.1
[d96e819e] Parameters v0.12.3
[69de0a69] Parsers v2.8.3
[ccf2f8ad] PlotThemes v3.3.0
[995b91a9] PlotUtils v1.4.4
[91a5bcdd] Plots v1.41.6
[2dfb63ee] PooledArrays v1.4.3
⌅ [aea7be01] PrecompileTools v1.2.1
[21216c6a] Preferences v1.5.2
[08abe8d2] PrettyTables v3.2.3
[43287f4e] PtrArrays v1.4.0
[be4d8f0f] Quadmath v0.5.13
[74087812] Random123 v1.7.1
[e6cf234a] RandomNumbers v1.6.0
[c84ed2f1] Ratios v0.4.5
[3cdcf5f2] RecipesBase v1.3.4
[01d81517] RecipesPipeline v0.6.12
[189a3867] Reexport v1.2.2
[05181044] RelocatableFolders v1.0.1
[ae029012] Requires v1.3.1
[37e2e3b7] ReverseDiff v1.16.2
[7e506255] ScopedValues v1.5.0
[6c6a2e73] Scratch v1.3.0
[91c51154] SentinelArrays v1.4.9
[992d4aef] Showoff v1.0.3
[777ac1f9] SimpleBufferStream v1.2.0
[ff4d7338] SolverCore v0.3.10
[a2af1166] SortingAlgorithms v1.2.2
[9f842d2f] SparseConnectivityTracer v1.2.1
[0a514795] SparseMatrixColorings v0.4.24
[276daf66] SpecialFunctions v2.7.1
[860ef19b] StableRNGs v1.0.4
[90137ffa] StaticArrays v1.9.17
[1e83bf80] StaticArraysCore v1.4.4
[10745b16] Statistics v1.11.1
[82ae8749] StatsAPI v1.8.0
[2913bbd2] StatsBase v0.34.10
[892a3eda] StringManipulation v0.4.4
[ec057cc2] StructUtils v2.6.3
[3783bdb8] TableTraits v1.0.1
[bd369af6] Tables v1.12.1
[62fd8b95] TensorCore v0.1.1
[a759f4b9] TimerOutputs v0.5.29
[e689c965] Tracy v0.1.6
[3bb67fe8] TranscodingStreams v0.11.3
[5c2747f8] URIs v1.6.1
[3a884ed6] UnPack v1.0.2
[1cfade01] UnicodeFun v0.4.1
[013be700] UnsafeAtomics v0.3.0
[41fe7b60] Unzip v0.2.0
[efce3f68] WoodburyMatrices v1.1.0
[ae81ac8f] ASL_jll v0.1.3+0
[6e34b625] Bzip2_jll v1.0.9+0
[d1e2174e] CUDA_Compiler_jll v0.4.1+1
[4ee394cb] CUDA_Driver_jll v13.1.0+2
⌅ [76a88914] CUDA_Runtime_jll v0.19.2+0
[4889d778] CUDSS_jll v0.7.1+0
[83423d85] Cairo_jll v1.18.5+1
[ee1fde0b] Dbus_jll v1.16.2+0
[2702e6a9] EpollShim_jll v0.0.20230411+1
[2e619515] Expat_jll v2.7.3+0
[b22a6f82] FFMPEG_jll v8.0.1+0
[a3f928ae] Fontconfig_jll v2.17.1+0
[d7e528f0] FreeType2_jll v2.13.4+0
[559328eb] FriBidi_jll v1.0.17+0
[0656b61e] GLFW_jll v3.4.1+0
[d2c73de3] GR_jll v0.73.24+0
[b0724c58] GettextRuntime_jll v0.22.4+0
[61579ee1] Ghostscript_jll v9.55.1+0
[7746bdde] Glib_jll v2.86.3+0
[3b182d85] Graphite2_jll v1.3.15+0
[017b0a0e] HSL_jll v4.0.4+0
[2e76f6c2] HarfBuzz_jll v8.5.1+0
[e33a78d0] Hwloc_jll v2.13.0+0
[1d5cc7b8] IntelOpenMP_jll v2025.2.0+0
[9cc047cb] Ipopt_jll v300.1400.1901+0
[aacddb02] JpegTurbo_jll v3.1.4+0
[9c1d0b0a] JuliaNVTXCallbacks_jll v0.2.1+0
[c1c5ebd0] LAME_jll v3.100.3+0
[88015f11] LERC_jll v4.0.1+0
[dad2f222] LLVMExtra_jll v0.0.38+0
[1d63c593] LLVMOpenMP_jll v18.1.8+0
[dd4b983a] LZO_jll v2.10.3+0
[ad6e5548] LibTracyClient_jll v0.13.1+0
⌅ [e9f186c6] Libffi_jll v3.4.7+0
[7e76a0d4] Libglvnd_jll v1.7.1+1
[94ce4f54] Libiconv_jll v1.18.0+0
[4b2f31a3] Libmount_jll v2.41.3+0
[89763e89] Libtiff_jll v4.7.2+0
[38a345b3] Libuuid_jll v2.41.3+0
[d00139f3] METIS_jll v5.1.3+0
[856f044c] MKL_jll v2025.2.0+0
[d7ed1dd3] MUMPS_seq_jll v500.800.200+0
[e98f9f5b] NVTX_jll v3.2.2+0
[e7412a2a] Ogg_jll v1.3.6+0
[656ef2d0] OpenBLAS32_jll v0.3.30+0
[458c3c95] OpenSSL_jll v3.5.5+0
[efe28fd5] OpenSpecFun_jll v0.5.6+0
[91d4177d] Opus_jll v1.6.1+0
[36c8627f] Pango_jll v1.57.0+0
⌅ [30392449] Pixman_jll v0.44.2+0
[c0090381] Qt6Base_jll v6.10.2+1
[629bc702] Qt6Declarative_jll v6.10.2+1
[ce943373] Qt6ShaderTools_jll v6.10.2+1
[6de9746b] Qt6Svg_jll v6.10.2+0
[e99dba38] Qt6Wayland_jll v6.10.2+1
[319450e9] SPRAL_jll v2025.9.18+0
[a44049a8] Vulkan_Loader_jll v1.3.243+0
[a2964d1f] Wayland_jll v1.24.0+0
⌅ [02c8fc9c] XML2_jll v2.13.9+0
[ffd25f8a] XZ_jll v5.8.2+0
[f67eecfb] Xorg_libICE_jll v1.1.2+0
[c834827a] Xorg_libSM_jll v1.2.6+0
[4f6342f7] Xorg_libX11_jll v1.8.13+0
[0c0b7dd1] Xorg_libXau_jll v1.0.13+0
[935fb764] Xorg_libXcursor_jll v1.2.4+0
[a3789734] Xorg_libXdmcp_jll v1.1.6+0
[1082639a] Xorg_libXext_jll v1.3.8+0
[d091e8ba] Xorg_libXfixes_jll v6.0.2+0
[a51aa0fd] Xorg_libXi_jll v1.8.3+0
[d1454406] Xorg_libXinerama_jll v1.1.7+0
[ec84b674] Xorg_libXrandr_jll v1.5.6+0
[ea2f1a96] Xorg_libXrender_jll v0.9.12+0
[a65dc6b1] Xorg_libpciaccess_jll v0.18.1+0
[c7cfdc94] Xorg_libxcb_jll v1.17.1+0
[cc61e674] Xorg_libxkbfile_jll v1.2.0+0
[e920d4aa] Xorg_xcb_util_cursor_jll v0.1.6+0
[12413925] Xorg_xcb_util_image_jll v0.4.1+0
[2def613f] Xorg_xcb_util_jll v0.4.1+0
[975044d2] Xorg_xcb_util_keysyms_jll v0.4.1+0
[0d47668e] Xorg_xcb_util_renderutil_jll v0.3.10+0
[c22f9ab0] Xorg_xcb_util_wm_jll v0.4.2+0
[35661453] Xorg_xkbcomp_jll v1.4.7+0
[33bec58e] Xorg_xkeyboard_config_jll v2.44.0+0
[c5fb5394] Xorg_xtrans_jll v1.6.0+0
[3161d3a3] Zstd_jll v1.5.7+1
[1e29f10c] demumble_jll v1.3.0+0
[35ca27e7] eudev_jll v3.2.14+0
[214eeab7] fzf_jll v0.61.1+0
[a4ae2306] libaom_jll v3.13.1+0
[0ac62f75] libass_jll v0.17.4+0
[1183f4f0] libdecor_jll v0.2.2+0
[2db6ffa8] libevdev_jll v1.13.4+0
[f638f0a6] libfdk_aac_jll v2.0.4+0
[36db933b] libinput_jll v1.28.1+0
[b53b4c65] libpng_jll v1.6.55+0
[f27f6e37] libvorbis_jll v1.3.8+0
[009596ad] mtdev_jll v1.1.7+0
[1317d2d5] oneTBB_jll v2022.0.0+1
⌅ [1270edf5] x264_jll v10164.0.1+0
[dfaa095f] x265_jll v4.1.0+0
[d8fb68d0] xkbcommon_jll v1.13.0+0
[0dad84c5] ArgTools v1.1.2
[56f22d72] Artifacts v1.11.0
[2a0f44e3] Base64 v1.11.0
[ade2ca70] Dates v1.11.0
[8ba89e20] Distributed v1.11.0
[f43a241f] Downloads v1.6.0
[7b1f6079] FileWatching v1.11.0
[9fa8497b] Future v1.11.0
[b77e0a4c] InteractiveUtils v1.11.0
[4af54fe1] LazyArtifacts v1.11.0
[b27032c2] LibCURL v0.6.4
[76f85450] LibGit2 v1.11.0
[8f399da3] Libdl v1.11.0
[37e2e46d] LinearAlgebra v1.11.0
[56ddb016] Logging v1.11.0
[d6f4376e] Markdown v1.11.0
[a63ad114] Mmap v1.11.0
[ca575930] NetworkOptions v1.2.0
[44cfe95a] Pkg v1.11.0
[de0858da] Printf v1.11.0
[9abbd945] Profile v1.11.0
[3fa0cd96] REPL v1.11.0
[9a3f8284] Random v1.11.0
[ea8e919c] SHA v0.7.0
[9e88b42a] Serialization v1.11.0
[1a1011a3] SharedArrays v1.11.0
[6462fe0b] Sockets v1.11.0
[2f01184e] SparseArrays v1.11.0
[f489334b] StyledStrings v1.11.0
[4607b0f0] SuiteSparse
[fa267f1f] TOML v1.0.3
[a4e569a6] Tar v1.10.0
[8dfed614] Test v1.11.0
[cf7118a7] UUIDs v1.11.0
[4ec0a83e] Unicode v1.11.0
[e66e0078] CompilerSupportLibraries_jll v1.1.1+0
[deac9b47] LibCURL_jll v8.6.0+0
[e37daf67] LibGit2_jll v1.7.2+0
[29816b5a] LibSSH2_jll v1.11.0+1
[c8ffd9c3] MbedTLS_jll v2.28.6+0
[14a3606d] MozillaCACerts_jll v2023.12.12
[4536629a] OpenBLAS_jll v0.3.27+1
[05823500] OpenLibm_jll v0.8.5+0
[efcefdf7] PCRE2_jll v10.42.0+1
[bea87d4a] SuiteSparse_jll v7.7.0+0
[83775a58] Zlib_jll v1.2.13+1
[8e850b90] libblastrampoline_jll v5.11.0+0
[8e850ede] nghttp2_jll v1.59.0+0
[3f19e933] p7zip_jll v17.4.0+2
Info Packages marked with ⌃ and ⌅ have new versions available. Those with ⌃ may be upgradable, but those with ⌅ are restricted by compatibility constraints from upgrading. To see why use `status --outdated -m` 📈 Performance Profile GPU Time
Dataset overview for core-kkt:
- Problems: 17 unique optimal control problems
- Instances: 68
- Solver combos: 2
Profile configuration:
- Instance definition: (problem, grid_size)
- Solver combos definition: (model, solver)
- Criterion: CPU time
- Successful runs: 134/136 (98.5%)
- Successful instances: 68/68 (100.0%)
- Unsuccessful instances: none (every instance had at least one successful run)
Robustness (% of instances solved):
(exa, madnlp): 98.5%(exa_gpu, madnlp): 98.5%
Efficiency (% of instances where fastest):
(exa, madnlp): 27.9%(exa_gpu, madnlp): 72.1%
Most robust: 2 combinations tied at 98.5%.
Most efficient: (exa_gpu, madnlp) was fastest on 72.1% of instances.
📈 Performance Profile Iterations
Dataset overview for core-kkt:
- Problems: 17 unique optimal control problems
- Instances: 68
- Solver combos: 2
Profile configuration:
- Instance definition: (problem, grid_size)
- Solver combos definition: (model, solver)
- Criterion: Iterations
- Successful runs: 134/136 (98.5%)
- Successful instances: 68/68 (100.0%)
- Unsuccessful instances: none (every instance had at least one successful run)
Robustness (% of instances solved):
(exa, madnlp): 98.5%(exa_gpu, madnlp): 98.5%
Efficiency (% of instances where fastest):
(exa, madnlp): 67.6%(exa_gpu, madnlp): 39.7%
Most robust: 2 combinations tied at 98.5%.
Most efficient: (exa, madnlp) was fastest on 67.6% of instances.
📊 Tables of Results
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 1000 | exa | madnlp | 129.901 | 25 | 771.007821 | min | ✓ |
| ✓ | 1000 | exa_gpu | madnlp | 289.063 | 30 | 770.995683 | min |
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✗ | 5000 | exa | madnlp | 31579.800 | 1000 | 883.592382 | min | |
| ✓ | 5000 | exa_gpu | madnlp | 1524.028 | 40 | 770.947303 | min | ✓ |
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 10000 | exa | madnlp | 4480.590 | 52 | 771.007836 | min | |
| ✓ | 10000 | exa_gpu | madnlp | 2226.689 | 36 | 770.886786 | min | ✓ |
| Success | N | Model | Solver | Time (ms) | Iters | Objective | Criterion | Best |
|---|---|---|---|---|---|---|---|---|
| ✓ | 20000 | exa | madnlp | 9378.732 | 50 | 771.007861 | min | |
| ✓ | 20000 | exa_gpu | madnlp | 5374.607 | 49 | 770.766066 | min | ✓ |
Benchmarks results:
┌─ Problem: beam
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 44.575 ms | iters: 26 | obj: 8.888914e+00 (min) | CPU: 8.89 MiB
│ │ ✓ | exa_gpu | time: 191.085 ms | iters: 48 | obj: 8.888302e+00 (min) | CPU: 16.619 MiB | GPU: 12.065 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 731.085 ms | iters: 79 | obj: 8.888892e+00 (min) | CPU: 97.02 MiB
│ │ ✓ | exa_gpu | time: 585.251 ms | iters: 139 | obj: 8.885839e+00 (min) | CPU: 48.059 MiB | GPU: 94.741 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 3.116 s | iters: 175 | obj: 8.888893e+00 (min) | CPU: 391.50 MiB
│ │ ✓ | exa_gpu | time: 1.362 s | iters: 266 | obj: 8.882791e+00 (min) | CPU: 99.561 MiB | GPU: 285.392 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 7.164 s | iters: 189 | obj: 8.888898e+00 (min) | CPU: 841.46 MiB
│ │ ✓ | exa_gpu | time: 1.899 s | iters: 414 | obj: 8.876698e+00 (min) | CPU: 134.679 MiB | GPU: 793.996 MiB
│ └─
└─
┌─ Problem: chain
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 36.757 ms | iters: 14 | obj: 5.068480e+00 (min) | CPU: 6.74 MiB
│ │ ✓ | exa_gpu | time: 88.766 ms | iters: 15 | obj: 5.068452e+00 (min) | CPU: 7.761 MiB | GPU: 14.326 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 298.970 ms | iters: 13 | obj: 5.068480e+00 (min) | CPU: 30.89 MiB
│ │ ✓ | exa_gpu | time: 126.469 ms | iters: 16 | obj: 5.068339e+00 (min) | CPU: 10.925 MiB | GPU: 72.023 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 611.178 ms | iters: 13 | obj: 5.068480e+00 (min) | CPU: 61.26 MiB
│ │ ✓ | exa_gpu | time: 422.304 ms | iters: 51 | obj: 5.068197e+00 (min) | CPU: 33.574 MiB | GPU: 176.652 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 2.089 s | iters: 14 | obj: 5.068480e+00 (min) | CPU: 125.07 MiB
│ │ ✓ | exa_gpu | time: 204.581 ms | iters: 15 | obj: 5.067922e+00 (min) | CPU: 19.887 MiB | GPU: 286.056 MiB
│ └─
└─
┌─ Problem: double_oscillator
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 24.124 ms | iters: 6 | obj: 9.110011e-04 (min) | CPU: 10.65 MiB
│ │ ✓ | exa_gpu | time: 115.200 ms | iters: 6 | obj: 9.106227e-04 (min) | CPU: 5.805 MiB | GPU: 29.597 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 295.077 ms | iters: 6 | obj: 9.110335e-04 (min) | CPU: 51.32 MiB
│ │ ✓ | exa_gpu | time: 134.800 ms | iters: 6 | obj: 9.091471e-04 (min) | CPU: 14.889 MiB | GPU: 148.157 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 613.176 ms | iters: 6 | obj: 9.110345e-04 (min) | CPU: 102.16 MiB
│ │ ✓ | exa_gpu | time: 146.024 ms | iters: 6 | obj: 9.072690e-04 (min) | CPU: 20.201 MiB | GPU: 295.844 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 1.417 s | iters: 6 | obj: 9.110348e-04 (min) | CPU: 203.85 MiB
│ │ ✓ | exa_gpu | time: 242.082 ms | iters: 6 | obj: 9.035309e-04 (min) | CPU: 36.195 MiB | GPU: 591.571 MiB
│ └─
└─
┌─ Problem: electric_vehicle
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 10.914 ms | iters: 4 | obj: 1.228583e+03 (min) | CPU: 4.95 MiB
│ │ ✓ | exa_gpu | time: 62.663 ms | iters: 11 | obj: 1.228577e+03 (min) | CPU: 6.128 MiB | GPU: 13.152 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 113.830 ms | iters: 5 | obj: 1.228580e+03 (min) | CPU: 23.54 MiB
│ │ ✓ | exa_gpu | time: 83.919 ms | iters: 11 | obj: 1.228551e+03 (min) | CPU: 9.564 MiB | GPU: 65.721 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 242.698 ms | iters: 5 | obj: 1.228580e+03 (min) | CPU: 46.63 MiB
│ │ ✓ | exa_gpu | time: 93.614 ms | iters: 10 | obj: 1.228521e+03 (min) | CPU: 12.344 MiB | GPU: 130.823 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 518.973 ms | iters: 5 | obj: 1.228580e+03 (min) | CPU: 92.81 MiB
│ │ ✓ | exa_gpu | time: 316.410 ms | iters: 10 | obj: 1.228463e+03 (min) | CPU: 19.292 MiB | GPU: 261.610 MiB
│ └─
└─
┌─ Problem: glider
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 863.147 ms | iters: 102 | obj: -1.247985e+03 (min) | CPU: 62.93 MiB
│ │ ✓ | exa_gpu | time: 595.462 ms | iters: 30 | obj: -1.247986e+03 (min) | CPU: 38.206 MiB | GPU: 55.249 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 772.646 ms | iters: 16 | obj: -1.247987e+03 (min) | CPU: 122.65 MiB
│ │ ✓ | exa_gpu | time: 3.619 s | iters: 92 | obj: -1.247990e+03 (min) | CPU: 98.110 MiB | GPU: 328.963 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.703 s | iters: 16 | obj: -1.247988e+03 (min) | CPU: 244.58 MiB
│ │ ✓ | exa_gpu | time: 1.697 s | iters: 22 | obj: -1.247993e+03 (min) | CPU: 58.597 MiB | GPU: 537.162 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 3.730 s | iters: 17 | obj: -1.247988e+03 (min) | CPU: 496.23 MiB
│ │ ✓ | exa_gpu | time: 2.706 s | iters: 20 | obj: -1.247998e+03 (min) | CPU: 92.262 MiB | GPU: 1.042 GiB
│ └─
└─
┌─ Problem: insurance
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 3.447 s | iters: 313 | obj: -2.058233e+00 (min) | CPU: 282.43 MiB
│ │ ✓ | exa_gpu | time: 2.944 s | iters: 448 | obj: -2.058242e+00 (min) | CPU: 193.963 MiB | GPU: 180.049 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 33.535 s | iters: 757 | obj: -2.059098e+00 (min) | CPU: 3.07 GiB
│ │ ✓ | exa_gpu | time: 6.901 s | iters: 442 | obj: -2.059144e+00 (min) | CPU: 248.964 MiB | GPU: 894.567 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 68.372 s | iters: 573 | obj: -2.059342e+00 (min) | CPU: 4.64 GiB
│ │ ✓ | exa_gpu | time: 13.416 s | iters: 504 | obj: -2.059436e+00 (min) | CPU: 293.081 MiB | GPU: 1.945 GiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 187.350 s | iters: 705 | obj: -2.059516e+00 (min) | CPU: 11.86 GiB
│ │ ✓ | exa_gpu | time: 28.060 s | iters: 716 | obj: -2.059776e+00 (min) | CPU: 418.306 MiB | GPU: 5.267 GiB
│ └─
└─
┌─ Problem: jackson
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 62.070 ms | iters: 23 | obj: -1.918150e-01 (min) | CPU: 20.88 MiB
│ │ ✓ | exa_gpu | time: 101.388 ms | iters: 22 | obj: -1.918374e-01 (min) | CPU: 10.195 MiB | GPU: 26.360 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 410.546 ms | iters: 21 | obj: -1.918128e-01 (min) | CPU: 96.91 MiB
│ │ ✓ | exa_gpu | time: 167.446 ms | iters: 25 | obj: -1.919247e-01 (min) | CPU: 16.848 MiB | GPU: 134.146 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 847.248 ms | iters: 21 | obj: -1.918111e-01 (min) | CPU: 193.26 MiB
│ │ ✓ | exa_gpu | time: 202.525 ms | iters: 24 | obj: -1.920350e-01 (min) | CPU: 23.377 MiB | GPU: 266.615 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 1.797 s | iters: 20 | obj: -1.918079e-01 (min) | CPU: 375.57 MiB
│ │ ✓ | exa_gpu | time: 358.025 ms | iters: 21 | obj: -1.922558e-01 (min) | CPU: 35.993 MiB | GPU: 523.307 MiB
│ └─
└─
┌─ Problem: robbins
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 183.457 ms | iters: 44 | obj: 1.943317e+01 (min) | CPU: 14.32 MiB
│ │ ✓ | exa_gpu | time: 195.002 ms | iters: 44 | obj: 1.943298e+01 (min) | CPU: 18.421 MiB | GPU: 18.336 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 1.219 s | iters: 75 | obj: 1.943184e+01 (min) | CPU: 100.74 MiB
│ │ ✓ | exa_gpu | time: 246.295 ms | iters: 47 | obj: 1.943093e+01 (min) | CPU: 22.073 MiB | GPU: 93.070 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 2.147 s | iters: 63 | obj: 1.943181e+01 (min) | CPU: 176.54 MiB
│ │ ✓ | exa_gpu | time: 559.073 ms | iters: 97 | obj: 1.942999e+01 (min) | CPU: 42.863 MiB | GPU: 238.016 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 5.562 s | iters: 71 | obj: 1.943181e+01 (min) | CPU: 386.90 MiB
│ │ ✓ | exa_gpu | time: 602.137 ms | iters: 88 | obj: 1.942819e+01 (min) | CPU: 47.758 MiB | GPU: 456.831 MiB
│ └─
└─
┌─ Problem: robot
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 312.799 ms | iters: 23 | obj: 9.140917e+00 (min) | CPU: 37.94 MiB
│ │ ✓ | exa_gpu | time: 387.340 ms | iters: 29 | obj: 9.140745e+00 (min) | CPU: 18.835 MiB | GPU: 58.809 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 4.047 s | iters: 50 | obj: 9.140949e+00 (min) | CPU: 311.78 MiB
│ │ ✓ | exa_gpu | time: 2.643 s | iters: 55 | obj: 9.140129e+00 (min) | CPU: 44.905 MiB | GPU: 337.881 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 5.629 s | iters: 34 | obj: 9.140939e+00 (min) | CPU: 465.04 MiB
│ │ ✓ | exa_gpu | time: 4.003 s | iters: 54 | obj: 9.139277e+00 (min) | CPU: 55.292 MiB | GPU: 658.198 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 25.862 s | iters: 41 | obj: 9.140966e+00 (min) | CPU: 1.12 GiB
│ │ ✓ | exa_gpu | time: 4.980 s | iters: 31 | obj: 9.137650e+00 (min) | CPU: 71.859 MiB | GPU: 1.157 GiB
│ └─
└─
┌─ Problem: rocket
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 78.904 ms | iters: 23 | obj: -1.012833e+00 (min) | CPU: 21.07 MiB
│ │ ✓ | exa_gpu | time: 174.721 ms | iters: 24 | obj: -1.012870e+00 (min) | CPU: 12.509 MiB | GPU: 34.815 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 479.342 ms | iters: 21 | obj: -1.012820e+00 (min) | CPU: 98.57 MiB
│ │ ✓ | exa_gpu | time: 645.685 ms | iters: 27 | obj: -1.013000e+00 (min) | CPU: 21.266 MiB | GPU: 176.399 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.190 s | iters: 24 | obj: -1.012824e+00 (min) | CPU: 210.32 MiB
│ │ ✓ | exa_gpu | time: 1.076 s | iters: 27 | obj: -1.013162e+00 (min) | CPU: 30.624 MiB | GPU: 352.749 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 2.223 s | iters: 21 | obj: -1.012767e+00 (min) | CPU: 392.57 MiB
│ │ ✓ | exa_gpu | time: 1.892 s | iters: 29 | obj: -1.013484e+00 (min) | CPU: 50.153 MiB | GPU: 711.680 MiB
│ └─
└─
┌─ Problem: space_shuttle
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 2.137 s | iters: 110 | obj: -5.958761e-01 (min) | CPU: 164.85 MiB
│ │ ✓ | exa_gpu | time: 2.020 s | iters: 118 | obj: -5.959073e-01 (min) | CPU: 86.037 MiB | GPU: 185.746 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 53.657 s | iters: 451 | obj: -5.958761e-01 (min) | CPU: 3.44 GiB
│ │ ✓ | exa_gpu | time: 7.136 s | iters: 116 | obj: -5.960318e-01 (min) | CPU: 127.264 MiB | GPU: 925.316 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 28.759 s | iters: 114 | obj: -5.958761e-01 (min) | CPU: 1.62 GiB
│ │ ✓ | exa_gpu | time: 11.703 s | iters: 104 | obj: -5.961874e-01 (min) | CPU: 171.268 MiB | GPU: 1.769 GiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 66.261 s | iters: 132 | obj: -5.958760e-01 (min) | CPU: 3.51 GiB
│ │ ✓ | exa_gpu | time: 29.395 s | iters: 145 | obj: -5.964987e-01 (min) | CPU: 307.333 MiB | GPU: 3.800 GiB
│ └─
└─
┌─ Problem: steering
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 75.630 ms | iters: 14 | obj: 5.545709e-01 (min) | CPU: 11.37 MiB
│ │ ✓ | exa_gpu | time: 131.640 ms | iters: 11 | obj: 5.545709e-01 (min) | CPU: 8.181 MiB | GPU: 23.636 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 544.788 ms | iters: 14 | obj: 5.545709e-01 (min) | CPU: 54.72 MiB
│ │ ✓ | exa_gpu | time: 479.458 ms | iters: 12 | obj: 5.545705e-01 (min) | CPU: 13.628 MiB | GPU: 118.770 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.197 s | iters: 15 | obj: 5.545709e-01 (min) | CPU: 111.75 MiB
│ │ ✓ | exa_gpu | time: 893.465 ms | iters: 12 | obj: 5.545706e-01 (min) | CPU: 18.395 MiB | GPU: 237.352 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 2.567 s | iters: 15 | obj: 5.545709e-01 (min) | CPU: 222.96 MiB
│ │ ✓ | exa_gpu | time: 1.666 s | iters: 13 | obj: 5.545694e-01 (min) | CPU: 31.252 MiB | GPU: 477.246 MiB
│ └─
└─
┌─ Problem: vanderpol
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 13.949 ms | iters: 4 | obj: 1.047807e+00 (min) | CPU: 5.41 MiB
│ │ ✓ | exa_gpu | time: 57.441 ms | iters: 8 | obj: 1.047785e+00 (min) | CPU: 5.187 MiB | GPU: 13.727 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 90.208 ms | iters: 4 | obj: 1.047807e+00 (min) | CPU: 25.29 MiB
│ │ ✓ | exa_gpu | time: 80.451 ms | iters: 7 | obj: 1.047710e+00 (min) | CPU: 9.707 MiB | GPU: 68.353 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 203.157 ms | iters: 4 | obj: 1.047807e+00 (min) | CPU: 50.14 MiB
│ │ ✓ | exa_gpu | time: 95.827 ms | iters: 8 | obj: 1.047590e+00 (min) | CPU: 13.363 MiB | GPU: 137.229 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 475.441 ms | iters: 5 | obj: 1.047807e+00 (min) | CPU: 102.12 MiB
│ │ ✓ | exa_gpu | time: 147.936 ms | iters: 7 | obj: 1.047420e+00 (min) | CPU: 21.650 MiB | GPU: 273.360 MiB
│ └─
└─
┌─ Problem: brachistochrone
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 78.617 ms | iters: 23 | obj: 1.802932e+00 (min) | CPU: 12.69 MiB
│ │ ✓ | exa_gpu | time: 161.781 ms | iters: 20 | obj: 1.802931e+00 (min) | CPU: 11.205 MiB | GPU: 23.607 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 688.369 ms | iters: 26 | obj: 1.802932e+00 (min) | CPU: 64.92 MiB
│ │ ✓ | exa_gpu | time: 1.834 s | iters: 79 | obj: 1.802923e+00 (min) | CPU: 38.189 MiB | GPU: 151.987 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 1.670 s | iters: 31 | obj: 1.802935e+00 (min) | CPU: 141.30 MiB
│ │ ✓ | exa_gpu | time: 1.135 s | iters: 26 | obj: 1.802923e+00 (min) | CPU: 24.237 MiB | GPU: 242.057 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 2.798 s | iters: 26 | obj: 1.802934e+00 (min) | CPU: 257.80 MiB
│ │ ✓ | exa_gpu | time: 1.690 s | iters: 21 | obj: 1.802914e+00 (min) | CPU: 33.815 MiB | GPU: 473.430 MiB
│ └─
└─
┌─ Problem: balanced_field
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 129.901 ms | iters: 25 | obj: 7.710078e+02 (min) | CPU: 26.53 MiB
│ │ ✓ | exa_gpu | time: 289.063 ms | iters: 30 | obj: 7.709957e+02 (min) | CPU: 22.780 MiB | GPU: 54.850 MiB
│ │
│ │ N = 5000
│ │ ✗ | exa | time: 31.580 s | iters: 1000 | obj: 8.835924e+02 (min) | CPU: 1.99 GiB
│ │ ✓ | exa_gpu | time: 1.524 s | iters: 40 | obj: 7.709473e+02 (min) | CPU: 39.904 MiB | GPU: 282.896 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 4.481 s | iters: 52 | obj: 7.710078e+02 (min) | CPU: 378.73 MiB
│ │ ✓ | exa_gpu | time: 2.227 s | iters: 36 | obj: 7.708868e+02 (min) | CPU: 53.616 MiB | GPU: 558.280 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 9.379 s | iters: 50 | obj: 7.710079e+02 (min) | CPU: 719.05 MiB
│ │ ✓ | exa_gpu | time: 5.375 s | iters: 49 | obj: 7.707661e+02 (min) | CPU: 96.038 MiB | GPU: 1.134 GiB
│ └─
└─
┌─ Problem: bryson_denham
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 34.745 ms | iters: 20 | obj: 4.000009e+00 (min) | CPU: 6.72 MiB
│ │ ✓ | exa_gpu | time: 518.799 ms | iters: 99 | obj: 3.999729e+00 (min) | CPU: 34.144 MiB | GPU: 15.870 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 881.260 ms | iters: 86 | obj: 4.000002e+00 (min) | CPU: 87.26 MiB
│ │ ✓ | exa_gpu | time: 313.598 ms | iters: 70 | obj: 3.998618e+00 (min) | CPU: 26.107 MiB | GPU: 68.138 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 3.125 s | iters: 158 | obj: 4.000004e+00 (min) | CPU: 294.86 MiB
│ │ ✓ | exa_gpu | time: 342.614 ms | iters: 63 | obj: 3.997237e+00 (min) | CPU: 27.557 MiB | GPU: 131.014 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 14.172 s | iters: 361 | obj: 4.000008e+00 (min) | CPU: 1.24 GiB
│ │ ✓ | exa_gpu | time: 1.613 s | iters: 293 | obj: 3.994479e+00 (min) | CPU: 112.981 MiB | GPU: 610.002 MiB
│ └─
└─
┌─ Problem: mountain_car
│
├──┬ Solver: madnlp, Discretization: midpoint
│ │
│ │ N = 1000
│ │ ✓ | exa | time: 206.963 ms | iters: 71 | obj: 1.023686e+02 (min) | CPU: 36.51 MiB
│ │ ✓ | exa_gpu | time: 1.124 s | iters: 184 | obj: 1.023511e+02 (min) | CPU: 77.396 MiB | GPU: 36.856 MiB
│ │
│ │ N = 5000
│ │ ✓ | exa | time: 2.269 s | iters: 141 | obj: 1.023676e+02 (min) | CPU: 338.09 MiB
│ │ ✗ | exa_gpu | time: 23.338 s | iters: 1000 | obj: 1.136136e+02 (min) | CPU: 387.983 MiB | GPU: 662.096 MiB
│ │
│ │ N = 10000
│ │ ✓ | exa | time: 10.889 s | iters: 402 | obj: 1.023676e+02 (min) | CPU: 1.67 GiB
│ │ ✓ | exa_gpu | time: 5.706 s | iters: 166 | obj: 1.021974e+02 (min) | CPU: 83.368 MiB | GPU: 346.825 MiB
│ │
│ │ N = 20000
│ │ ✓ | exa | time: 11.079 s | iters: 188 | obj: 1.023676e+02 (min) | CPU: 1.63 GiB
│ │ ✓ | exa_gpu | time: 22.810 s | iters: 446 | obj: 1.020270e+02 (min) | CPU: 209.499 MiB | GPU: 1.264 GiB
│ └─
└─