Merge master branch into merge_fkpl_collisions #162

johnomotani · 2023-11-23T13:51:29Z

No description provided.

When reloading data for a restart, it is necessary to create `coordinate` objects using the parallelisation of the new simulation, not the parallelisation that was used to write the data.

Only the coordinate ranges are actually different between the parallel_io=true and parallel_io=false branches, so can take a lot of code outside the `if parallel_io` block.

This allows a simulation to restart from a run with different resolution in any or all dimensions, and also to restart from a run with different moment-kinetic settings (`evolve_density`, `evolve_upar` and `evolve_ppar`).

This is needed to get the global_io_range correct, which is needed when reloading.

This will allow them to be reused in other tests.

`discretization_info` is the abstract type inherited by all implementations of a discretization (i.e. `chebyshev_info` and `finite_difference_info`). Also adds a `finite_difference_info` that is used to dispatch to the finite difference methods, instead of using `Bool` for that.

Length-1 dimensions have to be treated a bit specially. Derivatives in those directions should be zero. 'Interpolation' to a grid with more than one point: for spatial dimensions, assume the variable is constant; for velocity dimensions, use a Maxwellian whose peak value is the value on the length-1 grid, and whose width is 1 (in units of the reference speed) - this means that the integral of the variable over velocity space, after accounting for factors of pi^0.5 or pi^1.5 in the normalization of distribution functions, is the same (up to discretization error) after 'interpolating' as it was before.

The special "chebyshev_pseudospectral_vperp" is now handled by having a special case for "vperp" in the "chebyshev_pseudospectral" setup, so the default for "verp_discretization" should just be "chebyshev_pseudospectral".

The case when vpa and vz are not the same direction needs some special handling (for the neutrals) to convert between 1V and 3V. This is not implemented yet, so error in this case when bzeta!=0.

Have added tests for Krook collision operator and restart interpolation.

When not using parallel I/O, should not change the irank/nrank loaded from the original output files. Also check whether current distributed parallelisation is the same as the original distributed parallelisation (changing this is only supported when using parallel I/O).

Now that the `discretization_info` abstract type is defined, can define the specific types for each implementation in the module along with the implementation. This makes more sense and simplifies the dependencies between different modules.

The method for setting up MPI-capable HDF5 has changed in recent versions of HDF5.jl. Update the docs to reflect the new method.

This makes it easier to read and to refer to, as the amount of information in the file has increased.

Allow interpolation between grids when restarting

Add an input option `recycling_fraction` that can be set to a value between 0 and 1 to control the fraction of the incoming ion flux at the wall boundaries that is recycled as neutrals.

Source terms with a Maxwellian velocity profile (whose temperature is an input option) with various options for the spatial profiles (currently constant or Gaussian - with a tunable width - in each of the r- or z-directions). Includes an option for a PI controller for density. Controller sets amplitude of external source term to push the density (either full profile or the midpoint value) to a fixed value.

Used when `controller_type = "recycling"` in the `neutral_source` section.

Otherwise, if the initial temperature was decreased, the notch could be about as wide as the distribution function, which would make things weird.

...instead of constant_ionization_source. Also add moment-kinetic versions of test.

The earlier-initialised values are only used during initialisation to calculate some advection speeds whose sign is then checked to set boundary conditions, so the differences caused by this error were small, but previously (for evolving-upar and evolving-ppar cases) uninitialised arrays were used, which could change randomly.

johnomotani · 2023-11-24T13:41:33Z

I guess the bug shouldn't make much difference in well-resolved examples, because the derivative both at vperp=0 and vperp=L should be (approximately?) zero, so replacing each with the average of the two shouldn't make much difference?

johnomotani · 2023-11-24T13:42:32Z

@mrhardman should i just update the expected data for the fokker_planck_time_evolution_tests.jl case?

mrhardman · 2023-11-24T13:48:08Z

@mrhardman should i just update the expected data for the fokker_planck_time_evolution_tests.jl case?

I don't think so yet. To fully convince ourselves that all is well we should run the example case fokker_planck_relaxation.toml to a time t = 1000 / nuii and see that the Maxwellian is stable.

However, reconcile_element_boundaries_centered!() is called from derivative_elements_to_full_grid!(), from derivative!(), etc. and reconcile_element_boundaries_centered!() behaves differently if the coord.bc is "periodic" or something else in how it treats the end points. Previously vperp.bc had been set to "periodic", and I think this was incorrect.

The only place these routines were used for the vperp dimension would have been in the calculation of the boundary data in the Rosenbluth potentials. It is conceivable that the mistake could have made a small difference. The only way to check is to change the boundary condition in my PR #149 and see if this breaks the test. I suspect it will not. Let me report back.

johnomotani · 2023-11-24T13:56:44Z

Ah sorry, should have added: using the enforce_vperp_boundary_condition!() implementation from before bf8eda6, (which ignores vperp.bc) and setting vperp.bc = "periodic" the fokker_planck_time_evolution_tests.jl test passes with the code otherwise at the latest version of this PR (7d5d1c5).

mrhardman · 2023-11-24T13:59:29Z

Ah sorry, should have added: using the enforce_vperp_boundary_condition!() implementation from before bf8eda6, (which ignores vperp.bc) and setting vperp.bc = "periodic" the fokker_planck_time_evolution_tests.jl test passes with the code otherwise at the latest version of this PR (7d5d1c5).

Then I am confused, I thought you were arguing that the boundary conditions were imposed correctly with vperp.bc = "zero" in the current revision and that the error came from the application of the derivative! function? It sounds like you have localised the error to the enforce_verp_boundary_condition!() function?

johnomotani · 2023-11-24T14:05:53Z

No, I'm saying that the updated vperp boundary condition is fine (I only reverted to the old one, because the new one would throw an error saying vperp.bc = "periodic" is not supported when vperp.bc = "periodic" is set), and that setting vperp.bc = "periodic" 'passes' the test because it reproduces the behaviour of the code that the expected data was generated with. But the old behaviour when vperp.bc = "periodic" is incorrect. I think if you set vperp.bc = "zero" (or anything except "periodic" on the #149 branch, then the fokker_planck_time_evolution_tests.jl would fail [EDIT: indeed it does fail, with the exact same errors (at least I checked the phi error) as on this branch].

mrhardman · 2023-11-24T14:34:28Z

Trying to make this investigation, I hit a post processing error:

julia --project -O3 run_post_processing.jl runs/fokker-planck-relaxation
Activating project at ~/excalibur/moment_kinetics_merge_collisions
┌ Warning: No working GUI backend found for matplotlib
└ @ PyPlot ~/.julia/packages/PyPlot/2MlrT/src/init.jl:153
ERROR: LoadError: Unsupported number of dimensions: 4
Stacktrace:
[1] error(s::String)
@ Base ./error.jl:35
[2] read_distributed_zwallr_data!(var::Array{Float64, 4}, var_name::String, run_names::Tuple{String}, file_key::String, nblocks::Tuple{Int64}, nr_local::Int64, iskip::Int64, wallopt::String)
@ moment_kinetics.post_processing ~/excalibur/moment_kinetics_merge_collisions/src/post_processing.jl:257
[3] #10
@ ./none:0 [inlined]
[4] iterate
@ ./generator.jl:47 [inlined]
[5] collect
@ ./array.jl:782 [inlined]
[6] _totuple
@ ./tuple.jl:401 [inlined]
[7] Tuple(itr::Base.Generator{Tuple{Tuple{Array{Float64, 4}, String, Tuple{String}, String, Tuple{Int64}, Int64, Int64, String}}, moment_kinetics.post_processing.var"#10#20"{typeof(moment_kinetics.post_processing.read_distributed_zwallr_data!)}})
@ Base ./tuple.jl:369
[8] get_tuple_of_return_values(::Function, ::Tuple{Array{Float64, 4}}, ::Vararg{Any})
@ moment_kinetics.post_processing ~/excalibur/moment_kinetics_merge_collisions/src/post_processing.jl:435
[9] analyze_and_plot_data(prefix::String; run_index::Nothing)
@ moment_kinetics.post_processing ~/excalibur/moment_kinetics_merge_collisions/src/post_processing.jl:747
[10] analyze_and_plot_data(prefix::String)
@ moment_kinetics.post_processing ~/excalibur/moment_kinetics_merge_collisions/src/post_processing.jl:470
[11] top-level scope
@ ~/excalibur/moment_kinetics_merge_collisions/run_post_processing.jl:7

johnomotani · 2023-11-24T14:49:31Z

Trying to make this investigation, I hit a post processing error:

Fixed now.

mrhardman · 2023-11-24T15:01:36Z

@mrhardman should i just update the expected data for the fokker_planck_time_evolution_tests.jl case?

I have done the checks described above and I am happy for you to revise the test with the new vperp.bc = "zero" default boundary condition. I think this should permit the merge with PR #149.

Previous output was affected by a default `vperp_bc = "periodic"` which had unintended effects in `reconcile_element_boundaries_centered!()`, which is used by `derivative!()`, etc.

Only useful for debugging.

johnomotani · 2023-11-26T11:08:52Z

I've added a case with the collision operator to the automated debug checks. Hopefully this should help catch any shared-memory errors that we introduce in future!

This debug function is meant to emulate `get_best_ranges()`, but only actually splitting one region type. Previously, it treated all the other region types as serial regions, but this means that only the rank-0 process in each shared-memory block actually enters any `@loop_*` macro. However, in `get_best_ranges()` it is only the parallelised dimensions that get an empty range (of `1:0`) on processes that have no work to do - for the other (non-parallelised) dimensions every process should loop over all the points. Treating other region types as serial regions meant that some code that was correct in non-debug mode failed in debug mode. This commit fixes this problem, by making every process loop over all points in the non-parallelised dimensions.

...in explicit_fokker_planck_collisions_weak_form!(). Apparently it is necessary to have looping.loop_ranges in the correct state when wrapping around the end of the @loop_s_r_z.

The Fokker-Planck collisions do not do any spatially-dependent operations, so having just nelement=1, ngrid=2 in the r- and z-diretions should be enough to check for shared-memory errors. Reducing the number of spatial grid points from 5*5=25 to 2*2=4 massively speeds up the debug test, making it more practical to run (especially on the CI servers).

If no neutral speies is present, then the debug checks for combinations of neutral dimensions (anything including `:sn`) would be no different to running in serial, so they can be safely skipped to save time.

Github.com's Ubuntu CI servers seem to be slower than they used to be, and are making the debug checks job time out.

mrhardman · 2023-11-26T14:11:34Z

I've added a case with the collision operator to the automated debug checks. Hopefully this should help catch any shared-memory errors that we introduce in future!

Nice work! This would be super useful for looking improving the parallelisation.

It looks like this test simulates a "sheath scenario". Do we want to develop a cheap check-in test along these lines (there's an example in the discussion of PR #149)?

My experiments running the sheath simulations suggest that the "low hanging fruit" of parallelising over z r s in the collision operator loop (#140) would give a speedup of up to z.ngrid * r.ngrid compared to a distributed memory run with z_nelement_local = 1 and r_nelement_local = 1, depending on the number of cores available.

johnomotani · 2023-11-26T15:11:43Z

It looks like this test simulates a "sheath scenario". Do we want to develop a cheap check-in test along these lines (there's an example in the discussion of PR #149)?

The input was a copy of examples/fokker-planck/fokker-planck-relaxation.toml with the resolution cut way down.

I'm always in favour of more tests, is the 1D2V simulation quick enough to run as an automated test?

mrhardman · 2023-11-26T15:16:56Z

I'm always in favour of more tests, is the 1D2V simulation quick enough to run as an automated test?

I think yes, if we can cut down the velocity resolution and still get a physical looking solution. If we can use the same velocity resolutions as the current relaxation test, but just add a z domain, the runtime will scale roughly with the number of z points compared to the runtime of the current test. I'll make an experiment and post the plots here.

EDIT: It looks like using very reduced resolutions comparable to those in the relaxation test in sheath simulations leads to an unstable simulations in the present commit (and probably earlier ones too). Further work required.

ngrid=2 results in an out-of-bounds index error in Chebyshev derivatives.

johnomotani · 2023-11-26T23:13:07Z

The debug checks are taking a super long time now. I've got a 'fix' for that (it at least gets them back down to about 2 hrs...), but I'll add that in a separate PR. I think this might as well merge into PR #149 now. @mrhardman if there are any last tweak to add, we can put them on that branch.

johnomotani added 30 commits October 31, 2023 10:42

Optional arguments to load_coordinate() to specify the parallelisation

e31c171

When reloading data for a restart, it is necessary to create `coordinate` objects using the parallelisation of the new simulation, not the parallelisation that was used to write the data.

Reduce duplicated code in reload_evolving_fields!()

e088d64

Only the coordinate ranges are actually different between the parallel_io=true and parallel_io=false branches, so can take a lot of code outside the `if parallel_io` block.

Interpolate if grids are different when restarting

c2d7e0a

This allows a simulation to restart from a run with different resolution in any or all dimensions, and also to restart from a run with different moment-kinetic settings (`evolve_density`, `evolve_upar` and `evolve_ppar`).

Get and set parallel_io when loading coordinate from file

e726f42

This is needed to get the global_io_range correct, which is needed when reloading.

Move inputs, expected data for NonlinearSoundWaveTests to separate file

27935d7

This will allow them to be reused in other tests.

Test for restart interpolation

6428636

Test 2D, 2V/3V

1563413

Fix default discretization type for vperp dimension

560ca57

The special "chebyshev_pseudospectral_vperp" is now handled by having a special case for "vperp" in the "chebyshev_pseudospectral" setup, so the default for "verp_discretization" should just be "chebyshev_pseudospectral".

Error when interpolating between 1V/3V if bzeta!=0

1fdce06

The case when vpa and vz are not the same direction needs some special handling (for the neutrals) to convert between 1V and 3V. This is not implemented yet, so error in this case when bzeta!=0.

Increase timeouts in CI to allow for new tests

6622d9d

Have added tests for Krook collision operator and restart interpolation.

Make temporary directories consistent across MPI ranks in tests

aaafff9

Fix parallel execution of restart_interpolation_tests.jl

a80fd27

Use even looser tolerances for 2V/3V restart interpolation test

f13e3e0

Clean up discretization types

e204dba

Now that the `discretization_info` abstract type is defined, can define the specific types for each implementation in the module along with the implementation. This makes more sense and simplifies the dependencies between different modules.

Update parallel HDF5 setup instructions

39daef4

The method for setting up MPI-capable HDF5 has changed in recent versions of HDF5.jl. Update the docs to reflect the new method.

Reorganise the README.md file into shorter, more specific sections

c897de5

This makes it easier to read and to refer to, as the amount of information in the file has increased.

Add documentation for restart interpolation

b303e06

Add docs pages for constants, krook_collisions and reference_parameters

10fe60b

Merge pull request #128 from mabarnes/restart-interpolation

f765bd7

Allow interpolation between grids when restarting

Option to set recycling fraction less than 1

482219d

Add an input option `recycling_fraction` that can be set to a value between 0 and 1 to control the fraction of the incoming ion flux at the wall boundaries that is recycled as neutrals.

Volumetric neutral source that recycles fraction of ion flux to walls

863a957

Used when `controller_type = "recycling"` in the `neutral_source` section.

Make width of 'notch' in wall-bc initial condition proportional to vth

b48c5e7

Otherwise, if the initial temperature was decreased, the notch could be about as wide as the distribution function, which would make things weird.

Convert Harrison-Thompson test to use external_sources

a5d1e66

...instead of constant_ionization_source. Also add moment-kinetic versions of test.

Add tests with recycling_fraction=0.5, using external source for plasma

e41143d

Add debug option to check that every shared-memory array is initialized

38728b4

Fix a couple of merge errors in post_processing.jl

1396824

johnomotani added 4 commits November 24, 2023 17:16

Updated expected output for fokker_planck_time_evolution_tests.jl

0905d65

Previous output was affected by a default `vperp_bc = "periodic"` which had unintended effects in `reconcile_element_boundaries_centered!()`, which is used by `derivative!()`, etc.

Add automated debug checks for Fokker-Planck collision operator

67a4e3c

Only print warning about vperp boundary conditions on rank-0 process

bb587f8

Comment out print of MM2D_sparse

4bb811e

Only useful for debugging.

johnomotani added 6 commits November 26, 2023 13:26

Zero-initialise the dSdt array

e16e04c

Ensure loop type returns to vperp_vpa at outer level

85cfee6

...in explicit_fokker_planck_collisions_weak_form!(). Apparently it is necessary to have looping.loop_ranges in the correct state when wrapping around the end of the @loop_s_r_z.

Skip neutral dimensions in debug checks if n_neutral_species==0

86df79b

If no neutral speies is present, then the debug checks for combinations of neutral dimensions (anything including `:sn`) would be no different to running in serial, so they can be safely skipped to save time.

Increase timeout for debug checks in CI

c1e6c9a

Github.com's Ubuntu CI servers seem to be slower than they used to be, and are making the debug checks job time out.

johnomotani force-pushed the merge_fkpl_collisions-merge-master branch from 09b8e84 to c1e6c9a Compare November 26, 2023 13:27

Don't allocate arrays for collision operator when not using collisions

fb3be3a

Increase r_ngrid and z_ngrid to 3 in debug check for collisions

b1d73e5

ngrid=2 results in an out-of-bounds index error in Chebyshev derivatives.

johnomotani merged commit c9a3766 into merge_fkpl_collisions Nov 26, 2023
12 of 15 checks passed

johnomotani deleted the merge_fkpl_collisions-merge-master branch November 26, 2023 23:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge master branch into merge_fkpl_collisions #162

Merge master branch into merge_fkpl_collisions #162

johnomotani commented Nov 23, 2023

johnomotani commented Nov 24, 2023

johnomotani commented Nov 24, 2023

mrhardman commented Nov 24, 2023 •

edited

Loading

johnomotani commented Nov 24, 2023

mrhardman commented Nov 24, 2023

johnomotani commented Nov 24, 2023 •

edited

Loading

mrhardman commented Nov 24, 2023

johnomotani commented Nov 24, 2023

mrhardman commented Nov 24, 2023

johnomotani commented Nov 26, 2023

mrhardman commented Nov 26, 2023

johnomotani commented Nov 26, 2023

mrhardman commented Nov 26, 2023 •

edited

Loading

johnomotani commented Nov 26, 2023

Merge master branch into merge_fkpl_collisions #162

Merge master branch into merge_fkpl_collisions #162

Conversation

johnomotani commented Nov 23, 2023

johnomotani commented Nov 24, 2023

johnomotani commented Nov 24, 2023

mrhardman commented Nov 24, 2023 • edited Loading

johnomotani commented Nov 24, 2023

mrhardman commented Nov 24, 2023

johnomotani commented Nov 24, 2023 • edited Loading

mrhardman commented Nov 24, 2023

johnomotani commented Nov 24, 2023

mrhardman commented Nov 24, 2023

johnomotani commented Nov 26, 2023

mrhardman commented Nov 26, 2023

johnomotani commented Nov 26, 2023

mrhardman commented Nov 26, 2023 • edited Loading

johnomotani commented Nov 26, 2023

mrhardman commented Nov 24, 2023 •

edited

Loading

johnomotani commented Nov 24, 2023 •

edited

Loading

mrhardman commented Nov 26, 2023 •

edited

Loading