-
Notifications
You must be signed in to change notification settings - Fork 25
/
CHANGELOG
428 lines (301 loc) · 16.7 KB
/
CHANGELOG
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
----------------------------
Version 3.1.0, 3/2023
----------------------------
This version contains some major new features, including a new capability for
chaotic problems (Delta correction), more detailed timers for performance
analysis, and changes to more easily allow for distributed control variables
during optimization. Specifically, this release includes
- New Delta correction feature to accelerate MGRIT for nonlinear problems,
especially chaotic ones. See user manual and examples/ex-07 for more details,
including on (i) the reduced rank (storage) option and (ii) deferring Delta
correction to coarse levels for better efficiency. This new feature
builds a basis for the unstable Lyapunov manifold to accurately capture
the chaotic dynamics.
- Option for user-allocated MPI buffers. This is important if you need to
allocate buffers on a device/accelerator (GPU). See braid_SetBufAllocFree().
- Optional user sync function is called at the end (top) of each V-cycle, but
now with access to the temporal communicator for easier global analyses, e.g.,
of error estimates over the time dimension.
- Option to solve coarsest grid with relaxation only, which is particularly useful
for parabolic problems, see braid_SetRelaxOnlyCG().
- Option for more detailed timings. See braid_SetTimings(), where option 2
times all user functions and certain important parts of the MGRIT algorithm
like the coarse-grid solve and MPI Wait.
- Enhancements for easier distributed storage of control variables during while doing
optimization with MGRIT.
-> New ability to automatically always store last time point (whether F or C),
using ulast, see _braid_UGetLast()
-> New FinalFCRelax feature, that allows for a final FCRelax after Braid
cycling is finished. The done flag is True during this FCRelax, and thus
allows for easy computation of gradient information after Braid cycling.
If this feature is not used, the user still gets access to the solution
after Braid halts, but the coarse-grid correction from the final MGRIT
iteration has not been fully integrated on the finest grid.
-> New option for easily reverting processor ranks, to allow for backward sweeps
in time, see braid_SetRevertedRanks()
-> Adjoint variables are initialized with a non-physical time value of -1.0, to aid
in programming
-> braid_defs.h have commented out lines to allow for switch to single precision
- Reduced sensitivity of regression tests to numerical noise.
- Updated documentation to discuss new features, especially Delta correction.
----------------------------
Version 3.0.0, 8/2020
----------------------------
This version contains some major new features, a Python interface, and a new
file structure.
- Added a new periodic feature (activiated through BraidSetPeriodic) that
allows for solving time-periodic problems efficiently. The periodic
structure is replicated on all Braid levels. See examples/ directory
for example codes.
- Added a new Richardson-based extrapolation option (activated through
BraidSetRichardsonEstimation). This option assumes sufficient continuity
in time and then uses Richardson extrapolation to improve the accuracy
order in time on the finest grid. See examples/ for example codes.
- Added a new Richardson-based error estimation option (activated through
BraidSetRichardsonEstimation). This option also assumes sufficent
continuity in time and then uses Richardson-based techniques to give an
error estimate in time on the finest grid. See examples/ for example codes.
- Added a new Cython (Python) interface. See braid.pyx and the examples/
directory for how this works. In short, Braid can now be called and run
through Python, including mpi4py, while using other existing Python
libraries.
- Added weighted relaxation, which allows for weighting the C-relaxation as
one would for standard weighted-Jacobi (activated through braid_SetCRelaxWt).
Experiments show that this weight (when properly chosen) can reduce iterations
by 10%-30%, for the tested problems. Common weights are in the range of
1.0 to 2.0.
- Added an optional new user function called "Sync" that is called only once
per MPI task at certain points in the code, i.e., at the top of a Braid cycle,
and after FRefine. This is different from "Access" which is called for every
time-point that an MPI task "owns". Sync is designed to make it easier to
complete global operations, such as a reduce over the temporal communicator to
compute some quantity, e.g., a maximum error estimate.
- Split very large files, primarily _braid.c, into many smaller files. This
file had become too unwieldy and large. Also, renamed many files that began
with "_braid...". The goal here is to make the filenames easier to grok.
- Added more flexibility for FRefine (which implements FMG/Nested Iteration),
so that the user can more adaptively control options like the max number of
levels, the final number of levels, and when to initialize with a sequential
solution for debugging purposes.
- Updated Make system so that a default make doesn't include MFEM and that an
error message is printed if hypre isn't found. Hypre only affects some
example codes, not Braid itself.
Also,
- Incorporated various bug fixes, including some memory bugs in FRefine
- Updated documentation to discuss new features (weighted-relaxation,
Richardson-based error estimation and extrapolation, periodic
functionality, Cython interface, and Sync function.)
----------------------------
Version 2.3.0, 7/11/2018
----------------------------
- New adjoint feature added by Stefanie Guenther, allowing for
the solution of PDE-constrained optimization problems. This
feature allows XBraid to compute sensitivities (i.e., gradients)
to be used in an outer optimization loop. See the new examples
examples/ex-01-adjoint.c
examples/ex-01-optimization.c
examples/ex-04.c
- Cleaned up root source directory
- Moved source files into braid/braid directory
- Moved user_utils directory into `misc` folder
- Added regression test for MacBooks (especially Jacob's MacBook)
test/machine-mac.sh
- Updated documentation, and moved code hosting to GitHub
- Fixed some small bugs in the examples and drivers
- Removed limitations on the temporal refinement factor
----------------------------
Version 2.2.0, 10/02/2017
----------------------------
- New example examples/ex-01-refinement which highlights using
temporal refinement based on an error estimator. The "RDF Hack"
that put refinement in ex-03 has been removed.
- Removed restriction on the user-returned temporal refinement factor from
Step(). Previously, this value was limited to be less than the coarsening
factor. The temporal refinement factor can now be arbitrary.
New regression tests were added to verify this feature.
- Updated documentation to reflect new features and new debugging advice.
- Several bug fixes in drivers regarding CFL number computations and PFMG
iteration tracking
----------------------------
Version 2.1.0, 10/18/2016
----------------------------
- Updated documentation to reflect new features
- Added user-defined initial time-grids. Allows for non-uniform
initial grids, and for bit-wise equivalent time-grids between codes.
- New 1D heat equation example, which is an excellent teaching
tool for learning XBraid. examples/ex-02.c
- New linear elasticity MFEM driver, drivers/drive-lin-elasticity.cpp
- New scalar ODE example showing how to use the shell-vector option
with BDF2, ex-01-expanded-bdf2.c
- Renamed and reorganized examples and drivers directories.
So that all "research and development" files are in drivers,
and examples will be a stable set of files for instruction and
introducation to XBraid.
Also, reworked the examples for consistency in appearance, functionality and
style.
- Changed make system so that the sequential option is now enabled with
make sequential=yes
similar to the other two options
make debug=yes|no
make valgrind=yes|no
- Added a braid test to test the compatibility of Residual()
and Step(), i.e., that
f - Residual( Step(u_{i-1}, f_i), u_{i-1}) \approx 0
- Updates and bug fixes to the adaptive refinement in time capabilities
- Unified the status structure implementation
- Updated Fortran interface to support missing functions
----------------------------
Version 2.0.0, 05/25/2016
----------------------------
- Updated documentation to reflect new features
- Changed the name of "Phi" to "Step", because this is more
descriptive of the function's purpose, especially as it
relates to the new residual function.
- Added major new features
Flexible storage options, allowing for storage at only C
or at C and F points. Extra storage gives you better initial
guesses for implicit solves. Storage=-2 is compatability mode
with the original XBraid. Storage=-1 (storage only at C-points)
is the default.
Support for adaptive refinement in time (see FRefine function)
This was not an easy implementation!
Support for user-defined residual computations (see User's Manual
for description). This can result in computational savings.
Support for the new Shell feature which allows for storing meta
data at all time points (even if storage is turned off at F-points).
This is useful, for instance, for BDF methods. See examples.
Support for skipping work during the first XBraid down-cycle
Ability to visualize cycling with cycleplot.py (see user_utils)
Ability to compile without MPI (see makefile.inc and make sequential)
Ability to compute residual at all points, F and C, as opposed to the
default with only computes the residual at C-points.
Added Fortran interface, see ex-01-f.f90. Currently, it is turned on
manually in braid.h with 3 pound-defines at the top of that file.
- New examples: Burger's equation in 1D and the Lorenz attractors
- Set size_ptr prior to calling bufpack such that: size_ptr = 0 at entry
to bufpack indicates the message will be used for completing a time
step on the next processor. At entry size_ptr = 1 indicates the message
will be used for load balenceing in FRefine.
Reason: Far less information is needed when completing a time step
than when moving an entire vector to a new processor. In my
case each vector saves its entire spatial heirachy of meshes.
To complete a time step on the next processor the message only
needs to contain the current function and the current mesh. To
move an entire vector to a new processor the message must contain
the fine grid solution and mesh + all other coarse grid messages.
- Updates to the MFEM C++ interface and drivers,
drive-05.cpp is a new DG advection diffusion driver, supporting higher-order
artificial dissipation
drive-06 is a nonlinear diffusion (p-Laplacian) example
drive-10 is a nonlinear elasticity driver
1D heat equation with moving mesh driver
- Moved the additional "hypre" functions from drive-04.cpp to hypre_extra.hpp, so
they can be used in other drivers.
- Modification of the access_level. Now, we have:
- access_level == 0 : do nothing
- access_level == 1 : call at the end of the simulation (fine grid)
- access_level == 2 : call at the end of each iteration and end of simulation (fine grid only)
- access_level == 3 : same as other, plus call in FRestrict, FInterp and FRefine (on all levels).
- New regression tests, for among other things,
braid_GetSpatialAccuracy
braid_GetRNorms
braid_StepStatusGetTol
braid_StepStatusGetIter
braid_StepStatusGetRNorms
braid_StepStatusGetOldFineTolx
braid_StepStatusSetOldFineTolx
braid_StepStatusSetTightFineTolx
Note that the Get|Set.*Tolx routines
are tested implicitly with the tests on
braid_GetSpatialAccuracy
- Changed util.h|c to _util.h|c These files only contain Braid internal
routines and were as such improperly named.
- Simplified greatly the cycling logic in braid.c::Drive()
- Allows for interactive printing of cycle information with
_braid_DriveUpdateCycle() to the cycle.outfile which is
by default with 'braid.out.cycle'
- The cycling information in this file can be plotted with
cycleplot.gnuplot file
- There is a mpistubs.h|.c for purely serial compilations
- Fixed some bugs in _braid.c and especially in _braid_GetUInit() regarding
the various storage options, where storage=-2 is compatability mode with
old braid, and -1 and 0 use more current information. There is a long comment
at the top of FRestrict() that outline some of the reasoning behind the issues
addressed in this commit.
To test the bug fixes a new option was added braid_SetSeqSoln() which allows
the user to initialize the problem with the solution from sequential time stepping.
This is there just for debugging, but can be quite useful.
- Added a number of new features in drive-06 for the nonlinear parabolic
paper. These integrate some changes from Ben over the summer, like
tracking the number of Newton iterations, varying the Newton tolerance
and capping the number of Newton and BoomerAMG iterations. These options
are all available from the command line.
- Added new information to status structures
- Switched "isnan()" to our own in-house "braid_isnan()" macro. This is for
better portability.
- Added support for cfactor=1. I believe this one line change is required in
_braid.c I also added some regression tests to test cfactor=1 on the finest
grid.
- Cleaned up how the core->rnorms array is accessed with three new accessor
functions
_braid_AppendResidual
_braid_GetRNorm
_braid_DeleteLasResidual
The number of residual norms available depends on the skip option and the
iteration number. Sometimes there are resdiual norms available, sometimes not.
And the number available does not equal the number of iterations. So, these
functions make all that easier.
- Fixed a memory problem with the rnorms array when MaxIter is changed by the user.
- Added NaN check for braid residuals to stop iterating.
- Updated drive-02 so that it uses the storage and new residual options.
- A tolerance <= 0.0 now means that iterations will proceed until max_iter is
reached.
- Updated accuracy handle
----------------------------
Version 1.1.b, 03/10/2015
----------------------------
- Updated documentation to reflect new features
- Split examples directory into "examples" and "drivers"
examples holds simple user-friendly examples. Currently that's
ex-01 and ex-02, a scalar ODE and heat equation example, respectively.
ex-02 is a simplified example that only does backward Euler and corresponds
to ex-02-serial which is a mock-up of a simple user application that we
parallelize in time with ex-02.
drivers holds more sophisticated drivers that we use for research
This is where all the old drivers (drive-02, drive-03, drive-04,
drive-05 and hyperdrive) ended up.
The old drive-02 was eliminated, and replaced by the old drive-05.
These drivers duplicated functionality.
Associated regression tests were added for the new examples
- New information available through the Status structures in Phi,
like the current XBraid level.
- Drive-05 now supports spatial parallelism and spatial coarsening on
regular grids.
Also, changed the drive-05 interface to more easily allow playing around
with varying coarse grid CFL numbers. Also added a couple comments to
_braid.c that better explain the grid structure's data elements.
----------------------------
Version 1.0.b, 10/27/2014
----------------------------
- Implements basic MGRIT algorithm, see newly written
User's and Developer's Manuals for overview
- Decided on Git repository for collaboration and development of XBraid
- Developed logo
- Decided on name "XBraid" after too many conflicts with "Warp",
our first choice
- Developed C++ interface, see braid.hpp which implements abstract
base classes for wrapping Phi(), Clone(), ..., and then see
drivers/drive-04.cpp which uses the interface
- Basic driver and example functionality for the heat equation
(drive-04.cpp, drive-03.c, ...)
- Added wrapper test functions to test things like Sum(), Access(), ...
- Developed suite of regression tests (see test directory)
- Added Copyright and License statements for LGPL 2.1
- Allow for 1-norm and inf-norm halting options
- Implemented Status structure interface which allows for us to
- Added some debug level printing to allow for tracking the residual
value over each interval, see braid_SetPrintLevel
- Added ability to do enhanced F-cycles (i.e. mutliple V-cycles at each level
in an FMG cycle)
- Made design decisions for the user to include PDE forcing inside of Phi().
XBraid implements FAS, so this should be fine.