New Example CI + Fixes to some broken example! #525

okBrian · 2024-07-17T19:45:24Z

Description

Added New Example CI, - Average runtime - 30 minutes

Why MacOS runner?
From my initial testing its much faster than the ubuntu images. Some examples ran over two times faster on the MacOS runner

Why 15625, (125, 125), or (25, 25, 25) for cell boundaries?
Some 3D examples break if m < 25, and this seems like a good spot to put examples so they don't take too long

Fixes #474

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

Scope

This PR comprises a set of related changes with a common goal

How Has This Been Tested?

Github Actions Run all example with CI
Run all examples locally

Test Configuration:
gcc 14, Ubuntu 22,04.4 LTS
Github Action Macos Runner

Checklist

I have added comments for the new code
I ran ./mfc.sh format before committing my code
This PR does not introduce any repeated code (it follows the DRY principle)
I cannot think of a way to condense this code and reduce any introduced additional line count

…nto exampleCI2

…pleCI2

okBrian · 2024-07-17T19:59:16Z

I'm not exactly sure whats wrong with the test suite, but I think if I add the --generate flag they should be okay. Do we want the test suite to even test for examples because that would add another 30 minutes or more to them. Also I need to remove the -r flag from the example suites but it looks like it worked fine.

sbryngelson · 2024-07-17T20:49:40Z

@okBrian we can add this as a separate GitHub workflow (Examples-test.yml, Example Smoke Test).

sbryngelson · 2024-07-17T20:49:59Z

the -r --generate things I don't understand at the moment

sbryngelson · 2024-07-17T20:55:50Z

@okBrian It's important that these cases run and produce output that isn't nonsense. So you can run the "example suite" with ./mfc.sh test —a (+ your flags, I guess?), where -a already exists. ' This runs the files through post_process and then checks for NaNs/Infinity.

If parallel_IO is not true then I'm not sure if -a works or not. You may need to turn it on via your toolchain tricks.

henryleberre · 2024-07-17T21:02:50Z

I'm not exactly sure whats wrong with the test suite, but I think if I add the --generate flag they should be okay. Do we want the test suite to even test for examples because that would add another 30 minutes or more to them. Also I need to remove the -r flag from the example suites but it looks like it worked fine.

If you specify --generate it will compare the pre-process and simulation results against itself (not with the reference values from this PR). So you shouldn't pass --generate.

Whether we should or should not use the -r (--relentless) flag is debatable but we have currently opted not to use it so we should probably keep it that way. The main advantage is that we don't spend CI resources running more tests if one already failed and the workflow is marked as failed right away when the first test case fails.

…1 on examples, fixed m_patches typo. todo: fix 1D bubble cases

sbryngelson · 2024-07-27T02:36:04Z

toolchain/mfc/test/test.py

@@ -180,7 +181,7 @@ def _handle_case(case: TestCase, devices: typing.Set[int]):
                raise MFCException(f"Test {case}: {msg}")

    if ARG("test_all"):
-        case.delete_output()
+        # case.delete_output()


might want to put this back?

It breaks the 1D bubble cases. Not sure why maybe its deleting the D folder I'm creating.

perhaps associated with current bug in CI that if it has to 'retry' a simulation, it will try to create a directory that already exists

okBrian · 2024-07-27T02:44:54Z

I am expecting the code to not work with --no-mpi so thats another thing to debug... I think everything with mpi should pass hopefully.

sbryngelson · 2024-09-21T03:43:45Z

@okBrian you should be able to login to Frontier to see why your CI is failing. It is failing at build (link) time. It might be due to a change you made in the src/, or it could be due to the flags you added to CMake. It might be easier to debug Frontier by changing the flags and rerunning CI or just doing it on your own after login into Frontier and building there.

sbryngelson · 2024-09-21T03:45:57Z

@okBrian, I also noticed that it failed in some cases with GNU and Intel compilers. This suggests that the flags might not be doing what they should (or you didn't add enough). One way to be sure you "have enough" is just using -O0 for all --strict builds and ensuring that passes. If that doesn't work, then there's something else awry.

sbryngelson · 2024-09-25T13:06:02Z

CMakeLists.txt

@@ -186,13 +207,15 @@ elseif ((CMAKE_Fortran_COMPILER_ID STREQUAL "NVHPC") OR (CMAKE_Fortran_COMPILER_
    if (CMAKE_BUILD_TYPE STREQUAL "Debug")
        add_compile_options(
            $<$<COMPILE_LANGUAGE:Fortran>:-O0>
+            -C 


i'm not sure this syntax works. you could double check by looking at the cmake output of a specific run. i think you need to add
$<$<COMPILE_LANGUAGE:Fortran>:-<flag>> on separate lines.

I will fix this in my next commit. Thanks

okBrian · 2024-09-25T16:12:05Z

With the most recent commits these are the problematic examples
MacOS, MPI, Debug, False
2D -> Example Viscous Error: 2.51E-02
1D -> Example -> sodHypo Error: 1.01E-04

Ubuntu MPI. no-Debug, True
1D -> Example -> hypo_2materials Error: 1.74E-04
2D -> Example -> ibm_cfl_dt Error: 1.51E-04
1D -> Example -> sodHypo: Error: 1.01E-04
2D -> Example -> viscous Error: 2.51E-02
2D -> 1 Fluid(s) -> IBM Error: 1.33E-10
2D -> 1 Fluid(s) -> Viscous -> IBM Error: 2.21E-10
2D -> 1 Fluid(s) -> Viscous -> IBM -> model_eqns=3 Error: 2.21E-10
2D -> 2 Fluid(s) -> IBM Error: 5.95E-10

Ubuntu Mpi, Debug, False
2D -> Example -> ibm_cfl_dt Error: 1.65E-04

GT CPU Pheonix
2D -> Example -> laplace_pressure_jump -> NaN's detected in Case

sbryngelson · 2024-09-25T16:20:54Z

@okBrian can you visualize the output of a few of these cases (compared to the cases that are 'working')? I suspect that, given how diverse they are, some of them are just numerically unstable and eventually produce very large numbers that different compilers handle differently.

okBrian · 2024-10-28T17:44:31Z

Remaking PR

henryleberre and others added 15 commits July 11, 2024 01:14

MFlowCode#474: Kickstart

22abc46

Merge branch 'henry/474' of https://github.com/henryleberre/ChemMFC i…

f1bd842

…nto exampleCI2

test example workflow

6c0af87

fix example

5bd78e9

test

e0e3cdc

test 2

264fede

yay

7298bf8

Merge branch 'MFlowCode:master' into exampleCI2

c3af570

final changes

ff4290d

Merge branch 'exampleCI2' of https://github.com/okBrian/MFC into exam…

fd7f68e

…pleCI2

final v2

3e85cf2

fianl v3

8e302e9

final v4

01c069c

final v5

9bb0796

added a comment

9fb6279

okBrian requested review from sbryngelson and henryleberre as code owners July 17, 2024 19:45

henryleberre and others added 3 commits July 18, 2024 00:41

Switch to fastjsonschema

112bb89

fixed tol, reduced TestSize, fixed parralel_io w/o mpi, fixed format …

86d5db2

…1 on examples, fixed m_patches typo. todo: fix 1D bubble cases

fixed remaining 4 broken example, todo fix no-mpi possibly

2936a95

sbryngelson reviewed Jul 27, 2024

View reviewed changes

merge w/ master

b940337

fixed new broken examples & temp fix D error

8ee7ca2

okBrian marked this pull request as draft July 28, 2024 01:48

sbryngelson linked an issue Sep 21, 2024 that may be closed by this pull request

Building with strict floating point operations for testing purposes #622

Open

regenerate all files and fix frontier

98f9b3d

sbryngelson reviewed Sep 25, 2024

View reviewed changes

okBrian added 3 commits September 30, 2024 02:23

Merge remote-tracking branch 'origin' into exampleCI2

99465ee

fixed time_step_save issue

b4e958f

mimi pr for exampleCI

fdd5162

This was referenced Oct 9, 2024

WENO7 #638

Merged

Valgrind verrou to check for sketchy float operations #650

Open

okBrian force-pushed the exampleCI2 branch from 7bfe57d to bfceed7 Compare October 18, 2024 03:15

test lower tol + regenerating with debug

e1ac8f4

okBrian force-pushed the exampleCI2 branch from bfceed7 to e1ac8f4 Compare October 18, 2024 03:19

okBrian and others added 13 commits October 18, 2024 00:32

regenerate hypo_2materials?

f34a911

--regenerate with strict flag

7d90327

add strict back to workflow

aad1300

Merge branch 'MFlowCode:master' into exampleCI2

480c76d

inital viscous edit

f27d0ce

regen viscous

1d47274

test 2525 on exampleCI2

4c9636d

remove cases for merge

04a901a

remove strict flag & retest with default flags

c4db9d6

w/ correct weno_eps for testing

527a859

fix removal of strict in workflow

0e4fe57

close to done?

33ab525

remove strict on frontier

1c99f12

okBrian closed this Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Example CI + Fixes to some broken example! #525

New Example CI + Fixes to some broken example! #525

okBrian commented Jul 17, 2024

okBrian commented Jul 17, 2024

sbryngelson commented Jul 17, 2024

sbryngelson commented Jul 17, 2024

sbryngelson commented Jul 17, 2024

henryleberre commented Jul 17, 2024

sbryngelson Jul 27, 2024

okBrian Jul 27, 2024

sbryngelson Jul 27, 2024

okBrian commented Jul 27, 2024

sbryngelson commented Sep 21, 2024

sbryngelson commented Sep 21, 2024

sbryngelson Sep 25, 2024

okBrian Sep 25, 2024

okBrian commented Sep 25, 2024

sbryngelson commented Sep 25, 2024

okBrian commented Oct 28, 2024

New Example CI + Fixes to some broken example! #525

New Example CI + Fixes to some broken example! #525

Conversation

okBrian commented Jul 17, 2024

Description

Type of change

Scope

How Has This Been Tested?

Checklist

okBrian commented Jul 17, 2024

sbryngelson commented Jul 17, 2024

sbryngelson commented Jul 17, 2024

sbryngelson commented Jul 17, 2024

henryleberre commented Jul 17, 2024

sbryngelson Jul 27, 2024

Choose a reason for hiding this comment

okBrian Jul 27, 2024

Choose a reason for hiding this comment

sbryngelson Jul 27, 2024

Choose a reason for hiding this comment

okBrian commented Jul 27, 2024

sbryngelson commented Sep 21, 2024

sbryngelson commented Sep 21, 2024

sbryngelson Sep 25, 2024

Choose a reason for hiding this comment

okBrian Sep 25, 2024

Choose a reason for hiding this comment

okBrian commented Sep 25, 2024

sbryngelson commented Sep 25, 2024

okBrian commented Oct 28, 2024