Prepare to remove actions hooks #1038

danielrbradley · 2024-07-12T12:56:46Z

This will allow us to remove all of our preTest hooks in provider ci-mgmt configs - which we'd like to get rid of to avoid arbitary code injection into workflows which make them fragile and hard to refactor. See also #1037

Related to #936

Add standalone option for running provider integration tests

Set integrationTestProvider: true to run go test [...] -tags=${{ matrix.language }} in the provider directory.

Example of existing preTest usage: https://github.com/pulumi/pulumi-gcp/blob/92b64b51bfb05fc0d2d7b9c1cbcafe7de8a6f7f3/.ci-mgmt.yaml#L43

Always prepare upstream before testing

This step is safe to run, as we already do before the Go build steps elsewhere. The upstream target is always there but might be a no-op.

Add SSH setup option

Set sshPrivateKey: ${{ secrets.PRIVATE_SSH_KEY_FOR_DIGITALOCEAN }} to set up SSH for testing.

Example of docker use of preTest: https://github.com/pulumi/pulumi-docker/blob/fc1d68f823c34cef72e34e060b093230e21fc636/.ci-mgmt.yaml#L35

Migration

Merge this PR with intent-focused config options
Open PRs to the ~28 providers to replace the preTest hook with alternative settings
- Set integrationTestProvider: true on all providers which run integration tests via the provider module
- Use setup-script for arbitrary bash commands before tests
- Set sshPrivateKey: ${{ secrets.PRIVATE_SSH_KEY_FOR_DIGITALOCEAN }} for docker
Remove the actions config completely

Existing usage of actions hooks (there is no use of preBuild - only preTest): https://github.com/search?q=org%3Apulumi+path%3A.ci-mgmt.yaml+%22actions%3A%22&type=code

- Prepare upstream if we're running tests via the provider module. - Run `make upstream` before running any extra `setup-script`. - Only run provider integration tests on PRs if we're in "local" mode and not "pulumiExamples". - Format main test command to avoid confusing line break.

This addresses the use of the preTest hook in the docker provider.

blampe · 2024-07-12T16:26:48Z

pulumi/pulumi-docker-build#82 is another example where we need some similar extensibility for a native provider. I didn't want to just shoehorn an if provider === "docker-build" ... exception -- I'd much rather have a real extension point and use that, so I'm happy you're thinking about this!

which we'd like to get rid of to avoid arbitary code injection into workflows which make them fragile and hard to refactor. See also #1037

I wonder if there's a better way to strike a balance between provider repos still having some control over their prerequisites while not being so tightly coupled to ci-mgmt. We can certainly add specific options like sshPrivateKey:, injectEnvVarsForDockerBuild:, etc., but in the limit we'll end up with a ton of one-off special cases. That adds up to a lot of accidental complexity -- ci-mgmt needs to know more about provider internals than it really should, and changing those internals becomes a coordination problem between the provider and ci-mgmt. Additionally, any time ci-mgmt needs a new option added it represents some friction that will nudge people towards hacky workarounds, inaction (as I've done with docker-build), or forks.

I think ideally provider authors should be able to "own" some of their CI setup without needing to know or care about how ci-mgmt orchestrates things. This is a big problem area, but one simple strategy we could consider is to treat make targets as more of a contract between the provider and ci-mgmt. Concretely:

ci-mgmt ships a default Makefile to your repo, with all the build targets it expects to be defined, along with reasonable defaults.
The Makefile can do something like -include Makefile.local to add local overrides from the repo, if they exist.
The provider can customize a make ci-setup target in Makefile.local with their own logic if they need it.

You wouldn't be able to add arbitrary GH actions with that approach, but I agree that should probably be a non-goal anyway. Often times the GH actions we're talking about can be replicated with a few shell commands or a simple script. For example the docker-build provider has the same SSH key requirement as the docker provider, but that's addressable with a little bit of code that works equally well locally as in CI.

blampe · 2024-07-12T16:39:23Z

provider-ci/internal/pkg/templates/bridged-provider/.github/workflows/main.yml

+    #{{- if .Config.integrationTestProvider }}#
+    - name: Run provider tests
+      working-directory: provider
+      run: go test -v -json -count=1 -cover -timeout 2h -tags=${{ matrix.language }} -parallel 4 . 2>&1 | tee /tmp/gotest.log | gotestfmt
+    #{{- end }}#


This is unit testing the provider, no?

It's natural for providers to have slightly different ways to test themselves (for example I think it's a mistake to assume the provider subdiretory). Simplifying this to something like make test_provider_ci (or whatever) could accomplish the same while still leaving the door open for some repo-specific behavior.

This is running the language-specific tests within the provider. I think these are basically all tests which depend on either the SDKs or having access to deploy test infrastructure. These are specified by tags such as nodejs similar to the examples. These tests are excluded from running by default when running the normal provider unit tests.

Oh man I missed the language tag, that's weird! Is this maybe just an instance of someone dropping E2E tests under provider instead of examples?

Correct - which is what we already have in a few providers. I think all the replay tests are in the provider folder, for example. This just formalises what's already being done into a simple option rather than leaning on providers to inject their own custom steps and matrix options.

danielrbradley · 2024-07-12T17:00:10Z

pulumi/pulumi-docker-build#82 is another example where we need some similar extensibility for a native provider. I didn't want to just shoehorn an if provider === "docker-build" ... exception -- I'd much rather have a real extension point and use that, so I'm happy you're thinking about this!

Indeed, provider-specific switches should be an absolute last resort! I think there's normally a way to generalise what we're trying to achive with a specific provider to describe the intent better. My leaning is that more smaller, more focused options are easier to reason against compared to the arbitrary hooks which then have to be carefully audited before each refactor.

I wonder if there's a better way to strike a balance between provider repos still having some control over their prerequisites while not being so tightly coupled to ci-mgmt. We can certainly add specific options like sshPrivateKey:, injectEnvVarsForDockerBuild:, etc., but in the limit we'll end up with a ton of one-off special cases. That adds up to a lot of accidental complexity -- ci-mgmt needs to know more about provider internals than it really should, and changing those internals becomes a coordination problem between the provider and ci-mgmt. Additionally, any time ci-mgmt needs a new option added it represents some friction that will nudge people towards hacky workarounds, inaction (as I've done with docker-build), or forks.

I agree we don't want too much sprawl, though right now I think we only need to introduce these 2 new options (integrationTestProvider and sshPrivateKey) in addition to the existing setup-script option.

I think ideally provider authors should be able to "own" some of their CI setup without needing to know or care about how ci-mgmt orchestrates things. This is a big problem area, but one simple strategy we could consider is to treat make targets as more of a contract between the provider and ci-mgmt. Concretely:

ci-mgmt ships a default Makefile to your repo, with all the build targets it expects to be defined, along with reasonable defaults.

The Makefile can do something like -include Makefile.local to add local overrides from the repo, if they exist.

The provider can customize a make ci-setup target in Makefile.local with their own logic if they need it.

I like this idea - adding the more custom hooks via the makefile - it's somewhat easier to reason about the impact compared to changing CI workflows.

I discussed trying to formalise the interface between local builds and CI a while back in the provider build systems design.

Another simple way of implementing this could be via having the option to add scripts to call as part of different steps. E.g. scripts/provider_prebuild.sh or scripts/test_setup.sh. Ideally though I think we still want to lean towards implementing these cases for broader consumption. If a feature is needed in one provider, it's pretty likely that another will need something similar too at some point. Another option too is for some providers just to opt out of using the makefile generation and opt to manage that themselves, though this would be useful for bringing in new providers it would be quite a step backwards for bridged providers.

Leaning towards injecting commands rather than injecting actions is where I'm pushing towards for the near-future.

thomas11

I'm a bit concerned about this as well.

I wonder if there's a better way to strike a balance between provider repos still having some control over their prerequisites while not being so tightly coupled to ci-mgmt. We can certainly add specific options like sshPrivateKey:, injectEnvVarsForDockerBuild:, etc., but in the limit we'll end up with a ton of one-off special cases. That adds up to a lot of accidental complexity -- ci-mgmt needs to know more about provider internals than it really should, and changing those internals becomes a coordination problem between the provider and ci-mgmt. Additionally, any time ci-mgmt needs a new option added it represents some friction that will nudge people towards hacky workarounds, inaction (as I've done with docker-build), or forks.

Especially for SSH which is only used by one provider, correct?

I wouldn't block on this PR but maybe there are other options?

danielrbradley · 2024-07-23T15:29:15Z

Especially for SSH which is only used by one provider, correct?

I wouldn't block on this PR but maybe there are other options?

It is only currently used by one provider, though will probably need it for at least one more as we onboard the other providers.

There's already the .setup-scriptconfig option we can use here which allows a custom set of bash commands to be run before executing the tests in CI. I don't love the name, but this option for injecting arbitary command is easier to reason about than injecting GHA steps as it limits access to only the local file system and not for uploading/downloading assets or adding action dependencies etc.

SSH feels like a generic enough option to be generally useful.

thomas11

LGTM, discussion point about extensibility addressed in sync review

danielrbradley requested a review from a team July 12, 2024 12:56

danielrbradley self-assigned this Jul 12, 2024

danielrbradley force-pushed the test-prepare-upstream branch from b157648 to 13f6190 Compare July 12, 2024 12:59

danielrbradley changed the title ~~Always prepare upstream before testing~~ Prepare to remove actions hooks Jul 12, 2024

danielrbradley removed the request for review from a team July 12, 2024 13:37

danielrbradley marked this pull request as draft July 12, 2024 13:43

danielrbradley force-pushed the test-prepare-upstream branch from 13f6190 to 4beba4e Compare July 12, 2024 14:31

Add sskPrivateKey option for test setup

ea2b0d9

This addresses the use of the preTest hook in the docker provider.

danielrbradley marked this pull request as ready for review July 12, 2024 15:04

danielrbradley requested a review from a team July 12, 2024 15:05

Apply to nightly-test workflow

a8d1fff

This was referenced Jul 12, 2024

Remove use of actions hooks #1039

Draft

Extract shared test workflows and action #1037

Draft

blampe reviewed Jul 12, 2024

View reviewed changes

danielrbradley requested review from a team and blampe July 23, 2024 13:34

thomas11 reviewed Jul 23, 2024

View reviewed changes

thomas11 approved these changes Jul 24, 2024

View reviewed changes

mjeffryes modified the milestone: 0.107 Jul 24, 2024

danielrbradley merged commit fba1faf into master Jul 24, 2024
5 checks passed

danielrbradley deleted the test-prepare-upstream branch July 24, 2024 19:48

mjeffryes added this to the 0.108 milestone Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prepare to remove actions hooks #1038

Prepare to remove actions hooks #1038

danielrbradley commented Jul 12, 2024 •

edited

Loading

blampe commented Jul 12, 2024

blampe Jul 12, 2024

danielrbradley Jul 12, 2024

blampe Jul 12, 2024

danielrbradley Jul 22, 2024

danielrbradley commented Jul 12, 2024

thomas11 left a comment

danielrbradley commented Jul 23, 2024 •

edited

Loading

thomas11 left a comment

Prepare to remove actions hooks #1038

Prepare to remove actions hooks #1038

Conversation

danielrbradley commented Jul 12, 2024 • edited Loading

Add standalone option for running provider integration tests

Always prepare upstream before testing

Add SSH setup option

Migration

blampe commented Jul 12, 2024

blampe Jul 12, 2024

Choose a reason for hiding this comment

danielrbradley Jul 12, 2024

Choose a reason for hiding this comment

blampe Jul 12, 2024

Choose a reason for hiding this comment

danielrbradley Jul 22, 2024

Choose a reason for hiding this comment

danielrbradley commented Jul 12, 2024

thomas11 left a comment

Choose a reason for hiding this comment

danielrbradley commented Jul 23, 2024 • edited Loading

thomas11 left a comment

Choose a reason for hiding this comment

danielrbradley commented Jul 12, 2024 •

edited

Loading

danielrbradley commented Jul 23, 2024 •

edited

Loading