New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[DO NOT MERGE] Add one-pager for Terraform registry scrapers and registry metadata #2956

Closed

ulucinar wants to merge 9 commits into crossplane:master from ulucinar:fix-terrajet-203

Contributor

ulucinar commented Mar 10, 2022 •

edited

Loading

Description of your changes

Fixes crossplane/terrajet/issues/203

With this one-pager, we would like to discuss how metadata about Terraform native resources can be scraped and processed from the Terraform registry and used in Terrajet codegen pipelines to produce example manifests or CRD documentation and in various other use cases.

We believe such metadata offers large potential to improve the user experience with the Crossplane providers generated using Terrajet as well as the quality of them (e.g., auto-generated CRD documentation).

We aim to gather community feedback and stir up some discussions to discover new use cases with such metadata in Terrajet-based providers.

I have:

Read and followed Crossplane's contribution process.
Run make reviewable to ensure this PR is ready for review.
Added backport release-x.y labels to auto-backport this PR if necessary.

How has this code been tested

N.A.

NOTE: We are considering to move this proposal to https://github.com/crossplane/terrajet after the review process is completed.


          Add one-pager for Terraform registry scrapers and registry metadata

99a70b1

enhanced Terrajet codegen pipelines

- Fixes crossplane/terrajet/issues/203

Signed-off-by: Alper Rifat Ulucinar <[email protected]>

ulucinar force-pushed the fix-terrajet-203 branch from 7c9f0a5 to 99a70b1 Compare

March 10, 2022 01:05

ulucinar mentioned this pull request

One Pager - Metadata Extraction from Terraform Registry for Terrajet-based providers crossplane/terrajet#203

Open

turkenh requested review from turkenh, ezgidemirel, muvaf, sergenyalcin and AaronME

March 25, 2022 09:17

ulucinar mentioned this pull request

Figure out how to get CRD documentation from Terraform crossplane/terrajet#92

Open

muvaf reviewed

View reviewed changes

design/one-pager-terrajet-metadata-extraction.md Outdated

+                implementations to be able to fetch metadata from different sources but the
+                Terrajet pipelines will always be working on a well defined format regardless
+                of how those metadata are scraped.
+              - We would like to have the scrapers run as needed, produce their output in the

Member

muvaf Apr 6, 2022

I think this specific item is leaking the proposal detail into the goal section. IMO, the goals section as a whole should be a bit higher level and allow alternative proposal to achieve the same goals.

Contributor Author

ulucinar Apr 22, 2022

Thanks @muvaf for the detailed review.
I have restructured the Goals section and removed the proposal hints from it.

design/one-pager-terrajet-metadata-extraction.md Outdated

+                run each time with a `make generate`, just like the existing codegen pipelines
+                we have. This would allow us to separate the lifecycles of metadata-scraping
+                and code generation.

Member

muvaf Apr 6, 2022

A Proposal section could be helpful for readers here to understand the whole solution before jumping to the metadata format which is one of the implementation details of the solution.

Member

muvaf Apr 6, 2022

Listing what's available in TF registry as metadata information would also be very valuable for having readers think about the future expansions.

Contributor Author

ulucinar Apr 22, 2022

I have added a top-level proposal section giving a high-level overview and made the existing proposal sections subsections of this top-level section.
I have also extended the discussion on available metadata in the Terraform registry.

design/one-pager-terrajet-metadata-extraction.md Outdated

+              ### Metadata Format
+              The proposed syntax for scraped metadata documents is YAML as we would also like
+              the metadata to be human readable, searchable and maintainable, if needed. A

Member

muvaf Apr 6, 2022

What is meant by maintainable?

Contributor Author

ulucinar Apr 22, 2022

I have added example maintenance tasks for the scraped metadata in the document.

design/one-pager-terrajet-metadata-extraction.md

+              `azurerm_analysis_services_server` of the native Terraform provider
+              [terraform-provider-azurerm] could be as follows:
+              ```yaml

Member

muvaf Apr 6, 2022

Having an example definitely helps to understand what goes where but I wonder if we can have an API reference before the examples to show the schema of the YAML.

Contributor Author

ulucinar Apr 22, 2022

I think using a full example (an example that contains all keys) with comments instead of a formal notation (like BNF or OpenAPI schema) serves our purposes better here. This is not intended to be a formal specification but rather the purpose is to convey the idea.

design/one-pager-terrajet-metadata-extraction.md

+                      importStatements:
+                          - terraform import azurerm_analysis_services_server.server /subscriptions/00000000-0000-0000-0000-000000000000/resourceGroups/resourcegroup1/providers/Microsoft.AnalysisServices/servers/server1
+              ```

Member

muvaf Apr 6, 2022

Could you elaborate on why the chosen method is better than these alternatives? This would help us reason about them more deliberately when we want to make changes to the pipeline.

Contributor Author

ulucinar Apr 22, 2022

Added clarifications on the choices we made here.

design/one-pager-terrajet-metadata-extraction.md

+              from a pointed directory in the local filesystem, which is specified as a
+              command-line argument, for instance.
+              As already indicated, if it turns out that a common registry scraper

Member

muvaf Apr 6, 2022

I love that optionality.

design/one-pager-terrajet-metadata-extraction.md

+              none or some are not available in the corresponding metadata document.
+              Metadata is valuable; the scrapers should capture as much metadata as possible
+              and store them in the common format, even for future use cases we do not yet

Member

muvaf Apr 6, 2022

I'm a bit on the fence about storing everything we can because we have to have a format for each of those information that we don't use and once we do use them, we may want to change the format, i.e. where it lives in the YAML and also since we provide manual input ability, the format become one of our public APIs. So, I wonder if we should, for example, just keep the examples since that's all we use today and then once we use other information, we could add them.

Contributor Author

ulucinar Apr 22, 2022

I have added example use cases for the proposed fields. In fact (apart from documentation) most have been utilized in provider-jet-azure. I think this also serves as an opportunity to show the potential we have here.

design/one-pager-terrajet-metadata-extraction.md

+              Metadata is valuable; the scrapers should capture as much metadata as possible
+              and store them in the common format, even for future use cases we do not yet
+              envision. New Terrajet pipelines can be added, or existing ones can be enhanced
+              to support advanced use cases. One such proposal could be to extend the CRD

Member

muvaf Apr 6, 2022

A Future Considerations section could work well to list all the use cases you have in mind at the moment.

Contributor Author

ulucinar Apr 22, 2022

Added a Future Considerations section.

design/one-pager-terrajet-metadata-extraction.md Outdated

+              providers `provider-jet-aws`, `provider-jet-gcp` and `provider-jet-azure` in the
+              context of the corresponding [Terrajet issue #48], we have seen utility in
+              extracting such metadata from the Terraform registry and use it to generate
+              example manifests and documentation. In this document, we would like to propose:

Member

muvaf Apr 6, 2022

I think Background section could be limited to the problem statement in general and the actual proposal could be kept for its own section.

Contributor Author

ulucinar Apr 22, 2022

Removed proposal hints from the Background section as suggested.

design/one-pager-terrajet-metadata-extraction.md

		@@ -0,0 +1,272 @@
		# Metadata Extraction from Terraform Registry for Terrajet-based providers

Member

muvaf Apr 6, 2022

Suggested change

      
            # Metadata Extraction from Terraform Registry for Terrajet-based providers
          
            # Metadata Extraction from Terraform Registry in Terrajet

Since we're removing the difference between Terrajet vs SDK calls at repository level, I think removing Terrajet-based provider or Terrajet-based repository terms would be more future-proof for readers.

Member

negz commented Apr 16, 2022

Would it make sense to move this design (and perhaps the Terrajet design doc too?) to a new design/ directory under https://github.com/crossplane/terrajet?

ulucinar added 8 commits

April 22, 2022 12:22


          Move scraper discussion out of the "Goals" section

5583f04

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Add a top-level Proposal section that gives a high-level overview and

a729acf

move existing proposal sections as subsections of this top-level section.

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Make clarifications on scraped metadata maintenance tasks

f705638

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Discuss advantages of using per-resource metadata files instead of a

d4a9438

monolithic one.

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Add a discussion on how we can prevent manual metadata overrides

b3c4df1

from getting lost as scrapers are rerun, and extend the discussion
on when the proposed scrapers are run.

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Discuss pros & cons of the transport alternatives for the scrapers

1188f6e

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Add "Alternatives Considered" and "Future Considerations" sections

e09b040

Signed-off-by: Alper Rifat Ulucinar <[email protected]>


          Remove proposal hints from the Background section

36a824d

Signed-off-by: Alper Rifat Ulucinar <[email protected]>

Contributor Author

ulucinar commented Apr 22, 2022

Hi @negz,
I think what you propose makes sense. Let me have the PR approved here (to preserve the existing context) and then I can reproduce it in https://github.com/crossplane/terrajet referring to the review here. I will also mark this PR with do-not-merge. Thank you!

ulucinar changed the title ~~Add one-pager for Terraform registry scrapers and registry metadata~~ [DO NOT MERGE] Add one-pager for Terraform registry scrapers and registry metadata

Member

muvaf commented Apr 25, 2022 •

edited

Loading

Would it make sense to move this design (and perhaps the Terrajet design doc too?) to a new design/ directory under https://github.com/crossplane/terrajet?

I think having all design docs under crossplane/crossplane helps with discoverability of those docs for the community. Maybe something like crossplane/enhancements or crossplane/design-docs (similar to kubernetes) that could contain all design docs of Crossplane community? It could be a place for non-crossplane org stuff as well, like provider or other extension designs.

Member

negz commented May 2, 2022

I think having all design docs under crossplane/crossplane helps with discoverability of those docs for the community. Maybe something like crossplane/enhancements or crossplane/design-docs (similar to kubernetes) that could contain all design docs of Crossplane community? It could be a place for non-crossplane org stuff as well, like provider or other extension designs.

I don't feel super strongly, but it does seem a little odd to me that a design pertaining completely to Terrajet is not in the Terrajet repo. We can just stick with everything going in the design folder here if folks feel differently.

Contributor Author

ulucinar commented May 30, 2022

Closing this PR as we would like to keep code generation in terrajet only scoped to generating CRDs.

ulucinar closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

muvaf muvaf left review comments

turkenh Awaiting requested review from turkenh

ezgidemirel Awaiting requested review from ezgidemirel

sergenyalcin Awaiting requested review from sergenyalcin

AaronME Awaiting requested review from AaronME

Labels

None yet