-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for abfss://[email protected]/path URL syntax #72
Conversation
Could you please consider supporting
|
@sugibuchi I'm happy to try adding that, but I would like to do it as a separate PR to limit the scope of what's affected here. The |
@samansmink Could you have a look at this PR and merge if all looks ok? Apologies if it's not you that I should ping. I just looked at the other open PR. Maybe there should be a CODEOWNERS file in the repo? |
Hello, From my point a view I do not think that a good idea but maybe I a particular user that use too many storage account. A solution for this can also be to change the way secrets score is compute and allow wildcard. Regards |
@quentingodeau I see what you mean. But given how important support for this syntax is, would it be ok to accept this change and document the limitations with regards to the scopes? This change should not affect existing functionality, only add some more functionality (which is slightly limited with regards to the scopes, but better than nothing, IMHO). |
Yes all abfss path are in fact mostly manage this way. I was just to point out the limitations of the syntax. I really think that we should adapt the way score are compute in the secret management. |
OK. I understand and agree to this. |
If we should keep only one syntax, please consider keeping the Hadoop compatible syntax. For historical reasons, each ABFS filesystem implementation supports a different set of ABFS URL syntax. Apache Hadoop's
Python
Rust
Apache Arrow link
DuckDB
This inconsistency of supported ABFS URL syntax is a source of headaches, particularly when we combine different implementations of the ABFS file system (for example, extracting data from a massive table using Spark, then interactively analysing it with Polars, etc.). Fortunately, the ABFS URL syntax defined by Hadoop's To maximize interoperability with other frameworks/libraries, the support of the Hadoop-compatible syntax should be a high priority. |
@sugibuchi Thank you for the summary of the various options. It is quite a mess! I agree with the conclusion that the ABFS syntax should be the main priority. Hopefully we can merge this PR now, and then any decisions made about futher changes can be handled in a different one? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, I will fix CI on main then merge that into this PR so that CI can run properly here
all green 🟢 |
@sugibuchi your suggestion is now implemented thanks @gdubya! |
While playing with the uc_catalog and delta extensions I encountered an error while trying to read from an ADLS gen2 account using the abfss syntax in this format:
abfss://[email protected]/path
.This PR corrects the URL parser to account for this additional variation.
Linked issues: