Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Move most VARBINARY columns to VARCHAR in examples #15997

Open
1 of 3 tasks
mattlord opened this issue May 22, 2024 · 12 comments · May be fixed by #17318
Open
1 of 3 tasks

Feature Request: Move most VARBINARY columns to VARCHAR in examples #15997

mattlord opened this issue May 22, 2024 · 12 comments · May be fixed by #17318
Labels

Comments

@mattlord
Copy link
Contributor

mattlord commented May 22, 2024

Feature Description

The Vitess local examples (source) are a common first touch experience for users. These users are regularly confused or curious as to why many of the columns in the examples show up as HEX in the query results. The reason for this is that we use VARBINARY types throughout — in some rare cases like keyspace_id it makes sense, but most like customer.email are pure text. This makes understanding what's happening and quickly confirming at-a-glance results as expected more difficult and poses a minor hurdle when it's already quite a lot to get started with Vitess.

We should move most of our VARBINARY usage under ./examples to VARCHAR. This offers a more intuitive/expected experience while also more closely matching what users will be using for similar types of data.

Note: if any of those columns are used in Vindexes then we may also need to change the type used.

Tasks:

  • Change non-binary column types from VARBINARY to VARCHAR
  • Change the vindex type used for those columns in related vschema definitions from hash to xxhash
  • Get all of the example CI tests passing again

Use Case(s)

An easier on-ramp to Vitess.

@EraKin575
Copy link

I would like to work on this issue

@lata-11
Copy link

lata-11 commented Jul 8, 2024

Hey! I would be happy to raise a PR for this.

@mattlord
Copy link
Contributor Author

mattlord commented Jul 8, 2024

Anyone is welcome to open a PR. I don't want to assign it to anyone before there's a PR because history has shown that the assignee often doesn't end up opening a PR, nor do they unassign themselves, so others think someone is working on it when in reality nobody is. Then maintainers have to come back and unassign it after N months and around we go. 🙂

@anshikavashistha
Copy link
Contributor

@mattlord May I work on this issue if this is still open ?

@mattlord
Copy link
Contributor Author

@mattlord May I work on this issue if this is still open ?

@anshikavashistha sure, nobody else is working on this AFAIK. If you open a preliminary PR then I'll assign it to you. Assigning things like this to people has historically caused more work for maintainers and potentially caused others not to work on it when in the end nothing happened. Thanks!

@anshikavashistha
Copy link
Contributor

@mattlord May I work on this issue if this is still open ?

@anshikavashistha sure, nobody else is working on this AFAIK. If you open a preliminary PR then I'll assign it to you. Assigning things like this to people has historically caused more work for maintainers and potentially caused others not to work on it when in the end nothing happened. Thanks!

Sure @mattlord
Thank you

@anshikavashistha
Copy link
Contributor

image
@mattlord @deepthi ,I have to change VARBINARY columns to VARCHAR where appropriate or there might not be no need to move VARBINARY columns as per this relevant PR

@deepthi
Copy link
Member

deepthi commented Sep 3, 2024

The question is more for @mattlord. Given the change you made to the alias, many of the columns in the examples show up as HEX in the query results does not happen any more. Do we still want to change the data types?
As it turns out, none of the VARBINARY columns are being used as sharding columns, so it's not as if we are missing out on providing an example of how to shard using VARCHAR columns.

@mattlord
Copy link
Contributor Author

mattlord commented Sep 4, 2024

I think it's still worth doing as the type usage then aligns with what users would do, we move to xxhash, etc.

@deepthi
Copy link
Member

deepthi commented Sep 4, 2024

In that case, can you edit the issue description with the task list? Including moving from hash -> xxhash.

@anshikavashistha
Copy link
Contributor

@mattlord Is there any more relevant info that needs to be mentioned in the issue description?

@mattlord
Copy link
Contributor Author

@anshikavashistha I don't think so. I did add a task list — hopefully that makes sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants