Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for subqueries in order_bys and group_bys #607

Merged
merged 9 commits into from
May 28, 2024

Conversation

zachdaniel
Copy link
Contributor

@zachdaniel zachdaniel commented May 16, 2024

These are the corresponding ecto_sql changes for functionality added in a separate pull request: elixir-ecto/ecto#4417

@zachdaniel zachdaniel changed the title Test support for subqueries in order_bys Support for subqueries in order_bys and group_bys May 16, 2024
zachdaniel added a commit to zachdaniel/ecto_sql that referenced this pull request May 20, 2024
zachdaniel added a commit to zachdaniel/ecto_sql that referenced this pull request May 23, 2024
@zachdaniel
Copy link
Contributor Author

Can't say for sure, but that build error for the mysql integration test does not appear related to my changes/may be transient. I can't retry failed steps though.

@greg-rychlewski
Copy link
Member

something is happening with the MySQL CI...it seems like the MySQL server is not starting...will try to figure it out

@greg-rychlewski
Copy link
Member

it's not your change though. looks like it's been happening for 4-5 commits

@warmwaffles
Copy link
Member

looks like it's been happening for 4-5 commits

Flapping integration tests, say it isn't so. 😭

@greg-rychlewski
Copy link
Member

i don't believe it's any code change though because the commit it started happening doesn't alter this. so probably the docker stuff we are pulling is not fixed and we got an update that is broken with our methods

@josevalim
Copy link
Member

We should probably add a unit test for the other databases.

@zachdaniel
Copy link
Contributor Author

zachdaniel commented May 25, 2024

Okay, so, I've added tests for the other databases. Was (of course) a great idea and highlights some issues.

Two main points

  1. I haven't set up SQL server locally before, so I can't actually test that these queries work. is there a different test I'm not seeing that will actually run the tests?

  2. ecto_sql is actually (currently) building subqueries incorrectly when used in this context. The question here is: should I instead just make mysql raise an error and we can fix the query generation later? Or should I attempt to fix the subquery building as a part of this PR? There are failing tests indicating the incorrect query building currently.

@josevalim
Copy link
Member

What do you mean by “in this context”? Does it generate broken queries today? Or you mean in the additions of this PR? If it is in this PR, we should fix it now. If broken today, we should probably fix that before. Thanks :)

@zachdaniel
Copy link
Contributor Author

Sorry for being unclear. Building subqueries in the context of ORDER BY and DISTINCT must be done slightly differently (apparently). I'm not a mysql expert, but, I guess it has a problem with doubly nested parenthesis.

i.e

  1) test windows window with subquery (Ecto.Adapters.MyXQLTest)
     test/ecto/adapters/myxql_test.exs:1075
     Assertion with == failed
     code:  assert all(query) ==
              ~s"SELECT s0.`x` FROM `schema` AS s0 WINDOW `w` AS (ORDER BY exists(SELECT ss0.`x` AS `result` FROM `schema` AS ss0 WHERE (ss0.`x` = s0.`x`)))"
     left:  "SELECT s0.`x` FROM `schema` AS s0 WINDOW `w` AS (ORDER BY exists((SELECT ss0.`x` AS `result` FROM `schema` AS ss0 WHERE (ss0.`x` = s0.`x`))))"
     right: "SELECT s0.`x` FROM `schema` AS s0 WINDOW `w` AS (ORDER BY exists(SELECT ss0.`x` AS `result` FROM `schema` AS ss0 WHERE (ss0.`x` = s0.`x`)))"

@zachdaniel
Copy link
Contributor Author

I originally thought the problem was more serious but this may actually be easily fixable so I will take a look right now :)

@zachdaniel
Copy link
Contributor Author

Okay, I've updated it and now it produces valid subqueries, but not wrapping "top level" subqueries in parenthesis

@zachdaniel
Copy link
Contributor Author

I actually can't find the code that handles exists queries for mysql? But the queries it generates now work for me.

@josevalim
Copy link
Member

I dont think it needs to handle it explicitly? Exists is a regular function call and then just make sure to handle sub queries as their appear in the AST.

@zachdaniel
Copy link
Contributor Author

Ah, I see, thats this code. With my {:top_level, ...} modification, that resolves the issue of exists((subquery))

    defp expr({fun, _, args}, sources, query) when is_atom(fun) and is_list(args) do
      {modifier, args} =
        case args do
          [rest, :distinct] -> {"DISTINCT ", [rest]}
          _ -> {[], args}
        end

      case handle_call(fun, length(args)) do
        {:binary_op, op} ->
          [left, right] = args
          [op_to_binary(left, sources, query), op | op_to_binary(right, sources, query)]

        {:fun, fun} ->
          [fun, ?(, modifier, Enum.map_intersperse(args, ", ", &expr({:top_level, &1}, sources, query)), ?)]
      end
    end

mix.exs Outdated Show resolved Hide resolved
@zachdaniel
Copy link
Contributor Author

All done 👍

@zachdaniel
Copy link
Contributor Author

@greg-rychlewski trying to figure out the best place for that kind of test. Where should I put it?

@greg-rychlewski
Copy link
Member

we could put it here https://github.com/elixir-ecto/ecto_sql/blob/master/integration_test/sql/subquery.exs.

if there is an adapter that doesn't allow a certain thing we are doing here (like doesn't allow subqueries in group by or something like that) we could add a tag.

then the tag would be filtered out here https://github.com/elixir-ecto/ecto_sql/blob/master/integration_test/pg/test_helper.exs#L106 (and a similar place for mysql/tds)

@zachdaniel
Copy link
Contributor Author

Well, adding the integration tests has helped flesh out the fact that you just can't do this with mssql :) subqueries are apparently only supported in the where clause. Where should I handle this? Is it kosher to raise an error in Ecto.Adapters.Tds.Connection?

@greg-rychlewski
Copy link
Member

Yeah that would be best since the error from TDS might not be too clear. Thanks!

@zachdaniel
Copy link
Contributor Author

It is indeed not clear 😆

@zachdaniel
Copy link
Contributor Author

Okay, I think this should be ready now :)

@greg-rychlewski
Copy link
Member

Sorry to be annoying. For the last TDS change, since we know we only added subqueries for a few new things, could we not catch them in places like this and raise there:

defp group_by(%{group_bys: group_bys} = query, sources) do
      [
        " GROUP BY "
        | Enum.map_intersperse(group_bys, ", ", fn %ByExpr{expr: expr} ->
            Enum.map_intersperse(expr, ", ", &top_level_expr(&1, sources, query))
          end)
      ]
    end

That would be a smaller change than tagging where the expression came from all the way down the recursive calls. And I believe we just need it for group_by and order_by because distinct is already raising

@zachdaniel
Copy link
Contributor Author

The subqueries could be detected, but they could also be anywhere in an expression. So I'd have to crawl the expression to check for subqueries. i.e a and exists(...) or (b and exists(...))

@greg-rychlewski
Copy link
Member

Oh I see what you mean. TBH I'm not sure if raising in the adapter is worth keeping track of all the locations as the code evolves. It might not be so bad to just let the database raise because it should at least show you the bad query, similar to if you were using a CLI tool to interact with it.

@josevalim what do you think.

@zachdaniel
Copy link
Contributor Author

The error is pretty tough, you basically get something like syntax error at or near exists(

@greg-rychlewski
Copy link
Member

greg-rychlewski commented May 28, 2024

I see what you mean. My opinion might be too unforgiving so we should definitely see what Jose thinks too.

In general I think if you are shown the query Ecto generated and the error from the database then it's pretty fair. It's what you would get if you tried to write the query by hand in the CLI client.

And I think it's important people understand how to make the query by hand in their database before trying to do something in Ecto. Ecto should really just be a translator and not like an excuse not to understand the database.

Databases also evolve and I'm not sure it's feasible for Ecto to keep up with all of those rules instead of letting the database do its thing, given there were no sins committed before the query string was made.

@zachdaniel
Copy link
Contributor Author

I'm all good undoing that change, just wanted to make sure the options were clear. I leave it up to you two to decide :)

@josevalim
Copy link
Member

I agree with @greg-rychlewski! I appreciate the extra mile you went here @zachdaniel but I think we can keep the code simpler and we don't have to check if the subquery is valid. If the user put it there, we assume it is valid, and if it is not, the database will let them know it isn't. :)

@zachdaniel
Copy link
Contributor Author

No worries! Only took a few minutes :) Will back it out.

@zachdaniel
Copy link
Contributor Author

Alright, made that change. One thing I also did, however, was remove the tests for that particular thing as well. I can add them back in, but I think they kind of send the wrong message, since they are testing known-to-be-not-valid expressions. I'm happy to add them back in though :D

@zachdaniel
Copy link
Contributor Author

I think the test failure there is unrelated to my changes.

@greg-rychlewski greg-rychlewski merged commit a3c15a2 into elixir-ecto:master May 28, 2024
10 checks passed
@greg-rychlewski
Copy link
Member

Thanks for the changes! Nice work

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants