RFE: enhancements for the future of the project #1

ankostis · 2019-10-19T20:27:14Z

(Cloned from yahoo#22)
Collecting here enhancements that would nice to have on this project going forward:

Language & Build

2. Functionality

3. diagrams:

3.1 basalt: https://www.anishathalye.com/2019/12/12/constraint-based-graphic-design/
3.2 g9.js: http://omrelli.ug/g9/#installation
3.3 vis.js: https://visjs.github.io/vis-network/examples/

The text was updated successfully, but these errors were encountered:

syamajala · 2019-10-25T18:11:54Z

As far as colors go, I was actually thinking about this, what if it were possible to attach arbitrary metadata to nodes and pass a predicate function which looks at the metadata and returns true or false which tells you whether to evaluate the node or not?

ankostis · 2019-10-30T13:09:12Z

I agree, this is a flexible solution, a superset of "colors", since it is more "dynamic" - it can decide the nodes to consider after the DAG has formed.

Implementation wise, the operation_predicate() function is pretty simple, but i would place it in the execution_config ContextVar.
An alternative for the node-predicate function would be to expose the DAG from a new "hook" before_compute(), and allow arbitrary calls to networkx.subgraph() - that would filter nodes in a single step, fully under User control. A drawback is that the user may destroy the DAG (remove data-nodes needed by the underlying functions).
I like exposing the power of networkx to the users.

Finally we need the metadata to be in the DAG - graphtik has this mostly implemented.

A remark on performance,
if this predicates run right before every compute() call, the effect would be (in big-O notation)
$O\left ( \sum m_i \times n_i \right )$
where is the number of operations in the network-operation node (assuming utilizes predicates). For simple graph-operation, m = <total-number of operation nodes>, n=1.

e.g. assign "colors" to nodes, and solve a subset each time.

ankostis · 2019-12-11T19:15:50Z

@syamajala the just released 4.0.0 supports node-attributes and and a predicate function to prune arbitrary nodes before computing results (check CHANGES and NetworkOperation.pruned() method).

syamajala · 2020-12-09T09:55:35Z

Are there any plans to implement control flow nodes? Things like If, ElseIf, Else, and maybe For?

ankostis · 2020-12-09T16:21:39Z

I had thought that initially, and had precluded it, because of all the complication such a UX would imply, to drive conditional logic in the DAG level.
But since i also needed if-then-else functionality I decided that all "conditional" code should reside inside regular python operations, as can be seen in the plotted diagram of the landing page.

Specifically, if some function wants to cancel its downstream execution, or produce a subset of its outputs (and implement if-then-else logic), can do that with endured operations or partial outputs. A reschedule point kicks-in after such operations.

I cannot think of a way to workaround also for-loops with the same machinery, because of the following architectural reasons:

The conceptual model of this project cannot have a goto equivalent, all scheduling decisions are based on the existence of data (there are no direct operation-dependencies). I don't want this to change.
The execution DAGs cannot have loops (they are acyclic after all), planning resolves any loops in the pipeline-graph before execution (see for eg. sideffects that are used to bypass any data-loops). It would be rather difficult to change this.
The solution expects each operation to execute only once. That may change, but it is not enough for having loops.

syamajala · 2020-12-09T18:00:10Z

Based on what you said I have some vague idea of how to do something like this outside of graphtik. Note: I havent tried this out yet.

What you do is have some higher level functions/nodes.

graph = graph_composer(name='graph')(
If(name='val_greater_than_10', condition_needs=['val'], condition=lambda val: val > 10, needs=['a'], provides=['c'], group='if1')(
    operation(name='a_plus_10', needs=['a'], provides=['b'])(lambda a: a+10),
    operation(name='b_divide_5', needs['b'], provides=['c'])(lambda b: b/10)),
ElseIf(name='val_equal_10', condition_needs=['val'], condition=lambda val: val == 10, needs=['a'], provides=['c'], group='if1')(
    operation(name='a_minus_10', needs=['a'], provides=['c'])(lambda a: a-10))
Else(name='val_less_than_10', condition_needs=['val'], needs=['a'], provides=['c'], group='if1')(
    operation(name='a_divide_10', condition_needs=['val'], needs=['a'], provides=['c'])(lambda a: a/10)))

What graph_composer would do then is walk the list of nodes and return an operation that looks something like this:

def if1(val, a, b):
   if val > 10:
       return val_greater_than_10_graph(a ,b)
   elif val == 10:
        return val_equal_10_graph(a, b)
   else:
       return val_less_than_10_graph(a, b)

operation(name='if1', needs=['val', 'a', 'b'], provides=['c'])(if1)

You could probably clean this up by generating a class that holds the graphs as attributes and implements the if1 function as __call__

I think instead of For what I would really like is a Map where maybe what it does is loop over lists and just applies the subgraph to each entry.

graph = graph_composer(name='graph')(
Map(name='map', needs=['a', 'b'], provides=['c'])(
    operation(name='a_times_b', needs=['a', 'b'], provides=['c'])(lambda a, b: a*b)))

graph({'i': 0, 'a': [1, 2, 3], 'b': [11, 12, 13]}) -> {'c': [11, 24, 39]}

It would work in a similar method.

Also I should say my use case for something like this is that I provide users with libraries of operations and they build the graph themselves, so putting the logic inside of the operation itself is not really feasible for me because I dont know the conditions they want to evaluate on.

ankostis · 2020-12-09T20:54:06Z

If you auto-generate a function that returns_dict, you can forego the x3 sub-pipelines, and put all operations-nodes in the same graph:

def if1(val, a, b):
   if val > 10:
       return {"if1_if"': True}
   elif val == 10:
        return {"if1_elif"': True}
   else:
       return {"if1_else"': True}

operation(if1,
          needs="val", 
          provides=["if1_if", "if1_elif", "if1_else"], 
          returns_dict=True)

The returns_dict functions facilitate composing operations with partial-outputs.

You also need to auto-generate 3 branch-operations downstream, each one receiving one of the 3 dependencies provided above "if1_if", "if1_elif", "if1_else" as their needs.

Does that makes sense?

ankostis added a commit that referenced this issue Dec 11, 2019

FEAT(#1,net,netop): PRUNE by node-PROPS

d18b84a

e.g. assign "colors" to nodes, and solve a subset each time.

ankostis added a commit that referenced this issue Dec 11, 2019

FEAT(#1,net,netop): PRUNE by node-PROPS

3c5523f

e.g. assign "colors" to nodes, and solve a subset each time.

ankostis pinned this issue Jan 10, 2020

ankostis mentioned this issue Jun 10, 2021

WIP: Pre-review for v1.3.0 release yahoo/graphkit#31

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFE: enhancements for the future of the project #1

RFE: enhancements for the future of the project #1

ankostis commented Oct 19, 2019 •

edited

Loading

syamajala commented Oct 25, 2019

ankostis commented Oct 30, 2019

ankostis commented Dec 11, 2019

syamajala commented Dec 9, 2020

ankostis commented Dec 9, 2020 •

edited

Loading

syamajala commented Dec 9, 2020 •

edited

Loading

ankostis commented Dec 9, 2020 •

edited

Loading

RFE: enhancements for the future of the project #1

RFE: enhancements for the future of the project #1

Comments

ankostis commented Oct 19, 2019 • edited Loading

Language & Build

2. Functionality

3. diagrams:

syamajala commented Oct 25, 2019

ankostis commented Oct 30, 2019

ankostis commented Dec 11, 2019

syamajala commented Dec 9, 2020

ankostis commented Dec 9, 2020 • edited Loading

syamajala commented Dec 9, 2020 • edited Loading

ankostis commented Dec 9, 2020 • edited Loading

ankostis commented Oct 19, 2019 •

edited

Loading

ankostis commented Dec 9, 2020 •

edited

Loading

syamajala commented Dec 9, 2020 •

edited

Loading

ankostis commented Dec 9, 2020 •

edited

Loading