Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix pooled_hazard_task bug #333

Open
wants to merge 10 commits into
base: devel
Choose a base branch
from

Conversation

Larsvanderlaan
Copy link
Contributor

@Larsvanderlaan Larsvanderlaan commented Mar 17, 2021

Pooled hazards task does not work as intended (or errors) when using tasks that have non-null row_index internal variables. This occurs because when passing in a new dataset to next_in_chain, it does not reset the internal row_index variable. This makes using CV learners that utilize pooled hazards tasks internally (e.g. Lrnr_cv$new(Lrnr_pooled_hazards$new())) break down. For tmle3 survival, the pooled hazards task was created externally as the main task, so the hazard estimation with CV was not impacted by this bug.

It might also be worth changing sl3_Task so that row_index is reset when passing in a new dataset. Or, maybe we should just not allow datasets to be passed in through next_in_chain.

Also, changed Lrnr_hal9001 so that all params are passed into fit_hal.

Also, added squash option for Lrnr_hal9001 to reduce computation time.

Pooled hazards task does not work as intended (or errors) when using tasks that have non-null row_index internal variables. This occurs because when passing in a new dataset to next_in_chain, it does not reset the internal row_index variable. This makes using CV learners that utilize pooled hazards tasks internally (e.g. Lrnr_cv$new(Lrnr_pooled_hazards$new())) break down. For tmle3 survival, the pooled hazards task was created externally as the main task, so the hazard estimation with CV was not impacted by this bug. 

It might also be worth changing sl3_Task so that row_index is reset when passing in a new dataset. Or, maybe we should just not allow datasets to be passed in through next_in_chain.
Cheap fix for Lrnr_hal to work with latest version of hal9001/devel
@Larsvanderlaan
Copy link
Contributor Author

I think the issue is that squash_hal_fit is pretty broken in the master version of hal9001. I will work on getting the checks to pass.

@nhejazi nhejazi added the bug label Mar 23, 2021
@nhejazi nhejazi changed the title Fix pooled_hazard_task bug Fix pooled_hazard_task bug Apr 10, 2021
Base automatically changed from devel to master December 23, 2021 19:16
@nhejazi nhejazi changed the base branch from master to devel December 24, 2021 02:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants