-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No variables selected, on high dimensional data. #26
Comments
The problem is two-fold:
stabsel_parameters(p = 6248, cutoff =0.75,PFER =0.2)
# Stability selection with unimodality assumption
#
# Cutoff: 0.75; q: 34; PFER (*): 0.189
# (*) or expected number of low selection probability variables
# PFER (specified upper bound): 0.2
# PFER corresponds to signif. level 3.02e-05 (without multiplicity adjustment) Please have a look at the literature, e.g. citation("stabs")[[2]] i.e.,
where we give some advice on the choice of PFER (by relating it to the usual type 1 error rate) and on the best way to set parameters (i.e., fix ?stabsel.stabsel With respect to your issue with |
I used stab package for high dimensional gene type of data, which, regardless how I tried, (lars.lasso would not work on input, but glmnet.lasso works just fine), it shows that ,everytime I run, 'no variables selected '. for example,
stab_lasso<-stabsel(x =train_dat, y =tr_target, fitfun =glmnet.lasso, cutoff =0.75,PFER =0.2)
my data had extremely high dimensionality 6248 columns and low observations with 224 rows.
If I use glmnet.lasso_maxCoef, the algorithm would just return first 45 genes as being selected, this is clearly not right.
What am I doing wrong? has anyone, except the low dimensional data, tried on real world high dimensional data?
The text was updated successfully, but these errors were encountered: