SC: check robustness of results (frequentist) #45

drbenvincent · 2022-11-02T15:08:17Z

I've experienced clearly sub-optimal weightings when running the the WeightedProportion custom scikit-learn model. It is likely due to bad optimisation, perhaps getting stuck by local optima. So we need to explore the dependence of the results upon w_start.

CausalPy/causalpy/skl_models.py

Lines 22 to 33 in 815c14c

    
           def fit(self, X, y): 
        
               w_start = [1 / X.shape[1]] * X.shape[1] 
        
               coef_ = fmin_slsqp( 
        
                   partial(self.loss, X=X, y=y), 
        
                   np.array(w_start), 
        
                   f_eqcons=lambda w: np.sum(w) - 1, 
        
                   bounds=[(0.0, 1.0)] * len(w_start), 
        
                   disp=False, 
        
               ) 
        
               self.coef_ = np.atleast_2d(coef_)  # return as column vector 
        
               self.mse = self.loss(W=self.coef_, X=X, y=y) 
        
               return self

One way to approach making the results more reliable (more likely to represent the global minimum) is to use a particle swarm type approach where we run the optimisation multiple times, each with different w_start.

Look into the relevant fitting procedures in scikit-learn.

The text was updated successfully, but these errors were encountered:

drbenvincent mentioned this issue Nov 2, 2022

SC: check sensitivity to prior #46

Open

drbenvincent added this to the Stabilise the feature set milestone Dec 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SC: check robustness of results (frequentist) #45

SC: check robustness of results (frequentist) #45

drbenvincent commented Nov 2, 2022

SC: check robustness of results (frequentist) #45

SC: check robustness of results (frequentist) #45

Comments

drbenvincent commented Nov 2, 2022