Skip to content

Latest commit

 

History

History
78 lines (45 loc) · 6.21 KB

README.md

File metadata and controls

78 lines (45 loc) · 6.21 KB

Numerical Experiments Data from the paper entitled "Distributionally robust stochastic programs with side information based on trimmings"

Goals

The purpose of this repository is to provide the details of the numerical experiments coded in the Python programming language used in the paper [1]. This paper has been developed by some members of the OASYS group through funding from the project Flexanalytics. We encourage you visit the related links to know more about our research.

Contents

This repository includes the data about the two test problems used in the paper:

  • Single-item Newsvendor problem with side information:

    • [singlenewsvendor_generation_data]: This folder contains the data used in the numerical experiment corresponding to the single-item Newsvendor problem with side information considered in [1]. The file "indices+str(N)".csv contains the 400 samples' index of size N; that is, each row (total=400) of the .csv file represents a sample of size N. The data points of the bivariate distribution are in the file entitled "muestra_dist_real.csv". The rest of the parameters of the numerical experiments are described in [1].
  • Portfolio allocation problem with side information.

    • [portfolio_generation_data]: This folder contains the data used in the numerical experiment corresponding to the portfolio allocation problem with side information used in [1]. The file "indices+str(N)".csv contains the 200 samples' index of size N; that is, each row (total=200) of the .csv file represents a sample of size N. The data points of the joint distribution are in the file entitled "muestra_real_conjunta.csv". The rest of the parameters of the numerical experiment are described in [1]. The main test case deals with the case in which alpha=0 and the data points of the true conditional distribution of this case are in the file entitled "muestra_dist_real.csv". Some crucial remarks to reproduce the numerical experiments in the paper in the case alpha=0: The y-data coordinates are not scaled, so they must be scaled to reproduce reasonably the experiments as follows:

          Side/contextual information (scaled):  [(1000-1000)/50,(0.01-0.02)/0.01,(5-math.exp(1/2))/(math.sqrt((math.exp(1)-1)*math.exp(1)))]
          
          The y-data coordinates of the samples from the joint distribution are scaled as follows:
           puntos_yconjunta_real=muestra_real_conjunta[:,[3,4,5,6,7,8]]  #The y-data coordinates of the samples from the joint distribution, muestra_real_conjunta[:,[0,1,2]]              represents th feature coordinates of the samples from the joint distribution.
      
      
            cov_yrealconjunta=np.cov(puntos_yconjunta_real,rowvar=False)
      
      
            std_yrealconjunta=np.diag(np.sqrt(np.diag(cov_yrealconjunta)))  
            
            
            media_yrealconjunta=puntos_yconjunta_real.mean(axis=0) #mean vector
            #SCALE THE POINTS OF THE JOINT DISTRIBUTION:
            np.dot(inv(std_yrealconjunta), muestra_conjunta_sample[j,[3,4,5,6,7,8]]-media_yrealconjunta)
            media_yrealconjunta=puntos_yconjunta_real.mean(axis=0) #mean vector
            #SCALE THE POINTS OF THE JOINT DISTRIBUTION:
            np.dot(inv(std_yrealconjunta), muestra_conjunta_sample[j,[3,4,5,6,7,8]]-media_yrealconjunta)
      
    • Other related and complementary numerical tests :

    • Case varying the side information: In this case, the sample corresponding to muestra_real_conjunta[indices[0]] ( for each value of N considered, indices[0] contains the indices of the sample for size N ) is chosen. Note that, in this case the sample is fixed and we vary the side information (we consider 200 samples of the z-distribution which are located in the file "zdatasamples_variando.csv"). The file entitled "muestra_dist_realindex.csv" in the folder "conditional_distributions" is the conditional distribution corresponding to the index from 0 to 199 of the features as the side information used to compute the performance metrics.

    • Case alpha>0, additional numerical experiments at the Supplemental Material: In this case, the file "muestra_real_conjunta_alfapos.csv" contains the data of the joint probability distribution. The file "indices_alphaposN.csv" contains the indices of the sample of size N used in the numerical experiments.

The folder Codes contains the codes used in the numerical experiments.

References 📚

[1] Esteban-Pérez, A., & Morales, J. M. (2020). Distributionally robust stochastic programs with side information based on trimmings--Extended version. arXiv preprint arXiv:2009.10592.

Do you want to contribute? 👨🏾‍🔬

Any feedback is welcome ✨ so please feel free to ask about or comment on anything you want via a Pull Request in this repo. If you need more help, you can ask Adrián Esteban-Pérez ([email protected]) 📧.

Contributors

Developed by 👨🏾‍💻

License 📝

Copyright 2021 Optimization and Analytics for Sustainable energY Systems (OASYS)

Licensed under the GNU General Public License, Version 3 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.gnu.org/licenses/gpl-3.0.en.html

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.