Satellite extractor: Access Satellite Imagery with Geographical and Temporal Control
Satellite Extractor provides users with the capability to retrieve satellite imagery for specific geographical coordinates and predefined timestamps. While it offers a powerful tool for accessing remote sensing data, it's important to note that the quality of the images may not always be perfect due to potential cloud interference, which users may need to filter or process as necessary..
- We created full open-opensource datasets in HuggingFace and Physionet.
- Find a short overview of this project (Satellite extractor)
Sentinelhub grant: Sponsoring request ID 1c081a: Towards a Smart Eco-epidemiological Model of Dengue in Colombia using Satellite in Collaboration with MIT Critical Data Colombia.
Project supported by ESA Network of Resources Initiative
- [2024] Colombian dataset v1 using this repository is posted in Physionet.
- [2024] DengueNet: Dengue Prediction using Spatiotemporal Satellite Imagery for Resource-Limited Countries. paper
In this work, we introduce a data collection and processing framework to extract Sentinel-2 satellite 1 based on modified Copernicus Sentinel data. Satellite images of the ten cities are extracted from sentinel 2 through the SentinelHub API using a scalable dockerized framework that orchestrates Google Earth Engine (GEE) to generate regions of interest (ROI), which are afterwards used to download images. The framework also proposes a recursive noise artifact removal algorithm that reduces camera capturing noise generated by Sentinel 2-L1C orbit transit per week.
pip install satellite-extractor==0.7
Find in Pypi: https://pypi.org/project/satellite-extractor/
import satellite_extractor as sat
sat.run(TIMESTAPS, CLIENT_ID, CLIENT_SECRET, IMAGE_FORMAT, COORDINATES_PATH )
- see the
download_images.ipynb
notebook for learning how to initialize these variables.
-
Credentials for GCP: Please follow the instructions as explained here and add update your credentials as follows.
- Inside of
src/satellite-extractor-dockerized/config.py
, updateservice_account
with your new service account. - Store the GCP json key inside of
src/satellite-extractor-dockerized/data_config
- Inside of
-
Credentials for SentinelHub: The best of this project is that you can download data for free using the Free Tier from SentinelHub. Follow the instructions to create your username and password here. Please remember it is a free trial, access is limited!
If you want to download data using docker, please update the config file as desired and follow the next commands.
docker build -f Dockerfile -t docker .
Run with syncronized volume:
docker run -v /home/sebasmos/Desktop/satellite.extractor/src/satellite-extractor-dockerized:/Dengue -ti docker /bin/bash
Finally, run python satellite.extractor.py
to download the satellites as customized.
-
Clone repository and install dependencies in
pip install -r requirements.txt
-
Create account on Sentinelhub to obtain credentials:
-
Install sentinelhub API, follow up the official documentation.
v0.7: optimized satellite-extractor functions
v.0.6: including dockerized version and adding cloud2cloudless scripts
methods_improvements: adding uint16 java-scripts
- Read GCP bucket to Google Colab
- Download satellite data for N cities
- PART_1_satellite_imagery_augmentation: exploring forward artifact removal
- PART_2_satellite_imagery_augmentation: exploring forward-backward artifcat removal
- satellite_images_hashing: evaluate dataset quality by quantifying the number of duplicates on your data
- viz_many_folders: explore the visualization of satellite images
- create_Cloud2CloudlesDataset: create cloud to cloudless paired image dataset
- update dockerized api
- include Super-resolution into pipelines
- include Cloud removal algorithms pipeline
Feel free to contact me if want to collaborate on this project.
If you use this code for your research, please cite our papers.
@article{cajasmulti,
title={A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia},
author={Cajas, Sebastian A and Restrepo, David and Moukheiber, Dana and Kuo, Kuan Ting and Wu, Chenwei and Chicangana, David Santiago Garcia and Paddo, Atika Rahman and Moukheiber, Mira and Moukheiber, Lama and Moukheiber, Sulaiman and others}
}
@article{kuo2024denguenet,
title={DengueNet: Dengue Prediction using Spatiotemporal Satellite Imagery for Resource-Limited Countries},
author={Kuo, Kuan-Ting and Moukheiber, Dana and Ordonez, Sebastian Cajas and Restrepo, David and Paddo, Atika Rahman and Chen, Tsung-Yu and Moukheiber, Lama and Moukheiber, Mira and Moukheiber, Sulaiman and Purkayastha, Saptarshi and others},
journal={arXiv preprint arXiv:2401.11114},
year={2024}
}
@article{moukheiber2024multimodal,
title={A multimodal framework for extraction and fusion of satellite images and public health data},
author={Moukheiber, Dana and Restrepo, David and Cajas, Sebasti{\'a}n Andr{\'e}s and Montoya, Mar{\'\i}a Patricia Arbel{\'a}ez and Celi, Leo Anthony and Kuo, Kuan-Ting and L{\'o}pez, Diego M and Moukheiber, Lama and Moukheiber, Mira and Moukheiber, Sulaiman and others},
journal={Scientific Data},
volume={11},
number={1},
pages={634},
year={2024},
publisher={Nature Publishing Group UK London}
}