CS611 ML Engineering: Airbus Ship Detection with U-Net and Vertex AI Pipeline

This is the git repo for CS611 ML Engineering project for airbus ship detection.

Group 14 Members:

Wong Songhan
Koh Enyong
Arnold Ng
Gabriel Quek

Problem Statement

In this project, we tackle the problem of identifying ships in satellite images. We recognize 3 main applications for this problem:

Maritime Traffic Management – Improves general situational awareness, especially for small vessels not covered by AIS
Maritime Surveillance & Policing – For detection and tracking of vessels with AIS turned off, which may be engaged in illegal activity
Naval Warfare – An additional source of intelligence for detecting enemy locations

Dataset

The dataset was retrieved from Kaggle based on the Attributes of the dataset:

192,556 images from Airbus Ship Detection Challenge
Each image may have multiple ships
Labels are run-length encoded (RLE), for data compression, need to be converted to single channel image

Visit this Kaggle page for more info

https://www.kaggle.com/c/airbus-ship-detection

Pipeline

Below are the components of our entire pipeline:

1. EDA / Experimentation

We interactively approach the model building and exploration based on the input dataset. Understanding the dataset and problem well before training and building of our model and their respective components.

2. Data Ingest

Due to the complexity of the input dataset and problem itself, preprocessing of the input data is essential to provide good input data for our pipeline.

3. Data Statistics Generation

In this section, we create a component that computes the data statistics.

4. Model Training

Building of model training component that is used by the overall pipeline to be deployed and part of the CI/CD process that retrains the model based on certain triggers.

5. Model Evaluation

Component building of evaluation. Evaluation of the output trained model is conducted. Metrics will be output.

6. Model Deployment

Model is deployed to Vertex AI that is used to serve endpoint.

7. Pipeline Deployment

Stringing together of the pipeline, alongside test components that ensures every component in the pipeline is in order before pushing it to the Vertex AI platform.

8. Model Monitoring

Using the data statistics generated from Step (3), this notebook is used to aassess new data for train-serve drift.

9. Model Serving

This notebook provides a demo of calling RESTful api from Endpoint which returns a model prediction result given an input image.

Overall Pipeline (deployed on Vertex AI)

Project Organization

├── LICENSE
├── README.md          <- The top-level README
├── build 
├── config             <- config file for GCP resource
├── provision          <- terraform config for GCP resource startup  
├── Dockerfile         <- Docker file for custom model trainer
├── saved_models       <- Trained and serialized model data (for exploratory)
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── src                <- Source code for use in this project.
│   ├── __init__.py    <- Makes src a Python module
│   │
│   ├── evaluation     <- Scripts to generate model evaluation component
│   │   └── eval_component.py
│   │
│   │
│   ├── model_training  <- Scripts for custom model training
│   │   
│   │── models  <- Preprocessing scripts  
│   │   
│   └── utils  <- Common util scripts for data ingest and pre-processing
│       └── common.py
│       └── dataset.py
│
└── tox.ini            <- tox file with settings for running tox; see tox.readthedocs.io

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Name		Name	Last commit message	Last commit date
Latest commit History 138 Commits
__MACOSX		__MACOSX
build		build
config		config
docs		docs
models		models
provision		provision
references		references
reports		reports
saved_models/segm_full_200_20220626-143859		saved_models/segm_full_200_20220626-143859
src		src
.gitignore		.gitignore
01-EDA and Experimentation.ipynb		01-EDA and Experimentation.ipynb
02-Data Ingest.ipynb		02-Data Ingest.ipynb
03-Data Statistics Gen.ipynb		03-Data Statistics Gen.ipynb
04-Model Training.ipynb		04-Model Training.ipynb
05-Model Evaluation.ipynb		05-Model Evaluation.ipynb
06-Model Deployment.ipynb		06-Model Deployment.ipynb
07-Pipeline Deployment.ipynb		07-Pipeline Deployment.ipynb
08-Model Monitoring.ipynb		08-Model Monitoring.ipynb
09-Model Serving Demo.ipynb		09-Model Serving Demo.ipynb
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
airbusmle_pipeline.json		airbusmle_pipeline.json
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS611 ML Engineering: Airbus Ship Detection with U-Net and Vertex AI Pipeline

Group 14 Members:

Problem Statement

Dataset

Pipeline

1. EDA / Experimentation

2. Data Ingest

3. Data Statistics Generation

4. Model Training

5. Model Evaluation

6. Model Deployment

7. Pipeline Deployment

8. Model Monitoring

9. Model Serving

Overall Pipeline (deployed on Vertex AI)

Project Organization

About

Releases

Packages

Contributors 3

Languages

License

songhan89/mle-airbus-ship-detection

Folders and files

Latest commit

History

Repository files navigation

CS611 ML Engineering: Airbus Ship Detection with U-Net and Vertex AI Pipeline

Group 14 Members:

Problem Statement

Dataset

Pipeline

1. EDA / Experimentation

2. Data Ingest

3. Data Statistics Generation

4. Model Training

5. Model Evaluation

6. Model Deployment

7. Pipeline Deployment

8. Model Monitoring

9. Model Serving

Overall Pipeline (deployed on Vertex AI)

Project Organization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages