Data Processing for Nieves Observer

This repo contains the Jupyter Notebooks and Flask app that documents the ETL (extract-transform-load) process for the Nieves Observer application.

The app is continuously seeding new catalogues to the app depending on educators' and students' interests and requests.

Data

Deep Sky Objects (DSOs)

The app currently uses data from two catalogues:

Messier
Caldwell

The ETL process for each catalogue can be found in the etl_<catalogue>.ipynb files.

Exoplanets

All exoplanet data are derived from NASA's exoplanet database, with observatory-specific constraints set to the extraction. E.g., objects must be bright than 12 mags, transit periods must not last longer than 8 hours, etc.

Eclipsing Binaries

Coming soon.

Development

Work on the files in a Python shell such as pipenv.

Clone the project

git clone [email protected]:nieves-observatory/observarium-etl.git
cd observarium-etl

Install all requisite libraries.

pipenv shell
pipenv install -r requirements.txt

The ETL process is developed using Jupyter Lab, and thus has Jupyter Lab already installed as a dependency in this repository.

jupyter lab

Load

This ETL process is database agnostic--you can upload the data to any database of choice.

MongoDB

Observarium will be using MongoDB, which we can easily seed csv data with the Mongo Compass GUI.

PostgreSQL

The earlier version of Observarium uses a Postgres database, and you can follow the documentation here.

To load to Postgres, first create a Postgres database (e.g., we give the database a name of nieves_observer)

createdb -U postgres nieves_observer

Update the database name in the load python files. For example in load_dso.py:

app.config['SQLALCHEMY_DATABASE_URI'] = "postgresql://postgres@localhost/nieves_observer"
...
db_conn = psycopg2.connect(host="localhost", port="5432",
                           dbname="nieves_observer", user="postgres")

Once you're ready to load, run the load file.

python load_dso.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
raw_data		raw_data
transformed_data		transformed_data
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
bright_transits.csv		bright_transits.csv
etl_dso_caldwell.ipynb		etl_dso_caldwell.ipynb
etl_dso_messier.ipynb		etl_dso_messier.ipynb
etl_exoplanet.ipynb		etl_exoplanet.ipynb
load_dso.py		load_dso.py
load_exoplanet.py		load_exoplanet.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Processing for Nieves Observer

Data

Deep Sky Objects (DSOs)

Exoplanets

Eclipsing Binaries

Development

Load

MongoDB

PostgreSQL

About

Releases

Packages

Languages

nieves-observatory/observarium-etl

Folders and files

Latest commit

History

Repository files navigation

Data Processing for Nieves Observer

Data

Deep Sky Objects (DSOs)

Exoplanets

Eclipsing Binaries

Development

Load

MongoDB

PostgreSQL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages