Skip to content

Commit

Permalink
Removed output datasets and updated license file
Browse files Browse the repository at this point in the history
  • Loading branch information
scarnecchia committed Mar 27, 2022
1 parent 67447ef commit fd08038
Show file tree
Hide file tree
Showing 14 changed files with 1 addition and 7,604 deletions.
File renamed without changes.
4 changes: 1 addition & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,4 @@
## About
This is a simple R script designed to scrape data from Oryx' excellent post detailing materiel lost by all sides in the [Russian invasion of Ukraine](https://www.oryxspioenkop.com/2022/02/attack-on-europe-documenting-equipment.html) and to output it into csv format.

## To-Do
- [x] Explore XPATH's ability to extract data between HTML tags: The Country column is currently blank, making attribution impossible. I'm not super familiar with xpath, so I've yet to work out how it to extract elements under each country header and conditionally assign the country name.
- [ ] Data Quality Assurance using the Totals provided at the start of each list item.
Up-to-date output datasets can be found at in the [oryx_data](https://github.com/scarnecchia/oryx_data) repo. This repo is updated automatically daily at 22:30 EDT/EST.
226 changes: 0 additions & 226 deletions outputfiles/daily_count.csv

This file was deleted.

Loading

0 comments on commit fd08038

Please sign in to comment.