Skip to content

Providing a cleaned dataset of international names based on the EU Science Hub's JRC-Names dataset

License

Notifications You must be signed in to change notification settings

MWYang/InternationalNames

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

International Names

Providing a cleaned dataset of international names based on the EU Science Hub's JRC-Names dataset

Prerequisites

Code tested on MacOS 10.14.6. You need

  • gzip (should come with MacOS)
  • Python 3
  • pandas

Instructions

First run the download script. Then run the Python script to clean the data. Done.

Rationale

JRC-Names is a great resource—-it provides lots of different names from lots of different locales. Unfortunately, it is also a bit dirty. This repository aims to fix that.

About

Providing a cleaned dataset of international names based on the EU Science Hub's JRC-Names dataset

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published