Skip to content

Latest commit

 

History

History
16 lines (11 loc) · 421 Bytes

README.md

File metadata and controls

16 lines (11 loc) · 421 Bytes

DupPub (Duplicate Publication)

Detect duplicate or similar publications from database. This project aim to reduce size of the database by showing pairs of suspect duplications, to help citation easier and cleaner.

Export database as CSV file without header, with this fields:

  1. ID
  2. Authors
  3. Title of the article
  4. Year
  5. Abstract

Run with

python3 report.py publications.csv