Skip to content

Latest commit

 

History

History

PDF_Tables_Extractor

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

PDF tables extractor.

  • This script will convert the tables in the PDF file into CSV files.
  • For each tables in the PDF new CSV file will be generated.
  • Technology used: Tabula-py

Setting up:

  • Install the requirements
  $  pip install tabula-py 

Running the script:

  $  # Specify the no. of pages to scan
  $ python3 pdf_to_csv.py <no. of pages>    
  $  # If you want all pages to scan
  $ python3 pdf_to_csv.py all