PySummarize

A NLP based text summarizer. Can summarize PDF documents and Wikipedia articles too.

Uses NLTK for Python to enable tokenisation and core NLP features for Extractive Summarisation, and Hugging Face Transformers for Abstractive Summarisation, with Streamlit for front-end.

PDF Summariser

Uses Streamlit upload feature, and PDFPlumber to parse text in the PDF. Issues with academic papers which causes some text to become garbled. Works well on non-technical text.

Wikipedia Summariser

Uses BeautifulSoup to extract text from HTML before passing through the text summarisation engine.

Textbox Summariser

Basic textbox to allow for copy and paste entry of text for summarisation.

Installation Instructions

Install requirements - pip install -r requirements.txt
Run streamlit - streamlit run app.py

In the demo, you can test out extractive summarisation.

Live demo here: https://suyesha07-pysummarize-app-kwx0pp.streamlitapp.com/

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.slugignore		.slugignore
Procfile		Procfile
README.md		README.md
app.py		app.py
nltk.txt		nltk.txt
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PySummarize

PDF Summariser

Wikipedia Summariser

Textbox Summariser

Installation Instructions

About

Releases

Packages

Languages

suyesha07/PySummarize

Folders and files

Latest commit

History

Repository files navigation

PySummarize

PDF Summariser

Wikipedia Summariser

Textbox Summariser

Installation Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages