Text Summarization Project

🌟 Overview

This project focuses on Text Summarization, leveraging the power of Natural Language Processing (NLP) and Hugging Face Transformers to condense lengthy text documents into concise and meaningful summaries. It includes the development of robust pipelines for data ingestion and preprocessing, model fine-tuning, and performance evaluation.

Additionally, a FastAPI-based API was created for seamless integration, enabling easy access to the summarization service.

🔍 Features

Extractive Summarization

Utilizes algorithms to identify and extract key sentences from the original text.
Highlights the most important information while maintaining context.

Abstractive Summarization

Generates new sentences that capture the essence of the original text using transformer-based models.
Incorporates Hugging Face Transformers (e.g., T5, BART) for coherent and human-like summaries.

API Integration

FastAPI-based API for real-time summarization and easy integration into other applications.

🛠️ Technology Stack

Programming Language: Python
Libraries: Hugging Face Transformers, NLTK, spaCy
Web Framework: FastAPI, Streamlit (for an optional user interface)
Models: Pre-trained transformer models like T5 and BART for abstractive summarization.

🧪 Dataset

This project can work with any text data, including:

Articles
Research papers
News

For testing purposes, sample datasets have been provided in the data/ directory.

📈 Results

Extractive Summarization: Delivers summaries with an average ROUGE score of 0.75.
Abstractive Summarization: Achieves natural and concise summaries with fine-tuned transformer models.

✨ Future Enhancements

Add multilingual text summarization.
Incorporate real-time web scraping for dynamic content summarization.
Enhance model performance using fine-tuning on domain-specific data.
Expand API functionality for batch summarization.

Transforming Text into Insights with Hugging Face and FastAPI!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
config		config
research		research
src/textSummarizer		src/textSummarizer
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Summarization Project

🌟 Overview

🔍 Features

Extractive Summarization

Abstractive Summarization

API Integration

🛠️ Technology Stack

🧪 Dataset

📈 Results

✨ Future Enhancements

About

Releases

Packages

Languages

License

jothsnapraveena/Text-Summarization

Folders and files

Latest commit

History

Repository files navigation

Text Summarization Project

🌟 Overview

🔍 Features

Extractive Summarization

Abstractive Summarization

API Integration

🛠️ Technology Stack

🧪 Dataset

📈 Results

✨ Future Enhancements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages