GitHub - putrareddy/Abstractive-Summarizer: This file gives an Abstractive Summarized Text with a good accuracy

Techgium Hackathon - Abstractive Text Summarization

This project was developed as part of the Techgium hackathon hosted by L&T Company. It was selected among the top 100 elite teams from over 3,500 competing teams.

Overview

The focus of this project is abstractive text summarization in the field of computer science. The goal was to develop a highly accurate model that can understand text and generate summaries using its own words, without relying on sentence extraction.

Key Features

Leveraged transfer learning techniques, utilizing pre-trained models like BART, to develop a robust system for generating comprehensive summaries.
Employed BERT to construct an advanced Question and Answer module, ensuring the accuracy of the summaries by comparing answers derived from both the original text and the summary.

Implementation Details

Built upon the PyTorch deep learning framework, commonly used in computer science.
Utilized the versatile CNN-DM Dataset, known for its suitability for natural language processing tasks.
Seamlessly integrated the transformative Transformers library into the project for efficient processing.
Created Abstractive Summaries using BART and then extracted all the entities present in Article and Generated Summary to verify whether all the entities present in Generated Summary are present in entities of Article.
After this, QA model is implemented on Generated Summaries and Questions are generated.
Based on this questions, answers are generated as entities(one word answers) from both Article and Generated Summary.
If there is a mismatch, then those entities in Generated Summary are replaced by answer given by Article, Thus improving the accuracy of the model.

Results

Through dedicated effort and strategic utilization of these resources, our team successfully tackled the challenges of abstractive text summarization. The resulting summaries accurately capture the essence of the source material, while the Q&A module demonstrates their unwavering precision. The Accuracy we got is 93.14%

Usage

To use this project, follow the instructions below:

Clone the repository.
Install the required dependencies listed in the script.
Prepare your dataset or use the provided CNN-DM Dataset.
Run the main script to train the model and generate summaries.
Evaluate the generated summaries using the Question and Answer module.
Analyze the results and make improvements based on your requirements.

For more information, please refer to the documentation and code in the repository.

Note: This project was developed as a part of the Techgium hackathon and may require additional optimization and customization for specific use cases.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Abstractive_Summarizer!!.ipynb		Abstractive_Summarizer!!.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Techgium Hackathon - Abstractive Text Summarization

Overview

Key Features

Implementation Details

Results

Usage

About

Releases

Packages

Languages

putrareddy/Abstractive-Summarizer

Folders and files

Latest commit

History

Repository files navigation

Techgium Hackathon - Abstractive Text Summarization

Overview

Key Features

Implementation Details

Results

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages