Bert-small2Bert-small Summarization with EncoderDecoder Framework

Dec 12, 2020 | Educational

homemayankDocumentsarticle-generation-using-llmresized_imagesreadme_6_1018

In the world of Natural Language Processing (NLP), summarizing lengthy text into concise and coherent summaries is a significant task. Python libraries such as Transformers have made this even easier through pre-trained models. Today, we’ll learn how to implement the Bert-small2Bert-small summarization model using the EncoderDecoder framework, specially fine-tuned on the CNNDailymail dataset.

Overview of the Model

The Bert-small2Bert-small model is a warm-started BERT2BERT architecture, specifically designed for summarization tasks. It excels in compressing information while retaining essential details, achieving an impressive 17.37 ROUGE-2 score on the CNNDailymail‘s test dataset.

Results on Test Set

Metric	Value
ROUGE-2	17.37

How to Implement the Model

Follow these steps to implement the Bert-small2Bert-small summarization model with ease:

Install Required Libraries: Make sure you have the transformers library installed:
```
pip install transformers
```
Import Necessary Libraries: Bring in the essential components needed for our summarization model.

from transformers import BertTokenizerFast, EncoderDecoderModel
import torch

Set Up the Device: Specify whether to use GPU or CPU for computation.

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

Load the Tokenizer and Model: Fetch the pre-trained tokenizer and model with the following code:

tokenizer = BertTokenizerFast.from_pretrained("mrm8488/bert-small2bert-small-finetuned-cnn_daily_mail-summarization")
model = EncoderDecoderModel.from_pretrained("mrm8488/bert-small2bert-small-finetuned-cnn_daily_mail-summarization").to(device)

Create the Generation Function: This function takes the input text and generates a summary.

def generate_summary(text):
    # cut off at BERT max length 512
    inputs = tokenizer([text], padding="max_length", truncation=True, max_length=512, return_tensors="pt")
    input_ids = inputs.input_ids.to(device)
    attention_mask = inputs.attention_mask.to(device)
    output = model.generate(input_ids, attention_mask=attention_mask)
    return tokenizer.decode(output[0], skip_special_tokens=True)

Summarize Text: Replace ‘your text to be summarized here…’ with the text you want.

text = "your text to be summarized here..."
generate_summary(text)

Understanding with an Analogy

Imagine you have a huge library of books (the text), and you want to prepare a short summary for each book. The Bert-small2Bert-small model acts like a dedicated librarian who knows exactly how to pick the most significant points from each book and write them into concise paragraphs. Just like the librarian uses her experience to ignore unnecessary details, our model intelligently truncates the text while retaining the essential essence, making it easier for readers to grasp key concepts.

Troubleshooting

In case you run into issues while implementing the model, here are some troubleshooting ideas:

Import Errors: Ensure all necessary libraries are correctly installed. Use pip install transformers to install the Transformers library.
CUDA Not Found: If you receive an error concerning CUDA, make sure you have an NVIDIA GPU and the necessary drivers installed, or switch to CPU by ensuring torch.device("cpu") is set.
Tokenization Errors: If you encounter issues with input text length, remember that BERT can only process up to 512 tokens. Modify your inputs accordingly.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

By implementing the Bert-small2Bert-small summarization model, you can efficiently create concise summaries from larger text bodies, making information retrieval much more manageable.

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox