How to Use SqueezeBERT with SQuAD for Question Answering

Dec 15, 2020 | Educational

In this guide, we will explore how to leverage the power of SqueezeBERT, fine-tuned on the Stanford Question Answering Dataset (SQuAD v1.1), for your question-answering tasks. Whether you’re a researcher, a developer, or simply curious, this article will walk you through the process step-by-step.

What is SqueezeBERT?

SqueezeBERT is a remarkable model designed for the English language, employing a masked language modeling (MLM) and Sentence Order Prediction (SOP) objective. Think of SqueezeBERT as a speedy little assistant in the world of language models. While it has a similar structure to BERT-base, it gets things done much faster using grouped convolutions instead of traditional fully-connected layers. In fact, it’s 4.3 times quicker than BERT-base when running on a Google Pixel 3 smartphone!

For more detailed information, check out the original research paper on SqueezeBERT.

Understanding SQuAD

The Stanford Question Answering Dataset is a comprehensive reading comprehension dataset consisting of over 100,000 question-answer pairs drawn from more than 500 Wikipedia articles. Each question corresponds to a segment of text or may even be unanswerable. This dataset is the perfect starting point for training your question-answering models.

Training SqueezeBERT for QA

To train the model, you will need the following setup:

A Tesla P100 GPU.
25GB of RAM.

Your command will look something like this:

bash
python content/transformers/examples/question-answering/run_squad.py \
   --model_type bert \
   --model_name_or_path squeezebert/squeezebert-uncased \
   --do_eval \
   --do_train \
   --do_lower_case \
   --train_file content/dataset/train-v1.1.json \
   --predict_file content/dataset/dev-v1.1.json \
   --per_gpu_train_batch_size 16 \
   --learning_rate 3e-5 \
   --num_train_epochs 15 \
   --max_seq_length 384 \
   --doc_stride 128 \
   --output_dir content/output_dir \
   --overwrite_output_dir \
   --save_steps 2000

By following the command above, you will train your model using SQuAD v1.1 data.

Results from Test Set

After training, you can evaluate the model’s performance. The metrics you’ll likely encounter are:

Exact Match (EM): 76.66
F1 Score: 85.83

This model has a size of just 195 MB, making it relatively lightweight while still being powerful.

Using SqueezeBERT for Question Answering

Now comes the exciting part – using the model! You can simply use the pipeline method from the Transformers library:

python
from transformers import pipeline

QnA_pipeline = pipeline("question-answering", model="mrm8488/squeezebert-finetuned-squad-v1")

result = QnA_pipeline({
    'context': "A new strain of flu that has the potential to become a pandemic has been identified in China by scientists.",
    'question': "Who identified it?"
})

print(result)

The output will include the answer, the start position, the end position, and the confidence score. You’ll see something like:

Answer: scientists
Score: 0.6988425850868225

Troubleshooting

If you encounter any issues while using SqueezeBERT or during training, consider the following troubleshooting options:

Ensure that your GPU setup is correctly configured as it might not be activated.
Double-check the dataset paths; a common error arises from incorrect file locations.
Evaluate any dependencies you might be missing by running pip install -r requirements.txt.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Wrap Up

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox