BART-Squad2: Your Guide to Extractive Question Answering

Dec 13, 2020 | Educational

Welcome to the world of BART-Squad2, a powerful tool that harnesses the capabilities of extraction-based question answering. This guide aims to take you from installation to implementation in a user-friendly manner, making the journey into AI more enjoyable.

Model Description

The BART model is designed for extractive question answering and has been trained on the Squad 2.0 dataset. With an impressive F1 score of 87.4, it demonstrates a strong performance in understanding and retrieving answers from the text. However, it comes with its share of limitations, such as not functioning with Huggingface’s auto-inference API.

How to Use BART-Squad2 Locally

To get started with question answering locally, follow these steps:

python
from transformers import AutoTokenizer, AutoModelForQuestionAnswering

tokenizer = AutoTokenizer.from_pretrained("Primer/bart-squad2")
model = AutoModelForQuestionAnswering.from_pretrained("Primer/bart-squad2")
model.to("cuda")  # If you have a suitable GPU
model.eval()

def answer(question, text):
    seq = " " + question + "  " + text + " "
    tokens = tokenizer.encode_plus(seq, return_tensors="pt", padding="max_length", max_length=1024)
    input_ids = tokens["input_ids"].to("cuda")
    attention_mask = tokens["attention_mask"].to("cuda")
    start, end, _ = model(input_ids, attention_mask=attention_mask)

    start_idx = int(start.argmax().int())
    end_idx = int(end.argmax().int())

    print(tokenizer.decode(input_ids[0, start_idx:end_idx]).strip())

# Example usage
question = "Where does Tom live?"
context = "Tom is an engineer in San Francisco."
answer(question, context)

If you’re running this on a CPU instead of a GPU, you can simply omit the “.to(cuda)” lines to adapt the code accordingly.

Understanding the Code

Imagine you are a librarian helping someone find a specific book in a vast library. The BART model serves as this librarian, and the following steps illustrate the process:

Gathering the Question and Context: You combine a question (like “Where does Tom live?”) with a context about that query (like “Tom is an engineer in San Francisco.”) to form a single request.
Encoding the Input: You transform this combined request into a coded language that machines understand, much like putting a physical book in a code box for easier sorting.
Finding the Answer: The model looks through this coded input just like the librarian scans the shelves, finding the most relevant information (start and end positions of the answer).
Returning the Result: Finally, it decodes the answer from this coded language back into a language humans can read and understand.

Troubleshooting

Should you encounter issues while running the BART-Squad2 model, consider the following troubleshooting tips:

Ensure you have the necessary dependencies installed, such as the transformers library.
Check that your Python environment is compatible with the code you are running.
If using GPU, make sure CUDA is correctly set up and the code refers to the right device.
In case you receive an unanswerable result, the response may turn out to be an empty string, indicating that the model found no suitable answer within the provided context.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Limitations and Bias

While BART-Squad2 is an impressive model, it’s vital to acknowledge some limitations:

The model weighs in at a hefty 1.6G, which may be challenging for systems with limited resources.
No extensive evaluation has been performed beyond the training, leaving unknown biases and limitations on the table.

Training Procedure

For those interested in the technical side of things, here’s a brief overview of the training parameters used:

Batch Size: 8
Max Sequence Length: 1024
Learning Rate: 1e-5
Epochs: 2

The training process involved modifying the run_squad.py script to freeze shared parameters and encoder embeddings, which can enhance model performance.

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox