How to Use the GTE Multilingual Reranker Base Model

Aug 10, 2024 | Educational

Welcome to the world of advanced text retrieval with the gte-multilingual-reranker-base model, a powerful tool for multilingual tasks. In this article, we’ll guide you through its features, usage, and how to troubleshoot common issues efficiently.

Introduction to GTE Multilingual Reranker

The gte-multilingual-reranker-base is a state-of-the-art model designed to excel in multilingual retrieval tasks. Imagine it as a highly skilled librarian fluently speaking over 70 languages, efficiently sorting through vast volumes of texts to provide you with the most relevant information in record time.

Key Features

High Performance: Achieves state-of-the-art results in various retrieval tasks.
Training Architecture: Maintains a smaller model size and lower hardware requirements, making it quicker for inference.
Long Context Support: Handles text lengths of up to 8192 tokens.
Multilingual Capability: Supports over 70 languages, allowing global accessibility.

Model Information

Model Size: 306M
Max Input Tokens: 8192

How to Use the Model

Using the gte-multilingual-reranker-base model is straightforward with the Hugging Face transformers. Ensure you have transformers version >= 4.36.0. Here’s a step-by-step guide:

import torch
from transformers import AutoModelForSequenceClassification, AutoTokenizer

model_name_or_path = "Alibaba-NLP/gte-multilingual-reranker-base"
tokenizer = AutoTokenizer.from_pretrained(model_name_or_path)
model = AutoModelForSequenceClassification.from_pretrained(model_name_or_path, trust_remote_code=True)
model.eval()

pairs = [["中国的首都在哪儿", "北京"], ["what is the capital of China?", "北京"], ["how to implement quick sort in python?", "Introduction of quick sort"]]

with torch.no_grad():
    inputs = tokenizer(pairs, padding=True, truncation=True, return_tensors='pt', max_length=512)
    scores = model(**inputs, return_dict=True).logits.view(-1, ).float()
    print(scores)  # Output will be like tensor([1.2315, 0.5923, 0.3041])

Explaining the Code

Let’s break down the code above with an analogy: Think of the entire process as a chef preparing three meal options based on ingredients (inputs):

The import torch and from transformers lines are like the chef gathering all necessary tools and ingredients from the pantry.
Defining model_name_or_path and creating the tokenizer and model are akin to selecting a recipe and prepping different components for cooking.
The pairs list represents the menu items the chef needs to serve—questions and answers from different cuisines (languages).
Using torch.no_grad() ensures the chef is focused on cooking without worrying about the calories being counted (no gradients needed for inference).
Finally, the model scores each meal option based on taste—outputting a tensor that reflects how well each dish is received by diners (scores will vary with input).

How to Use it Offline

If you want to utilize the model offline, refer to the Disable trust_remote_code for guidelines. This can ensure your model runs smoothly without needing real-time online connections.

Evaluation

The reranking results from multiple text retrieval datasets are promising! For a visual overview, check out the evaluation result image:

For more detailed experimental results, you can find them in the paper.

Cloud API Services

The GTE series not only offers open-source models but also provides commercial API services on Alibaba Cloud. Here’s what’s available:

Embedding Models: Three versions available—text-embedding-v1/v2/v3, with v3 being the latest.
ReRank Models: Access the gte-rerank model service for your commercial applications.

Note that these commercial APIs may have differences compared to their open-source counterparts.

Troubleshooting

If you encounter issues while implementing the model, consider these suggestions:

Ensure you have the correct version of transformers installed (>=4.36.0).
Double-check that your input pairs are well-formed, and ensure you are not exceeding the token limits.
If you’re using the model offline, verify that all necessary dependencies are properly loaded.
For persistent issues, reach out to the community or refer to documentation for additional support.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox