How to Use BGE-M3 for Advanced Retrieval Tasks

Apr 3, 2024 | Educational

homemayankDocumentsarticle-generation-using-llmresized_imagesreadme_8_170

Are you ready to dive into the innovative world of text embeddings and retrieval? Welcome to the BGE-M3 project, where versatility meets efficiency in handling multi-functionality, multi-linguality, and multi-granularity. This guide will walk you through utilizing BGE-M3 in your own projects, with a friendly exploration of its features, usage instructions, and troubleshooting tips.

What is BGE-M3?

BGE-M3 is an advanced embedding model designed for various retrieval functionalities. It boasts:

Multi-Functionality: Execute dense retrieval, multi-vector retrieval, and sparse retrieval simultaneously.
Multi-Linguality: Support for over 100 languages.
Multi-Granularity: Handle inputs of varying lengths, from brief sentences to lengthy documents up to 8192 tokens.

Installing BGE-M3

To get started with BGE-M3, follow these commands:

git clone https://github.com/FlagOpen/FlagEmbedding.git
cd FlagEmbedding
pip install -e .

Generating Embeddings

There are different methods to generate embeddings with BGE-M3, such as dense, sparse, and multi-vector embeddings. Here’s how they work:

Dense Embedding

from FlagEmbedding import BGEM3FlagModel
model = BGEM3FlagModel("BAAI/bge-m3", use_fp16=True)

sentences = ["What is BGE M3?", "Definition of BM25"]
embeddings = model.encode(sentences, batch_size=12, max_length=8192)
similarity = embeddings[0] @ embeddings[1].T
print(similarity)

This method maps the text into a single embedding. Think of it as taking a snapshot of a landscape; it captures the essence but not the details.

Sparse Embedding

output = model.encode(sentences, return_dense=True, return_sparse=True)
print(model.convert_id_to_token(output[0]["lexical_weights"]))

In sparse retrieval, it only calculates weights for tokens present in the text. Imagine sorting through a pantry: you only take note of the ingredients that are there, leaving the others aside.

Multi-Vector Retrieval

output = model.encode(sentences, return_dense=True, return_sparse=True, return_colbert_vecs=True)
print(model.colbert_score(output[0]["colbert_vecs"], output[1]["colbert_vecs"]))

This combines multiple vectors representing the text, allowing for a more nuanced understanding—like listening to a symphony rather than a single note.

Using BGE-M3 in Your Projects

To incorporate BGE-M3 into your existing projects, use the same approach as previous embeddings but note that BGE-M3 simplifies query structures.

Troubleshooting Tips

If you encounter issues while using BGE-M3, here are some common problems and solutions:

Issue: Installation fails

Ensure you’re using a compatible version of Python and that all dependencies are met.

Issue: Inconsistent results

Check input data for errors or inconsistencies. Input quality can drastically affect output.

Issue: Memory errors during embedding generation

Try reducing the batch size or utilizing smaller models if available.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

With the flexibility of BGE-M3 in your toolkit, you can explore new avenues in text embeddings and retrieval tasks, providing multilingual support and accommodating various text lengths. At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox