How to Use Granite-3B-Code-Instruct for Text Generation

Jul 11, 2024 | Educational

The world of AI is continually evolving, and tools like the Granite-3B-Code-Instruct model are paving the way for more intuitive coding assistants. This versatile model, developed by IBM Research, can generate code snippets, solve programming issues, and provide insightful explanations for various tasks. In this blog post, we will guide you through the implementation of this powerful model and offer troubleshooting tips to enhance your experience.

Understanding Granite-3B-Code-Instruct

Granite-3B-Code-Instruct is built upon the foundation of the Granite-3B-Code-Base and has been fine-tuned with a diverse set of instruction data. Imagine it as a brilliant chef, trained by watching countless cooking shows. This chef not only knows how to follow a recipe but can also recommend variations and solve cooking challenges, making the user experience richer and more informative.

Getting Started with the Model

To utilize the Granite-3B-Code-Instruct model, follow these steps:

Install the necessary Python libraries, primarily transformers from Hugging Face.
Import the required classes and load the model and tokenizer.
Create a chat template to interact with the model.
Generate text based on your input instructions.

Example Code

Here’s a simple example demonstrating how to generate code using the Granite-3B-Code-Instruct model:

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

device = 'cuda'  # or 'cpu'

model_path = 'ibm-granitegranite-3b-code-instruct'
tokenizer = AutoTokenizer.from_pretrained(model_path)

model = AutoModelForCausalLM.from_pretrained(model_path, device_map=device)
model.eval()

chat = [{"role": "user", "content": "Write a code to find the maximum value in a list of numbers."}]
chat = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
input_tokens = tokenizer(chat, return_tensors="pt")

for i in input_tokens:
    input_tokens[i] = input_tokens[i].to(device)

output = model.generate(**input_tokens, max_new_tokens=100)
output = tokenizer.batch_decode(output)

for i in output:
    print(i)

How This Works

Think of the above code as setting up a conversation with your AI assistant:

You invite the assistant by loading the model and tokenizer, much like preparing the kitchen.
By defining a chat input, you’re giving the assistant a recipe to follow. The input asks for a specific task—finding the maximum value in a list.
After processing, the assistant presents you with a beautifully crafted code snippet, akin to serving a delicious dish.

Model Training Data

The Granite Code Instruct model utilized various datasets for fine-tuning, encompassing:

Code Commits Datasets: Sourced from [CommitPackFT](https://huggingface.co/datasets/bigcode/commitpackft), covering 92 programming languages.
Math Datasets: High-quality math-focused datasets including [MathInstruct](https://huggingface.co/datasets/TIGER-Lab/MathInstruct) and [MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA).
Code Instruction Datasets: Including [Glaive-Code-Assistant-v3](https://huggingface.co/datasets/glaiveai/glaive-code-assistant-v3) and many others.

Troubleshooting Tips

If you encounter issues whilst using the Granite-3B-Code-Instruct model, consider the following steps:

Check Dependencies: Ensure that all required libraries are installed and updated, especially torch and transformers.
Device Compatibility: Ensure the correct device is set (CPU or GPU) and verify if your hardware meets the model’s requirements.
Input Format: Validate that the chat input accurately follows the expected structure to avoid format errors.
Resource Utilization: Monitor your system’s resources to prevent overloading during model inference.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Ethical Considerations

As you incorporate Granite-3B-Code-Instruct into your projects, it is essential to perform safety testing and consider the ethical implications involved, particularly how the model handles different programming languages in varying contexts.

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox