How to Get Started with Gemma on Hugging Face

Aug 7, 2024 | Educational

homemayankDocumentsarticle-generation-using-llmresized_imagesreadme_24_45

If you’re excited about diving into the world of AI and want to utilize text generation capabilities, you’ve come to the right place! In this article, we’ll guide you through the steps of accessing and using the Gemma model hosted on Hugging Face. Think of it as planting a seed that can grow into a big tree of knowledge and creativity!

What is Gemma?

Gemma is a family of state-of-the-art open models from Google, designed for various text generation tasks like questioning answering, summarization, and more! Its lightweight nature allows you to deploy it even on devices with limited resources, like laptops or desktops.

Steps to Access Gemma on Hugging Face

Step 1: Ensure you have a Hugging Face account. Sign up or log in if you haven’t already! You’ll need this to access the model.
Step 2: Review and agree to Google’s usage license. You can do this by clicking the appropriate button on the Gemma page on Hugging Face.
Step 3: Install the Transformers library if you haven’t done so by running the following command in your terminal:

pip install -U transformers

Step 4: Choose your method of usage, whether through the `pipeline` API, CLI or other methods as outlined below!

How to Use the Gemma Model

Now that you have access, let’s see how you can run some exciting text generation tasks using Gemma. Imagine running a majestic play where you tell the AI what to perform, and you’ll be amazed at the script it generates!

Running with the `pipeline` API

import torch
from transformers import pipeline

pipe = pipeline(
    "text-generation",
    model="google/gemma-2-2b",
    device="cuda",  # replace with "mps" to run on a Mac device
)

text = "Once upon a time,"
outputs = pipe(text, max_new_tokens=256)
response = outputs[0]["generated_text"]
print(response)

Running Model on Single/Multi-GPU

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

tokenizer = AutoTokenizer.from_pretrained("google/gemma-2-2b")
model = AutoModelForCausalLM.from_pretrained(
    "google/gemma-2-2b",
    device_map="auto",
)

input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids, max_new_tokens=32)
print(tokenizer.decode(outputs[0]))

CLI Option

To run Gemma through the command line interface, utilize the local-gemma repository. After installing it, run the command:

local-gemma --model "google/gemma-2-2b" --prompt "What is the capital of Mexico?"

Troubleshooting:

Even if you follow the steps, challenges may arise. Here are some troubleshooting tips:

Model Not Found: Double-check that you’re logged in to Hugging Face and have acknowledged the necessary licenses.
Dependencies: Ensure that all required libraries and packages are installed. Sometimes running pip install -U transformers and pip install -U accelerate can solve dependency issues.
Resource Issues: If you encounter problems related to system resources, consider running the models on fewer tokens or switching to a lower precision.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

Congratulations! You’ve unlocked the door to using one of the finest AI text generation models, Gemma! Remember, experimentation is key, so feel free to tweak the prompts and settings to get the best results.

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Final Thoughts

The world of AI and machine learning opens vast possibilities. With tools like Gemma, you’ll be sure to be at the forefront of this exciting journey. Happy experimenting!

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox