How to Use OnnxStream for Stable Diffusion Models on Limited Hardware

Jul 23, 2020 | Data Science

homemayankDocumentsarticle-generation-using-llmresized_images_gitstable_diffusionreadme_vitoplantamura_OnnxStream

If you’re passionate about artificial intelligence and image generation, you’ve probably heard about Stable Diffusion. In this guide, we’ll explore how to efficiently use the OnnxStream library to run Stable Diffusion models, specifically on low-resource hardware like the Raspberry Pi Zero 2. With its recent updates, OnnxStream has introduced features such as WebAssembly support and compatibility with various large models, making it a valuable tool for developers and enthusiasts alike.

What is OnnxStream?

OnnxStream is a lightweight inference library designed to reduce memory consumption when running complex machine learning models. It allows you to run models that require a significant amount of RAM on devices that typically don’t have any like the Raspberry Pi Zero 2, which has only 512MB of RAM. The magic lies in its decoupled architecture that separates the inference engine from the weights provider, making it customizable and efficient.

Getting Started with OnnxStream

Follow these steps to get Up and Running:

Install Dependencies: Ensure you have the necessary packages installed, including Python, ONNX, and the ONNX Simplifier.
Clone the Repository: Use git to clone the OnnxStream repository:

git clone https://github.com/vitoplantamura/OnnxStream.git

Build XNNPACK: Follow the instructions in the README to build XNNPACK as it’s a crucial dependency for performance optimizations.
Compile OnnxStream: After setting up XNNPACK, compile OnnxStream using the command:

cmake -DXNNPACK_DIR=PATH_TO_XNNPACK ..

Using OnnxStream with Stable Diffusion Models

OnnxStream now supports multiple versions of Stable Diffusion, including 1.5, XL 1.0, and Turbo 1.0, allowing for flexibility based on your needs. The following steps will guide you on how to deploy Stable Diffusion models:

Download Weights: Depending on the model you want to use, download the respective weights. For instance, to use Stable Diffusion 1.5, run:

git lfs install
git clone --depth=1 https://huggingface.co/vitoplantamura/stable-diffusion-1.5-onnxstream

Run the Model: Execute the model using the command line options provided in the repository README for different operations and configurations.

Understanding the Code: An Analogy

Imagine you are hosting a dinner party but you have limited kitchen space (your Raspberry Pi). You have a great recipe that requires many pots on the stove (RAM), and usually, you’d need a big oven (high-powered hardware) to bake all the dishes (process data) at once. However, you can make the food in small batches and stack them together (decoupled inference with OnnxStream). Instead of trying to fit all your ingredients and pots in the kitchen, you prepare some of them in the living room.

In programming terms, the way you “decouple” the components allows you to work within the tight kitchen space without sacrificing the quality of the dishes prepared (data output). This is the essence of how OnnxStream works—it takes complex, memory-hungry models and breaks them into manageable pieces that can coexist efficiently in your limited environment.

Troubleshooting Common Issues

Sometimes, you may face issues while running models with OnnxStream. Here are some troubleshooting ideas:

Issue: Model performance is slow.
Ensure you have compiled OnnxStream with the MAX_SPEED option enabled. This typically improves performance significantly.
Issue: Insufficient memory errors.
Consider dynamically quantizing your model or adjusting the input sizes to fit within the hardware capabilities.
Issue: Download problems.
If you face issues downloading model weights, check your internet connection or try downloading from a different network.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

The Future of AI with OnnxStream

At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox