How to Build a Cat vs. Dog Classifier Using TensorFlow

Apr 3, 2022 | Educational

homemayankDocumentsarticle-generation-using-llmresized_imagesreadme_21_1359

Welcome to this insightful guide where we’ll walk you through the exciting process of creating a machine learning model to classify images of cats and dogs using TensorFlow. Let’s dive into the details step by step!

Step 1: Setting Up Your Environment

Before we dive into coding, we need to set up our environment. This consists of two main tasks: uploading the necessary files and installing the required packages.

Upload your kaggle.json file which contains your Kaggle API credentials.
Install the Kaggle CLI to download datasets.

Here’s how you can do this:

from google.colab import files
files.upload()  # Upload kaggle.json file
!pip install kaggle  # Install the Kaggle CLI
!cp kaggle.json ~/.kaggle/  # Move the json to the Kaggle directory
!chmod 600 ~/.kaggle/kaggle.json  # Set permissions

Step 2: Downloading the Dataset

We will download the well-known Microsoft Cats vs. Dogs dataset from Kaggle.

!kaggle datasets download -d shaunthesheep/microsoft-catsvsdogs-dataset
!unzip microsoft-catsvsdogs-dataset

Step 3: Preparing the Data

Now that we have the data, it’s time to organize it. Think of this step as setting up different rooms for a party; you have a room for cats and another for dogs. We’ll create training and testing folders.

import os
image_dir = 'content/PetImages/Cat/'

# Create train and test folders
!mkdir train_folder
!mkdir test_folder

# Directory paths for training images
dir_upside_down = 'upside_down'
dir_normal = 'normal'

training_normal = os.path.join(image_dir, dir_normal)
training_upside = os.path.join(image_dir, dir_upside_down)
os.mkdir(training_normal)
os.mkdir(training_upside)

We then need to copy a set number of images into the respective folders for both training and testing.

import shutil

# Copying cat images for training
fnames = ['cat{}.jpg'.format(i) for i in range(2000)]
for fname in fnames:
    src = os.path.join(image_dir, fname)
    dst = os.path.join(training_normal, fname)
    shutil.copyfile(src, dst)

# Copying cat images for testing
fnames = ['cat{}.jpg'.format(i) for i in range(2000, 4000)]
for fname in fnames:
    src = os.path.join(image_dir, fname)
    dst = os.path.join(testing_normal, fname)
    shutil.copyfile(src, dst)

Step 4: Data Augmentation

Before we train our model, we can enhance our dataset through data augmentation. This is like adding a splash of creativity to our images by flipping them upside down. This helps the model generalize better.

import numpy as np
from PIL import Image
from scipy import ndimage
import imageio

# Inverting Training Images
outPath = 'content/train_folder/upside_down/'
path = 'content/train_folder/normal/'

for image_path in os.listdir(path):
    input_path = os.path.join(path, image_path)
    image_to_rotate = plt.imread(input_path)
    rotated = np.flipud(image_to_rotate)
    fullpath = os.path.join(outPath, 'rotated_' + image_path)
    imageio.imwrite(fullpath, rotated)

Step 5: Creating the Model

Now we’re ready to create our TensorFlow model. Imagine this model as the brain of your application; it’s what will learn from the data and make predictions. Here’s how we do it:

import tensorflow as tf
from tensorflow.keras.preprocessing.image import ImageDataGenerator

train_dir = 'content/train_folder'
train_gen = ImageDataGenerator(rescale=1./255)
train_images = train_gen.flow_from_directory(
    train_dir,
    target_size=(250, 250),
    batch_size=50,
    class_mode='binary'
)

validation_dir = 'content/test_folder'
test_gen = ImageDataGenerator(rescale=1./255)
test_images = test_gen.flow_from_directory(
    validation_dir,
    target_size=(250, 250),
    batch_size=50,
    class_mode='binary'
)

# Sequential model
model = tf.keras.Sequential([
    tf.keras.layers.Conv2D(16, (3, 3), activation='relu', input_shape=(250, 250, 3)),
    tf.keras.layers.MaxPooling2D(2, 2),
    tf.keras.layers.Conv2D(32, (3, 3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2, 2),
    tf.keras.layers.Conv2D(64, (3, 3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2, 2),
    tf.keras.layers.Conv2D(128, (3, 3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2, 2),
    tf.keras.layers.Flatten(),
    tf.keras.layers.Dense(512, activation='relu'),
    tf.keras.layers.Dense(1, activation='sigmoid')
])

Step 6: Compiling and Training the Model

Next, we’ll compile our model and begin the training process. This is akin to teaching your brain using trial and error until it gets the right answer!

from tensorflow.keras.optimizers import RMSprop

model.compile(optimizer=RMSprop(learning_rate=0.001),
              loss=tf.keras.losses.BinaryCrossentropy(),
              metrics=['accuracy'])

history = model.fit(train_images, validation_data=test_images, epochs=5, steps_per_epoch=40)

Troubleshooting Tips

If you hit any bumps along the road, here are some troubleshooting tips to guide you:

Ensure that your file paths are correct. A misplaced image can throw everything off!
Check if all required libraries are installed and up to date.
If the training process seems slow, consider using a machine with better GPU capabilities.

For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.

Conclusion

Congratulations! You’ve just built a cat vs. dog classifier using TensorFlow! At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.

Stay Informed with the Newest F(x) Insights and Blogs

Tech News and Blog Highlights, Straight to Your Inbox