PicPredict: Google Quick, Draw Image Recognition

Overview

This is a machine learning project designed to classify hand-drawn images from Google's Quick, Draw dataset into 15 distinct categories. Utilizing a Convolutional Neural Network (CNN) implemented with TensorFlow and Keras, the model is built to achieve high accuracy in recognizing and categorizing various sketches.

Features

Image Classification: Recognizes and classifies hand-drawn images into 15 distinct categories such as accessories, fruits, vehicles, and more, leveraging the QuickDraw dataset.
Dataset Customization: Downloads and organizes QuickDraw data into major and minor class structures, enabling flexible dataset management and efficient storage.
Model Architecture: A deep convolutional neural network (CNN) with multiple convolutional layers, pooling layers, and fully connected dense layers, incorporating Batch Normalization and Dropout for improved regularization and performance.
Dataset Splitting: Automatically splits the dataset into training and validation subsets with an 80/20 split ratio for robust model evaluation.
TensorBoard Integration: Tracks and visualizes training and validation metrics, including accuracy and loss, in real time via TensorBoard.
Dynamic Image Preprocessing: Rescales input grayscale images to ensure consistency in pixel value ranges for effective training.
Scalable Dataset Management: Efficiently handles large datasets, organizing them into major classes, automatically creating and managing directories.
Optimized Training: Utilizes Adam optimizer and Sparse Categorical Crossentropy loss for efficient training on multi-class classification tasks.
Model Export: Saves trained models with timestamped filenames, making them ready for future deployment in real-world applications.

Model Performance Graph

Running Predictions with the Pre-Trained Model

The pre-trained model is saved as model_20240912-174238.keras. You can run predictions on sample images by executing the run.py script located in the unit-test directory.

Make sure you have the necessary Python packages installed:

pip install -r requirements.txt

Running the Evaluation

The evaluator script used to assess the model's performance was provided by TAMU Datathon organizers as part of the competition framework. Since the Datathon, the project has been revamped with an improved structure and a new, higher-performing model architecture.

# Navigate to the evaluator-Datathon directory:
cd evaluator-Datathon

# Run the run.py script to start the evaluation server:
python run.py

# After the server is running, open another terminal window and run Runner.py:
python Runner.py

# Enter the port number when prompted:
Enter your port number: 5555

This script will launch a Flask application that serves the evaluator on a local address. By default, it will run on port 5555. Follow the prompts to enter the port number (in this case, 5555) to start the evaluation process. For each image evaluated, the system will output a message indicating the class it guessed and update the score accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
bulk-convert		bulk-convert
code-ML		code-ML
evaluator-Datathon		evaluator-Datathon
results		results
unit-test		unit-test
.gitignore		.gitignore
README.md		README.md
performance-graph.png		performance-graph.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PicPredict: Google Quick, Draw Image Recognition

Overview

Features

Model Performance Graph

Running Predictions with the Pre-Trained Model

Running the Evaluation

About

Releases

Packages

Languages

NikolaSlv/PicPredict

Folders and files

Latest commit

History

Repository files navigation

PicPredict: Google Quick, Draw Image Recognition

Overview

Features

Model Performance Graph

Running Predictions with the Pre-Trained Model

Running the Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages