Global-Wheat-Detection

Showcases the use of deep learning to detect wheat heads from crops. The project is based on this Kaggle Competition: https://www.kaggle.com/c/global-wheat-detection.

Here's a description of the prediction task:

In this competition, youll detect wheat heads from outdoor images of wheat plants, including wheat datasets from around the globe. Using worldwide data, you will focus on a generalized solution to estimate the number and size of wheat heads. To better gauge the performance for unseen genotypes, environments, and observational conditions, the training dataset covers multiple regions. You will use more than 3,000 images from Europe (France, UK, Switzerland) and North America (Canada). The test data includes about 1,000 images from Australia, Japan, and China.

- Source

Data

An overview is available here: https://www.kaggle.com/c/global-wheat-detection/data.

The dataset includes images that either have wheat heads or do not have them. Here are some examples:

(The following ones do not have any wheat heads)

I used the following command to obtain the data:

$ kaggle competitions download -c global-wheat-detection

This is an object detection task and the project uses TensorFlow Object Detection (TFOD) API .

About the files & directories

 faster_rcnn_resnet101_coco_11_06_2017: Contains the pre-trained checkpoints and frozen inference graph.
    saved_model
       variables
       saved_model.pb
    checkpoint
    frozen_inference_graph.pb
    model.ckpt.data-00000-of-00001
    model.ckpt.index
    model.ckpt.meta
    pipeline.config
 test: Contains the test images of the competition. 
    2fd875eaa.jpg
    348a992bb.jpg
    51b3e36ab.jpg
    51f1be19e.jpg
    53f253011.jpg
    796707dd7.jpg
    aac893a91.jpg
    cb8d261a3.jpg
    cc3532ff6.jpg
    f5a1f0358.jpg
 train: Contains the training images of the competition. 
 Basic_EDA.ipynb: Performs basic data visualization on the provided dataset.
 Data_Prep.ipynb: Prepares the data in a TFOD API compatible format.
 faster_rcnn_resnet101_pets.config: Training configuration file.
 generate_tfrecord.py: Utility script for generating TFRecords from `.csv` files.
 label_map.pbtxt: Label map file.
 new_train_df.csv: The newly created partial training set.
 train.csv: Comes with the initial dataset & contain information about the bounding boxes. 
 train_df.csv: Expanded version of the initial `train.csv` file. 
 train.record: TFRecord file of the partial training set. 
 valid_df.csv: The newly created validation set.
 valid.record: TFRecord file of the validation set.

Note

The files that you don't see here in the directory were not intentionally provided because of their sizes.

Results

Following are the results I got from TensorBoard while my model was training (following are images from the validation set I prepared):

Steps to reproduce the results

Follow the instructions from Data_Prep.ipynb notebook.
Once the new training and validation splits are generated run generate_tfrecord.py script for generating the TFRecords.

Download the pre-trained checkpoints of Faster RCNN with Inception Network as base by running:

wget http://download.tensorflow.org/models/object_detection/faster_rcnn_resnet101_coco_2018_01_28.tar.gz

Follow instructions from this piece on how to package an object detection application in TensorFlow Object Detection API and submit a training job to AI Platform. It also shows how to monitor performance with TensorBoard and export the trained model checkpoints as a frozen inference graph.

This project uses GCS buckets for storing intermediate training checkpoints along with all the other files necessary to run a TFOD API model on AI Platform. Following are the initial files from my GCS bucket:

$ gsutil ls gs://global_wd_faster_rcnn/data
gs://global_wheat_detection/data/label_map.pbtxt
gs://global_wheat_detection/data/model.ckpt.data-00000-of-00001
gs://global_wheat_detection/data/model.ckpt.index
gs://global_wheat_detection/data/model.ckpt.meta
gs://global_wheat_detection/data/faster_rcnn_resnet101_pets.config
gs://global_wheat_detection/data/train.record
gs://global_wheat_detection/data/valid.record

Acknowledgement

ML-GDE program for granting me GCP credits.
TensorFlow Research Cloud.

The project extensively uses GCP's AI offerings such as AI Platform. Specifically, I used AI Platform Notebooks and AI Platform Jobs. For storage, I used GCS buckets.

Related Projects

salsa-valentina

08 Jun 2019 1

E2E-Object-Detection-in-TFLite

This repository shows how to train a custom detection model with the TFOD API, optimize it with T...

10 Sep 2020 28

visual_prompting

Official implementation and data release of the paper "Visual Prompting via Image Inpainting".

05 Sep 2022 274

ML-Bootcamp-Launchpad

Contains notebooks prepared for ML Bootcamp organized by Google Developers Launchpad.

05 Dec 2019 51

Blood-Cell-Detection-using-TFOD-API

This project demonstrates the use of TensorFlow Object Detection API (along with GCP ML Engine) t...

28 Aug 2019 22

xview2

Code for xView2 challenge (https://xview2.org/) submission. 2nd place submission in Track 3: "Eva...

02 Oct 2019 57

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

26 May 2017 1,401

progressive-resizing

Applying progressive resizing to building models in Keras.

21 Mar 2019 18

food-not-food

Machine Learning powered app to decide whether a photo is food or not.

05 Nov 2021 50