Clomask is a product that uses object detection, segmentation and classification for retail products. The goal is to semantically segment retail products such as soda bottles, candy bags and cereal boxes in images using deep-learning based mask-detection.
Sample Input | Sample Output |
---|---|
.
├── data-acquisition
├── research
├── mrcnn
├── model_server
└── webapp
A major challenge was to acquire enough data without spending too much time/effort in manual labeling. The data acquisition module attempts to achieve the generation of large-scale, reproducible image/mask datasets with a high degree of variance.
The COCO dataset only has bottle
(out of the other retail products we needed to segment, such as bag
and box
), and thus we needed to retrain the model on newer data. This section covers our experiments, learnings and results from the training and validation process.
This is a sub-repo that contains the Mask RCNN implementation files, modified for Clomask's use.
In order to serve the model over the web, we use AWS to accept requests and generate the mask files on a private S3 bucket. These can be accessed through pre-signed URLs.
The web app simply deploys and serves our final model in an end-to-end fashion where a user can upload an image and get the view the annotated image as the result.