Journey into Computer Vision and Machine Learning Part II // My Nguyen Blog

Welcome to the second part of our ongoing journey into computer vision and machine learning. In this blog post, I am thrilled to provide an update on our progress in learning annotation techniques for creating segmentation masks in instance segmentation. Instance segmentation, which involves detecting object outlines, has been a central focus of our exploration. Over the past month, our dedicated team has been working together, utilizing CVAT as our annotation tool, to produce datasets enriched with segmentation masks. Our ultimate objective is to accurately detect specific objects in alignment with our project’s goals.

Learning Annotation Techniques with CVAT:
In our pursuit of instance segmentation, we have diligently learned annotation techniques using the versatile tool, CVAT. This robust annotation toolset has equipped us with the necessary skills to create precise segmentation masks. Through collaborative efforts and continuous learning, we have mastered the art of outlining and labeling objects of interest in our datasets. These segmentation masks play a crucial role in training our models to accurately detect and analyze objects.

With the aid of CVAT, we have made significant strides in producing datasets brimming with detailed segmentation masks. These masks provide invaluable information about object boundaries and outlines within the images. By leveraging these datasets, we are poised to train our models to identify and classify specific objects with enhanced precision and accuracy. Our project’s overarching goal revolves around detecting and analyzing these objects, and the creation of these high-quality datasets marks a significant milestone in our journey.

Overcoming Challenges:
Throughout our exploration, we encountered a challenge related to the conversion of segmentation masks to polygons. The tutorial and training materials we found online provided scripts that were designed to work with one class projects only. To address this limitation, I took the initiative to develop my own version of the script capable of handling multiple classes. The code is readily available on my GitHub repository, specifically tailored for users of CVAT who are annotating for segmentation in YOLOv8 format. I encourage you to explore and utilize this code for your own projects.

In our pursuit of efficient annotation, we have embraced cutting-edge technology. Thanks to Meta AI, annotation has become significantly faster and more streamlined through the use of SAM (Segment Anything), an innovative solution that employs AI to automate the annotation process. SAM has revolutionized the way we annotate and has resulted in cleaner and more efficient dataset creation. This remarkable advancement has accelerated our progress and enabled us to focus on further aspects of our project.

Future Prospects and Continued Progress:
As we forge ahead in our computer vision and machine learning journey, we remain passionate about the incredible possibilities that instance segmentation holds. Our expertise in annotation techniques, bolstered by the powerful capabilities of CVAT and our customized conversion script, empowers us to unlock new levels of object detection accuracy. Additionally, the integration of Meta AI’s SAM has transformed the annotation process, elevating our efficiency and productivity. We are excited to continue this journey and share more updates in the future.