YOLOv8 Instance Segmentation Large

Our new model ZOO works with DepthAI V3. Find out more in our documentation.

Model Details

Model Description

YOLOv8 Instance Segmentation is a YOLOv8 based convolutional neural network model designed for identifying individual objects in an image and segmenting them from the rest of the image. It is highly effective and accurate even for more tricky images. We implement here the large version of the model.

Developed by: Ultralytics

Shared by:

Model type: Computer Vision

License:

Resources for more information:

Training Details

Training Data

is a large-scale object detection, segmentation, and captioning dataset.

Testing Details

Metrics

Results of the mAP and speed are evaluated on COCO dataset with the input resolution of 640×640. Results are taken from .

Model	mAP (box)	mAP (mask)	Params ^(M)
	52.3	42.6	46.0

Technical Specifications

Input/Output Details

Input:

Name: image

Info: NCHW BGR un-normalized image

Outputs:

Name: multiple (see NN archive)

Info: Unprocessed outputs of a multitude of detections, masks and protos

Model Architecture

Backbone: CSPDarknet53

Head: Anchor-free object segmentation head (pruned of concatenation)

Consult the for more information.

Throughput

Model variant: yolov8-instance-segmentation-large:coco-640x352

• Input shape: [1, 3, 352, 640] • Output shapes: [[1, 85, 44, 80], [1, 85, 22, 40], [1, 85, 11, 20], [1, 32, 44, 80], [1, 32, 22, 40], [1, 32, 11, 20], [1, 32, 88, 160]]

• Params (M): 45.974 • GFLOPs: 62.223

Platform	Precision	Throughput (infs/sec)	Power Consumption (W)
RVC4	INT8	148.56	4.95

Model variant: yolov8-instance-segmentation-large:coco-640x480

• Input shape: [1, 3, 480, 640] • Output shapes:

[[1, 85, 60, 80], [1, 85, 30, 40], [1, 85, 15, 20], [1, 32, 60, 80], [1, 32, 30, 40], [1, 32, 15, 20], [1, 32, 120, 160]]

• Params (M): 45.974 • GFLOPs: 84.850

Platform	Precision	Throughput (infs/sec)	Power Consumption (W)
RVC4	INT8	126.35	6.73

* Benchmarked with , using 2 threads (and the DSP runtime in balanced mode for RVC4).

* Parameters and FLOPs are obtained from the package.

Quantization

RVC4 version of the model was quantized using a custom dataset. This was created by taking a full 128-image dataset.

Utilization

Models converted for RVC Platforms can be used for inference on OAK devices. DepthAI pipelines are used to define the information flow linking the device, inference model, and the output parser (as defined in model head(s)). Below, we present the most crucial utilization steps for the particular model. Please consult the docs for more information.

Install DAIv3 and depthai-nodes libraries:

pip install depthai
pip install depthai-nodes

Define model:

model_description = dai.NNModelDescription(
    "luxonis/yolov8-instance-segmentation-large:coco-640x352"
)

nn = pipeline.create(ParsingNeuralNetwork).build(
    <CameraNode>, model_description
)

Inspect model head(s):

YOLOExtendedParser that outputs message (bounding boxes and segmentation masks of detected objects).

Get parsed output(s):

while pipeline.isRuning():
    parser_output: ImgDetectionsExtended = parser_output_queue.get()

Example

You can quickly run the model using our script. It automatically downloads the model, creates a DepthAI pipeline, runs the inference, and displays the results using our DepthAI visualizer tool. To try it out, run:

python3 main.py \
    --model luxonis/yolov8-instance-segmentation-large:coco-640x352 \
    -overlay

General instance segmentation model
License	GNU Affero General Public License v3.0 Commercial use
Downloads	18421
Tasks	Instance Segmentation
Model Types	ONNX

Model Variants

Name	Version	Available For	Created At	Deploy
		RVC4	Over 1 year ago
		RVC4	Almost 2 years ago