Python GStreamer YOLO Object Detection

View source on GitHub

This guide walks you through building and running the YOLO object detection reference on Avocado OS. The app captures video from a USB camera on the Jetson Orin Nano, runs GPU-accelerated object detection, and serves the annotated feed as an MJPEG stream.

Prerequisites

macOS 10.12+ or Linux (Ubuntu 22.04+, Fedora 39+)
Docker Desktop installed and running
The latest version of the Avocado CLI
NVIDIA Jetson Orin Nano Developer Kit
USB (UVC) camera (e.g., Opal Tadpole)
SD card or USB cable for flashing

Initialize

Clone the reference or initialize a new project from it:

avocado init --reference python-gstreamer-yolo python-gstreamer-yolo
cd python-gstreamer-yolo

Install

Install the SDK toolchain, extension dependencies, and runtime packages:

avocado install -f

This pulls the SDK container image and installs nativesdk-uv for pip package compilation. Runtime packages include OpenCV with CUDA support, cuDNN, GStreamer plugins, and the UVC camera driver.

Build

Build the extensions and assemble the runtime image:

avocado build

The build step runs app-compile.sh inside the SDK container, which uses uv pip install to download Flask. The YOLO11n model (yolo11n.onnx) is already checked into the overlay at app/overlay/usr/lib/app/models/. Then app-install.sh copies the packages into the extension sysroot.

Deploy

Flash the image to an SD card or eMMC, connect a USB camera, and boot the Jetson Orin Nano.

avocado provision -r dev

Verify

Open your browser to http://<device-ip>:5000 to view the dashboard.

systemctl status app
journalctl -u app -f

You should see output like:

app starting
  device: avocado-jetson-orin-nano-devkit
  camera: /dev/video0 (1280x720@30fps)
  dashboard: http://0.0.0.0:5000
  loading model: /usr/lib/app/models/yolo11n.onnx (10.2 MB)
  CUDA backend active
  model loaded successfully
  trying pipeline 1/4 [nvidia-mjpeg-decode]...
  pipeline started: [nvidia-mjpeg-decode] (GPU=True)

The dashboard shows a live annotated video feed, detected objects with confidence scores, inference FPS, and device metrics.

API Endpoints

GET /api/stats — device metrics, model status, and current detections
GET /api/detections — current detections only
GET /stream — live MJPEG stream with bounding boxes
GET / — web dashboard

Customize

Configure your camera

Check what your camera supports and adjust the settings:

v4l2-ctl --list-formats-ext -d /dev/video0

Edit the environment variables in app/overlay/usr/lib/systemd/system/app.service:

Environment=CAMERA_DEVICE=/dev/video0
Environment=CAMERA_WIDTH=1280
Environment=CAMERA_HEIGHT=720
Environment=CAMERA_FRAMERATE=30

Change the model

Replace yolo11n.onnx with a different YOLO11 variant for accuracy vs. speed:

Model	Size	Speed	Accuracy
`yolo11n.onnx`	11 MB	Fastest	Good
`yolo11s.onnx`	37 MB	Fast	Better
`yolo11m.onnx`	77 MB	Medium	High

Export a new model with opset 12 for OpenCV 4.9 compatibility:

uv run --with ultralytics python3 -c "
from ultralytics import YOLO
model = YOLO('yolo11s.pt')
model.export(format='onnx', opset=12)
"
cp yolo11s.onnx app/overlay/usr/lib/app/models/

Adjust detection sensitivity

Edit app/overlay/usr/local/bin/app.py:

CONFIDENCE_THRESHOLD = 0.3    # lower = more detections (default 0.5)
NMS_THRESHOLD = 0.45          # non-maximum suppression threshold

Rebuild after changes

After any change, rebuild and reprovision:

avocado build
avocado provision -r dev

Prerequisites​

Initialize​

Install​

Build​

Deploy​

Verify​

API Endpoints​

Customize​

Configure your camera​

Change the model​

Adjust detection sensitivity​

Rebuild after changes​