Evaluation and Testing#
The evaluation for each model can be found on the GitHub repo of the PoV.
System Evaluation Results#
The section shows the benchmark results of the VisionPilot model on different hardware environment as the references. There are two procedures to conduct the benchmark:
Two sets of computation configurations are used to benchmark the pipile:
ADLINK AVA-3510#
Hardware Spec#
- CPU: Intel Xeon E-2278GE (16 cores)
- GPU: NVIDIA Quadro RTX 5000.
- Memory: 64GB LPDDR5/16G on RTX 5000
- Driver: Driver Version: 580.82.07 & CUDA Version: 13.0
- ROS: ROS Jazzy & Zenoh
- Runtime: TensorRT
- OS: Ubuntu 24.04.3
link to AL30 ( Autonomous Driving Solutions)
Benchmark Result#
-
Zenoh:
Model CPU Utilization GPU Utilization Peak Memory Usage Frame Rate SceneSeg 13% 65% 20G 58 DomainSeg 14% 62% 21G 58 Scene3D 15% 69% 19G 57 EgoSpace - SceneSeg
Current FPS: 58 --- Per-frame Timing (microseconds) --- * Total processing time: 17229 us * Preprocessing time: 1200 us * Inference time: 15829 us * Output time: 199 us- DomainSeg
Current FPS: 58 --- Per-frame Timing (microseconds) --- * Total processing time: 16625 us * Preprocessing time: 1203 us * Inference time: 15250 us * Output time: 171 us- Scene3D
Current FPS: 57 --- Per-frame Timing (microseconds) --- * Total processing time: 17484 us * Preprocessing time: 204 us * Inference time: 16817 us * Output time: 462 us -
ROS 2:
Model CPU Utilization GPU Utilization Peak Memory Usage Frame Rate SceneSeg 17% 72% 6G 60 DomainSeg 16% 68% 6G 60 Scene3D 16% 67% 5.9G 60 EgoSpace - SceneSeg
* Current FPS: 60 --- Per-frame Timing (microseconds) --- * Total processing time: 16272 us * Preprocessing time: 178 us * Inference time: 15713 us * Output time: 381 us- DomainSeg
* Current FPS: 60.00 --- Per-frame Timing (microseconds) --- * Total processing time: 15393 us * Preprocessing time: 253 us * Inference time: 14672 us * Output time: 467 us --------------------------
ARM processors and nVidia AGX Orin#
Hardware spec:#
- CPU: 12-core ARM Cortex-A78AE CPU at 2.2GHz.
- GPU: NVIDIA Ampere GPU with 2048 CUDA Cores.
- Memory: 64GB LPDDR5. The system and GPU memories are shared.
- Driver: The NVIDIA JetPack 6.0 (Ubuntu 22.04 LTS based) was used.
- ROS: ROS Humble with Autoware recommended Cyclone DDS settings.
- Runtime: ONNX runtime 1.19.0 or TensorRT
link to nVidia Jetson Orin AGX
Benchmark results:#
| Model | CPU Utilization | GPU Utilization | Peak Memory Usage | Frame Rate |
|---|---|---|---|---|
| SceneSeg (ONNX runtime) |
91% ~ 99% | 99% | 45G including network model (~30G) + other process (15G) |
8 |
| SceneSeg (TensorRT runtime - FP16) |
57 ~ 66 % | 74 % | 0.8 % (~0.50 GB) | 29.12 |
| DomainSeg (TensorRT runtime - FP16) |
56 ~ 60 % | 88 % | 0.8 % (~0.50 GB) | 29.85 |
| Scene3D (TensorRT runtime - FP16) |
53 ~ 56 % | 82 % | 0.6 % (~0.38 GB) | 29.90 |
| SceneSeg (TensorRT runtime - FP32) |
42 ~ 49 % | 99 % | 0.6 % (~0.38 GB) | 17.10 |
| DomainSeg (TensorRT runtime - FP32) |
43 ~ 47 % | 99 % | 0.6 % (~0.38 GB) | 17.07 |
| Scene3D (TensorRT runtime - FP32) |
44 ~ 46 % | 99 % | 0.6 % (~0.38 GB) | 17.03 |
link to the instructions and complete results.
- Demo Video: link