A detailed technical schematic diagram in the style of a computer vision architecture flowchart, illustrating the VICAD "Human-Vehicle-Road-Cloud" collaborative architecture. The diagram features rectangular boxes connected by arrows, with labels in sans-serif font. On the left: a box labeled "Human Input" connected to an image of a person interacting with a device. Flowing to a central "Vehicle Backbone" box, then splitting to "Road Feature Extraction" and "Cloud Context Fusion" boxes. These connect to a "Synergistic Depth Net" with a depth distribution visualization. Further to an "Efficient Collaborative Pooling" box leading to a "BEV-Like Synergy Feature" plane. Finally, to a "Decision Head" with prediction results visualization. Below: multi-view images of human, vehicle, road, and cloud icons. Include dotted lines for supervision signals. Clean, professional style with blue accents, white background, similar to BEV detection model diagrams. See more