EVP
Official demo for EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment. EVP is a deep learning model for metric depth estimation from a single image as well as referring segmentation. Please refer to our project page or paper or github for more details.
Depth Prediction demo
Examples
Referring Segmentation demo
Examples
| Input Image | Prompt |
|---|