@stevemacenski makes a lot of good points, this is a very difficult set of tasks.
How can I measure the distance between door and the camera.
What exactly does this mean? the distance between the centre of the door and the camera, the distance to the closest point on the door, or the distance to the plane the door is in? For all of these you will need to estimate the shape and pose of the door, YOLO isn't doing this for you, so you will have to build something to do this.
How can I measure the height and width of the door.
For this you will need to estimate the location of the two vertical edges of the door and the two horizontal edges of the door, this seems obvious I know, but this is the real question here.
One approach would be to use a NN to segment the door once it has been detected by YOLO. This would give you an estimate of each of the 3D points from the depth camera that actually make up part of the door, this estimate will probably be a bit noisy but should be good enough to make a start. You'll be able to use this set of 3D points and find the best fit plane which should lie fairly accurately over the door.
Secondly you'll want to have some estimate of the orientation of your camera, if it's mounted on a platform with no roll or pitch that's great you can assume it's vertical. If not you'll need to have some TFs or visual estimate so you can know roughly which way is up.
Now you know which way is up in the point cloud and the plane of the door, it is now relatively straight forward to remove the outliers and estimate the edges of the door.
What you're asking is possible, but it is by no means a simple task.
What data set have you trained YOLO on to recognize doors and door handles? Thanks!!
I am facing nearly the same problem did you find anything ?