I don't really get it. Shouldn't the guy who makes the media decide what the dynamic range should be and bake that into the final rendered image? Why do I need the media format and screen to negotiate this?

I've never watched HDR media btw. I want to understand why it can't be done with existing tech because it sounds like it can.