论文标题
物理基础图像编辑的逆渲染技术
Inverse Rendering Techniques for Physically Grounded Image Editing
论文作者
论文摘要
从一张场景的一张图片中,人们通常可以立即掌握空间布局,甚至可以很好地猜测材料属性以及光线来照亮场景的地方。例如,我们可以可靠地分辨出哪些对象会阻塞其他物体,一个对象是由什么组成的,其粗糙的形状,被照亮或在阴影中的区域等等。有趣的是,我们做出这些决定的能力知之甚少。因此,我们仍然无法稳健地“教”计算机与人进行相同的高级观察。本文档提出了用于了解单个图像的固有场景属性的算法。这些反向渲染技术的目的是估算仅使用图像中可见的信息(几何,材料,照明器,相机参数等)的场景元素的配置。此类算法在机器人技术和计算机图形方面具有应用。一种这样的应用程序是在物理接地的图像编辑中:通过利用物理空间的知识使照片编辑更容易。这些应用程序允许在几秒钟内执行复杂的编辑操作,从而使图像中对象的无缝添加,删除或搬迁。
From a single picture of a scene, people can typically grasp the spatial layout immediately and even make good guesses at materials properties and where light is coming from to illuminate the scene. For example, we can reliably tell which objects occlude others, what an object is made of and its rough shape, regions that are illuminated or in shadow, and so on. It is interesting how little is known about our ability to make these determinations; as such, we are still not able to robustly "teach" computers to make the same high-level observations as people. This document presents algorithms for understanding intrinsic scene properties from single images. The goal of these inverse rendering techniques is to estimate the configurations of scene elements (geometry, materials, luminaires, camera parameters, etc) using only information visible in an image. Such algorithms have applications in robotics and computer graphics. One such application is in physically grounded image editing: photo editing made easier by leveraging knowledge of the physical space. These applications allow sophisticated editing operations to be performed in a matter of seconds, enabling seamless addition, removal, or relocation of objects in images.
