Demo Instructions.
View in widescreen or zoom out until all images fit on one line.
Look around by clicking and dragging on an image above or using [W,A,S,D] keys.
Use the buttons above or the shift key to toggle between translation and rotation.
Click "Granular Scene Movement" to load more images for smoother movement.
This probably won't work on Internet Explorer or on mobile.

Fast Scene Loading Granular Scene Movement

Abstract

Recent advancements in differentiable rendering and 3D reasoning have driven exciting results in novel view synthesis from a single image. Despite realistic results, methods are limited to relatively small view change. In order to synthesize immersive scenes, models must also be able to extrapolate. We present an approach that fuses 3D reasoning with autoregressive modeling to outpaint large view changes in a 3D-consistent manner, enabling scene synthesis. We demonstrate considerable improvement in single image large-angle view synthesis results compared to a variety of methods and possible variants across simulated and real datasets. In addition, we show increased 3D consistency compared to alternative accumulation methods.

Video Results

We display rendered scenes from PixelSynth and baselines on RealEstate10K and Matterport.

Recent & Concurrent Work

There has been a variety of exciting recent and concurrent work on single-image novel view synthesis. In addition to SynSin, here is a partial list:

Jing Yu Koh, Honglak Lee, Yinfei Yang, Jason Baldridge and Peter Anderson. Pathdreamer: A World Model for Indoor Navigation [PDF]
Robin Rombach*, Patrick Esser* and Bjorn Ommer. Geometry-Free View Synthesis: Transformers and no 3D Priors [PDF]
Ronghang Hu, Nikhila Ravi, Alex Berg and Deepak Pathak. Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image [PDF]
Andrew Liu*, Richard Tucker*, Varun Jampani, Ameesh Makadia, Noah Snavely and Angjoo Kanazawa. Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image [PDF]
Meng-Li Shih, Shih-Yang Su, Johannes Kopf and Jia-Bin Huang. 3D Photography using Context-aware Layered Depth Inpainting [PDF]
Richard Tucker and Noah Snavely. Single-View View Synthesis with Multiplane Images [PDF]

Acknowledgements

Thanks to Angel Chang, Angela Dai, Richard Tucker and Noah Snavely for allowing us to share frames from their datasets. Thanks Olivia Wiles and Ajay Jain for polished model repositories which were so helpful in this work. Thanks to Shengyi Qian, Karan Desai, Mohamed El Banani, Linyi Jin, and Richard Higgins for the helpful discussions. Special thanks to the Michigan Help Desk (DCO) for after-hours help with machines. The webpage template originally came from some colorful folks.

Interactive Demo

Input

PixelSynth

SynSin (6x)

No 3D Accum.

Abstract

Paper and Supplemental Material

Video Results

Recent & Concurrent Work

Acknowledgements