skip to main content
10.1145/3474085.3475194acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

MageAdd: Real-Time Interaction Simulation for Scene Synthesis

Published: 17 October 2021 Publication History

Abstract

While recent researches on computational 3D scene synthesis have achieved impressive results, automatically synthesized scenes do not guarantee satisfaction of end users. On the other hand, manual scene modelling can always ensure high quality, but requires a cumbersome trial-and-error process. In this paper, we bridge the above gap by presenting a data-driven 3D scene synthesis framework that can intelligently infer objects to the scene by incorporating and simulating user preferences with minimum input. While the cursor is moved and clicked in the scene, our framework automatically selects and transforms suitable objects into scenes in real time. This is based on priors learnt from the dataset for placing different types of objects, and updated according to the current scene context. Through extensive experiments we demonstrate that our framework outperforms the state-of-the-art on result aesthetics, and enables effective and efficient user interactions.

Supplementary Material

ZIP File (mfp0204aux.zip)
Including results, a video and a document.
MP4 File (MM21-mfp0204.mp4)
This video contains a presentation of the Paper "MageAdd: Real-Time Interaction Simulation for Scene Synthesis". It focuses on motivation, interactive demo, general workflow, experiments, etc. For more technical details, please refer to the paper. MageAdd is an interactive framework for synthesizing 3D scenes by iteratively inferring objects and their transformations based on cursor movements and clicks. Given a cursor movement at any moment, our framework automatically selects, translates and rotates an object plausibly into the scene. Mouse click would end an iteration and refresh priors, which introduces more potential objects and constraints. While the cursor moves, we achieve a real-time arrangement of objects with proper transformations.

References

[1]
autodesk.com. 2020. Autodesk Revit. https://www.autodesk.com/ Retrieved Dec 8, 2020 from
[2]
Angel Chang, Will Monroe, Manolis Savva, Christopher Potts, and Christopher D Manning. 2015. Text to 3d scene generation with rich lexical grounding. arXiv preprint arXiv:1505.06289 (2015).
[3]
Kang Chen, Yukun Lai, Yu-Xin Wu, Ralph Robert Martin, and Shi-Min Hu. 2014. Automatic semantic modeling of indoor scenes from low-quality RGB-D data using contextual information. ACM Transactions on Graphics, Vol. 33, 6 (2014).
[4]
Kang Chen, Yu-Kun Lai, and Shi-Min Hu. 2015. 3D indoor scene modeling from RGB-D data: a survey. Computational Visual Media, Vol. 1, 4 (2015), 267--278.
[5]
Francis DK Ching and Corky Binggeli. 2018. Interior design illustrated .John Wiley & Sons.
[6]
ea.com. 2020. The Sims 4. https://www.ea.com/zh-cn/games/the-sims/the-sims-4 Retrieved Dec 8, 2020 from
[7]
Matthew Fisher, Daniel Ritchie, Manolis Savva, Thomas Funkhouser, and Pat Hanrahan. 2012. Example-based synthesis of 3D object arrangements. ACM Transactions on Graphics (TOG), Vol. 31, 6 (2012), 135.
[8]
Matthew Fisher, Manolis Savva, Yangyan Li, Pat Hanrahan, and Matthias Nießner. 2015. Activity-centric scene synthesis for functional 3D scene modeling. ACM Transactions on Graphics (TOG), Vol. 34, 6 (2015), 179.
[9]
Huan Fu, Bowen Cai, Lin Gao, Lingxiao Zhang, Cao Li, Qixun Zeng, Chengyue Sun, Yiyun Fei, Yu Zheng, Ying Li, Yi Liu, Peng Liu, Lin Ma, Le Weng, Xiaohang Hu, Xin Ma, Qian Qian, Rongfei Jia, Binqiang Zhao, and Hao Zhang. 2020 a. 3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics. arXiv preprint arXiv:2011.09127 (2020).
[10]
Huan Fu, Rongfei Jia, Lin Gao, Mingming Gong, Binqiang Zhao, Steve Maybank, and Dacheng Tao. 2020 b. 3D-FUTURE: 3D Furniture shape with TextURE. arXiv preprint arXiv:2009.09633 (2020).
[11]
Qiang Fu, Xiaowu Chen, Xiaotian Wang, Sijia Wen, Bin Zhou, and Hongbo Fu. 2017. Adaptive synthesis of indoor scenes via activity-associated object relation graphs. ACM Transactions on Graphics (TOG), Vol. 36, 6 (2017), 1--13.
[12]
Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, and Roberto Cipolla. 2016. Understanding real world indoor scenes with synthetic data. In Proceedings of the IEEE CVPR. 4077--4085.
[13]
kujiale.com. 2020. Kujiale. https://www.kujiale.com/ Retrieved Dec 8, 2020 from
[14]
Manyi Li, Akshay Gadi Patil, Kai Xu, Siddhartha Chaudhuri, Owais Khan, Ariel Shamir, Changhe Tu, Baoquan Chen, Daniel Cohen-Or, and Hao Zhang. 2019. Grains: Generative recursive autoencoders for indoor scenes. ACM Transactions on Graphics (TOG), Vol. 38, 2 (2019), 1--16.
[15]
Wanwan Li, Javier Talavera, Amilcar Gomez Samayoa, Jyh-Ming Lien, and Lap-Fai Yu. 2020. Automatic Synthesis of Virtual Wheelchair Training Scenarios. In 2020 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). IEEE, 539--547.
[16]
Yuan Liang, Song-Hai Zhang, and Ralph Robert Martin. 2017. Automatic data-driven room design generation. In International Workshop on Next Generation Computer Animation Techniques. Springer, 133--148.
[17]
Tianqiang Liu, Aaron Hertzmann, Wilmot Li, and Thomas Funkhouser. 2015. Style compatibility for 3D furniture models. ACM Transactions on Graphics (TOG), Vol. 34, 4 (2015), 1--9.
[18]
Hong-Li Lu. 2010. Residential Interior Design .Liaoning Science and Technology Publishing House.
[19]
Andrew Luo, Zhoutong Zhang, Jiajun Wu, and Joshua B Tenenbaum. 2020. End-to-End Optimization of Scene Layout. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3754--3763.
[20]
Rui Ma, Akshay Gadi Patil, Matthew Fisher, Manyi Li, Sören Pirk, Binh-Son Hua, Sai-Kit Yeung, Xin Tong, Leonidas Guibas, and Hao Zhang. 2018. Language-driven synthesis of 3D scenes from scene databases. In SIGGRAPH Asia 2018 Technical Papers. ACM, 212.
[21]
Paul Merrell, Eric Schkufza, Zeyang Li, Maneesh Agrawala, and Vladlen Koltun. 2011. Interactive furniture layout using interior design guidelines. In ACM transactions on graphics (TOG), Vol. 30. ACM, 87.
[22]
Maureen Mitton and Courtney Nystuen. 2016. Residential interior design: A guide to planning spaces .John Wiley & Sons.
[23]
planner5d.com. 2020. Planner5d. https://planner5d.com/ Retrieved Dec 8, 2020 from
[24]
Siyuan Qi, Yixin Zhu, Siyuan Huang, Chenfanfu Jiang, and Song-Chun Zhu. 2018. Human-centric indoor scene synthesis using stochastic grammar. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5899--5908.
[25]
Manolis Savva, Angel X Chang, and Maneesh Agrawala. 2017. Scenesuggest: Context-driven 3d scene design. arXiv preprint arXiv:1703.00061 (2017).
[26]
Anna Bonarou Vasiliki Asaroglou. 2013. Furniture arrangement: in Residential spaces .CreateSpace Independent Publishing Platform.
[27]
Kai Wang, Yu-An Lin, Ben Weissmann, Manolis Savva, Angel X Chang, and Daniel Ritchie. 2019. Planit: Planning and instantiating indoor scenes with relation graph and spatial prior networks. ACM Transactions on Graphics (TOG), Vol. 38, 4 (2019), 132.
[28]
Kai Wang, Manolis Savva, Angel X Chang, and Daniel Ritchie. 2018. Deep convolutional priors for indoor scene synthesis. ACM Transactions on Graphics (TOG), Vol. 37, 4 (2018), 70.
[29]
Tomer Weiss, Alan Litteneker, Noah Duncan, Masaki Nakada, Chenfanfu Jiang, Lap-Fai Yu, and Demetri Terzopoulos. 2018. Fast and scalable position-based layout synthesis. arXiv preprint arXiv:1809.10526 (2018).
[30]
Guoming Xiong, Qiang Fu, Hongbo Fu, Bin Zhou, Guoliang Luo, and Zhigang Deng. 2020. Motion Planning for Convertible Indoor Scene Layout Design. IEEE Transactions on Visualization and Computer Graphics (2020).
[31]
Kun Xu, Kang Chen, Hongbo Fu, Wei-Lun Sun, and Shi-Min Hu. 2013. Sketch2Scene: sketch-based co-retrieval and co-placement of 3D models. ACM Transactions on Graphics (TOG), Vol. 32, 4 (2013), 123.
[32]
Meng Yan, Xuejin Chen, and Jie Zhou. 2017. An interactive system for efficient 3D furniture arrangement. In Proceedings of the Computer Graphics International Conference. 1--6.
[33]
Yi-Ting Yeh, Lingfeng Yang, Matthew Watson, Noah D Goodman, and Pat Hanrahan. 2012. Synthesizing open worlds with constraints using locally annealed reversible jump mcmc. ACM Transactions on Graphics (TOG), Vol. 31, 4 (2012), 56.
[34]
Lap-Fai Yu, Sai Kit Yeung, Chi-Keung Tang, Demetri Terzopoulos, Tony F Chan, and Stanley Osher. 2011. Make it home: automatic optimization of furniture arrangement. ACM Trans. Graph., Vol. 30, 4 (2011), 86.
[35]
Lap-Fai Yu, Sai-Kit Yeung, and Demetri Terzopoulos. 2015. The clutterpalette: An interactive tool for detailing indoor scenes. IEEE transactions on visualization and computer graphics, Vol. 22, 2 (2015), 1138--1148.
[36]
Suiyun Zhang, Zhizhong Han, and Hui Zhang. 2016. User guided 3D scene enrichment. In VRCAI. 353--362.
[37]
Song-Hai Zhang, Shao-Kui Zhang, Yuan Liang, and Peter Hall. 2019. A Survey of 3D Indoor Scene Synthesis. Journal of Computer Science and Technology, Vol. 34, 3, Article 594 (2019), 14 pages. https://doi.org/10.1007/s11390-019--1929--5
[38]
Song-Hai Zhang, Shao-Kui Zhang, Wei-Yu Xie, Cheng-Yang Luo, Yong-Liang Yang, and Hongbo Fu. 2021 b. Fast 3D Indoor Scene Synthesis by Learning Spatial Relation Priors of Objects. IEEE Transactions on Visualization and Computer Graphics (2021). https://doi.org/10.1109/TVCG.2021.3050143
[39]
Shao-Kui Zhang, Wei-Yu Xie, and Song-Hai Zhang. 2021 a. Geometry-Based Layout Generation with Hyper-Relations AMONG Objects. Graphical Models (2021), 101104. https://doi.org/10.1016/j.gmod.2021.101104
[40]
Tao-Kai Zheng. 2011. Interior Design For Home .Huazhong University of Science & Technology Press.
[41]
Yang Zhou, Zachary While, and Evangelos Kalogerakis. 2019. Scenegraphnet: Neural message passing for 3d indoor scene augmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7384--7392.
[42]
Fan Zhu, Li Liu, Jin Xie, Fumin Shen, Ling Shao, and Yi Fang. 2018. Learning to synthesize 3d indoor scenes from monocular images. In Proceedings of the 26th ACM international conference on Multimedia. 501--509.

Cited By

View all
  • (2024)Controllable Procedural Generation of LandscapesProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681129(6394-6403)Online publication date: 28-Oct-2024
  • (2024)ScenePhotographer: Object-Oriented Photography for Residential ScenesProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680942(7843-7851)Online publication date: 28-Oct-2024
  • (2024)SceneExpander: Real-Time Scene Synthesis for Interactive Floor Plan EditingProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680798(6232-6240)Online publication date: 28-Oct-2024
  • Show More Cited By

Index Terms

  1. MageAdd: Real-Time Interaction Simulation for Scene Synthesis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '21: Proceedings of the 29th ACM International Conference on Multimedia
    October 2021
    5796 pages
    ISBN:9781450386517
    DOI:10.1145/3474085
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. 3D indoor scene synthesis
    2. spatial inference
    3. user interaction

    Qualifiers

    • Research-article

    Funding Sources

    • National Key Technology R&D Program
    • National Natural Science Foundation of China
    • Tsinghua?Tencent Joint Laboratory for Internet Innovation Technology
    • Adobe
    • Research Grant of Beijing Higher Institution Engineering Research Center
    • RCUK grant CAMERA

    Conference

    MM '21
    Sponsor:
    MM '21: ACM Multimedia Conference
    October 20 - 24, 2021
    Virtual Event, China

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)52
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media