skip to main content
10.1109/IROS51168.2021.9636041guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

Assembly Planning by Recognizing a Graphical Instruction Manual

Published: 27 September 2021 Publication History

Abstract

This paper proposes a robot assembly planning method by automatically reading the graphical instruction manuals designed for humans. Essentially, the method generates an Assembly Task Sequence Graph (ATSG) by recognizing a graphical instruction manual. An ATSG is a graph describing the assembly task procedure by detecting types of parts included in the instruction images, completing the missing information automatically, and correcting the detection errors automatically. To build an ATSG, the proposed method first extracts the information of the parts contained in each image of the graphical instruction manual. Then, by using the extracted part information, it estimates the proper work motions and tools for the assembly task. After that, the method builds an ATSG by considering the relationship between the previous and following images, which makes it possible to estimate the undetected parts caused by occlusion using the information of the entire image series. Finally, by collating the total number of each part with the generated ATSG, the excess or deficiency of parts are investigated, and task procedures are removed or added according to those parts. In the experiment section, we build an ATSG using the proposed method to a graphical instruction manual for a chair and demonstrate the action sequences found in the ATSG can be performed by a dual-arm robot execution. The results show the proposed method is effective and simplifies robot teaching in automatic assembly.

References

[1]
J. Hatori, Y. Kikuchi, S. Kobayashi, K. Takahashi, Y. Tsuboi, Y. Unno, W. Ko, and J. Tan,. Interactively picking real-world objects with unconstrained spoken language instructions. In Proceedings of IEEE International Conference on Robotics and Automation (ICRA), 2018.
[2]
A. E. Erdal, O. Ekaterina, O. Adil, Y. Yezhou, and A. Tamim,. Unsupervised linking of visual features to textual descriptions in long manipulation activities. IEEE Robotics and Automation Letters, 2(3):1397–1404, 2017.
[3]
L. S. H. de Mello and A. C. Sanderson, . And/or graph representation of assembly plans. IEEE Transactions on Robotics and Automation, 6(2):188–199, 1990.
[4]
S. J. Kwak, K. Hasegawa, and S. Y. Chung,. A framework for automatic generation of a contact state graph for robotic assembly. Advanced Robotics, 25(13-14):1603–1625, 2011.
[5]
R. H. Wilson and J. C. Latombe,. Geometric reasoning about mechanical assembly. Artificial Intelligence, 71(2):371–396, 1994.
[6]
H. Fakhurldeen, F. Dailami, and A. G. Pipe,. Cara system architecture - a click and assemble robotic assembly system. In Proceedings of International Conference on Robotics and Automation (ICRA), pages 5830–5836, 2019.
[7]
U. Thomas, T. Stouraitis, and M. A. Roa,. Flexible assembly through integrated assembly sequence planning and grasp planning. Proceedings of IEEE International Conference on Automation Science and Engineering (CASE), pages 586–592, 2015.
[8]
M. Dogar, A. Spielberg, S. Baker, and D. Rus,. Multi-robot grasp planning for sequential assembly operations. Proceedings of IEEE International Conference on Robotics and Automation (ICRA), 2015.
[9]
Weiwei Wan, Kensuke Harada, and Kazuyuki Nagata,. Assembly sequence planning for motion planning. Assembly Automation, 2018.
[10]
Ryota Moriyama, Weiwei Wan, and Kensuke Harada,. Dual-arm assembly planning considering gravitational constraints. In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5566–5572, 2019.
[11]
T. Nishimura, A. Hashimoto, Y. Ushiku, H. Kameko, Y. Yamakata, and S. Mori,. Structure-aware procedural text generation from an image sequence. IEEE Access, 9:2125–2141, 2021.
[12]
M. Nicolescu, N. Arnold, J. Blankenburg, D. Feil-Seifer, S. Banisetty, M. Nicolescu, A. Palmer, and T. Monteverde,. Learning of complex-structured tasks from verbal instruction. In Proceedings of IEEE-RAS International Conference on Humanoid Robots (Humanoids), pages 747–754, 2019.
[13]
D. Paulius, Y. Huang, R. Milton, W. D. Buchanan, J. Sam, and Y. Sun,. Functional object-oriented network for manipulation learning. In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2655–2662, 2016.
[14]
D. Paulius, A. B. Jelodar, and Y. Sun,. Functional object-oriented network: Construction expansion. In Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pages 5935–5941, 2018.
[15]
D. Paulius, K. ShengP. Dong, and Y. Sun,. Functional object-oriented network: Considering robot’s capability in human-robot collaboration. CoRR, abs/1905.00502, 2019.
[16]
G. Kazhoyan and M. Beetz,. Programming robotic agents with action description. In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 103–108, 2017.
[17]
R. Shaw and NJ J. Bransford, Hillsdale,. The theory of affordances, in Perceiving, Acting, and Knowing. Towards an Ecological Psychology. Lawrence Erlbaum, 1977. pp 67-82; repr. as ch. 8 in Gibson.
[18]
M. Schoeler and F. Wörgötter,. Bootstrapping the semantics of tools: Affordance analysis of real world objects on a per-part basis. IEEE Transactions on Cognitive and Developmental Systems, 8(2):84–98, 2016.
[19]
L. S. H. Mello and A. C. Sanderson, . And/or graph representation of assembly plans. IEEE Transactions on Robotics and Automation, 6(2), 1990.
[20]
J. Redmon and A. Farhadi,. Yolov3: An incremental improvement. CoRR, abs/1804.02767, 2018.
[22]
K. Fukuda, I. G. Ramirez-Alpizar, N. Yamanobe, D. Petit, K. Nagata, and K. Harada,. Recognition of assembly tasks based on the actions associated to the manipulated objects. In Proceedings of IEEE/SICE International Symposium on System Integration (SII), pages 193–198, 2019.
[23]
Z. Hu, W. Wan, K. Koyama, and K. Harada,. A mechanical screwing tool for 2-finger parallel grippers – design, optimization, and manipulation policies, 2020.
[24]
J. J. Kuffner and S. M. LaValle,. Rrt-connect: An efficient approach to single-query path planning. In Proceedings of IEEE International Conference on Robotics and Automation (ICRA), volume 2, pages 995–1001, 2000.

Cited By

View all
  • (2024)GIMNet: Assembly Plan Generation from Graphical Instruction ManualProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632467(422-429)Online publication date: 4-Jan-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Sep 2021
7915 pages

Publisher

IEEE Press

Publication History

Published: 27 September 2021

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)GIMNet: Assembly Plan Generation from Graphical Instruction ManualProceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)10.1145/3632410.3632467(422-429)Online publication date: 4-Jan-2024

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media