Latest Trends in Visual Manipulation and Navigation in Robotics

Latest Trends in Visual Manipulation and Navigation in Robotics

Authors

DOI:

https://doi.org/10.56741/jnest.v2i01.253

Keywords:

visual manipulation, visual navigation, robotics, robot vision, machine vision

Abstract

In recent decades, the term Robot has become more and more popular. A robot can be defined as a machine that is specifically built to complete certain tasks to help human-being. In order to successfully accomplish its task, the robot needs to receive input data and process it. Then, the processed data is used for manipulator-actions decision-making. The input data can vary from sound, temperature, vibration, touch, vision, etc. Among those input data, vision is arguably one of the most challenging data. This is because vision often needs detailed and complicated preprocessing before it can be used. In addition, vision data size is relatively larger compared to the other type of input data, making it more challenging to process considering the computational resources. In this paper, current research and future development trend of robotic vision were reviewed and discussed. Further, challenges and potential issues about robot vision, such as safety and privacy concerns, were also discussed.

Downloads

Download data is not yet available.

Author Biographies

Muhammad Miftahul Amri, Universitas Ahmad Dahlan, Indonesia

Muhammad Miftahul Amri received his B.S. from the Department of Computer Science and Electronics, Universitas Gadjah Mada Indonesia, in 2018, and an M.S. from the Department of Electrical and Computer Engineering, Sungkyunkwan University South Korea in 2021, where he is currently pursuing his Ph.D. In 2022, he received his M.M. and professional engineer degrees from Universitas Terbuka Indonesia and Universitas Muhammadiyah Yogyakarta Indonesia, respectively. In 2021, he joined the faculty at Universitas Ahmad Dahlan Indonesia, where he is currently a lecturer in the Department of Electrical Engineering. His research interests include wireless communication and reconfigurable intelligent surface. He can be contacted by email: muhammad.amri@te.uad.ac.id.

Franklin Ore Areche, National University of Huancavelica

Franklin Ore Areche is a professor at the Professional School of Agroindustrial Engineering at the National University of Huancavelica, Peru. He conducted research on the various Andean agricultural products along the lines of food science and technology, industrial fermentations, medicinal plants, and essential oils, among others. He can be contacted at: franklin.ore@unh.edu.pe

Amar Ratnakar Naik, PES University, Bangalore, India

Amar Ratnakar Naik completed his Bachelor’s in   Computer   Science from the College of Engineering (Goa, India) and his MBA from IIM Lucknow (India). He is currently pursuing his Ph.D. degree from PES University (Bangalore, India). He has been working for the last 15+ years in the IT industry in the BI and analytics space. He can be contacted at: amar.r.naik@gmail.com.

References

Owen-Hill, Alex. "Robot Vision vs Computer Vision: What's the Difference?" Robot Vision vs Computer Vision: What's the Difference?, blog.robotiq.com/robot-vision-vs-computer-vision-whats-the-difference.

Hashimoto, Manabu, Yukiyasu Domae, and Shun'ichi Kaneko. "Current status and future trends on robot vision technology." Journal of Robotics and Mechatronics 29.2 (2017): 275-286.

H. Kayaba, S. Kaneko, H. Takauji, M. Toda, K. Kuno, and H. Suganuma, "Robust Matching of Dot Cloud Data Based on Model Shape Evaluation Oriented to 3D Defect Recognition," IEICE Trans. D, Vol.J95-D, No.1, pp. 97-110, 2012.

S. Kaneko, T. Kondo, and A. Miyamoto, "Robust matching of 3D contours using iterative closest point algorithm improved by Mestimation," Pattern Recognition, Vol.36, pp. 2041-2047, 2003.

Y. Domae, S. Kawato et al., "Self-calibration of Hand-eye Coordinate Systems by Five Observations of an Uncalibrated Mark," CIEEJ Trans. On Electronics, Information and Systems, Vol.132, No.6, pp. 968-974, 2011.

R. Haraguchi, Y. Domae et al., "Development of Production Robot System that can Assemble Products with Cable and Connector," J. of Robotics and Mechatronics, Vol.23, No.6, pp. 939-950, 2011.

A. Noda, Y. Domae et al., "Bin-picking System for General Objects," J. of the Robotics Society of Japan, Vol.33, No.5, pp. 387- 394, 2015.

T. Komuro, Y. Senjo, K. Sogen, S. Kagami, and M. Ishikawa, "RealTime Shape Recognition Using a Pixel-Parallel Processor," J. of Robotics and Mechatronics, Vol.17, No.4, pp. 410-419, 2005.

T. Senoo, Y. Yamakawa, Y. Watanabe, H. Oku, and M. Ishikawa, "High-Speed Vision and its Application Systems," J. of Robotics and Mechatronics, Vol.26, No.3, pp. 287-301, 2014.

Yanming Guo, Yu Liu, Ard Oerlemans, Songyang Lao, Song Wu, Michael S. Lew. Deep learning for visual understanding: A review. Neurocomputing, 187 (2016), pp. 27-48. 10.

Soren Goyal, Paul Benjamin. Object Recognition Using Deep Neural Networks: A Survey. http://arxiv.org/abs/1412.3684. Dec 2014.

Jiuxiang Gu, Zhenhua Wang, Jason Kuen, Lianyang Ma, Amir Shahroudy, Bing Shuai, Ting Liu, Xingxing Wang, Gang Wang. Recent Advances in Convolutional Neural Networks. http://arxiv.org/abs/1512.07108. Dec 2015.

Yann LeCun, Yoshua Bengio, Geoffrey Hinton. Deep learning. Nature 521, 436–444 (28 May 2015), doi:10.1038/nature14539.

Li Deng. A Tutorial Survey of Architectures, Algorithms, and Applications for Deep Learning. APSIPA Transactions on Signal and Information Processing.

J. Schmidhuber. Deep Learning in Neural Networks: An Overview. Neural Networks, Volume 61, January 2015, Pages 85-117 (DOI: 10.1016/j.neunet.2014.09.003), published online in 2014.

Ivan Bogun, Anelia Angelova, Navdeep Jaitly. Object Recognition from Short Videos for Robotic Perception. arXiv:1509.01602v1 [cs.CV] 4 Sep 2015.

Joseph Redmon, Anelia Angelova. Real-Time Grasp Detection Using Convolutional Neural Networks. arXiv:1412.3128v2 [cs.RO] 28 Feb 2015.

Jaeyong Sung, Seok Hyun Jin, Ian Lenz, and Ashutosh Saxena. Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding. arXiv:1601.02705v1 [cs.RO] 12 Jan 2016.

Pablo F. Alcantarilla, Simon Stent, German Ros, Roberto Arroyo and Riccardo Gherardi. Street-View Change Detection with Deconvolutional Networks. Proceedings of Robotics: Science and System Proceedings. June 2016.

D. Tomè, F. Monti, L. Baroffio, L. Bondi, M. Tagliasacchi, S. Tubaro. Deep Convolutional neural networks for pedestrian detection. arXiv:1510.03608v5 [cs.CV] 7 Mar 2016.

Farzad Husain, Babette Dellen and Carme Torra. Action Recognition based on Efficient Deep Feature Learning in the Spatio-Temporal Domain. IEEE International Conference on Robotics and Automation (ICRA). May 2016.

Yang, Qiang, et al. "Federated machine learning: Concept and applications." ACM Transactions on Intelligent Systems and Technology (TIST) 10.2 (2019): 1-19.

Luo, Jiahuan, et al. "Real-world image datasets for federated learning." arXiv preprint arXiv:1910.11089 (2019).

Liu, Fenglin, et al. "Federated learning for vision-and-language grounding problems." Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 34. No. 07. 2020.

Liu, Yang, et al. "Fedvision: An online visual object detection platform powered by federated learning." Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 34. No. 08. 2020.

Breaking Privacy in Federated Learning." KDnuggets, www.kdnuggets.com/2020/08/breaking-privacy-federated-learning.html.

Goldberg, Ken, and Ben Kehoe. "Cloud robotics and automation: A survey of related work." EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2013-5 (2013).

H. Bistry and J. Zhang, "A cloud computing approach to complex robot vision tasks using smart camera systems," 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 3195-3200, doi: 10.1109/IROS.2010.5653660.

J. Salmerón-Garcı´a, P. Íñigo-Blasco, F. Dı´az-del-Rı´o and D. Cagigas-Muñiz, "A Tradeoff Analysis of a Cloud-Based Robot Navigation Assistant Using Stereo Image Processing," in IEEE Transactions on Automation Science and Engineering, vol. 12, no. 2, pp. 444-454, April 2015, doi: 10.1109/TASE.2015.2403593.

"Transportation Planning: Modern Mobility Partners: Atlanta." Modern Mobility Partners: Atalanta 2020, www.modernmobilitypartners.com/.

Koizumi, Norihiro, et al. "Robot Vision Technology for Technologizing and Digitalization of Medical Diagnostic and Therapeutic Skills." IJAT 3.5 (2009): 541-550.

Jeon S, Hwangbo S, Hong J. A Surgical Navigation System to Assist in Chronic Total Occlusion Intervention. In: URAI 2016.

Choi H, Park Y, Cho H, Hong J. An augmented reality based simple navigation system for pelvic tumor resection. Proc. of American Acadamy of Orthopaedic Surgeons, 2016.

Lee S, Yoon H-S, Park J, Chung Y-S, Hong J, Yi B-J. A Surgical Navigation and Endoscope Holder Integrated System for Sinus Surgery. Proc. of the 11th Asian Conference on Computer Aided Surgery, 2015.

B. -E. Byambasuren, T. Baasanjav, T. Myagmarjav and B. Baatar, "Application of Image Processing and Industrial Robot Arm for Quality Assurance Process of Production," 2020 IEEE Region 10 Symposium (TENSYMP), 2020, pp. 526-530, doi: 10.1109/TENSYMP50017.2020.9230977.

S. Chakole and N. Ukani, "Low-Cost Vision System for Pick and Place application using camera and ABB Industrial Robot," 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2020, pp. 1-6, doi: 10.1109/ICCCNT49239.2020.9225522.

L. Sun, "Industrial Robot Control Based on Probablistic Network Image Recognition," 2020 IEEE International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA), 2020, pp. 1517-1520, doi: 10.1109/ICIBA50161.2020.9276841.

"Google's Self-Driving Car Collides with Public Bus in California." Donga, www.donga.com /en/article/all/20160302/526673/1/Google-s-self-driving-car-collides-with-public-bus-in-California.

"Uber's self-driving operator charged over fatal crash." BBC, www.bbc.com/news/technology-54175359.

"2 Killed in Driverless Tesla Car Crash, Officials Say." The New York Times, www.nytimes.com /2021/04/18/business/tesla-fatal-crash-texas.html.

"First Driverless Car Crash Study: Autonomous Vehicles Crash More, but Injuries Are Less Serious." Government Technology, www.govtech.com/fs/first-driverless-car-crash-study-autonomous-vehicles-crash-more-injuries-are-less-serious.html

Do, Truong-Dong, et al. "Real-time self-driving car navigation using deep neural network." 2018 4th International Conference on Green Technology and Sustainable Development (GTSD). IEEE, 2018.

Suresh, Aswath, et al. "Design and Development of Intelligent Self-driving Car Using ROS and Machine Vision Algorithm." International Conference on Robot Intelligence Technology and Applications. Springer, Cham, 2017.

Raviteja, Tirumalapudi. "Global Image Segmentation Process using Machine Learning algorithm & Convolution Neural Network method for Self-Driving Vehicles." arXiv preprint arXiv:2010.13294 (2020).

Shamrat, FM Javed Mehedi, et al. "A smart automated system model for vehicles detection to maintain traffic by image processing." International Journal of Scientific and Technology Research 9.2 (2020): 2921-2928.

Shaikh, Safiya Naaz, Ramijraja Zamadar, and Harish Kumar Sharma. "Identification and Recognition of Text and Face Using Image Processing for Visually Impaired." Available at SSRN 3682074 (2020).

Rizki, Rai Purnama, et al. "Image Processing Technique for Smart Home Security Based On the Principal Component Analysis (PCA) Methods." 2020 6th International Conference on Wireless and Telematics (ICWT). IEEE, 2020.

Kong, Xiangbo, Lin Meng, and Hiroyuki Tomiyama. "A self-learning fall detection system for elderly persons using depth camera." International Journal of Advanced Mechatronic Systems 8.1 (2020): 16-25.

"Computational Challenges in Image Processing." Newton Gateway to Mathematics, www.gateway.newton.ac.uk/event/ofbw32.

Ruiz-del-Solar, Javier, Patricio Loncomilla, and Naiomi Soto. "A survey on deep learning methods for robot vision." arXiv preprint arXiv:1803.10862 (2018).

S. Han, H. Mao, W.J. Dally: Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding (ICLR'16, best paper award)

Han S, Liu X, Mao H, Pu J, Pedram A, Horowitz MA, Dally WJ. EIE: Efficient inference engine on compressed deep neural network. ACM SIGARCH Computer Architecture News. 2016 Jun 18;44(3):243-54.

Kumar, PS Jagadeesh, et al. "Advanced Robot Vision for Medical Surgical Applications."

Sivak, Michael, and Brandon Schoettle. Road safety with self-driving vehicles: General limitations and road sharing with conventional vehicles. University of Michigan, Ann Arbor, Transportation Research Institute, 2015.

Chowdhury, Abdullahi, et al. "Attacks on Self-Driving Cars and Their Countermeasures: A Survey." IEEE Access 8 (2020): 207308-207342.

Hisan, Urfa Khairatun, and Muhammad Miftahul Amri. "Artificial Intelligence for Human Life: A Critical Opinion from Medical Bioethics Perspective–Part I." Journal of Public Health Sciences 1.02 (2022): 100-111.

Hisan, Urfa Khairatun, and Muhammad Miftahul Amri. "Artificial Intelligence for Human Life: A Critical Opinion from Medical Bioethics Perspective–Part II." Journal of Public Health Sciences 1.02 (2022): 112-130.

"Driverless cars: Who should die in a crash?." BBC, www.bbc.com/news/technology-45991093.

Downloads

Published

2023-02-27

How to Cite

Miftahul Amri, M., Areche, F. O., & Ratnakar Naik, A. (2023). Latest Trends in Visual Manipulation and Navigation in Robotics. Journal of Novel Engineering Science and Technology, 2(01), 1–8. https://doi.org/10.56741/jnest.v2i01.253

Plaudit

Loading...