iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://api.crossref.org/works/10.1145/3446370

{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,23]],"date-time":"2024-09-23T04:23:37Z","timestamp":1727065417764},"reference-count":181,"publisher":"Association for Computing Machinery (ACM)","issue":"3","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2022,4,30]]},"abstract":"Nowadays, robots are dominating the manufacturing, entertainment, and healthcare industries. Robot vision aims to equip robots with the capabilities to discover information, understand it, and interact with the environment, which require an agent to effectively understand object affordances and functions in complex visual domains. In this literature survey, first, \u201cvisual affordances\u201d are focused on and current state-of-the-art approaches for solving relevant problems as well as open problems and research gaps are summarized. Then, sub-problems, such as affordance detection, categorization, segmentation, and high-level affordance reasoning, are specifically discussed. Furthermore, functional scene understanding and its prevalent descriptors used in the literature are covered. This survey also provides the necessary background to the problem, sheds light on its significance, and highlights the existing challenges for affordance and functionality learning.<\/jats:p>","DOI":"10.1145\/3446370","type":"journal-article","created":{"date-parts":[[2021,4,17]],"date-time":"2021-04-17T10:09:06Z","timestamp":1618654146000},"page":"1-35","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":42,"title":["Visual Affordance and Function Understanding"],"prefix":"10.1145","volume":"54","author":[{"given":"Mohammed","family":"Hassanin","sequence":"first","affiliation":[{"name":"University of New South Wales Canberra, Australia"}]},{"given":"Salman","family":"Khan","sequence":"additional","affiliation":[{"name":"Inception Institute of Artificial Intelligence, IIAT, UAE"}]},{"given":"Murat","family":"Tahtali","sequence":"additional","affiliation":[{"name":"University of New South Wales Canberra, Australia"}]}],"member":"320","published-online":{"date-parts":[[2021,4,17]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the International Conference on Computer, Communication and Signal Processing (ICCCSP\u201917)","author":"Aarthi S.","year":"2017","unstructured":"S. Aarthi and S. Chitrakala . 2017. Scene understanding; A survey . In Proceedings of the International Conference on Computer, Communication and Signal Processing (ICCCSP\u201917) . 1--4. DOI:https:\/\/doi.org\/10.1109\/ICCCSP. 2017 .7944094 10.1109\/ICCCSP.2017.7944094 S. Aarthi and S. Chitrakala. 2017. Scene understanding; A survey. In Proceedings of the International Conference on Computer, Communication and Signal Processing (ICCCSP\u201917). 1--4. DOI:https:\/\/doi.org\/10.1109\/ICCCSP.2017.7944094"},{"key":"e_1_2_1_2_1","unstructured":"Paulo Abelha and Frank Guerin. 2017. Transfer of tool affordance and manipulation cues with 3D vision data. Retrieved from https:\/\/arXiv:1710.04970. Paulo Abelha and Frank Guerin. 2017. Transfer of tool affordance and manipulation cues with 3D vision data. Retrieved from https:\/\/arXiv:1710.04970."},{"key":"e_1_2_1_3_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201916)","author":"Abelha P.","unstructured":"P. Abelha , F. Guerin , and M. Schoeler . 2016. A model-based approach to finding substitute tools in 3D vision data . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201916) . 2471--2478. P. Abelha, F. Guerin, and M. Schoeler. 2016. A model-based approach to finding substitute tools in 3D vision data. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201916). 2471--2478."},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of IEEE International Conference on Intelligent Robots and Systems (IROS\u201917)","author":"Ferreira Paulo Abelha","year":"2017","unstructured":"Paulo Abelha Ferreira and Frank Guerin . 2017 . Learning how a tool affords by simulating 3D models from the web . In Proceedings of IEEE International Conference on Intelligent Robots and Systems (IROS\u201917) . IEEE Press. Paulo Abelha Ferreira and Frank Guerin. 2017. Learning how a tool affords by simulating 3D models from the web. In Proceedings of IEEE International Conference on Intelligent Robots and Systems (IROS\u201917). IEEE Press."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2012.6224931"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12369-015-0281-3"},{"key":"e_1_2_1_7_1","volume-title":"Experimental Robotics","author":"Bo Liefeng","unstructured":"Liefeng Bo , Xiaofeng Ren , and Dieter Fox . 2013. Unsupervised feature learning for RGB-D based object recognition . In Experimental Robotics . Springer , 387--402. Liefeng Bo, Xiaofeng Ren, and Dieter Fox. 2013. Unsupervised feature learning for RGB-D based object recognition. In Experimental Robotics. Springer, 387--402."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2013.2289018"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2009.5459303"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAMD.2011.2106782"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915)","author":"Chao Y. W.","year":"2015","unstructured":"Y. W. Chao , Z. Wang , R. Mihalcea , and J. Deng . 2015. Mining semantic affordances of visual object categories . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915) . 4259--4267. DOI:https:\/\/doi.org\/10.1109\/CVPR. 2015 .7299054 10.1109\/CVPR.2015.7299054 Y. W. Chao, Z. Wang, R. Mihalcea, and J. Deng. 2015. Mining semantic affordances of visual object categories. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915). 4259--4267. DOI:https:\/\/doi.org\/10.1109\/CVPR.2015.7299054"},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915)","author":"Chao Y. W.","unstructured":"Y. W. Chao , Z. Wang , R. Mihalcea , and J. Deng . 2015. Mining semantic affordances of visual object categories . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915) . 4259--4267. Y. W. Chao, Z. Wang, R. Mihalcea, and J. Deng. 2015. Mining semantic affordances of visual object categories. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201915). 4259--4267."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1207\/S15326969ECO1502_5"},{"key":"e_1_2_1_14_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201915)","author":"Chen C.","year":"2015","unstructured":"C. Chen , A. Seff , A. Kornhauser , and J. Xiao . 2015. DeepDriving: Learning affordance for direct perception in autonomous driving . In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201915) . 2722--2730. DOI:https:\/\/doi.org\/10.1109\/ICCV. 2015 .312 10.1109\/ICCV.2015.312 C. Chen, A. Seff, A. Kornhauser, and J. Xiao. 2015. DeepDriving: Learning affordance for direct perception in autonomous driving. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201915). 2722--2730. DOI:https:\/\/doi.org\/10.1109\/ICCV.2015.312"},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the International Conference on Learning Representations.","author":"Chen Liang-Chieh","unstructured":"Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , and Alan L. Yuille . 2015. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs . In Proceedings of the International Conference on Learning Representations. Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille. 2015. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. In Proceedings of the International Conference on Learning Representations."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0779-4"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2019.2930364"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2019.2894439"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Ching-Yao Chuang Jiaman Li Antonio Torralba and Sanja Fidler. 2017. Learning to act properly: Predicting and explaining affordances from images. Retrieved from https:\/\/arXiv:1712.07576. Ching-Yao Chuang Jiaman Li Antonio Torralba and Sanja Fidler. 2017. Learning to act properly: Predicting and explaining affordances from images. Retrieved from https:\/\/arXiv:1712.07576.","DOI":"10.1109\/CVPR.2018.00108"},{"key":"e_1_2_1_20_1","volume-title":"R-fcn: Object detection via region-based fully convolutional networks. In Advances in Neural Information Processing Systems","author":"Dai Jifeng","year":"2016","unstructured":"Jifeng Dai , Yi Li , Kaiming He , and Jian Sun . 2016 . R-fcn: Object detection via region-based fully convolutional networks. In Advances in Neural Information Processing Systems . MIT Press , 379--387. Jifeng Dai, Yi Li, Kaiming He, and Jian Sun. 2016. R-fcn: Object detection via region-based fully convolutional networks. In Advances in Neural Information Processing Systems. MIT Press, 379--387."},{"key":"e_1_2_1_21_1","volume-title":"Muggleton","author":"Raedt Luc De","year":"2008","unstructured":"Luc De Raedt , Paolo Frasconi , Kristian Kersting , and Stephen H . Muggleton . 2008 . Probabilistic Inductive Logic Programming. Vol. 4911 . Springer . Luc De Raedt, Paolo Frasconi, Kristian Kersting, and Stephen H. Muggleton. 2008. Probabilistic Inductive Logic Programming. Vol. 4911. Springer."},{"key":"e_1_2_1_22_1","volume-title":"IJCAI","volume":"7","author":"Raedt Luc De","year":"2007","unstructured":"Luc De Raedt , Angelika Kimmig , and Hannu Toivonen . 2007 . ProbLog: A probabilistic Prolog and its application in link discovery . In IJCAI , Vol. 7 . Hyderabad, 2462--2467. Luc De Raedt, Angelika Kimmig, and Hannu Toivonen. 2007. ProbLog: A probabilistic Prolog and its application in link discovery. In IJCAI, Vol. 7. Hyderabad, 2462--2467."},{"key":"e_1_2_1_23_1","volume-title":"Efros","author":"Delaitre Vincent","year":"2012","unstructured":"Vincent Delaitre , David F. Fouhey , Ivan Laptev , Josef Sivic , Abhinav Gupta , and Alexei A . Efros . 2012 . Scene semantics from long-term observation of people. In Proceedings of the European Conference on Computer Vision. Springer , 284--298. Vincent Delaitre, David F. Fouhey, Ivan Laptev, Josef Sivic, Abhinav Gupta, and Alexei A. Efros. 2012. Scene semantics from long-term observation of people. In Proceedings of the European Conference on Computer Vision. Springer, 284--298."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298916"},{"key":"e_1_2_1_25_1","volume-title":"Tsagarakis","author":"Do Thanh-Toan","year":"2017","unstructured":"Thanh-Toan Do , Anh Nguyen , Ian Reid , Darwin G. Caldwell , and Nikos G . Tsagarakis . 2017 . Affordancenet : An end-to-end deep learning approach for object affordance detection. Retrieved from https:\/\/arXiv:1709.07326. Thanh-Toan Do, Anh Nguyen, Ian Reid, Darwin G. Caldwell, and Nikos G. Tsagarakis. 2017. Affordancenet: An end-to-end deep learning approach for object affordance detection. Retrieved from https:\/\/arXiv:1709.07326."},{"key":"e_1_2_1_26_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201913)","author":"Doll\u00e1r Piotr","year":"1841","unstructured":"Piotr Doll\u00e1r and C. Lawrence Zitnick . 2013. Structured forests for fast edge detection . In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201913) . IEEE, 1841 --1848. Piotr Doll\u00e1r and C. Lawrence Zitnick. 2013. Structured forests for fast edge detection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201913). IEEE, 1841--1848."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/RoMoCo.2017.8003891"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/362007.362035"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.304"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.","author":"Farhadi A.","unstructured":"A. Farhadi , I. Endres , D. Hoiem , and D. Forsyth . 2009. Describing objects by their attributes . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth. 2009. Describing objects by their attributes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.167"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2003.1242073"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0710-z"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913491297"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/2787405.2787406"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2716262"},{"key":"e_1_2_1_37_1","unstructured":"James Jerome Gibson. 1966. The senses considered as perceptual systems. (1966). James Jerome Gibson. 1966. The senses considered as perceptual systems. (1966)."},{"key":"e_1_2_1_38_1","volume-title":"Place, Space Reader","author":"Gibson James J.","year":"1979","unstructured":"James J. Gibson . 1979. The theory of affordances. People , Place, Space Reader ( 1979 ), 56--60. James J. Gibson. 1979. The theory of affordances. People, Place, Space Reader (1979), 56--60."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995327"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2014.6943119"},{"key":"e_1_2_1_41_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8.","author":"Gupta A.","unstructured":"A. Gupta and L. S. Davis . 2007. Objects in action: An approach for combining action understanding and object perception . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8. A. Gupta and L. S. Davis. 2007. Objects in action: An approach for combining action understanding and object perception. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.83"},{"key":"e_1_2_1_43_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201911). 1961","author":"Gupta A.","year":"1968","unstructured":"A. Gupta , S. Satkin , A. A. Efros , and M. Hebert . 2011. From 3D scene geometry to human workspace . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201911). 1961 -- 1968 . A. Gupta, S. Satkin, A. A. Efros, and M. Hebert. 2011. From 3D scene geometry to human workspace. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201911). 1961--1968."},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0777-6"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10584-0_23"},{"key":"e_1_2_1_46_1","volume-title":"Attribute Based Affordance Detection from Human-Object Interaction Images","author":"Hassan Mahmudul","unstructured":"Mahmudul Hassan and Anuja Dharmaratne . 2016. Attribute Based Affordance Detection from Human-Object Interaction Images . Springer International Publishing , Cham , 220--232. Mahmudul Hassan and Anuja Dharmaratne. 2016. Attribute Based Affordance Detection from Human-Object Interaction Images. Springer International Publishing, Cham, 220--232."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2958608"},{"key":"e_1_2_1_48_1","volume-title":"Recent Advances in Natural Language Processing. John Benjamins","author":"Havasi Catherine","unstructured":"Catherine Havasi , Robert Speer , and Jason Alonso . 2007. ConceptNet 3: A flexible, multilingual semantic network for common sense knowledge . In Recent Advances in Natural Language Processing. John Benjamins , Philadelphia, PA , 27--29. Catherine Havasi, Robert Speer, and Jason Alonso. 2007. ConceptNet 3: A flexible, multilingual semantic network for common sense knowledge. In Recent Advances in Natural Language Processing. John Benjamins, Philadelphia, PA, 27--29."},{"key":"e_1_2_1_49_1","volume-title":"Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201917)","author":"He Kaiming","year":"2017","unstructured":"Kaiming He , Georgia Gkioxari , Piotr Dollar , and Ross Girshick . 2017 . Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201917) . Kaiming He, Georgia Gkioxari, Piotr Dollar, and Ross Girshick. 2017. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201917)."},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_51_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201911)","author":"Hermans Tucker","year":"2011","unstructured":"Tucker Hermans , James M. Rehg , and Aaron Bobick . 2011 . Affordance prediction via learned object attributes . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201911) . 181--184. Tucker Hermans, James M. Rehg, and Aaron Bobick. 2011. Affordance prediction via learned object attributes. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201911). 181--184."},{"key":"e_1_2_1_52_1","volume-title":"Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 2784--2790","author":"Hinkle L.","unstructured":"L. Hinkle and E. Olson . 2013. Predicting object functionality using physical simulations . In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 2784--2790 . L. Hinkle and E. Olson. 2013. Predicting object functionality using physical simulations. In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 2784--2790."},{"key":"e_1_2_1_53_1","volume-title":"Affordances for robots: A brief survey. Avant 3, 2","author":"Horton Thomas E.","year":"2012","unstructured":"Thomas E. Horton , Arpan Chakraborty , and Robert St. Amant . 2012. Affordances for robots: A brief survey. Avant 3, 2 ( 2012 ). Thomas E. Horton, Arpan Chakraborty, and Robert St. Amant. 2012. Affordances for robots: A brief survey. Avant 3, 2 (2012)."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/2897824.2925870"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766914"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.573"},{"key":"e_1_2_1_57_1","first-page":"1","article-title":"Bayesian learning of tool affordances based on generalization of functional feature to estimate effects of unseen tools","volume":"18","author":"Jain Raghvendra","year":"2013","unstructured":"Raghvendra Jain and Tetsunari Inamura . 2013 . Bayesian learning of tool affordances based on generalization of functional feature to estimate effects of unseen tools . Artific. Life Robot. 18 , 1 -- 2 (2013), 95--103. Raghvendra Jain and Tetsunari Inamura. 2013. Bayesian learning of tool affordances based on generalization of functional feature to estimate effects of unseen tools. Artific. Life Robot. 18, 1--2 (2013), 95--103.","journal-title":"Artific. Life Robot."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-015-9456-1"},{"key":"e_1_2_1_59_1","volume-title":"The ecological approach to visual perception. Houghtom Mifflin","author":"James Gibson","unstructured":"Gibson James . 1979. The ecological approach to visual perception. Houghtom Mifflin , Dallas, TX . Gibson James. 1979. The ecological approach to visual perception. Houghtom Mifflin, Dallas, TX."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCDS.2016.2594134"},{"key":"e_1_2_1_61_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201914)","author":"Kim David Inkyu","unstructured":"David Inkyu Kim and Gaurav S. Sukhatme . 2014. Semantic labeling of 3d point clouds with object affordance for robot manipulation . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201914) . IEEE, 5578--5584. David Inkyu Kim and Gaurav S. Sukhatme. 2014. Semantic labeling of 3d point clouds with object affordance for robot manipulation. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201914). IEEE, 5578--5584."},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/2601097.2601117"},{"key":"e_1_2_1_63_1","volume-title":"Kingma and Max Welling","author":"Diederik","year":"2013","unstructured":"Diederik P. Kingma and Max Welling . 2013 . Auto-encoding variational bayes. Retrieved from https:\/\/arXiv:1312.6114. Diederik P. Kingma and Max Welling. 2013. Auto-encoding variational bayes. Retrieved from https:\/\/arXiv:1312.6114."},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2010.08.002"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913478446"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2430335"},{"key":"e_1_2_1_67_1","volume-title":"Proc. of the 2nd Int. Workshop on Epigenetics Robotics. Citeseer, 59--61","author":"Kozima Hideki","year":"2002","unstructured":"Hideki Kozima , Cocoro Nakagawa , and Hiroyuki Yano . 2002 . Emergence of imitation mediated by objects . In Proc. of the 2nd Int. Workshop on Epigenetics Robotics. Citeseer, 59--61 . Hideki Kozima, Cocoro Nakagawa, and Hiroyuki Yano. 2002. Emergence of imitation mediated by objects. In Proc. of the 2nd Int. Workshop on Epigenetics Robotics. Citeseer, 59--61."},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/2516971.2516975"},{"key":"e_1_2_1_69_1","volume-title":"Toward Category-level Object Recognition","author":"Leibe Bastian","unstructured":"Bastian Leibe , Ales Leonardis , and Bernt Schiele . 2006. An implicit shape model for combined object categorization and segmentation . In Toward Category-level Object Recognition . Springer , 508--524. Bastian Leibe, Ales Leonardis, and Bernt Schiele. 2006. An implicit shape model for combined object categorization and segmentation. In Toward Category-level Object Recognition. Springer, 508--524."},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.01265"},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46466-4_41"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.472"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298958"},{"key":"e_1_2_1_74_1","volume-title":"Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. 3418--3424","author":"Liang Wei","year":"2016","unstructured":"Wei Liang , Yibiao Zhao , Yixin Zhu , and Song-Chun Zhu . 2016 . What is where: Inferring containment relations from videos . In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. 3418--3424 . Wei Liang, Yibiao Zhao, Yixin Zhu, and Song-Chun Zhu. 2016. What is where: Inferring containment relations from videos. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. 3418--3424."},{"key":"e_1_2_1_75_1","volume-title":"Proceedings of the 37th Annual Meeting of the Cognitive Science Society.","author":"Liang Wei","year":"2015","unstructured":"Wei Liang , Yibiao Zhao , Yixin Zhu , and Song-Chun Zhu . 2015 . Evaluating human cognition of containing relations with physical simulation .. In Proceedings of the 37th Annual Meeting of the Cognitive Science Society. Wei Liang, Yibiao Zhao, Yixin Zhu, and Song-Chun Zhu. 2015. Evaluating human cognition of containing relations with physical simulation.. In Proceedings of the 37th Annual Meeting of the Cognitive Science Society."},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2017.2662238"},{"key":"e_1_2_1_77_1","volume-title":"Berg","author":"Liu Wei","year":"2016","unstructured":"Wei Liu , Dragomir Anguelov , Dumitru Erhan , Christian Szegedy , Scott Reed , Cheng-Yang Fu , and Alexander C . Berg . 2016 . SSD : Single shot multibox detector. In Proceedings of the European Conference on Computer Vision. Springer , 21--37. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single shot multibox detector. In Proceedings of the European Conference on Computer Vision. Springer, 21--37."},{"key":"e_1_2_1_78_1","volume-title":"Text classification using string kernels. J. Mach. Learn. Res. 2 (Feb","author":"Lodhi Huma","year":"2002","unstructured":"Huma Lodhi , Craig Saunders , John Shawe-Taylor , Nello Cristianini , and Chris Watkins . 2002. Text classification using string kernels. J. Mach. Learn. Res. 2 (Feb . 2002 ), 419--444. Huma Lodhi, Craig Saunders, John Shawe-Taylor, Nello Cristianini, and Chris Watkins. 2002. Text classification using string kernels. J. Mach. Learn. Res. 2 (Feb. 2002), 419--444."},{"key":"e_1_2_1_79_1","volume-title":"Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 1015--1021","author":"Lopes M.","unstructured":"M. Lopes , F. S. Melo , and L. Montesano . 2007. Affordance-based imitation learning in robots . In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 1015--1021 . M. Lopes, F. S. Melo, and L. Montesano. 2007. Affordance-based imitation learning in robots. In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems. 1015--1021."},{"key":"e_1_2_1_80_1","first-page":"3","article-title":"Visual learning by imitation with motor representations","volume":"35","author":"Lopes M.","year":"2005","unstructured":"M. Lopes and J. Santos-Victor . 2005 . Visual learning by imitation with motor representations . IEEE Trans. Syst., Man, Cybernet., Part B (Cybernet.) 35 , 3 (June 2005), 438--449. M. Lopes and J. Santos-Victor. 2005. Visual learning by imitation with motor representations. IEEE Trans. Syst., Man, Cybernet., Part B (Cybernet.) 35, 3 (June 2005), 438--449.","journal-title":"IEEE Trans. Syst., Man, Cybernet., Part B (Cybernet.)"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCVW.2017.96"},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2980237"},{"key":"e_1_2_1_83_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation. 1716--1723","author":"Madry M.","unstructured":"M. Madry , D. Song , and D. Kragic . 2012. From object categories to grasp transfer using probabilistic reasoning . In Proceedings of the IEEE International Conference on Robotics and Automation. 1716--1723 . M. Madry, D. Song, and D. Kragic. 2012. From object categories to grasp transfer using probabilistic reasoning. In Proceedings of the IEEE International Conference on Robotics and Automation. 1716--1723."},{"key":"e_1_2_1_84_1","volume-title":"Proceedings of the IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids\u201915)","author":"Mar T.","unstructured":"T. Mar , V. Tikhanoff , G. Metta , and L. Natale . 2015. Multi-model approach based on 3D functional features for tool affordance learning in robotics . In Proceedings of the IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids\u201915) . 482--489. T. Mar, V. Tikhanoff, G. Metta, and L. Natale. 2015. Multi-model approach based on 3D functional features for tool affordance learning in robotics. In Proceedings of the IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids\u201915). 482--489."},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2017.7989110"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1145\/219717.219748"},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCDS.2016.2614992"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10514-017-9637-x"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2012.6225042"},{"key":"e_1_2_1_90_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201914)","author":"Moldovan B.","year":"2014","unstructured":"B. Moldovan and L. De Raedt . 2014. Occluded object search by relational affordances . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201914) . 169--174. DOI:https:\/\/doi.org\/10.1109\/ICRA. 2014 .6906605 10.1109\/ICRA.2014.6906605 B. Moldovan and L. De Raedt. 2014. Occluded object search by relational affordances. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201914). 169--174. DOI:https:\/\/doi.org\/10.1109\/ICRA.2014.6906605"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2007.914848"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2007.4399511"},{"key":"e_1_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2007.914848"},{"key":"e_1_2_1_94_1","volume-title":"Numerical Analysis","author":"Mor\u00e9 Jorge J.","unstructured":"Jorge J. Mor\u00e9 . 1978. The levenberg-marquardt algorithm: Implementation and theory . In Numerical Analysis . Springer , 105--116. Jorge J. Mor\u00e9. 1978. The levenberg-marquardt algorithm: Implementation and theory. In Numerical Analysis. Springer, 105--116."},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.207"},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2013.6631175"},{"key":"e_1_2_1_97_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201915)","author":"Myers A.","year":"2015","unstructured":"A. Myers , C. L. Teo , C. Fermuller , and Y. Aloimonos . 2015. Affordance detection of tool parts from geometric features . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201915) . 1374--1381. DOI:https:\/\/doi.org\/10.1109\/ICRA. 2015 .7139369 10.1109\/ICRA.2015.7139369 A. Myers, C. L. Teo, C. Fermuller, and Y. Aloimonos. 2015. Affordance detection of tool parts from geometric features. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201915). 1374--1381. DOI:https:\/\/doi.org\/10.1109\/ICRA.2015.7139369"},{"key":"e_1_2_1_98_1","volume-title":"Proceedings of the 28th International Conference on Machine Learning (ICML\u201911)","author":"Ngiam Jiquan","unstructured":"Jiquan Ngiam , Aditya Khosla , Mingyu Kim , Juhan Nam , Honglak Lee , and Andrew Y. Ng . 2011. Multimodal deep learning . In Proceedings of the 28th International Conference on Machine Learning (ICML\u201911) . 689--696. Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y. Ng. 2011. Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning (ICML\u201911). 689--696."},{"key":"e_1_2_1_99_1","volume-title":"Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS\u201916)","author":"Nguyen A.","unstructured":"A. Nguyen , D. Kanoulas , D. G. Caldwell , and N. G. Tsagarakis . 2016. Detecting object affordances with convolutional neural networks . In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS\u201916) . 2765--2770. A. Nguyen, D. Kanoulas, D. G. Caldwell, and N. G. Tsagarakis. 2016. Detecting object affordances with convolutional neural networks. In Proceedings of the IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS\u201916). 2765--2770."},{"key":"e_1_2_1_100_1","volume-title":"Proceedings of the International Conference on Intelligent Robots and Systems (IROS\u201917)","author":"Nguyen Anh","unstructured":"Anh Nguyen , Dimitrios Kanoulas , Darwin G. Caldwell , and Nikos G. Tsagarakis . 2017. Object-based affordances detection with convolutional neural networks and dense conditional random fields . In Proceedings of the International Conference on Intelligent Robots and Systems (IROS\u201917) . Anh Nguyen, Dimitrios Kanoulas, Darwin G. Caldwell, and Nikos G. Tsagarakis. 2017. Object-based affordances detection with convolutional neural networks and dense conditional random fields. In Proceedings of the International Conference on Intelligent Robots and Systems (IROS\u201917)."},{"key":"e_1_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15549-9_40"},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126281"},{"key":"e_1_2_1_103_1","volume-title":"Visual Attributes","author":"Patterson Genevieve","unstructured":"Genevieve Patterson and James Hays . 2017. The SUN attribute database: Organizing scenes by affordances, materials, and layout . In Visual Attributes . Springer , 269--297. Genevieve Patterson and James Hays. 2017. The SUN attribute database: Organizing scenes by affordances, materials, and layout. In Visual Attributes. Springer, 269--297."},{"key":"e_1_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2007.06.002"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1006\/gmod.1999.0521"},{"key":"e_1_2_1_106_1","volume-title":"Robotics: Science and Systems","author":"Phillips Cody J.","unstructured":"Cody J. Phillips , Matthieu Lecce , and Kostas Daniilidis . 2016. Seeing glassware: From edge detection to pose estimation and shape recovery . In Robotics: Science and Systems , Vol. 3 . MIT Press , Cambridge, MA . Cody J. Phillips, Matthieu Lecce, and Kostas Daniilidis. 2016. Seeing glassware: From edge detection to pose estimation and shape recovery. In Robotics: Science and Systems, Vol. 3. MIT Press, Cambridge, MA."},{"key":"e_1_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2013.6630736"},{"key":"e_1_2_1_108_1","volume-title":"Proceedings of the IEEE International Conference on Intelligent Robots and Systems Workshop. IEEE.","author":"Pieropan Alessandro","year":"2015","unstructured":"Alessandro Pieropan , Carl Henrik Ek , and Hedvig Kjellstr\u00f6m . 2015 . Functional descriptors for object affordances . In Proceedings of the IEEE International Conference on Intelligent Robots and Systems Workshop. IEEE. Alessandro Pieropan, Carl Henrik Ek, and Hedvig Kjellstr\u00f6m. 2015. Functional descriptors for object affordances. In Proceedings of the IEEE International Conference on Intelligent Robots and Systems Workshop. IEEE."},{"key":"e_1_2_1_109_1","volume-title":"Proceedings of the IEEE-RAS International Conference on Humanoid Robots. 52--58","author":"Pieropan A.","unstructured":"A. Pieropan , C. H. Ek , and H. Kjellstrom . 2014. Recognizing object affordances in terms of spatio-temporal object-object relationships . In Proceedings of the IEEE-RAS International Conference on Humanoid Robots. 52--58 . A. Pieropan, C. H. Ek, and H. Kjellstrom. 2014. Recognizing object affordances in terms of spatio-temporal object-object relationships. In Proceedings of the IEEE-RAS International Conference on Humanoid Robots. 52--58."},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_5"},{"key":"e_1_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1145\/3072959.3083725"},{"key":"e_1_2_1_112_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.132"},{"key":"e_1_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_2_1_114_1","volume-title":"Advances in Neural Information Processing Systems 28","author":"Ren Shaoqing","unstructured":"Shaoqing Ren , Kaiming He , Ross Girshick , and Jian Sun . 2015. Faster R-CNN: Towards real-time object detection with region proposal networks . In Advances in Neural Information Processing Systems 28 , C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett (Eds.). Curran Associates, 91--99. Retrieved from http:\/\/papers.nips.cc\/paper\/5638-faster-r-cnn-towards-real-time-object-detection-with-region-proposal-networks.pdf. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems 28, C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett (Eds.). Curran Associates, 91--99. Retrieved from http:\/\/papers.nips.cc\/paper\/5638-faster-r-cnn-towards-real-time-object-detection-with-region-proposal-networks.pdf."},{"key":"e_1_2_1_115_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--588","author":"Rhinehart Nicholas","unstructured":"Nicholas Rhinehart and Kris M. Kitani . 2016. Learning action maps of large environments via first-person vision . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--588 . Nicholas Rhinehart and Kris M. Kitani. 2016. Learning action maps of large environments via first-person vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--588."},{"key":"e_1_2_1_116_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--588","author":"Rhinehart Nicholas","unstructured":"Nicholas Rhinehart and Kris M. Kitani . 2016. Learning action maps of large environments via first-person vision . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--588 . Nicholas Rhinehart and Kris M. Kitani. 2016. Learning action maps of large environments via first-person vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--588."},{"key":"e_1_2_1_117_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-006-5833-1"},{"key":"e_1_2_1_118_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995627"},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_12"},{"key":"e_1_2_1_120_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1177\/1059712307084689"},{"key":"e_1_2_1_122_1","volume-title":"Proceedings of the International Conference on Collaboration Technologies and Systems (CTS\u201913)","author":"Saponaro G.","unstructured":"G. Saponaro , G. Salvi , and A. Bernardino . 2013. Robot anticipation of human intentions through continuous gesture recognition . In Proceedings of the International Conference on Collaboration Technologies and Systems (CTS\u201913) . 218--225. G. Saponaro, G. Salvi, and A. Bernardino. 2013. Robot anticipation of human intentions through continuous gesture recognition. In Proceedings of the International Conference on Collaboration Technologies and Systems (CTS\u201913). 218--225."},{"key":"e_1_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661230"},{"key":"e_1_2_1_124_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661229.2661230"},{"key":"e_1_2_1_125_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201917)","author":"Sawatzky Johann","year":"2017","unstructured":"Johann Sawatzky and Jurgen Gall . 2017 . Adaptive binarization for weakly supervised affordance segmentation . In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201917) . Johann Sawatzky and Jurgen Gall. 2017. Adaptive binarization for weakly supervised affordance segmentation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201917)."},{"key":"e_1_2_1_126_1","volume-title":"the IEEE International Conference on Computer Vision (ICCV'17)","author":"Sawatzky Johann","year":"2017","unstructured":"Johann Sawatzky and Jurgen Gall . 2017 . Adaptive binarization for weakly supervised affordance segmentation . In the IEEE International Conference on Computer Vision (ICCV'17) . Johann Sawatzky and Jurgen Gall. 2017. Adaptive binarization for weakly supervised affordance segmentation. In the IEEE International Conference on Computer Vision (ICCV'17)."},{"key":"e_1_2_1_127_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299157"},{"key":"e_1_2_1_128_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAMD.2015.2488284"},{"key":"e_1_2_1_129_1","volume-title":"Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication. 750--755","author":"Shiraki Y.","unstructured":"Y. Shiraki , K. Nagata , N. Yamanobe , A. Nakamura , K. Harada , D. Sato , and D. N. Nenchev . 2014. Modeling of everyday objects for semantic grasp . In Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication. 750--755 . Y. Shiraki, K. Nagata, N. Yamanobe, A. Nakamura, K. Harada, D. Sato, and D. N. Nenchev. 2014. Modeling of everyday objects for semantic grasp. In Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication. 750--755."},{"key":"e_1_2_1_130_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201917)","author":"Shu T.","unstructured":"T. Shu , X. Gao , M. S. Ryoo , and S. C. Zhu . 2017. Learning social affordance grammar from videos: Transferring human interactions to human-robot interactions . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201917) . 1669--1676. T. Shu, X. Gao, M. S. Ryoo, and S. C. Zhu. 2017. Learning social affordance grammar from videos: Transferring human interactions to human-robot interactions. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201917). 1669--1676."},{"key":"e_1_2_1_131_1","volume-title":"Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916)","author":"Shu Tianmin","year":"2016","unstructured":"Tianmin Shu , M. S. Ryoo , and Song-Chun Zhu . 2016 . Learning social affordance for human-robot interaction . In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916) . AAAI Press, 3454--3461. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id=3061053.3061104. Tianmin Shu, M. S. Ryoo, and Song-Chun Zhu. 2016. Learning social affordance for human-robot interaction. In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI\u201916). AAAI Press, 3454--3461. Retrieved from http:\/\/dl.acm.org\/citation.cfm?id=3061053.3061104."},{"key":"e_1_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33715-4_54"},{"key":"e_1_2_1_133_1","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. Retrieved from https:\/\/arXiv:1409.1556. Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. Retrieved from https:\/\/arXiv:1409.1556."},{"key":"e_1_2_1_134_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASE.2015.2396014"},{"key":"e_1_2_1_135_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCV\u201911)","author":"Song Hyun Oh","unstructured":"Hyun Oh Song , M. Fritz , C. Gu , and T. Darrell . 2011. Visual grasp affordances from appearance-based cues . In Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCV\u201911) . 998--1005. Hyun Oh Song, M. Fritz, C. Gu, and T. Darrell. 2011. Visual grasp affordances from appearance-based cues. In Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCV\u201911). 998--1005."},{"key":"e_1_2_1_136_1","doi-asserted-by":"publisher","DOI":"10.1006\/ciun.1994.1001"},{"key":"e_1_2_1_137_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-79547-6_42"},{"key":"e_1_2_1_138_1","doi-asserted-by":"publisher","DOI":"10.1207\/S15326969ECO1201_1"},{"key":"e_1_2_1_139_1","doi-asserted-by":"publisher","DOI":"10.1207\/S15326969ECO1502_2"},{"key":"e_1_2_1_140_1","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1177\/0278364909356602","article-title":"Learning visual object categories for robot affordance prediction","volume":"29","author":"Sun Jie","year":"2010","unstructured":"Jie Sun , Joshua L. Moore , Aaron Bobick , and James M. Rehg . 2010 . Learning visual object categories for robot affordance prediction . Int. J. Robot. Res. 29 , 2 -- 3 (2010), 174--197. Jie Sun, Joshua L. Moore, Aaron Bobick, and James M. Rehg. 2010. Learning visual object categories for robot affordance prediction. Int. J. Robot. Res. 29, 2--3 (2010), 174--197.","journal-title":"Int. J. Robot. Res."},{"key":"e_1_2_1_141_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.robot.2013.12.005"},{"key":"e_1_2_1_142_1","volume-title":"Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots. 430--435","author":"Tenorth M.","unstructured":"M. Tenorth , L. Kunze , D. Jain , and M. Beetz . 2010. KNOWROB-MAP\u2014Knowledge-linked semantic object maps . In Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots. 430--435 . M. Tenorth, L. Kunze, D. Jain, and M. Beetz. 2010. KNOWROB-MAP\u2014Knowledge-linked semantic object maps. In Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots. 430--435."},{"key":"e_1_2_1_143_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.13"},{"key":"e_1_2_1_144_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15558-1_26"},{"key":"e_1_2_1_145_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15552-9_48"},{"key":"e_1_2_1_146_1","doi-asserted-by":"publisher","DOI":"10.1207\/s15326969eco0403_3"},{"key":"e_1_2_1_147_1","doi-asserted-by":"publisher","DOI":"10.1109\/ROBOT.2007.363571"},{"key":"e_1_2_1_148_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation. 4768--4773","author":"Ugur E.","unstructured":"E. Ugur , E. Oztop , and E. Sahin . 2011. Going beyond the perception of affordances: Learning how to actualize them through behavioral parameters . In Proceedings of the IEEE International Conference on Robotics and Automation. 4768--4773 . E. Ugur, E. Oztop, and E. Sahin. 2011. Going beyond the perception of affordances: Learning how to actualize them through behavioral parameters. In Proceedings of the IEEE International Conference on Robotics and Automation. 4768--4773."},{"key":"e_1_2_1_149_1","doi-asserted-by":"publisher","DOI":"10.1109\/DEVLRN.2014.6983026"},{"key":"e_1_2_1_150_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-013-0620-5"},{"key":"e_1_2_1_151_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICAR.2011.6088647"},{"key":"e_1_2_1_152_1","volume-title":"Proceedings of the Asian Conference on Computer Vision. Springer, 512--523","author":"Varadarajan Karthik Mahesh","year":"2012","unstructured":"Karthik Mahesh Varadarajan and Markus Vincze . 2012 . AfNet: The affordance network . In Proceedings of the Asian Conference on Computer Vision. Springer, 512--523 . Karthik Mahesh Varadarajan and Markus Vincze. 2012. AfNet: The affordance network. In Proceedings of the Asian Conference on Computer Vision. Springer, 512--523."},{"key":"e_1_2_1_153_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2012.6386232"},{"key":"e_1_2_1_154_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-39402-7_36"},{"key":"e_1_2_1_155_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10602-1_28"},{"key":"e_1_2_1_156_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.319"},{"key":"e_1_2_1_157_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.65"},{"key":"e_1_2_1_158_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2011.5995698"},{"key":"e_1_2_1_159_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.359"},{"key":"e_1_2_1_160_1","doi-asserted-by":"crossref","unstructured":"Zhe Wang Liyan Chen Shaurya Rathore Daeyun Shin and Charless Fowlkes. 2019. Geometric pose affordance: 3D human pose with scene constraints. Retrieved from https:\/\/arXiv:1905.07718. Zhe Wang Liyan Chen Shaurya Rathore Daeyun Shin and Charless Fowlkes. 2019. Geometric pose affordance: 3D human pose with scene constraints. Retrieved from https:\/\/arXiv:1905.07718.","DOI":"10.1007\/978-3-031-25075-0_1"},{"key":"e_1_2_1_161_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2014.16"},{"key":"e_1_2_1_162_1","unstructured":"Bernhard Wymann Eric Espi\u00e9 Christophe Guionneau Christos Dimitrakakis R\u00e9mi Coulom and Andrew Sumner. 2014. TORCS the open racing car simulator. Retrieved from http:\/\/www.torcs.org. Bernhard Wymann Eric Espi\u00e9 Christophe Guionneau Christos Dimitrakakis R\u00e9mi Coulom and Andrew Sumner. 2014. TORCS the open racing car simulator. Retrieved from http:\/\/www.torcs.org."},{"key":"e_1_2_1_163_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201913)","author":"Xie Dan","year":"2013","unstructured":"Dan Xie , Sinisa Todorovic , and Song-Chun Zhu . 2013 . Inferring \u201cDark Matter\u201d and \u201cDark Energy\u201d from videos . In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201913) . Dan Xie, Sinisa Todorovic, and Song-Chun Zhu. 2013. Inferring \u201cDark Matter\u201d and \u201cDark Energy\u201d from videos. In Proceedings of the IEEE International Conference on Computer Vision (ICCV\u201913)."},{"key":"e_1_2_1_164_1","doi-asserted-by":"publisher","DOI":"10.1080\/01691864.2017.1394912"},{"key":"e_1_2_1_165_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5540234"},{"key":"e_1_2_1_166_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 2512--2519","author":"Yao B.","year":"2013","unstructured":"B. Yao , J. Ma , and L. Fei-Fei . 2013. Discovering object functionality . In Proceedings of the IEEE International Conference on Computer Vision. 2512--2519 . DOI:https:\/\/doi.org\/10.1109\/ICCV. 2013 .312 10.1109\/ICCV.2013.312 B. Yao, J. Ma, and L. Fei-Fei. 2013. Discovering object functionality. In Proceedings of the IEEE International Conference on Computer Vision. 2512--2519. DOI:https:\/\/doi.org\/10.1109\/ICCV.2013.312"},{"key":"e_1_2_1_167_1","volume-title":"Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201917)","author":"Ye C.","year":"2017","unstructured":"C. Ye , Y. Yang , R. Mao , C. Fermuller , and Y. Aloimonos . 2017. What can I do around here? Deep functional scene understanding for cognitive robots . In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201917) . 4604--4611. DOI:https:\/\/doi.org\/10.1109\/ICRA. 2017 .7989535 10.1109\/ICRA.2017.7989535 C. Ye, Y. Yang, R. Mao, C. Fermuller, and Y. Aloimonos. 2017. What can I do around here? Deep functional scene understanding for cognitive robots. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA\u201917). 4604--4611. DOI:https:\/\/doi.org\/10.1109\/ICRA.2017.7989535"},{"key":"e_1_2_1_168_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.88"},{"key":"e_1_2_1_169_1","volume-title":"Proceedings of the 21st International Conference on Pattern Recognition (ICPR\u201912)","author":"Zen Gloria","year":"2012","unstructured":"Gloria Zen , Negar Rostamzadeh , Jacopo Staiano , Elisa Ricci , and Nicu Sebe . 2012 . Enhanced semantic descriptors for functional scene categorization . In Proceedings of the 21st International Conference on Pattern Recognition (ICPR\u201912) . IEEE, 1985--1988. Gloria Zen, Negar Rostamzadeh, Jacopo Staiano, Elisa Ricci, and Nicu Sebe. 2012. Enhanced semantic descriptors for functional scene categorization. In Proceedings of the 21st International Conference on Pattern Recognition (ICPR\u201912). IEEE, 1985--1988."},{"key":"e_1_2_1_170_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2014.2330794"},{"key":"e_1_2_1_171_1","doi-asserted-by":"publisher","DOI":"10.1145\/2574860"},{"key":"e_1_2_1_172_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.401"},{"key":"e_1_2_1_173_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision. 1529--1537","author":"Zheng Shuai","unstructured":"Shuai Zheng , Sadeep Jayasumana , Bernardino Romera-Paredes , Vibhav Vineet , Zhizhong Su , Dalong Du , Chang Huang , and Philip H. S. Torr . 2015. Conditional random fields as recurrent neural networks . In Proceedings of the IEEE International Conference on Computer Vision. 1529--1537 . Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, and Philip H. S. Torr. 2015. Conditional random fields as recurrent neural networks. In Proceedings of the IEEE International Conference on Computer Vision. 1529--1537."},{"key":"e_1_2_1_174_1","doi-asserted-by":"crossref","unstructured":"Bolei Zhou Hang Zhao Xavier Puig Sanja Fidler Adela Barriuso and Antonio Torralba. 2016. Semantic understanding of scenes through the ADE20K dataset. Retrieved from https:\/\/arXiv:1608.05442. Bolei Zhou Hang Zhao Xavier Puig Sanja Fidler Adela Barriuso and Antonio Torralba. 2016. Semantic understanding of scenes through the ADE20K dataset. Retrieved from https:\/\/arXiv:1608.05442.","DOI":"10.1109\/CVPR.2017.544"},{"key":"e_1_2_1_175_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972825.35"},{"key":"e_1_2_1_176_1","volume-title":"Advances in Neural Information Processing Systems","author":"Zhou Zhi-Hua","unstructured":"Zhi-Hua Zhou and Min-Ling Zhang . 2007. Multi-instance multi-label learning with application to scene classification . In Advances in Neural Information Processing Systems . MIT Press , 1609--1616. Zhi-Hua Zhou and Min-Ling Zhang. 2007. Multi-instance multi-label learning with application to scene classification. In Advances in Neural Information Processing Systems. MIT Press, 1609--1616."},{"key":"e_1_2_1_177_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2011.10.002"},{"key":"e_1_2_1_178_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10605-2_27"},{"key":"e_1_2_1_179_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.415"},{"key":"e_1_2_1_180_1","unstructured":"Yuke Zhu Ce Zhang Christopher R\u00e9 and Li Fei-Fei. 2015. Building a large-scale multimodal knowledge base system for answering visual queries. Retrieved from https:\/\/arXiv:1507.05670. Yuke Zhu Ce Zhang Christopher R\u00e9 and Li Fei-Fei. 2015. Building a large-scale multimodal knowledge base system for answering visual queries. Retrieved from https:\/\/arXiv:1507.05670."},{"key":"e_1_2_1_181_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298903"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3446370","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,2]],"date-time":"2023-11-02T07:51:22Z","timestamp":1698911482000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3446370"}},"subtitle":["A Survey"],"short-title":[],"issued":{"date-parts":[[2021,4,17]]},"references-count":181,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,4,30]]}},"alternative-id":["10.1145\/3446370"],"URL":"http:\/\/dx.doi.org\/10.1145\/3446370","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,4,17]]},"assertion":[{"value":"2018-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-12-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-04-17","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}