{"id":687401,"date":"2024-03-10T16:10:00","date_gmt":"2024-03-10T23:10:00","guid":{"rendered":"https:\/\/makerfaire.com\/yearbook\/-projects\/embodied-ai-agent-with-a-real-robotic-platform\/"},"modified":"2024-03-10T16:10:00","modified_gmt":"2024-03-10T23:10:00","slug":"embodied-ai-agent-with-a-real-robotic-platform-2023","status":"publish","type":"projects","link":"https:\/\/makerfaire.com\/yearbook\/projects\/embodied-ai-agent-with-a-real-robotic-platform-2023\/","title":{"rendered":"Embodied AI Agent with a real robotic platform"},"content":{"rendered":"<p>Maker Names (not publicly visible) Machine Learning Reply<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The talk will focus on a real implementation of Embodied AI agent. We will start with an overview of the Machine Learning models covered within Reply R&amp;D, therefore DinoV2 for Object Detection (https:\/\/dinov2.metademolab.com\/), PALM (https:\/\/palm-e.github.io\/ ) as a starting point for VLMs (Visual Language Models) and be able to generalize a large number of tasks that require multimodal input (both with images and text). We will then move on to a focus on a robotic agent such as SPOT by Boston Dynamics, therefore its architecture, the potential of this agent and the sensors present in stock. From here we will have the basis to move on to an implementation of Embodied AI Agents controlled completely with voice in natural language. We will show an orchestrator who, by receiving voice commands in natural language as input, will be able to control a robotic agent such as SPOT by Boston Dynamics and use the Machine Learning models necessary to complete the individual tasks within the episode initiated by the user. We will then show current developments in the way related to the use of Visual Language Models, such as RT-2(https:\/\/robotics-transformer2.github.io\/) for robotic agents and LINGO-1(https:\/\/wayve.ai\/ ) for autonomous driving.<\/p>","protected":false},"featured_media":687410,"parent":0,"menu_order":0,"template":"","meta":{"_acf_changed":false},"mf-project-cat":[21679,21661,21662,21663],"class_list":["post-687401","projects","type-projects","status-publish","has-post-thumbnail","hentry","mf-project-cat-artificial-intelligence","mf-project-cat-education","mf-project-cat-flying-aeronautics","mf-project-cat-robotics","mf-year-tax-21649"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Embodied AI Agent with a real robotic platform - Maker Faire<\/title>\n<meta name=\"description\" content=\"Maker Faire Rome 2023 - Embodied AI Agent with a real robotic platform - The talk will focus on a real implementation of Embodied AI agent. We will start with an overview of the Machine Learning models covered within Reply R&amp;D, therefore DinoV2 for Object Detection (https:\/\/dinov2.metademolab.com\/), PALM (https:\/\/palm-e.github.io\/ ) as a starting point for VLMs (Visual Language Models) and be able to generalize a large number of tasks that require multimodal input (both with images and text). We will then move on to a focus on a robotic agent such as SPOT by Boston Dynamics, therefore its architecture, the potential of this agent and the sensors present in stock. From here we will have the basis to move on to an implementation of Embodied AI Agents controlled completely with voice in natural language. We will show an orchestrator who, by receiving voice commands in natural language as input, will be able to control a robotic agent such as SPOT by Boston Dynamics and use the Machine Learning models necessary to complete the individual tasks within the episode initiated by the user. We will then show current developments in the way related to the use of Visual Language Models, such as RT-2(https:\/\/robotics-transformer2.github.io\/) for robotic agents and LINGO-1(https:\/\/wayve.ai\/ ) for autonomous driving.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Embodied AI Agent with a real robotic platform - Maker Faire\" \/>\n<meta property=\"og:description\" content=\"The talk will focus on a real implementation of Embodied AI agent. We will start with an overview of the Machine Learning models covered within Reply R&amp;D, therefore DinoV2 for Object Detection (https:\/\/dinov2.metademolab.com\/), PALM (https:\/\/palm-e.github.io\/ ) as a starting point for VLMs (Visual Language Models) and be able to generalize a large number of tasks that require multimodal input (both with images and text). We will then move on to a focus on a robotic agent such as SPOT by Boston Dynamics, therefore its architecture, the potential of this agent and the sensors present in stock. From here we will have the basis to move on to an implementation of Embodied AI Agents controlled completely with voice in natural language. We will show an orchestrator who, by receiving voice commands in natural language as input, will be able to control a robotic agent such as SPOT by Boston Dynamics and use the Machine Learning models necessary to complete the individual tasks within the episode initiated by the user. We will then show current developments in the way related to the use of Visual Language Models, such as RT-2(https:\/\/robotics-transformer2.github.io\/) for robotic agents and LINGO-1(https:\/\/wayve.ai\/ ) for autonomous driving.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/\" \/>\n<meta property=\"og:site_name\" content=\"Maker Faire\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/makerfaire\" \/>\n<meta property=\"og:image\" content=\"https:\/\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform-1024x541.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"541\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@makerfaire\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/\",\"url\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/\",\"name\":\"Embodied AI Agent with a real robotic platform - Maker Faire\",\"isPartOf\":{\"@id\":\"https:\/\/makerfaire.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform.png?fit=1918%2C1014&ssl=1\",\"datePublished\":\"2024-03-10T23:10:00+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform.png?fit=1918%2C1014&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform.png?fit=1918%2C1014&ssl=1\",\"width\":1918,\"height\":1014,\"caption\":\"Embodied AI Agent with a real robotic platform - Maker Faire\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/makerfaire.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Embodied AI Agent with a real robotic platform\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/makerfaire.com\/#website\",\"url\":\"https:\/\/makerfaire.com\/\",\"name\":\"Maker Faire\",\"description\":\"The Greatest Show (&amp; Tell) on Earth. Maker Faire is part science fair, part county fair, and part something entirely new! As a celebration of the Maker Movement, it\u2019s a family-friendly showcase of invention, creativity, and resourcefulness.\",\"publisher\":{\"@id\":\"https:\/\/makerfaire.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/makerfaire.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/makerfaire.com\/#organization\",\"name\":\"Make: Community\",\"alternateName\":\"Make: Community\",\"url\":\"https:\/\/makerfaire.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/makerfaire.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2019\/08\/makecommunity_logo.png?fit=1050%2C680&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2019\/08\/makecommunity_logo.png?fit=1050%2C680&ssl=1\",\"width\":1050,\"height\":680,\"caption\":\"Make: Community\"},\"image\":{\"@id\":\"https:\/\/makerfaire.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/makerfaire\",\"https:\/\/x.com\/makerfaire\",\"https:\/\/instagram.com\/makerfaire\",\"https:\/\/www.pinterest.com\/makemagazine\/maker-faire\",\"https:\/\/www.youtube.com\/user\/MakerFaireVideo\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Embodied AI Agent with a real robotic platform - Maker Faire","description":"Maker Faire Rome 2023 - Embodied AI Agent with a real robotic platform - The talk will focus on a real implementation of Embodied AI agent. We will start with an overview of the Machine Learning models covered within Reply R&D, therefore DinoV2 for Object Detection (https:\/\/dinov2.metademolab.com\/), PALM (https:\/\/palm-e.github.io\/ ) as a starting point for VLMs (Visual Language Models) and be able to generalize a large number of tasks that require multimodal input (both with images and text). We will then move on to a focus on a robotic agent such as SPOT by Boston Dynamics, therefore its architecture, the potential of this agent and the sensors present in stock. From here we will have the basis to move on to an implementation of Embodied AI Agents controlled completely with voice in natural language. We will show an orchestrator who, by receiving voice commands in natural language as input, will be able to control a robotic agent such as SPOT by Boston Dynamics and use the Machine Learning models necessary to complete the individual tasks within the episode initiated by the user. We will then show current developments in the way related to the use of Visual Language Models, such as RT-2(https:\/\/robotics-transformer2.github.io\/) for robotic agents and LINGO-1(https:\/\/wayve.ai\/ ) for autonomous driving.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/","og_locale":"en_US","og_type":"article","og_title":"Embodied AI Agent with a real robotic platform - Maker Faire","og_description":"The talk will focus on a real implementation of Embodied AI agent. We will start with an overview of the Machine Learning models covered within Reply R&amp;D, therefore DinoV2 for Object Detection (https:\/\/dinov2.metademolab.com\/), PALM (https:\/\/palm-e.github.io\/ ) as a starting point for VLMs (Visual Language Models) and be able to generalize a large number of tasks that require multimodal input (both with images and text). We will then move on to a focus on a robotic agent such as SPOT by Boston Dynamics, therefore its architecture, the potential of this agent and the sensors present in stock. From here we will have the basis to move on to an implementation of Embodied AI Agents controlled completely with voice in natural language. We will show an orchestrator who, by receiving voice commands in natural language as input, will be able to control a robotic agent such as SPOT by Boston Dynamics and use the Machine Learning models necessary to complete the individual tasks within the episode initiated by the user. We will then show current developments in the way related to the use of Visual Language Models, such as RT-2(https:\/\/robotics-transformer2.github.io\/) for robotic agents and LINGO-1(https:\/\/wayve.ai\/ ) for autonomous driving.","og_url":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/","og_site_name":"Maker Faire","article_publisher":"https:\/\/www.facebook.com\/makerfaire","og_image":[{"width":1024,"height":541,"url":"https:\/\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform-1024x541.png","type":"image\/png"}],"twitter_card":"summary_large_image","twitter_site":"@makerfaire","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/","url":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/","name":"Embodied AI Agent with a real robotic platform - Maker Faire","isPartOf":{"@id":"https:\/\/makerfaire.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#primaryimage"},"image":{"@id":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform.png?fit=1918%2C1014&ssl=1","datePublished":"2024-03-10T23:10:00+00:00","breadcrumb":{"@id":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#primaryimage","url":"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform.png?fit=1918%2C1014&ssl=1","contentUrl":"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2024\/03\/embodied-ai-agent-with-a-real-robotic-platform.png?fit=1918%2C1014&ssl=1","width":1918,"height":1014,"caption":"Embodied AI Agent with a real robotic platform - Maker Faire"},{"@type":"BreadcrumbList","@id":"https:\/\/makerfaire.com\/yearbook\/2023-projects\/embodied-ai-agent-with-a-real-robotic-platform\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/makerfaire.com\/"},{"@type":"ListItem","position":2,"name":"Embodied AI Agent with a real robotic platform"}]},{"@type":"WebSite","@id":"https:\/\/makerfaire.com\/#website","url":"https:\/\/makerfaire.com\/","name":"Maker Faire","description":"The Greatest Show (&amp; Tell) on Earth. Maker Faire is part science fair, part county fair, and part something entirely new! As a celebration of the Maker Movement, it\u2019s a family-friendly showcase of invention, creativity, and resourcefulness.","publisher":{"@id":"https:\/\/makerfaire.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/makerfaire.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/makerfaire.com\/#organization","name":"Make: Community","alternateName":"Make: Community","url":"https:\/\/makerfaire.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/makerfaire.com\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2019\/08\/makecommunity_logo.png?fit=1050%2C680&ssl=1","contentUrl":"https:\/\/i0.wp.com\/makerfaire.com\/wp-content\/uploads\/2019\/08\/makecommunity_logo.png?fit=1050%2C680&ssl=1","width":1050,"height":680,"caption":"Make: Community"},"image":{"@id":"https:\/\/makerfaire.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/makerfaire","https:\/\/x.com\/makerfaire","https:\/\/instagram.com\/makerfaire","https:\/\/www.pinterest.com\/makemagazine\/maker-faire","https:\/\/www.youtube.com\/user\/MakerFaireVideo"]}]}},"_links":{"self":[{"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/projects\/687401","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/projects"}],"about":[{"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/types\/projects"}],"version-history":[{"count":0,"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/projects\/687401\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/media\/687410"}],"wp:attachment":[{"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/media?parent=687401"}],"wp:term":[{"taxonomy":"mf-project-cat","embeddable":true,"href":"https:\/\/makerfaire.com\/wp-json\/wp\/v2\/mf-project-cat?post=687401"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}