{"id":1057,"date":"2019-12-13T08:44:08","date_gmt":"2019-12-13T08:44:08","guid":{"rendered":"https:\/\/www.danielparente.net\/en\/2019\/12\/13\/how-advanced-ai-built-microsofts-vision-for-ubiquitous-computing-microsoft-latinx\/"},"modified":"2019-12-13T08:44:08","modified_gmt":"2019-12-13T08:44:08","slug":"how-advanced-ai-built-microsofts-vision-for-ubiquitous-computing-microsoft-latinx","status":"publish","type":"post","link":"https:\/\/www.danielparente.net\/en\/2019\/12\/13\/how-advanced-ai-built-microsofts-vision-for-ubiquitous-computing-microsoft-latinx\/","title":{"rendered":"How advanced AI built Microsoft\u2019s vision for ubiquitous computing \u2013 Microsoft Latinx"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-7376\" src=\"https:\/\/1gew6o3qn6vx9kp3s42ge0y1-wpengine.netdna-ssl.com\/wp-content\/uploads\/prod\/sites\/166\/2019\/11\/HoloLens.png\" alt=\"HoloLens\" srcset=\"https:\/\/1gew6o3qn6vx9kp3s42ge0y1-wpengine.netdna-ssl.com\/wp-content\/uploads\/prod\/sites\/166\/2019\/11\/HoloLens.png 1691w, https:\/\/1gew6o3qn6vx9kp3s42ge0y1-wpengine.netdna-ssl.com\/wp-content\/uploads\/prod\/sites\/166\/2019\/11\/HoloLens-300x142.png 300w, https:\/\/1gew6o3qn6vx9kp3s42ge0y1-wpengine.netdna-ssl.com\/wp-content\/uploads\/prod\/sites\/166\/2019\/11\/HoloLens-768x363.png 768w, https:\/\/1gew6o3qn6vx9kp3s42ge0y1-wpengine.netdna-ssl.com\/wp-content\/uploads\/prod\/sites\/166\/2019\/11\/HoloLens-1024x484.png 1024w\" sizes=\"(max-width: 1691px) 100vw, 1691px\"\/><\/p>\n<p class=\"\">The first time people don the new HoloLens 2 on their heads, the device automatically gets to know them: It measures everything from the precise shape of their hands to the exact distance between their eyes.<\/p>\n<p>The artificial intelligence research and development that enabled those capabilities \u201cwas astonishingly complicated\u201d but essential to making the experience of using the device \u201cinstinctual,\u201d said\u00a0<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jamiesho\/\" target=\"_blank\" rel=\"noopener\">Jamie Shotton<\/a>, a partner scientist who leads the HoloLens science team in Cambridge, United Kingdom.<\/p>\n<p>\u201cWe want you to know how to use HoloLens without having to be taught how to use it,\u201d he said. \u201cWe know how to interact with things in the real, physical world: We pick things up, we press buttons, we point to things. We aim, as far as possible, to translate that directly into mixed reality.\u201d<\/p>\n<p>Microsoft today announced the\u00a0<a href=\"https:\/\/www.microsoft.com\/en-us\/hololens\" target=\"_blank\" rel=\"noopener\">HoloLens 2<\/a>\u00a0is now shipping to customers. The sensor-packed holographic computing headset uses AI to displace space and time, creating a mixed reality of people, places and things in order to facilitate one of our most basic human impulses: exchanging knowledge.<\/p>\n<p>Microsoft Technical Fellow Alex Kipman said the headset defines the highest watermark of intelligent edge devices \u2013 AI-capable technologies that can collect and process data even without a reliable internet connection \u2013 and that can then share some or all of that data with the intelligent cloud when connected.<\/p>\n<p>On a recent day, Kipman sketched a diagram of this ubiquitous computing fabric on a digital whiteboard in his office.<\/p>\n<p>\u201cHoloLens,\u201d he said, \u201cis the first native device to be invented from the ground up with this worldview in mind.\u201d<\/p>\n<p>The marriage of the AI in HoloLens 2 with the AI capabilities of Azure, Microsoft\u2019s cloud computing platform,\u00a0<a href=\"https:\/\/news.microsoft.com\/innovation-stories\/hololens-2\/\" target=\"_blank\" rel=\"noopener\">allows heads-up, hands-on workers to learn skills<\/a>\u00a0that advance their careers, and makes it possible for people on opposite ends of the Earth who speak different languages to\u00a0<a href=\"https:\/\/www.youtube.com\/watch?time_continue=32&amp;v=auJJrHgG9Mc\" target=\"_blank\" rel=\"noopener\">collaborate with a shared sense of physical presence<\/a>.<\/p>\n<p>\u201cYou can do really interesting things with HoloLens, and you can do really interesting things with the cloud,\u201d said Julia White, Microsoft corporate vice president of Azure marketing. \u201cBut when you see these two things come together, it changes the game in terms of what people can actually do.\u201d<\/p>\n<h2><strong>Delivering mixed reality with AI<\/strong><\/h2>\n<p>To enable instinctual interactions with HoloLens 2, Shotton and his colleagues developed, trained and deployed AI models onto the device that track people\u2019s hand motions and eye gaze so that, for example, they can perceive a hologram floating in front of them and reach out to resize it or reposition it.<\/p>\n<p>To build the hand tracking system, the team built a rig with a dome of cameras pointing inward that they used to record a diverse range of people\u2019s hands. Then the team used offline cloud processing to build a 3D model capable of representing all human hand shapes and motions.<\/p>\n<p>From this 3D model, the team was able to use computer graphics to render realistic, synthetic images of hands along with synthetic labels to make the model robust across a variety of hand shapes, poses and movements.<\/p>\n<p>\u201cYou can generate effectively unlimited quantities of training data,\u201d Shotton said.<\/p>\n<p>The team used this data to train a compact deep neural network, a type of AI algorithm, that fits on the HoloLens\u2019s onboard processor and runs efficiently on every frame coming from the device\u2019s depth sensor.<\/p>\n<p>When a new customer puts on a HoloLens 2, the system uses this neural network to help fit a personalized 3D model to the customer\u2019s hands, enabling the precise tracking required to allow instinctual interaction with holograms.<\/p>\n<p>Shotton\u2019s team took a similar approach to build and train the eye tracking model, paying close attention to what\u2019s called interpupillary distance, or the distance between the centers of the pupils of the eyes, which varies across people and affects how a person sees near or distant objects.<\/p>\n<p>The result is an eye tracking model that allows the HoloLens 2 to precisely display holograms in front of customers for interaction and manipulation with their hands and eyes.<\/p>\n<p>\u201cWithout eye tracking, aligning holograms to the real world \u2013 especially the person\u2019s physical hand \u2013 would just be impossible to the level of precision needed to allow instinctual interaction,\u201d Shotton said.<\/p>\n<h2><strong class=\"\">AI on the edge to the cloud<\/strong><\/h2>\n<p class=\"\">The hand and eye tracking capabilities, along with other intelligent features such as simultaneous localization and mapping that\u2019s necessary to make holograms appear pinned to the world as a person moves around, are embedded on the HoloLens 2 in the second generation custom chip called a holographic processing unit, or HPU 2.0.<\/p>\n<p>Kipman calls this class of on-device AI capability perception AI.<\/p>\n<p>\u201cPerception is like reptile brain,\u201d he said. \u201cIt is performing those operations that your brain performs that are instinctual, that you don\u2019t think about.\u201d<\/p>\n<p>For people, this type of intelligence keeps our hearts beating, our lungs breathing and our eyes performing microsaccades to gauge depth of field, for example. When we\u2019re thirsty and want a sip of water, our eyes instinctively gauge the distance to the water glass that our hands lift to our lips.<\/p>\n<p>Perception AI on the HoloLens 2 enables people to manipulate and interact with holograms without worrying about what\u2019s called latency \u2013 typically the hundreds of milliseconds it takes for data to travel to the cloud, be processed and returned to the edge.<\/p>\n<p>\u201cEven tens of milliseconds make a significant perceptual difference\u201d when pressing a button on a hologram, for example, or scrolling through text on a hologram with your eyes, noted Shotton. \u201cThat turnaround time is critical.\u201d<\/p>\n<p>Privacy concerns are another reason to do AI calculations locally on a device; the iris scans that HoloLens 2 performs to authenticate customers are the type of personal data people may not want sent to the cloud.<\/p>\n<p>For many other types of data, however, there\u2019s an advantage to sending it to the cloud: Once there, the customer can take advantage of Azure AI and mixed reality services and combine the data from their device with data from throughout the ubiquitous computing fabric. That allows for more advanced computation or cognition, Kipman said.<\/p>\n<h2><strong class=\"\">Cloud collaboration<\/strong><\/h2>\n<p>A key advantage of intelligent cloud-powered holographic computing is the ability to share information with others who have a HoloLens or another device with similar capabilities, said\u00a0<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/mapoll\/\" target=\"_blank\" rel=\"noopener\">Marc Pollefeys<\/a>, the director of Microsoft\u2019s\u00a0<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/lab\/mixed-reality-ai-zurich\/\" target=\"_blank\" rel=\"noopener\">Mixed Reality and AI Zurich Lab<\/a>\u00a0in Switzerland.<\/p>\n<p>Pollefeys is leading a team that develops core computer vision algorithms for a mixed reality cloud service called\u00a0<a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/spatial-anchors\/\" target=\"_blank\" rel=\"noopener\">Azure Spatial Anchors<\/a>\u00a0that allows holograms to persist, locked in the real world, for anyone with the appropriate level of access to view.<\/p>\n<p>For example, spatial anchor technology allows a manager in a factory to place holograms next to equipment on an assembly line that contain vital, real-time operating and maintenance information that any credentialed worker with a mixed reality capable device can access.<\/p>\n<p>\u201cIf I can only place information that I will see back on my device, it\u2019s probably never worth placing holograms in the world, but if I can annotate the world and afterward anyone else in the company that has the right access can see all of the information, it is suddenly much more valuable,\u201d Pollefeys said.<\/p>\n<p>To create this capability, Pollefeys and his team developed AI computer vision algorithms that process data from sensors to extract 3D geometric information about the environment and piece it together in the cloud to create a digital twin, or map, of the area of interest.<\/p>\n<p>HoloLens has always built up a 3D or spatial understanding of its environment to function. Azure Spatial Anchors creates, refines and shares these maps across devices, Pollefeys noted. That\u2019s why the maps from individual devices are pieced together and stored in the cloud.<\/p>\n<p>\u201cIt doesn\u2019t make sense to have that data only on an individual device,\u201d he said. \u201cIt is one of those things where I have a little piece of the puzzle, and somebody else has a little piece of the puzzle, and all of the devices together have covered the whole space of interest.\u201d<\/p>\n<p>These maps get denser, more precise and robust over time as different mixed reality capable devices \u2013 HoloLenses as well as properly equipped phones, tablets and laptops \u2013 map their environment and share the data with the cloud.<\/p>\n<p>For example, the map of the factory floor where the manager left holograms floating over pieces of equipment on the assembly line is steadily refined as more and more credentialed workers view the holograms with their devices.<\/p>\n<p>This capability also enables scenarios such as a meeting between architects and clients to view and interact with a holographic 3D blueprint of a building, each of them with mixed reality capable devices looking at the blueprint from their own point of view as they sit around a table.<\/p>\n<p>Azure contains pre-built services to write applications for these types of experiences on HoloLens and any other mixed reality device, including smartphones and tablets running the iOS and Android operating systems, noted White.<\/p>\n<p>\u201cThat collaboration experience isn\u2019t just locked to HoloLens,\u201d she said. \u201cAnd, the cost and complexity and skillset required to make an application that does something amazing is far down.\u201d<\/p>\n<p>The cross-device and platform capability, for example, enables experiences such as\u00a0<a href=\"https:\/\/www.minecraft.net\/en-us\/earth\" target=\"_blank\" rel=\"noopener\">Minecraft Earth<\/a>, which merges the popular video game with mixed reality in a way that players can build and place virtual structures in the real world that persist so that other players can interact with them on their devices.<\/p>\n<p>\u201cWe all get to participate because it is based on using cloud technology that can be understood and interpreted by all different devices,\u201d said White.<\/p>\n<h2><strong>Technology that is designed for people<\/strong><\/h2>\n<p>For HoloLens to work as envisioned, the technology that underpins the experience needs to understand the world in ways that are similar to the way people do, Kipman noted.<\/p>\n<p>That\u2019s why he and his collaborators across Microsoft have developed, deployed and leveraged AI solutions throughout the ubiquitous computing fabric, from the silicon in the headset of HoloLens 2 to Azure AI and mixed reality services.<\/p>\n<p>Back at his digital whiteboard, Kipman has now sketched out a vision for ubiquitous computing that is rife with words, boxes, arrows \u2013 and a stick-figure picture of two people locked in conversation next to an intelligent device.<\/p>\n<p>That, he says, is the ultimate goal of ubiquitous computing \u2013 to get people to interact with other people in natural ways.<\/p>\n<p>To drive home the point, he establishes a moment of intense, deliberate eye contact and says, \u201cHopefully, you are getting more out of this conversation because you are physically present with me.\u201d<\/p>\n<p>\u201cWe could have done this over the phone,\u201d he continues. \u201cWe could have done it over Skype. I could have recorded it and sent you a tape. You didn\u2019t choose to do that. You chose to be physically present with me. Why? Because that\u2019s how we do human things.\u201d<\/p>\n<p>\u201cThe con is you have to be here at the same time I am here, and we have to be in the same location. The power of this technology is it gives us the ability to displace space and time.\u201d<\/p>\n<p><em>Top image: Microsoft Technical Fellow Alex Kipman models the HoloLens 2, a sensor-packed holographic computing headset. Photo by Microsoft.<\/em><\/p>\n<h3><strong>Related:<\/strong><\/h3>\n<p><em>John Roach writes about Microsoft research and innovation. Follow him on\u00a0<\/em><a href=\"https:\/\/twitter.com\/byjohnroach\" target=\"_blank\" rel=\"noopener\"><em>Twitter<\/em><\/a><em>.<\/em><\/p>\n<p class=\"tag-list\">Tags: <a aria-label=\"See more stories about HoloLens\" href=\"https:\/\/blogs.microsoft.com\/latinx\/tag\/hololens\/\" rel=\"tag noopener\" target=\"_blank\">HoloLens<\/a>, <a aria-label=\"See more stories about Microsoft Latinx\" href=\"https:\/\/blogs.microsoft.com\/latinx\/tag\/microsoft-latinx\/\" rel=\"tag noopener\" target=\"_blank\">Microsoft Latinx<\/a><\/p>\n<\/p><\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><br \/>\n<br \/>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/blogs.microsoft.com\/latinx\/2019\/11\/20\/the-making-of-the-hololens-2-how-advanced-ai-built-microsofts-vision-for-ubiquitous-computing\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] The first time people don the new HoloLens 2 on their heads, the device automatically gets to know them: It measures everything from the precise shape of their hands to the exact distance between their eyes. The artificial intelligence research and development that enabled those capabilities \u201cwas astonishingly complicated\u201d but essential to making the [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":1058,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":"","jetpack_post_was_ever_published":false},"categories":[168],"tags":[],"class_list":["post-1057","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-xr"],"blocksy_meta":[],"jetpack_featured_media_url":"https:\/\/e928cfdc7rs.exactdn.com\/info\/uploads\/sites\/3\/2019\/12\/How-advanced-AI-built-Microsoft\u2019s-vision-for-ubiquitous-computing-\u2013.png?strip=all","jetpack_shortlink":"https:\/\/wp.me\/p2TFCd-h3","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/posts\/1057","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/comments?post=1057"}],"version-history":[{"count":0,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/posts\/1057\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/media\/1058"}],"wp:attachment":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/media?parent=1057"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/categories?post=1057"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/tags?post=1057"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}