{"id":562,"date":"2019-12-04T20:00:48","date_gmt":"2019-12-04T20:00:48","guid":{"rendered":"https:\/\/www.danielparente.net\/en\/2019\/12\/04\/how-to-succeed-with-machine-learning-and-data-science\/"},"modified":"2019-12-04T20:00:48","modified_gmt":"2019-12-04T20:00:48","slug":"how-to-succeed-with-machine-learning-and-data-science","status":"publish","type":"post","link":"https:\/\/www.danielparente.net\/en\/2019\/12\/04\/how-to-succeed-with-machine-learning-and-data-science\/","title":{"rendered":"How to Succeed With Machine Learning and Data Science"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<div>\n<div class=\"ep\">\n<div class=\"n eq er es et\">\n<div class=\"o n\">\n<div><a href=\"https:\/\/towardsdatascience.com\/@mdeerfreelance?source=post_page-----2a356c2040b7----------------------\" rel=\"noopener\" target=\"_blank\"><\/p>\n<div class=\"eu ev ew\"><img decoding=\"async\" alt=\"Marcel Deer\" class=\"r fe ew ev\" src=\"https:\/\/miro.medium.com\/fit\/c\/96\/96\/2*Aghq2jgtZepKIyOWy-MDVg.jpeg\" width=\"48\" height=\"48\"\/><\/div>\n<p><\/a><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p id=\"6056\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">AI, Machine Learning, IoT and Cloud-Based Services Must Deliver Value From Their Data.<\/p>\n<figure class=\"hf hg hh hi hj hk do dp paragraph-image\"><figcaption class=\"ax fi ia ib ic dq do dp id ie as cx\">Image Source: <a href=\"https:\/\/pixabay.com\/illustrations\/iot-internet-of-things-internet-4085382\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Pixabay<\/a><\/figcaption><\/figure>\n<p id=\"5602\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">With growing attention devoted to AI, machine learning, and IoT, what we\u2019ve come to know as big data has become an even broader version of itself. In recent years, big data was seen as an unstoppable force of nature that would either overwhelm enterprises or propel them to new heights.<\/p>\n<h2 id=\"26ed\" class=\"ij ik ef at as il im in io ip iq ir is it iu iv iw\">The Expansive Data Generation<\/h2>\n<p id=\"7c77\" class=\"gq gr ef at gs b gt ix gv iy gx iz gz ja hb jb hd\">This next generation of big data \u2014 we\u2019ll call it expansive data, pulsing through systems in real-time, powering processes unseen to human eyes, and adapting and learning as it goes along \u2014 is going to reshape enterprises in ways not even anticipated.<\/p>\n<p id=\"323c\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">This requires attention to new types of tools, platforms, and approaches to deliver value to today\u2019s data-hungry businesses. Expansive data will represent ever-growing volumes of information, potentially increasing within enterprises at a rate of up to 36% a year, according to <a href=\"http:\/\/dresneradvisory.com\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Dresner Advisory Services<\/a>.<\/p>\n<blockquote class=\"jc jd je\">\n<p id=\"88f0\" class=\"gq gr ef jf gs b gt gu gv gw gx gy gz ha hb hc hd\"><em class=\"at\">This next generation of big data \u2014 we\u2019ll call it expansive data, pulsing through systems in real-time, powering processes unseen to human eyes, and adapting and learning as it goes along \u2014 is going to reshape enterprises in ways not even anticipated.<\/em><\/p>\n<\/blockquote>\n<p id=\"9e13\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Platforms supporting this growth include <a href=\"https:\/\/aws.amazon.com\/s3\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Amazon Web Services S3<\/a>, <a href=\"https:\/\/spark.apache.org\/sql\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Spark SQL<\/a>, <a href=\"https:\/\/hive.apache.org\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Hive<\/a>, and <a href=\"https:\/\/hadoop.apache.org\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Hadoop<\/a>. Additional tools popular in enterprises are Apache Spark and Tensorflow. Expansive data places even higher demands on enterprise infrastructures, processes, and the managers and administrators responsible for making it all work.<\/p>\n<p id=\"e092\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Organizations are Leaning Heavily on Data Assets<\/p>\n<p id=\"79ab\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">That\u2019s because organizations are leaning more heavily than ever before on their data assets and analytics capabilities, and initiatives such as AI and machine learning, to help them compete. Edge computing is also a defining factor in expansive data.<\/p>\n<p id=\"ec1b\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">There is likely to be greater activity at the edges \u2014 expansive data means more processing may be distributed across IoT networks. Data can be ingested, processed, and even stored within edge devices and systems, and, if it is deemed critical on an enterprise-scale, moved to centralized data centers or clouds.<\/p>\n<p id=\"6a83\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Edge computing continues to extend its capabilities. It encompasses a broad assortment of devices and systems that may require real-time interactions and responsiveness, including kiosks, autonomous cars and trucks, and sensors embedded across IoT. With comprehensive data surging across all points of the enterprise, infrastructures could be quickly overwhelmed with ingestion, processing, and storage demands.<\/p>\n<p id=\"d069\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Expansive data could also be valuable data without proper preparation. Fortunately, none of this is happening in a vacuum, and other developments may be helping organizations manage the challenge. Thanks to the ubiquity of cloud-based services, from infrastructure to platform to applications, the power, and capacity to support even bigger data environments are readily available. A new generation of database platforms and tools \u2014 led and enabled by machine-learning initiatives \u2014 is supporting the continuous, relentless data growth.<\/p>\n<p id=\"2ba2\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Hadoop is a big data framework that made huge-scale data analytics a reality for every company that will benefit from processing data. However, the software is beginning to show its age. While Hadoop was once seen as the single cure-all for significant data challenges ten years ago, today\u2019s expansive data calls for a variety of tools, platforms, and frameworks to help enterprises better manage their data. Nonetheless, the Hadoop Distributed File System can either support or be a part of data lake architectures, opening up a new mission for these environments.<\/p>\n<p id=\"2e0d\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">According to a 2018 survey conducted by <a href=\"http:\/\/www.unisphereresearch.com\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Unisphere Research<\/a>, a division of Information Today, Inc., 44% of enterprises had Hadoop in production, which represents a downward shift from 2016, in which 55% reported using the framework (\u201c2018 Next-Generation Data Deployment Strategies Report\u201d). Also, the survey found general satisfaction levels with Hadoop are mixed:<\/p>\n<p id=\"1981\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Only 14% consider themselves to be \u201cextremely satisfied\u201d with Hadoop, while 64% are either dissatisfied or lukewarm toward the framework. While Hadoop provided one-of-a-kind functionality in its early days \u2014 such as parallel processing and management of a variety of data types \u2014 other technologies and solutions also now share these capabilities without the skill levels that Hadoop demands.<\/p>\n<h2 id=\"404f\" class=\"ij ik ef at as il im in io ip iq ir is it iu iv iw\">Big Data and the IoT<\/h2>\n<figure class=\"hf hg hh hi hj hk do dp paragraph-image\"><figcaption class=\"ax fi ia ib ic dq do dp id ie as cx\">Image Source: <a href=\"https:\/\/pixabay.com\/illustrations\/analytics-information-innovation-3088958\/\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Pixabay<\/a><\/figcaption><\/figure>\n<p id=\"9202\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">Predictably, the growth of expansive data is likely to track closely to that of IoT itself. Accordingly, next-generation data technology initiatives represent new approaches to data management. The Unisphere Research survey found notable growth in the adoption of data lakes \u2014 places to store diverse datasets without requiring to build a model first. Their adoption continues to rise as data management personnel seek to develop ways to quickly capture and store their data from a myriad of sources and in various formats.<\/p>\n<p id=\"b5ea\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\"><a href=\"https:\/\/mapr.com\/whitepapers\/getting-value-big-data-and-data-science-enterprise\/assets\/getting-value-big-data-and-data-science-enterprise.pdf\" class=\"dc by if ig ih ii\" target=\"_blank\" rel=\"noopener nofollow noreferrer\">Overall, 38% <\/a>of organizations employ data lakes as part of their data architecture, up from 20% in a survey conducted two years prior. Another 15% said they were considering adoption. Data lakes are growing to impressive levels as well \u2014 close to one-third, 32%, support more than 100TB of data, the survey found.<\/p>\n<p id=\"2394\" class=\"gq gr ef at gs b gt gu gv gw gx gy gz ha hb hc hd\">With the relentless rise of IoT, AI, machine learning, and cloud-based services, enterprises are now challenged with accommodating and delivering value from the expansive data that surges through their systems. Data warehouses and Hadoop represented solutions for the pre-IoT, pre-AI enterprises. Today\u2019s opportunities and challenges call for the next generation of platforms and tools to bring it all together.<\/p>\n<\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/towardsdatascience.com\/how-to-succeed-with-machine-learning-and-data-science-2a356c2040b7?gi=44df0b77efe6\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] AI, Machine Learning, IoT and Cloud-Based Services Must Deliver Value From Their Data. Image Source: Pixabay With growing attention devoted to AI, machine learning, and IoT, what we\u2019ve come to know as big data has become an even broader version of itself. In recent years, big data was seen as an unstoppable force of [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":563,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":"","jetpack_post_was_ever_published":false},"categories":[1],"tags":[],"class_list":["post-562","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"blocksy_meta":[],"jetpack_featured_media_url":"https:\/\/e928cfdc7rs.exactdn.com\/info\/uploads\/sites\/3\/2019\/12\/How-to-Succeed-With-Machine-Learning-and-Data-Science.jpeg?strip=all","jetpack_shortlink":"https:\/\/wp.me\/p2TFCd-94","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/posts\/562","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/comments?post=562"}],"version-history":[{"count":0,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/posts\/562\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/media\/563"}],"wp:attachment":[{"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/media?parent=562"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/categories?post=562"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.danielparente.net\/en\/wp-json\/wp\/v2\/tags?post=562"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}