Inhalt: Businesses thrive by making informed decisions that target the needs of their customers and users. To make such strategic decisions, they rely on data. Hive is a tool of choice for many data scientists because it allows them to work with SQL, a familiar syntax, to derive insights from Hadoop, reflecting the information that businesses seek to plan effectively. This course shows how to use Hive to process data. Instructor Ben Sullins starts by showing you how to structure and optimize your data. Next, he explains how to get Hue, the Hadoop user interface, to leverage HiveQL when analyzing data. Using the newly configured option, he then demonstrates how to load data, create aggregate tables for fast query access, and run advanced analytics. He also takes you through managing tables and putting functions to use. This course is designed to help you find new ways to work with datasets so you can answer the tough data science questions that come your way. Umfang: 01:53:06.00
Inhalt: Apache Spark is a powerful platform that provides users with new ways to store and make use of big data. In this course, get up to speed with Spark, and discover how to leverage this popular processing engine to deliver effective and comprehensive insights into your data. Instructor Ben Sullins provides an overview of the platform, going into the different components that make up Apache Spark. He shows how to analyze data in Spark using PySpark and Spark SQL, explores running machine learning algorithms using MLib, demonstrates how to create a streaming analytics application using Spark Streaming, and more. Umfang: 01:27:18.00
Inhalt: Approach big data with confidence by mastering the core skills needed to put data to work for your business. This course covers the basics of data engineering, system design, analytics, and business intelligence. Data science expert Ben Sullins explains how to collect and organize your data so you can deliver results that your organization can leverage. Ben starts by examining the modern data ecosystem and how it relates to running a smart and efficient data hub. Then, he shows you how to perform the principle tasks involved in managing, loading, extracting, and transforming data. He also takes you through staging, profiling, cleansing, and migrating data. Along the way, he provides actionable recommendations that applicable to data experts throughout an organization-analysts, engineers, scientists, modelers, and more. Umfang: 00:53:24.00
Inhalt: Elasticsearch has been widely adopted in search engine platforms for modern web and mobile applications. Combined with the power of Kibana-which can help to provide analytical solutions on top of your Elasticsearch cluster-this powerful platform adds the capability to answer complex business questions about your data and your customers, as well as serve up relevant results in your applications. Beyond just search, the Elastic Stack provides companies with complex analytics and advanced features like machine learning. Companies large and small use Elasticsearch to identify potential fraud, machines that aren''t operating properly, and what users are doing in their apps. In this course, join Ben Sullins as he dives into the inner workings of Elasticsearch combined with Kibana. Ben provides an overview of the architecture, and then goes over the different deployment methods, and how to best structure your data. From there, he demonstrates how to query data, and how to work with Kibana to present your insights. Umfang: 01:31:19.00
Inhalt: Hadoop-the hugely popular big data platform-offers a vast array of capabilities designed to help data scientists deliver their insights. In this course, Ben Sullins helps you get up to speed with Hadoop by sharing a series of tips and tricks for doing data science work in this powerful platform. He starts by looking at how to work with Hadoop data in HDFS, and then explores using Hive-the Hadoop SQL engine-where a lot of data science work happens. To wrap up the course, Ben covers techniques for running fast queries in the Hive engine. Umfang: 01:12:30.00
Inhalt: If you're interested in working in the field of data or looking to advance in the field, you need a foundational knowledge of several key areas of data science. Not only that, you need to be able to demonstrate that knowledge. In this four-part, hands-on series, Ben Sullins shows how to build four distinct data science projects using SQL, Tableau, Python, and Spark. In this first installment, Ben uses SQL to analyze employee data, which is notoriously difficult to analyze given its structure. He breaks down the specific structure of employee data, and the best way to track this kind of information, then covers how you can start answering specific questions utilizing SQL. Finally, he gives advice on how to present your data, considering both your audience and the visuals you use, in order to convey your knowledge of the subject. Note: This course was created by Free the Data Academy. We are pleased to host this training in our library. Umfang: 00:44:56
Inhalt: If you're interested in working in data or looking to advance in the field, you need a foundational knowledge of several key areas of data science. Not only that, you need to be able to demonstrate that knowledge. In this four-part series, Ben Sullins shows how to build four distinct data science projects using SQL, Tableau, Python, and Spark. In this second installment, Ben details the steps in building a sales dashboard with Tableau, the popular data visualization platform favored by organizations worldwide. Ben starts by breaking down the different aspects of Tableau, from working on a desktop, to sharing data over the web, to using the Tableau Public platform to publicly share your data visualizations. He then shows Tableau in action, looking at how it facilitates a deep dive into your data, before demonstrating how to build out an exploratory dashboard with your data. At the end of the course, you'll be able to give a live demo of your data visualizations in Tableau Public on the web. Note: This course was created by Free the Data Academy. We are pleased to host this training in our library. Umfang: 01:01:15
Inhalt: Python is a popular programming language in the field of data science, used for engineering and analytics, as well as data science itself. In this course, instructor Ben Sullins focuses on teaching you how to use Python using notebooks set up on the Jupyter platform. Ben walks through installing Jupyter using Anaconda, then shows you how to navigate the user interface and get Python running on a notebook in Jupyter. He covers how to import pandas, use pandas to explore sample data, and use data frames. Ben steps you through a number of functions, then concludes with some of the unique ways to present your data using Jupyter notebooks and Python. Note: This course was created by Ben Sullins and Free the Data Academy. We are pleased to host this training in our library. Umfang: 00:27:57
Inhalt: Are you considering working in data science, and would you like to try out some popular tools first? This course focuses on what you can do with Apache Spark. Instructor Ben Sullins shows you how to set up Spark on Databricks. Ben goes over how to import your data and start working with it, using both Python and SQL languages in Spark. He steps through taking your project to the next level with some easy data visualizations. After explaining some tips and tricks to present your data using Spark, Ben concludes with some additional resources that you can use to pursue your data science journey. Note: This course was created by Ben Sullins and Free the Data Academy. We are pleased to host this training in our library. Umfang: 00:29:27
Inhalt: Apache HBase is the Hadoop database-a NoSQL database management system that runs on top of HDFS (Hadoop Distributed File System). Like Hadoop, HBase is an open-source, distributed, versioned, column-oriented store. Companies such as Facebook, Adobe, and Twitter are using HBase to facilitate random, real-time read/write access to big data. Any data scientist or database engineer who wants a job at these top-tier organizations needs to master HBase to make it in the door. This course can help professionals further their career in big data analytics using HBase and the Hadoop framework. Learn to describe HBase in the context of the NoSQL landscape, build simple architecture models, and explore basic HBase commands. Instructor Ben Sullins shows how all the concepts fit together, resulting in the kind of distributed big data storage you need for scalable, enterprise-level applications. Umfang: 01:20:07.00
Inhalt: R is known as one of the most robust statistical computing solutions out there. Tableau-a leading business intelligence platform-provides excellent data visualization and exploration capabilities. When combined, Tableau and R offer one of the most powerful and complete data analytics solutions in the industry today, providing businesses with unparalleled abilities to see and understand their data. In this course, learn how to integrate these two platforms, as well as how to determine when each one is a better choice. Instructor Ben Sullins explains how to connect Tableau to R, and covers geocoding, running linear regression models, clustering, and more. Umfang: 01:10:38.00
Inhalt: Developed at LinkedIn, Apache Kafka is a distributed streaming platform that provides scalable, high-throughput messaging systems in place of traditional messaging systems like JMS. In this course, examine all the core concepts of Kafka. Ben Sullins kicks off the course by making the case for Kafka, and explaining who's using this efficient platform and why. He then shares Kafka workflows to provide context for core concepts, explains how to install and test Kafka locally, and dives into real-world examples. By the end of this course, you'll be prepared to achieve scalability, fault tolerance, and durability with Apache Kafka. Umfang: 01:20:48.00
Inhalt: Looker-a powerful data analytics platform-can help both large and small companies glean value from their data. In this short course, get up to speed with Looker, and learn how to leverage this platform to make collecting, visualizing, and analyzing data a bit easier. Ben Sullins begins by explaining how and why Looker is used, and exploring the Looker ecosystem. He also dives into how Looker organizes its data using LookML, how to visualize data in the Looker platform, and how to create a web-based dashboard. Umfang: 00:47:49.00
Inhalt: If you're looking for work as a junior data analyst, engineer, or scientist, this course gives you the best techniques to land jobs in data science. Instructor Ben Sullins explores a hiring manager's mindset and shows you how to prepare a demo that you can bring to your job interview. Ben explains some best practices for being physically and mentally healthy and well-prepared for your interview. He covers what to bring to your interview, then goes over follow-up methods and steps you should take to negotiate what you want most from an offer. Ben's methods give you the confidence and practice that you need to land a job in data science. Note: This course was created by Ben Sullins and Free the Data Academy. We are pleased to host this training in our library. Umfang: 00:31:57
Inhalt: Netflix and Airbnb both use Presto-an open-source SQL query engine developed by Facebook-for their ever-expanding big data querying needs. In this course, learn how to harness the power of your big data system using the Presto platform, which breaks the false dilemma of having to choose between an expensive commercial solution that offers fast analytics, and a slow, ostensibly free solution that requires excessive hardware. Data science expert Ben Sullins helps you get up to speed with Presto, and leverage it to accomplish a wide-range of data science and analytics tasks. He uses different interfaces with Presto-such as R and Tableau-and digs into the expressive SQL language that Presto offers for your analysis. At the end of this course, you'll know the key concepts of Presto and how to use them to take full advantage of your modern big data system. Umfang: 01:48:51.00
Inhalt: Modern work in data science requires skilled professionals versed in analysis workflows and using powerful tools. Python can play an integral role in nearly every aspect of working with data-from ingest, to querying, to extracting and visualizing. This course highlights twelve tips and tricks you can put into practice to improve your skills in Python. These techniques are readily applied and in common data management tasks and include the following: how to ingest data using CSV, JSON, and TXT files; how to explore data using libraries like Pandas; how to organize and join data using DataFrames; how to create charts and graphic representations of data using ggplot in Python; and more. Umfang: 00:47:46.00
Inhalt: Get Ben Sullins's 12 must-have SQL techniques for data science pros-engineers, DevOps, data miners, programmers, and other systems specialists. Ben's tips focus on practical applications of SQL queries for data analysis. Learn how to retrieve data, join tables, calculate rolling averages and rankings, work with dates and times, use window functions, aggregate and filter data, and much more. Each tip is short, relevant, and up to date with current industry best practices-making this the perfect course for busy analysts who normally struggle to find time to build their skills. Umfang: 00:59:23.00
Inhalt: In this course, Ben Sullins debunks 12 common misconceptions within the field of data science. Busy engineers, data miners, programmers, and other systems specialists who want to bolster their skills can benefit from Ben's succinct, practical insights. Separate data science fact from fiction, and learn what big data actually is, and why-contrary to what media coverage often suggests-it's not a singular thing. Ben also explains why big data can't instantly yield great insights, how to make analytics clearer, when to replace your relational databases, and more. Umfang: 00:36:05.00
Programm Findus Internet-OPAC findus.pl V20.235/8 auf Server windhund2.findus-internet-opac.de,
letztes Datenbankupdate: 08.05.2024, 18:40 Uhr. 907 Zugriffe im Mai 2024. Insgesamt 511.137 Zugriffe seit Januar 2009
Mobil - Impressum - Datenschutz - CO2-Neutral