January 
10
, 
2019

00:00 AM

Agenda item

November
 
22
 at 
7:00pm

About

Unifying Data Pipelines and Machine Learning with Apache Spark™ and Amazon SageMaker

The widespread adoption of Apache Spark™, the first unified analytics engine, has helped data professionals make great strides in data science and machine learning. Yet, their upstream data lakes still face reliability challenges when it comes to building production data pipelines at scale to power these initiatives.

 

Delta Lake is an open source storage layer that brings reliability to data lakes. It has numerous reliability features including ACID transactions, scalable metadata handling, and unified streaming and batch data processing. It also offers DML commands to update, delete, and merge data for your data lifecycle, such as for GDPR/CCPA. Delta Lake runs on top of your existing data lake, such as on Azure Data Lake Storage, AWS S3, Hadoop HDFS, or on-premise, and is fully compatible with Apache Spark APIs. 

 

Join this hands-on lab to learn how Delta Lake can help you build robust production data pipelines at scale. This event will give you the opportunity to:

 

Gain an understanding of the Delta Lake open source project
Learn how to build highly scalable and reliable data pipelines using Delta Lake
See Delta Lake in action with a demo and hands-on code walkthrough
Ask Databricks experts your most challenging data questions 
Network and learn from your data engineering and data science peers



Space is limited! RSVP now to save your spot.

Every enterprise today wants to accelerate innovation by building Data and ML into their business. However, most companies struggle with preparing large datasets for analytics, managing the proliferation of Data and ML frameworks, and moving models in development to production.

 

In this live workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your Data and ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources, as well as Delta Lake to make your data lake machine learning ready. You’ll also learn how to use Data and ML frameworks (i.e. TensorFlow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production on Amazon SageMaker.

 

Join this live workshop to learn how Unified Data Analytics can bring Data Science, Business Analytics and engineering together to accelerate your Data and ML efforts. This live workshop will give you the opportunity to:

 

  • Learn how to build highly scalable and reliable pipelines for analytics
  • Deeper insight into Apache Spark and Databricks, including the latest updates with Delta Lake
  • Train a model against data and learn best practices for working with ML frameworks (i.e. - TensorFlow, XGBoost, Scikit-Learn, etc.)
  • Learn about MLflow to track experiments, share projects and deploy models in the cloud with Amazon SageMaker
  • Network and learn from your ML and Apache Spark peers


We will use Zoom for a virtual meeting environment. Your Zoom link will be sent to you upon registration. 

 

We look forward to seeing you on November 10th at 9am CDT. 


Slalom Privacy Policy 

Date & Time

11
/
10
/
2020
 
9:00am 
CST
RSVP
Text goes here
X


RSVP now to save your spot.

Agenda

9:00 AM

Databricks Keynote

9:15 AM

Customer Presentation - Outreach

9:30 AM

Technical Hands On Workshop

10:50 AM

Partner Presentation/Demo - Slalom

11:00 AM

Wrap up

RSVP
Text goes here
X

Privacy Policy

Terms of Use 

[confirmation_headline]
[confirmation_messaging]

 


Thank you for registering for the AWS | Databricks ML Dev Day Live Workshop: Unifying Data Pipelines and Machine Learning with Apache Spark™ and Amazon SageMaker. 


Link to join live workshop:  https://databricks.zoom.us/webinar/register/WN_oBtw7WAzT4ao1IjWS8NV6Q

 
You should be receiving an email shortly with additional details. 


If you’d like to follow along during the technical demos, please sign up for Databricks Community Edition in advance to the workshop -- no credit card needed. 

For more information about what to expect at the event or to refer colleagues to attend, please visit [event_url].


If you have any other questions at all, please email fieldmarketing@databricks.com.




Vish Gupta is a host of exceptional ability. Studies show that a vast majority of guests attending events by Vish have been known to leave more elated than visitors to Santa's Workshop, The Lost of Continent of Atlantis, and the Fountain of Youth.

Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
CONTACT THE ORGANIZER
Google   Outlook   iCal   Yahoo

Get Tickets

Add to my Calendar
  • Google  Outlook  iCal  Yahoo