Warehouse Stock Clearance Sale

Grab a bargain today!


Sign Up for Fishpond's Best Deals Delivered to You Every Day
Go
Databricks Certified ­Associate Developer for ­Apache Spark Using Python
The ultimate guide to getting certified in Apache Spark using practical examples with Python

Rating
Format
Paperback, 274 pages
Published
United Kingdom, 14 June 2024

Learn the concepts and exercises needed to get certified as a Databricks Associate Developer for Apache Spark 3.0 and validate your skills as a Spark expert with an industry-recognized credential

Key Features

Understand the fundamentals of Apache Spark to help you design robust and fast Spark applications
Delve into various data manipulation components for each phase of your data engineering project
Prepare for the certification exam with sample questions and mock exams, and get closer to your goal
Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionWith extensive data being collected every second, computing power cannot keep up with this pace of rapid growth. To make use of all the data, Spark has become a de facto standard for big data processing. Migrating data processing to Spark will not only help you save resources that will allow you to focus on your business, but also enable you to modernize your workloads by leveraging the capabilities of Spark and the modern technology stack for creating new business opportunities.
This book is a comprehensive guide that lets you explore the core components of Apache Spark, its architecture, and its optimization. You’ll become familiar with the Spark dataframe API and its components needed for data manipulation. Next, you’ll find out what Spark streaming is and why it’s important for modern data stacks, before learning about machine learning in Spark and its different use cases. What’s more, you’ll discover sample questions at the end of each section along with two mock exams to help you prepare for the certification exam.
By the end of this book, you’ll know what to expect in the exam and how to pass it with enough understanding of Spark and its tools. You’ll also be able to apply this knowledge in a real-world setting and take your skillset to the next level.What you will learn

Create and manipulate SQL queries in Spark
Build complex Spark functions using Spark UDFs
Architect big data apps with Spark fundamentals for optimal design
Apply techniques to manipulate and optimize big data applications
Build real-time or near-real-time applications using Spark Streaming
Work with Apache Spark for machine learning applications

Who this book is forThis book is for you if you’re a professional looking to venture into the world of big data and data engineering, a data professional who wants to endorse your knowledge of Spark, or a student. Although working knowledge of Python is required, no prior Spark knowledge is needed. Additionally, experience with Pyspark will be beneficial.

Show more

Our Price
HK$330
Ships from UK Estimated delivery date: 6th Jun - 13th Jun from UK
Free Shipping Worldwide

Buy Together
+
Buy together with Dispossessed [Large Print] at a great price!
Buy Together
HK$470

Product Description

Learn the concepts and exercises needed to get certified as a Databricks Associate Developer for Apache Spark 3.0 and validate your skills as a Spark expert with an industry-recognized credential

Key Features

Understand the fundamentals of Apache Spark to help you design robust and fast Spark applications
Delve into various data manipulation components for each phase of your data engineering project
Prepare for the certification exam with sample questions and mock exams, and get closer to your goal
Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionWith extensive data being collected every second, computing power cannot keep up with this pace of rapid growth. To make use of all the data, Spark has become a de facto standard for big data processing. Migrating data processing to Spark will not only help you save resources that will allow you to focus on your business, but also enable you to modernize your workloads by leveraging the capabilities of Spark and the modern technology stack for creating new business opportunities.
This book is a comprehensive guide that lets you explore the core components of Apache Spark, its architecture, and its optimization. You’ll become familiar with the Spark dataframe API and its components needed for data manipulation. Next, you’ll find out what Spark streaming is and why it’s important for modern data stacks, before learning about machine learning in Spark and its different use cases. What’s more, you’ll discover sample questions at the end of each section along with two mock exams to help you prepare for the certification exam.
By the end of this book, you’ll know what to expect in the exam and how to pass it with enough understanding of Spark and its tools. You’ll also be able to apply this knowledge in a real-world setting and take your skillset to the next level.What you will learn

Create and manipulate SQL queries in Spark
Build complex Spark functions using Spark UDFs
Architect big data apps with Spark fundamentals for optimal design
Apply techniques to manipulate and optimize big data applications
Build real-time or near-real-time applications using Spark Streaming
Work with Apache Spark for machine learning applications

Who this book is forThis book is for you if you’re a professional looking to venture into the world of big data and data engineering, a data professional who wants to endorse your knowledge of Spark, or a student. Although working knowledge of Python is required, no prior Spark knowledge is needed. Additionally, experience with Pyspark will be beneficial.

Show more
Product Details
EAN
9781804619780
ISBN
1804619787
Age Range
Dimensions
23.5 x 19.1 x 1.5 centimeters (0.48 kg)

Table of Contents

Table of Contents

  • Overview of Certification Guide and Exam
  • Understanding Apache Spark and Its Applications
  • Spark Architecture & Transformations
  • Spark Datarames and its Operations
  • Advanced Operations in Spark
  • SQL Queries in Spark
  • Structured Streaming in Spark
  • Machine Learning with Spark ML
  • Mock Test
  • About the Author

    Saba Shah is a Data and AI Architect and Evangelist with a wide technical breadth and deep understanding of big data and machine learning technologies. She has experience leading data science and data engineering teams in Fortune 500s as well as startups. She started her career as a software engineer but soon transitioned to big data. She is currently a solutions architect at Databricks and works with enterprises building their data strategy and helping them create a vision for the future with machine learning and predictive analytics. Saba graduated with a degree in Computer Science and later earned an MS degree in Advanced Web Technologies. She is passionate about all things data and cricket. She currently resides in RTP, NC.

    Show more
    Review this Product
    Ask a Question About this Product More...
     
    Look for similar items by category
    People also searched for
    Item ships from and is sold by Fishpond World Ltd.

    Back to top