Spark: The Definitive Guide: Big Data Processing Made Simple

Description:

About this item:

Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals. You'll explore the basic operations and common functions of Spark's structured APIs, as well as Structured Streaming, a new high-level API for building end-to-end streaming applications.

Editorial Reviews

About the Author

Bill Chambers is a Product Manager at Databricks focusing on large-scale analytics, strong documentation, and collaboration across the organization to help customers succeed with Spark and Databricks. He has a Master's degree in Information Systems from the UC Berkeley School of Information, where he focused on data science.

Matei Zaharia is an assistant professor of computer science at Stanford University and Chief Technologist at Databricks. He started the Spark project at UC Berkeley in 2009, where he was a PhD student, and he continues to serve as its vice president at Apache. Matei also co-started the Apache Mesos project and is a committer on Apache Hadoop. Matei’s research work was recognized through the 2014 ACM Doctoral Dissertation Award and the VMware Systems Research Award.

Review:

4.7 out of 5

93.85% of customers are satisfied

5.0 out of 5 stars Good condition pre-owned book

a.s. · May 2, 2024

Absolutely loved it. The packaging was good, received the book before time, and condition was good.

5.0 out of 5 stars What a great way to learn Spark (pyspark for me)

P. · October 21, 2021

Love the book. It gets hands on right away and give you both scala and python versions of code. I used databricks community version of spark. Some code is wrong. Python is sometimes but rarely missing. Highly recommend this to anyone who is looking to gain knowledge in Spark

4.0 out of 5 stars Good intro text - *not* a recipes book

J.N. · March 23, 2019

+s:+ Great intro text.+ Very detailed with lots of code samples.+ ML section is thorough (if limited in depth)+ all code is on GitHub :)+ conceptual+ tuning and optimizations sections-s:- Organization is a little choppy - to understand Structured Streamimg aggregations requires jumping back and forth to aggregations section (for example)- Copy-pasting code samples is annoying.- Kindle for Mac is sucky: resizing windows and adjusting text size breaks the flow, sometimes requiring a restart. Indexing is weird and it ”depaginates”- Could use a few sections in wide vs narrow...

5.0 out of 5 stars Better than expected

E. · July 17, 2022

I wasn’t sure about this book initially but as I started to use spark and read the book in parallel I discovered it explained very well the behind the scene that I needed to understand. I would recommend this to people that already program in other languages such as Python and want to start using pyspark

5.0 out of 5 stars Good single source for learning and using Spark in production

R.G. · May 6, 2018

This book presents the main Spark concepts, particularly the v2.x Structured API in tutorial fashion using Scala and Python. Much of this information is available piecemeal online, but I found it valuable to have it ordered and explained thoroughly rather than digging through stackoverflow or trying to make sense of the docs.After presenting how Spark works and the Structured and low level RDD APIs, the book helps you deploy, monitor, and tune your application to run on a cluster. There is a detailed section on Structured Streaming explaining windowing and event time processing, plus a section on advanced machine learning analytics.

5.0 out of 5 stars Very useful book for exploiting the powerful Spark platform

S.P. · August 28, 2018

Apache Spark is a powerful platform for Big Data applications that explores a lot of advanced techniques.The book describes clearly and systematically the Spark architecture and has a lot of outstanding examplesthat help the reader to become familiar with the rather brilliant Spark programming models.The presentation of the material is excellent and the explanations are quite supportive and help the understanding.It is a very nice book on the very admirable Spark system!

5.0 out of 5 stars Far the best Spark book

A.C. · February 13, 2019

Despite big volume - 600 pages this is far the best tech book I have read so far. Very well structured, covers different levels - from beginner to expert, excellent diagrams and code examples.

3.0 out of 5 stars Great info, but can't read on a computer

W.L. · July 11, 2018

This is a great beginner to intermediate book on Spark. The authors did an excellent job explaining concepts and gave a lot of examples (in Scala and Python).My only complaint is that you can't use Kindle Cloud Reader. For a normal book it might not be an issue, but for a programming book, you'd probably want to read it on your computer so you can take notes, type in examples, and search. I've bought other O'Reilly books and haven't had this issue in the past (this book seems to be the exception). Right now you're limited to kindle apps so a table might look like this on your phone or tablet: +----------------- ----------+ | some_field | another_field +----------------- ----------+ | a | bThe more I reference this book, the more I think its a big disadvantage.

Libro obligado

F. · December 26, 2020

Sin duda el mejor libro para comprender cómo funciona el framework de Apache Spark y lo que puedes llegar a hacer con él. Los ejemplos con código e incluso lo que intentan explicarte en las ilustraciones son por demás claros y concisos.Qué mejor que comprar un libro en donde uno de los autores (Matei Zaharia) es uno de los creadores del Framework.Si lo quieren comprar para estudiar y obtener alguna certificación de Databricks, no lo duden, cómprenlo y será la mejor inversión para ese propósito.

The content is awesome

A.Z. · January 17, 2022

The book of course is great, but the quality of the cover I'd wish to be improved. It made from very soft material and almost not protecting the book.

It's a must-have book for people who need to program in SPARK

J. · March 6, 2019

This book is well-structured especially for people who are new to SPARK but do not need to set up things himself. From earlier chapters (page 49) readers can start to do some simple work and learn some programming. this is encouraging for people to keep learning.

Livro sensacional para quem quer ver as funcionalidades do Spark 2.2.X

A.D.S. · June 5, 2018

Uma das únicas referências para quem quer mais sobre o que há de possibilidades além do Spark 1.6 pois em sua maioria ele aborda temas recentes. O livro é muito claro e objetivo, além de conter diversas referências de mateiras complementares. O autor com toda a certeza domina muito bem o assunto! Livro essencial para que quer entrar no mundo do Spark, com segurança e informações confiáveis, todos exemplos de código são dados tanto em Scala quanto em Python .Único ponto negativo, não do livro mas da Amazon, é que no Brasil não temos a opção de comprar em capa comum, não só para este livro mas para outros que abordam o mesmo tema.

Buena introducción para trabajar en SPARK y clarificar conceptos

A.A. · December 6, 2018

Explicaciones claras, con muchos ejemplos en Scala y Python. Temario actualizado a 2018 y por eso no está basado en RDD, aunque también hay un capítulo para ellos.

Spark: The Definitive Guide: Big Data Processing Made Simple

4.5

BHD36495

Quantity:

|

Order today to get by

Free delivery on orders over BHD 20

Return and refund policies

Product origin: United States

Electrical items shipped from the US are by default considered to be 120v, unless stated otherwise in the product description. Contact Bolo support for voltage information of specific products. A step-up transformer is required to convert from 120v to 240v. All heating electrical items of 120v will be automatically cancelled.

All product information listed on the site are from 3rd party sources, including images and reviews. bolo.bh is not liable for any claims or promotions mentioned on the product description or images with textual content. For detailed product information, please contact the manufacturer or Bolo support by logging into your account. Unless stated otherwise during checkout, all import taxes and duty are included in the price mentioned on the product page. bolo.bh follows the rules and regulations of sale in Bahrain and will cancel items in an order that are illegal for sale in Bahrain. We take all the necessary steps to ensure only products for sale in Bahrain are displayed. Product stock and delivery estimate may change with the seller even after placing the order. All items are shipped by air and items marked “Dangerous Goods (DG)” by the IATA will be cancelled from orders. We strive to process your order as soon as it is finalized.

Similar suggestions by Bolo

More from this brand

Similar items from “Databases & Big Data”