Solving Analytical Problems using Apache Spark


Duration: 50 mins
Rohit Bhardwaj
Senior Architect, Cloud-native Expert in cloud-native solutions

In this talk, we will explore why Spark is the most prominent solution as compared to just Hadoop. We will look at MapReduce and how Spark makes the creation of Big Data algorithms simple and faster. Next, we will explore Spark Context and how Resilient Distributed Dataset (RDD) to help with the establishment of Directed Acyclic Graph (DAG); Transformations using map and filter; Actions using collect, count and reduce. Later we will explore the Spark Cassandra connector. We will look at Spark API and Spark SQL. We will also discuss how DataStax helps give a high level of stability to open source Apache Spark and Apache Cassandra projects. Key takeaways from this talk will be for a developer and architect to understand how Apache Spark and Apache Cassandra helps in implementing enterprise level analytical solutions. It is 100x faster than Hadoop!

You may also be interested in

25 mins
Writing Professionally

The most important thing you do in your job is write. It's in every email you send, every commit you...

50 mins
Systems Thinking

Albert Einstein once said — “We cannot solve our problems with the same thinking we used when we created them.” As...

180 mins
Design Principles for the Effective Developer

How many design patterns and principles can you name as developer? Are they important? Should we not rather focus on...

50 mins
Do You Know Da Wae

We build development teams based on individual ability to write code but development of a software project of any significance...

50 mins
All That Glitters Ain't Gold

Let’s use Kafka, everywhere! Let’s try event driven architecture! How about Rust for this service? Let’s use Elixir for this!...

50 mins
Definition of Ready & Done - A Guide to Achieving Predictability

Delivering software often takes longer than we anticipate. Why is that? Part of the reason is not understanding the nuances...